TY - JOUR
T1 - Population distribution models
T2 - Species distributions are better modeled using biologically relevant data partitions
AU - Gonzalez, Sergio C.
AU - Soto-Centeno, J. Angel
AU - Reed, David L.
N1 - Funding Information:
We thank R. Fletcher for comments on an early draft of this manuscript. JAS-C thanks RD Barrilito for support. This work was supported by grants to DLR from the University of Florida Research Opportunity SEED Fund and the National Science Foundation (DEB 0717165 and DEB 0845392). Publication of this article was funded in part by the University of Florida Open-Access Publishing Fund.
PY - 2011/9/19
Y1 - 2011/9/19
N2 - Background: Predicting the geographic distribution of widespread species through modeling is problematic for several reasons including high rates of omission errors. One potential source of error for modeling widespread species is that subspecies and/or races of species are frequently pooled for analyses, which may mask biologically relevant spatial variation within the distribution of a single widespread species. We contrast a presence-only maximum entropy model for the widely distributed oldfield mouse (Peromyscus polionotus) that includes all available presence locations for this species, with two composite maximum entropy models. The composite models either subdivided the total species distribution into four geographic quadrants or by fifteen subspecies to capture spatially relevant variation in P. polionotus distributions.Results: Despite high Area Under the ROC Curve (AUC) values for all models, the composite species distribution model of P. polionotus generated from individual subspecies models represented the known distribution of the species much better than did the models produced by partitioning data into geographic quadrants or modeling the whole species as a single unit.Conclusions: Because the AUC values failed to describe the differences in the predictability of the three modeling strategies, we suggest using omission curves in addition to AUC values to assess model performance. Dividing the data of a widespread species into biologically relevant partitions greatly increased the performance of our distribution model; therefore, this approach may prove to be quite practical and informative for a wide range of modeling applications.
AB - Background: Predicting the geographic distribution of widespread species through modeling is problematic for several reasons including high rates of omission errors. One potential source of error for modeling widespread species is that subspecies and/or races of species are frequently pooled for analyses, which may mask biologically relevant spatial variation within the distribution of a single widespread species. We contrast a presence-only maximum entropy model for the widely distributed oldfield mouse (Peromyscus polionotus) that includes all available presence locations for this species, with two composite maximum entropy models. The composite models either subdivided the total species distribution into four geographic quadrants or by fifteen subspecies to capture spatially relevant variation in P. polionotus distributions.Results: Despite high Area Under the ROC Curve (AUC) values for all models, the composite species distribution model of P. polionotus generated from individual subspecies models represented the known distribution of the species much better than did the models produced by partitioning data into geographic quadrants or modeling the whole species as a single unit.Conclusions: Because the AUC values failed to describe the differences in the predictability of the three modeling strategies, we suggest using omission curves in addition to AUC values to assess model performance. Dividing the data of a widespread species into biologically relevant partitions greatly increased the performance of our distribution model; therefore, this approach may prove to be quite practical and informative for a wide range of modeling applications.
UR - http://www.scopus.com/inward/record.url?scp=80052849534&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80052849534&partnerID=8YFLogxK
U2 - 10.1186/1472-6785-11-20
DO - 10.1186/1472-6785-11-20
M3 - Article
C2 - 21929792
AN - SCOPUS:80052849534
SN - 1472-6785
VL - 11
JO - BMC Ecology
JF - BMC Ecology
M1 - 20
ER -