TY - JOUR
T1 - A comparison of three empirically based, spatially explicit predictive models of residential soil Pb concentrations in Baltimore, Maryland, USA
T2 - Understanding the variability within cities
AU - Schwarz, Kirsten
AU - Weathers, Kathleen C.
AU - Pickett, Steward T.A.
AU - Lathrop, Richard G.
AU - Pouyat, Richard V.
AU - Cadenasso, Mary L.
N1 - Funding Information:
Acknowledgments We are especially grateful to the homeowners for access to their property and to the University of California, Davis for use of the XRF. The building footprint dataset was used with permission under a license agreement with Baltimore City. Our understanding of GIS and spatial statistics greatly benefited from conversations with John Bognar, Dr. Adele Cutler, Amanda Elliot Lindsey, Dr. Elizabeth Freeman, Scott Haag, David Lewis, Dr. Zewei Maio, Dr. Samuel Simkin, Jim Trimble, and Dr. Weiqi Zhou. This work is a contribution to the long-term ecological research program (a program of the National Science Foundation) and the Cary Institute of Ecosystem Studies and was supported by NSF grants DEB 042376 and 0808418.
PY - 2013/8
Y1 - 2013/8
N2 - In many older US cities, lead (Pb) contamination of residential soil is widespread; however, contamination is not uniform. Empirically based, spatially explicit models can assist city agencies in addressing this important public health concern by identifying areas predicted to exceed public health targets for soil Pb contamination. Sampling of 61 residential properties in Baltimore City using field portable X-ray fluorescence revealed that 53 % had soil Pb that exceeded the USEPA reportable limit of 400 ppm. These data were used as the input to three different spatially explicit models: a traditional general linear model (GLM), and two machine learning techniques: classification and regression trees (CART) and Random Forests (RF). The GLM revealed that housing age, distance to road, distance to building, and the interactions between variables explained 38 % of the variation in the data. The CART model confirmed the importance of these variables, with housing age, distance to building, and distance to major road networks determining the terminal nodes of the CART model. Using the same three predictor variables, the RF model explained 42 % of the variation in the data. The overall accuracy, which is a measure of agreement between the model and an independent dataset, was 90 % for the GLM, 83 % for the CART model, and 72 % for the RF model. A range of spatially explicit models that can be adapted to changing soil Pb guidelines allows managers to select the most appropriate model based on public health targets.
AB - In many older US cities, lead (Pb) contamination of residential soil is widespread; however, contamination is not uniform. Empirically based, spatially explicit models can assist city agencies in addressing this important public health concern by identifying areas predicted to exceed public health targets for soil Pb contamination. Sampling of 61 residential properties in Baltimore City using field portable X-ray fluorescence revealed that 53 % had soil Pb that exceeded the USEPA reportable limit of 400 ppm. These data were used as the input to three different spatially explicit models: a traditional general linear model (GLM), and two machine learning techniques: classification and regression trees (CART) and Random Forests (RF). The GLM revealed that housing age, distance to road, distance to building, and the interactions between variables explained 38 % of the variation in the data. The CART model confirmed the importance of these variables, with housing age, distance to building, and distance to major road networks determining the terminal nodes of the CART model. Using the same three predictor variables, the RF model explained 42 % of the variation in the data. The overall accuracy, which is a measure of agreement between the model and an independent dataset, was 90 % for the GLM, 83 % for the CART model, and 72 % for the RF model. A range of spatially explicit models that can be adapted to changing soil Pb guidelines allows managers to select the most appropriate model based on public health targets.
KW - Classification and regression trees
KW - Pb
KW - Random Forest
KW - Soil
KW - Spatial modeling
KW - Urban
UR - http://www.scopus.com/inward/record.url?scp=84879785384&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84879785384&partnerID=8YFLogxK
U2 - 10.1007/s10653-013-9510-6
DO - 10.1007/s10653-013-9510-6
M3 - Article
C2 - 23775390
AN - SCOPUS:84879785384
SN - 0269-4042
VL - 35
SP - 495
EP - 510
JO - Environmental Geochemistry and Health
JF - Environmental Geochemistry and Health
IS - 4
ER -