TY - JOUR
T1 - Flexible Bayesian Ensemble Machine Learning Framework for Predicting Local Ozone Concentrations
AU - Ren, Xiang
AU - Mi, Zhongyuan
AU - Cai, Ting
AU - Nolte, Christopher G.
AU - Georgopoulos, Panos G.
N1 - Funding Information:
This study was supported by the Ozone Research Center (funded by the State of New Jersey Department of Environmental Protection under grant AQ05-011), by the Center for Environmental Exposures and Disease at EOHSI (NIEHS grant P30ES005022), and by the NJ Alliance for Clinical Translational Science (NIH grant UL1TROO3017). The views expressed in this article are those of the authors and do not necessarily represent the views or policies of the USEPA or any of the funding agencies. We sincerely acknowledge the help from Adam Reff for running the validation experiment of the EPA-BSTH downscaler.
Publisher Copyright:
© 2022 American Chemical Society. All rights reserved.
PY - 2022/4/5
Y1 - 2022/4/5
N2 - 3D-grid-based chemical transport models, such as the Community Multiscale Air Quality (CMAQ) modeling system, have been widely used for predicting concentrations of ambient air pollutants. However, typical horizontal resolutions of nationwide CMAQ simulations (12 × 12 km2) cannot capture local-scale gradients for accurately assessing human exposures and environmental justice disparities. In this study, a Bayesian ensemble machine learning (BEML) framework, which integrates 13 learning algorithms, was developed for downscaling CMAQ estimates of ozone daily maximum 8 h averages to the census tract level, across the contiguous US, and was demonstrated for 2011. Three-stage hyperparameter tuning and targeted validations were designed to ensure the ensemble model's ability to interpolate, extrapolate, and capture concentration peaks. The Shapley value metric from coalitional game theory was applied to interpret the drivers of subgrid gradients. The flexibility (transferability) of the 2011-trained BEML model was further tested by evaluating its ability to estimate fine-scale concentrations for other years (2012-2017) without retraining. To demonstrate the feasibility of using the BEML approach to strictly "data-limited" situations, the model was applied to downscale CMAQ outputs for a future-year scenario-based simulation that considers effects of variations in meteorology associated with climate change.
AB - 3D-grid-based chemical transport models, such as the Community Multiscale Air Quality (CMAQ) modeling system, have been widely used for predicting concentrations of ambient air pollutants. However, typical horizontal resolutions of nationwide CMAQ simulations (12 × 12 km2) cannot capture local-scale gradients for accurately assessing human exposures and environmental justice disparities. In this study, a Bayesian ensemble machine learning (BEML) framework, which integrates 13 learning algorithms, was developed for downscaling CMAQ estimates of ozone daily maximum 8 h averages to the census tract level, across the contiguous US, and was demonstrated for 2011. Three-stage hyperparameter tuning and targeted validations were designed to ensure the ensemble model's ability to interpolate, extrapolate, and capture concentration peaks. The Shapley value metric from coalitional game theory was applied to interpret the drivers of subgrid gradients. The flexibility (transferability) of the 2011-trained BEML model was further tested by evaluating its ability to estimate fine-scale concentrations for other years (2012-2017) without retraining. To demonstrate the feasibility of using the BEML approach to strictly "data-limited" situations, the model was applied to downscale CMAQ outputs for a future-year scenario-based simulation that considers effects of variations in meteorology associated with climate change.
KW - data fusion
KW - environmental and climate justice
KW - exposure assessment
KW - interpretable machine learning
KW - ozone
UR - http://www.scopus.com/inward/record.url?scp=85127455859&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85127455859&partnerID=8YFLogxK
U2 - 10.1021/acs.est.1c04076
DO - 10.1021/acs.est.1c04076
M3 - Article
C2 - 35312316
AN - SCOPUS:85127455859
SN - 0013-936X
VL - 56
SP - 3871
EP - 3883
JO - Environmental Science & Technology
JF - Environmental Science & Technology
IS - 7
ER -