Tags

Type your tag names separated by a space and hit enter

Spatiotemporal continuous estimates of PM2.5 concentrations in China, 2000-2016: A machine learning method with inputs from satellites, chemical transport model, and ground observations.
Environ Int. 2019 02; 123:345-357.EI

Abstract

Ambient exposure to fine particulate matter (PM2.5) is known to harm public health in China. Satellite remote sensing measurements of aerosol optical depth (AOD) were statistically associated with in-situ observations after 2013 to predict PM2.5 concentrations nationwide, while the lack of surface monitoring data before 2013 have created difficulties in historical PM2.5 exposure estimates. Hindcast approaches using statistical models or chemical transport models (CTMs) were developed to overcome this limitation, while those approaches still suffer from incomplete daily coverage due to missing AOD data or limited accuracy due to uncertainties of CTMs. Here we developed a new machine learning (ML) model with high-dimensional expansion (HD-expansion) of numerous predictors (including AOD and other satellite covariates, meteorological variables and CTM simulations). Through comprehensive characterization of the nonlinear effects of, and interactions among different predictors, the HD-expansion parameterized the association between PM2.5 and AOD as a nonlinear function of space and time covariates (e.g., planetary boundary layer height and relative humidity). In this way, the PM2.5-AOD association can vary spatiotemporally. We trained the model with data from 2013 to 2016 and evaluated its performance using annually-iterated cross-validation, which iteratively held out the in-situ observations for a whole calendar year (as testing data) to examine the predictions from a model trained by the rest of the observations. Our estimates were found to be in good agreement with in-situ observations, with correlation coefficients (R2) of 0.61, 0.68, and 0.75 for daily, monthly and annual averages, respectively. To interpolate the missing predictions due to incomplete AOD data, we incorporated a generalized additive model into the ML model. The two-stage estimates of PM2.5 sacrificed the prediction accuracy on a daily timescale (R2 = 0.55), but achieved complete spatiotemporal coverage and improved the accuracy of monthly (R2 = 0.71) and annual (R2 = 0.77) averages. The model was then used to predict daily PM2.5 concentrations during 2000-2016 across China and estimate long-term trends in PM2.5 for the period. We found that population-weighted concentrations of PM2.5 significantly increased, by 2.10 (95% confidence interval (CI): 1.74, 2.46) μg/m3/year during 2000-2007, and rapidly decreased by 4.51 (3.12, 5.90) μg/m3/year during 2013-2016. In this study, we produced AOD-based estimates of historical PM2.5 with complete spatiotemporal coverage, which were evidenced as accurate, particularly in middle and long term. The products could support large-scale epidemiological studies and risk assessments of ambient PM2.5 in China and can be accessed via the website (http://www.meicmodel.org/dataset-phd.html).

Authors+Show Affiliations

BIC-ESAT and SKL-ESPC, College of Environmental Science and Engineering, Peking University, Beijing 100871, China; Department of Earth System Science, Tsinghua University, Beijing 100084, China.Department of Earth System Science, Tsinghua University, Beijing 100084, China.Department of Earth System Science, Tsinghua University, Beijing 100084, China.State Key Joint Laboratory of Environment Simulation and Pollution Control, School of Environment, Tsinghua University, Beijing 100084, China.Department of Earth System Science, Tsinghua University, Beijing 100084, China.BIC-ESAT and SKL-ESPC, College of Environmental Science and Engineering, Peking University, Beijing 100871, China.Department of Earth System Science, Tsinghua University, Beijing 100084, China. Electronic address: qiangzhang@tsinghua.edu.cn.

Pub Type(s)

Journal Article
Research Support, Non-U.S. Gov't

Language

eng

PubMed ID

30562706

Citation

Xue, Tao, et al. "Spatiotemporal Continuous Estimates of PM2.5 Concentrations in China, 2000-2016: a Machine Learning Method With Inputs From Satellites, Chemical Transport Model, and Ground Observations." Environment International, vol. 123, 2019, pp. 345-357.
Xue T, Zheng Y, Tong D, et al. Spatiotemporal continuous estimates of PM2.5 concentrations in China, 2000-2016: A machine learning method with inputs from satellites, chemical transport model, and ground observations. Environ Int. 2019;123:345-357.
Xue, T., Zheng, Y., Tong, D., Zheng, B., Li, X., Zhu, T., & Zhang, Q. (2019). Spatiotemporal continuous estimates of PM2.5 concentrations in China, 2000-2016: A machine learning method with inputs from satellites, chemical transport model, and ground observations. Environment International, 123, 345-357. https://doi.org/10.1016/j.envint.2018.11.075
Xue T, et al. Spatiotemporal Continuous Estimates of PM2.5 Concentrations in China, 2000-2016: a Machine Learning Method With Inputs From Satellites, Chemical Transport Model, and Ground Observations. Environ Int. 2019;123:345-357. PubMed PMID: 30562706.
* Article titles in AMA citation format should be in sentence-case
TY - JOUR T1 - Spatiotemporal continuous estimates of PM2.5 concentrations in China, 2000-2016: A machine learning method with inputs from satellites, chemical transport model, and ground observations. AU - Xue,Tao, AU - Zheng,Yixuan, AU - Tong,Dan, AU - Zheng,Bo, AU - Li,Xin, AU - Zhu,Tong, AU - Zhang,Qiang, Y1 - 2018/12/18/ PY - 2018/07/26/received PY - 2018/11/09/revised PY - 2018/11/29/accepted PY - 2018/12/19/pubmed PY - 2019/6/14/medline PY - 2018/12/19/entrez KW - Aerosol optical depth KW - Fine particulate matter KW - Machine learning KW - Satellite remote sensing SP - 345 EP - 357 JF - Environment international JO - Environ Int VL - 123 N2 - Ambient exposure to fine particulate matter (PM2.5) is known to harm public health in China. Satellite remote sensing measurements of aerosol optical depth (AOD) were statistically associated with in-situ observations after 2013 to predict PM2.5 concentrations nationwide, while the lack of surface monitoring data before 2013 have created difficulties in historical PM2.5 exposure estimates. Hindcast approaches using statistical models or chemical transport models (CTMs) were developed to overcome this limitation, while those approaches still suffer from incomplete daily coverage due to missing AOD data or limited accuracy due to uncertainties of CTMs. Here we developed a new machine learning (ML) model with high-dimensional expansion (HD-expansion) of numerous predictors (including AOD and other satellite covariates, meteorological variables and CTM simulations). Through comprehensive characterization of the nonlinear effects of, and interactions among different predictors, the HD-expansion parameterized the association between PM2.5 and AOD as a nonlinear function of space and time covariates (e.g., planetary boundary layer height and relative humidity). In this way, the PM2.5-AOD association can vary spatiotemporally. We trained the model with data from 2013 to 2016 and evaluated its performance using annually-iterated cross-validation, which iteratively held out the in-situ observations for a whole calendar year (as testing data) to examine the predictions from a model trained by the rest of the observations. Our estimates were found to be in good agreement with in-situ observations, with correlation coefficients (R2) of 0.61, 0.68, and 0.75 for daily, monthly and annual averages, respectively. To interpolate the missing predictions due to incomplete AOD data, we incorporated a generalized additive model into the ML model. The two-stage estimates of PM2.5 sacrificed the prediction accuracy on a daily timescale (R2 = 0.55), but achieved complete spatiotemporal coverage and improved the accuracy of monthly (R2 = 0.71) and annual (R2 = 0.77) averages. The model was then used to predict daily PM2.5 concentrations during 2000-2016 across China and estimate long-term trends in PM2.5 for the period. We found that population-weighted concentrations of PM2.5 significantly increased, by 2.10 (95% confidence interval (CI): 1.74, 2.46) μg/m3/year during 2000-2007, and rapidly decreased by 4.51 (3.12, 5.90) μg/m3/year during 2013-2016. In this study, we produced AOD-based estimates of historical PM2.5 with complete spatiotemporal coverage, which were evidenced as accurate, particularly in middle and long term. The products could support large-scale epidemiological studies and risk assessments of ambient PM2.5 in China and can be accessed via the website (http://www.meicmodel.org/dataset-phd.html). SN - 1873-6750 UR - https://www.unboundmedicine.com/medline/citation/30562706/Spatiotemporal_continuous_estimates_of_PM2_5_concentrations_in_China_2000_2016:_A_machine_learning_method_with_inputs_from_satellites_chemical_transport_model_and_ground_observations_ DB - PRIME DP - Unbound Medicine ER -