TY - JOUR
T1 - Revealing Drivers of Haze Pollution by Explainable Machine Learning
AU - Hou, Linlu
AU - Dai, Qili
AU - Song, Congbo
AU - Liu, Bowen
AU - Guo, Fangzhou
AU - Dai, Tianjiao
AU - Li, Linxuan
AU - Liu, Baoshuang
AU - Bi, Xiaohui
AU - Zhang, Yufen
AU - Feng, Yinchang
N1 - Funding Information:
This work was financially supported by the National Natural Science Foundation of China (42177085), the Tianjin Science and Technology Plan Project (PTZWHZ00120), the Tianjin Research Institute for the Development Strategy of China’s Engineering Science and Technology (2020C0-0002), and the National Key R&D Program of China (2016YFC0208505).
Publisher Copyright:
© 2022 American Chemical Society.
PY - 2022/8
Y1 - 2022/8
N2 - Many places on earth still suffer from a high level of atmospheric fine particulate matter (PM2.5) pollution. Formation of a particulate pollution event or haze episode (HE) involves many factors, including meteorology, emissions, and chemistry. Understanding the direct causes of and key drivers behind the HE is thus essential. Traditionally, this is done via chemical transport models. However, substantial uncertainties are introduced into the model estimation when there are significant changes in the emissions inventory due to interventions (e.g., the COVID-19 lockdown). Here we applied a Random Forest model coupled with a Shapley additive explanation algorithm, a post hoc explanation technique, to investigate the roles of major meteorological factors, primary emissions, and chemistry in five severe HEs that occurred before or during the COVID-19 lockdown in China. We discovered that, in addition to the high level of primary emissions, PM2.5 in these haze episodes was largely driven by meteorological effects (with average contributions of 30-65 μg m-3 for the five HEs), followed by chemistry (∼15-30 μg m-3). Photochemistry was likely the major pathway of formation of nitrate, while air humidity was the predominant factor in forming sulfate. Our results highlight that the machine learning driven by data has the potential to be a complementary tool in predicting and interpreting air pollution.
AB - Many places on earth still suffer from a high level of atmospheric fine particulate matter (PM2.5) pollution. Formation of a particulate pollution event or haze episode (HE) involves many factors, including meteorology, emissions, and chemistry. Understanding the direct causes of and key drivers behind the HE is thus essential. Traditionally, this is done via chemical transport models. However, substantial uncertainties are introduced into the model estimation when there are significant changes in the emissions inventory due to interventions (e.g., the COVID-19 lockdown). Here we applied a Random Forest model coupled with a Shapley additive explanation algorithm, a post hoc explanation technique, to investigate the roles of major meteorological factors, primary emissions, and chemistry in five severe HEs that occurred before or during the COVID-19 lockdown in China. We discovered that, in addition to the high level of primary emissions, PM2.5 in these haze episodes was largely driven by meteorological effects (with average contributions of 30-65 μg m-3 for the five HEs), followed by chemistry (∼15-30 μg m-3). Photochemistry was likely the major pathway of formation of nitrate, while air humidity was the predominant factor in forming sulfate. Our results highlight that the machine learning driven by data has the potential to be a complementary tool in predicting and interpreting air pollution.
UR - http://www.scopus.com/inward/record.url?scp=85122751971&partnerID=8YFLogxK
U2 - 10.1021/acs.estlett.1c00865
DO - 10.1021/acs.estlett.1c00865
M3 - Article
AN - SCOPUS:85122751971
SN - 2328-8930
VL - 9
SP - 112
EP - 119
JO - Environmental Science and Technology Letters
JF - Environmental Science and Technology Letters
IS - 2
ER -