Information-based approach to PM2.5 estimation and air quality assessment using statistical and deep learning models

Sehrish Khan; Maqbool Ahmad; Bahadar Zeb; Shahla Nazneen; Beenish Ali; Mubarak Ahmad; Khan Alam; Allah Ditta

doi:10.1039/D5VA00383K

Information-based approach to PM_2.5 estimation and air quality assessment using statistical and deep learning models

Sehrish Khan,^a Maqbool Ahmad,^b Bahadar Zeb,^c Shahla Nazneen,^a Beenish Ali,^d Mubarak Ahmad,^e Khan Alam*^f and Allah Ditta

*^g

Author affiliations

* Corresponding authors

^a Department of Environmental Sciences, University of Peshawar, Peshawar 25120, Khyber Pakhtunkhwa, Pakistan

^b Department of Elementary and Secondary Education, Peshawar, Khyber Pakhtunkhwa, Pakistan

^c Department of Mathematics, Shaheed Benazir Bhutto University, Sheringal, Dir (Upper), Pakistan

^d Department of Geology, Bacha Khan University Charsadda, Charsadda 24420, Khyber Pakhtunkhwa, Pakistan

^e School of Automation, Wuxi University, 333 Xishan Avenue, Xishan District, Wuxi, Jiangsu Province, China

^f Department of Physics, University of Peshawar, Peshawar, Pakistan
E-mail: khanalam@uop.edu.pk

^g Department of Environmental Sciences, Shaheed Benazir Bhutto University, Sheringal, Dir (U), Khyber Pakhtunkhwa 18000, Pakistan
E-mail: allah.ditta@sbbu.edu.pk

Abstract

In Pakistan, Peshawar City is persistently experiencing high concentrations of fine particulate matter (PM_2.5), frequently surpassing national as well as international air quality standards. For this purpose, the present study aims to enhance the accuracy of PM_2.5 estimation at the city scale through a data-driven and interdisciplinary modeling framework. To achieve this, a series of predictors, such as air pollutants (nitrogen dioxide (NO₂) and sulphur dioxide (SO₂)), meteorological conditions (temperature, wind speed, humidity), and satellite-based aerosol optical depth (AOD), were used to construct a multiple linear regression (MLR) model. Similarly, the Long Short-Term Memory (LSTM) and Convolutional Neural Network (CNN) were modeled to estimate PM_2.5 using historical ground-level PM_2.5 data in the year 2021, leveraging their capabilities to model temporal trends. The results revealed that estimated PM_2.5 levels using the CNN model were almost in the same range as the available measured concentrations, whereas MLR and LSTM models showed some variations against measured values. The insights about their comparative analysis showed that the CNN model could achieve better estimation than MLR and LSTM models. The CNN model achieved a root mean square error (RMSE) of 34.89 µg m⁻³ and coefficient of determination (R²) of 0.79, indicating higher estimation accuracy. Both the LSTM (R² = 0.74 and RMSE = 51.93 µg m⁻³) and MLR (R² = 0.46 and RMSE = 44.35 µg m⁻³) models underperformed. Based on the air quality index (AQI), the study region has experienced extremely unhealthy and healthy conditions, which may lead to the formation of visible haze and ultimately to the particulate component of smog. Generally, this study highlights the superior performance of deep learning approaches for urban air quality assessment. In conclusion, this study breaks new ground by applying and integrating MLR, CNN, and LSTM models in the study region. It will help in opening a promising direction for city-specific air quality modeling in any regional or local urban environment.

Environmental Science: Advances

Information-based approach to PM_2.5 estimation and air quality assessment using statistical and deep learning models

Abstract

Transparent peer review

Article information

Download Citation

Permissions

Information-based approach to PM_2.5 estimation and air quality assessment using statistical and deep learning models

Social activity

Search articles by author

Spotlight

Advertisements