Information-based Approach to PM 2.5 Estimation and Air Quality Assessment Using Statistical and Deep Learning Models

Abstract

In Pakistan, Peshawar City is persistently experiencing high concentrations of fine particulate matter (PM2.5), frequently surpassing national as well as international air quality standards. For this purpose, the present study aims to enhance the accuracy of PM2.5 estimation at the city scale through a data-driven and interdisciplinary modeling framework. To achieve this, a series of predictors, such as air pollutants of nitrogen dioxide (NO2) and sulphur dioxide (SO2), meteorological conditions (temperature, wind speed, humidity), and satellite-based aerosol optical depth (AOD), were used to construct the multiple linear regression (MLR) model. Similarly, the Long Short-Term Memory (LSTM) and Convolutional Neural Network (CNN) were modeled to estimate PM2.5 using historical ground-level PM2.5 data in the year 2021, leveraging their capabilities to model temporal trends. The results revealed that PM2.5 levels by the CNN model were almost in the same range as the available measured concentrations, whereas MLR and LSTM models showed some variations against measured values. The insights into their comparative analysis showed that the CNN model could achieve better estimation than MLR and LSTM models. The CNN model achieved a root mean square error (RMSE) of 34.89 μg/m³ and an R² of 0.79, indicating higher estimation accuracy. Both the LSTM (R2 = 0.74, RMSE = 51.93 μg/m³) and MLR (R2 = 0.46, RMSE = 44.35 μg/m³) models outperformed. Based on the air quality index (AQI), the study region has experienced extremely unhealthy and healthy conditions, which may lead to the formation of visible haze and ultimately to the particulate component of smog. Generally, this study highlights the superior performance of deep learning approaches for urban air quality assessment. In conclusion, this study breaks new ground by applying and integrating MLR, CNN, and LSTM models in the study region. It will help in opening a promising direction for city-specific air quality modeling in any regional or local urban environment.

Transparent peer review

To support increased transparency, we offer authors the option to publish the peer review history alongside their article.

View this article’s peer review history

Article information

Article type
Paper
Submitted
25 Oct 2025
Accepted
16 Feb 2026
First published
23 Feb 2026
This article is Open Access
Creative Commons BY license

Environ. Sci.: Adv., 2026, Accepted Manuscript

Information-based Approach to PM 2.5 Estimation and Air Quality Assessment Using Statistical and Deep Learning Models

S. Khan, M. Ahmad, B. Zeb, S. Nazaneen, B. Ali, M. Ahmad, K. Alam and A. Ditta, Environ. Sci.: Adv., 2026, Accepted Manuscript , DOI: 10.1039/D5VA00383K

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements