Enhancing spatial inference of air pollution using machine learning techniques with low-cost monitors in data-limited scenarios

Leonardo Y. Kamigauti; Gabriel M. P. Perez; Thomas C. M. Martin; Maria de Fatima Andrade; Prashant Kumar

doi:10.1039/D3EA00126A

Enhancing spatial inference of air pollution using machine learning techniques with low-cost monitors in data-limited scenarios†

Leonardo Y. Kamigauti,

*^ab Gabriel M. P. Perez,^cd Thomas C. M. Martin,^bd Maria de Fatima Andrade^b and Prashant Kumar

^ae

Author affiliations

* Corresponding authors

^a Departamento de Ciências Atmosféricas, Universidade de São Paulo, Brazil
E-mail: leonardo.kamigauti@usp.br

^b Global Centre for Clean Air Research (GCARE), School of Sustainability, Civil and Environmental Engineering, Faculty of Engineering & Physical Sciences, University of Surrey, Guildford GU2 7XH, Surrey, UK

^c Department of Meteorology, University of Reading, UK

^d MeteoIA, São Paulo, Brazil

^e Institute for Sustainability, University of Surrey, Guildford GU2 7XH, Surrey, UK

Abstract

Ensuring environmental justice necessitates equitable access to air quality data, particularly for vulnerable communities. However, traditional air quality data from reference monitors can be costly and challenging to interpret without in-depth knowledge of local meteorology. Low-cost monitors present an opportunity to enhance data availability in developing countries and enable the establishment of local monitoring networks. While machine learning models have shown promise in atmospheric dispersion modelling, many existing approaches rely on complementary data sources that are inaccessible in low-income areas, such as smartphone tracking and real-time traffic monitoring. This study addresses these limitations by introducing deep learning-based models for particulate matter dispersion at the neighbourhood scale. The models utilize data from low-cost monitors and widely available free datasets, delivering root mean square errors (RMSE) below 2.9 μg m⁻³ for PM₁, PM_2.5, and PM₁₀. The sensitivity analysis shows that the most important inputs to the models were the nearby monitors' PM concentrations, boundary layer dissipation and height, and precipitation variables. The models presented different sensitivities to each road type, and an RMSE below the regional differences, evidencing the learning of the spatial dependencies. This breakthrough paves the way for applications in various vulnerable localities, significantly improving air pollution data accessibility and contributing to environmental justice. Moreover, this work sets the stage for future research endeavours in refining the models and expanding data accessibility using alternative sources.

This article is part of the themed collection: The Use of Machine Learning in Atmospheric Science Research - Topic Highlight

Environmental Science: Atmospheres

Enhancing spatial inference of air pollution using machine learning techniques with low-cost monitors in data-limited scenarios†

Abstract

Supplementary files

Article information

Download Citation

Permissions

Enhancing spatial inference of air pollution using machine learning techniques with low-cost monitors in data-limited scenarios

Social activity

Search articles by author

Spotlight

Advertisements