Advances and innovations in machine learning-based spectral detection methods for trace organic pollutants

Abstract

The rapid and sensitive detection of trace organic pollutants in water is crucial for ensuring environmental safety. Traditional detection methods struggle to meet the demands of large-scale, real-time, and on-site detection. This paper reviews recent advances in the application of machine learning (ML) in spectral detection methods for trace organic pollutants. It introduces techniques such as data augmentation, intelligent feature extraction, and model construction, as well as their application in different spectral techniques, for example, generative adversarial networks (GANs) for data augmentation, convolutional neural networks (CNNs) for feature extraction, and random forests (RF) for classification and identification. It focuses on exploring the combination of different spectral techniques and ML methods, such as the antibiotic database established by combining surface-enhanced Raman spectroscopy (SERS) and CNNs, and the classification of microplastics using infrared spectroscopy combined with RF. Through these combinations, ML enhances the sensitivity, selectivity, and robustness of detection. Furthermore, it provides an in-depth analysis of model interpretability methods and cross-laboratory validation frameworks, emphasizing the importance of building standardized detection processes and evaluation systems. Looking ahead, research in this field will focus on more efficient ML algorithms, deep integration of hardware and algorithms, and the expansion of application scenarios, to build an AI-driven autonomous decision-making system for pollutant detection and treatment.

Supplementary files

Transparent peer review

To support increased transparency, we offer authors the option to publish the peer review history alongside their article.

View this article’s peer review history

Article information

Article type
Tutorial Review
Submitted
24 Aug 2025
Accepted
08 Dec 2025
First published
09 Dec 2025

Analyst, 2026, Accepted Manuscript

Advances and innovations in machine learning-based spectral detection methods for trace organic pollutants

Y. qin, Q. Duan, H. Wang, Y. bai, Y. qin, L. Yao, F. Song, M. Wu and J. Lee, Analyst, 2026, Accepted Manuscript , DOI: 10.1039/D5AN00903K

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements