Multimodal Prediction of Sludge Volume Index for Monitoring Sludge Settling Performance

Abstract

The sludge volume index (SVI) is a crucial parameter for evaluating the sludge settling performance in wastewater treatment plants. However, existing methods that utilize process parameters or visual features lack multi-modal collaborative modeling of macroscopic settling characteristics, which restricts the prediction accuracy of SVI. To address this issue, we propose a multi-modal framework specifically for SVI prediction that integrates process data (e.g., chemical oxygen demand; pH) and macroscopic visual features (e.g., floc color; 30-minute sludge volume) of sludge settling—parameters that are critical for SVI estimation. Firstly, a sludge settling visual dataset and a process-visual fusion database are constructed for SVI prediction scenarios. Secondly, an improved YOLO11 model is designed to achieve reliable detection and localization of floc state and supernatant state in sludge settling images. Thirdly, combining K-means clustering with geometric feature analysis, the output of the improved YOLO11 model is used to quantify the color and settling ratio of flocs. Finally, a stochastic configuration network is employed to fuse multi-source data for SVI prediction. Experiments show that the improved YOLO11 achieves a mAP@0.5 of 94.8% on the sludge settling visual dataset, with a 42.1% reduction in the number of parameters; the proposed multi-modal model achieves lower prediction errors than single-modal models, with a root mean square error (RMSE) of 7.31 and a mean absolute error (MAE) of 4.40, and the contribution of visual features to SVI prediction accuracy reaches 63.7%. This study provides an efficient solution for the real-time monitoring of SVI, a key indicator of sludge settling performance in wastewater treatment plants.

Transparent peer review

To support increased transparency, we offer authors the option to publish the peer review history alongside their article.

View this article’s peer review history

Article information

Article type
Paper
Submitted
23 May 2025
Accepted
13 Feb 2026
First published
18 Feb 2026

Environ. Sci.: Water Res. Technol., 2026, Accepted Manuscript

Multimodal Prediction of Sludge Volume Index for Monitoring Sludge Settling Performance

L. Zhao, Z. Li, M. Huang and Z. Gao, Environ. Sci.: Water Res. Technol., 2026, Accepted Manuscript , DOI: 10.1039/D5EW00470E

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements