An intelligent diagnostic algorithm for Raman spectroscopy of gastrointestinal cancer based on component modeling

Abstract

Background: Early diagnosis of gastrointestinal (GI) cancer is crucial for patient prognosis, yet conventional methods suffer from invasiveness and insufficient molecular sensitivity. Raman spectroscopy offers non-invasive molecular fingerprinting, but spectral overlap in complex biological samples poses a challenge. This study introduces a diagnostic framework synergizing Raman spectroscopy with deep learning (specifically, convolutional neural networks -CNN) to quantitatively resolve spectral components for improved GI cancer detection.Results: Raman spectra from 829 GI tissues (760 benign, 69 malignant) and pure components (DNA, triolein, histone, collagen, actin) were collected. An improved CNN regression model, trained on 100,000 simulated spectra derived from the pure components, accurately quantified the relative proportions of these five biochemicals within tissue spectra (R² values: 0.91-0.98). Quantitative analysis revealed significantly higher coefficients for DNA, collagen, and actin, and lower coefficients for triolein and histone in malignant tissues compared to benign tissues (P < 0.01).Utilizing these quantitative molecular features, a LightGBM classification model achieved an accuracy of 97.2%, sensitivity of 90%, specificity of 98.1%, and an AUC of 0.973 on an independent test set of 579 samples.Significance: This work demonstrates a powerful approach for discriminating benign and malignant GI tissues by quantitatively modeling key molecular alterations using Raman spectroscopy and a tailored CNN. The high classification accuracy validates the clinical translational potential of this non-invasive method for GI cancer screening. Furthermore, the developed synergistic framework for quantitative spectral decomposition and classification offers a generalizable strategy extendable to other complex biological analyses, multimodal diagnostics, and potentially cancer staging.

Supplementary files

Article information

Article type
Paper
Submitted
17 Jul 2025
Accepted
12 Oct 2025
First published
15 Oct 2025

Anal. Methods, 2025, Accepted Manuscript

An intelligent diagnostic algorithm for Raman spectroscopy of gastrointestinal cancer based on component modeling

M. Wang, J. Li, W. Mo, D. Qi, S. Ni, F. Tang, X. Wang, C. Qing and M. Zhou, Anal. Methods, 2025, Accepted Manuscript , DOI: 10.1039/D5AY01178G

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements