Issue 27, 2025

Mid-level data fusion of pleural effusion SERS spectra and serum CEA levels using machine learning algorithms for precise lung cancer detection

Abstract

Accurate identification of clinically malignant pleural effusions is critical for cancer diagnosis and subsequent treatment planning. Here, surface-enhanced Raman spectroscopy (SERS) data of pleural effusions and serum carcinoembryonic antigen (CEA) levels were integrated to develop an innovative mid-level data fusion method combined with machine learning algorithms to improve the accuracy of cancer detection. SERS spectra of pleural effusions from 15 lung cancer patients, 10 other cancer patients, and 28 non-cancer patients were first acquired using a handheld Raman spectrometer. The principal component analysis (PCA) scores from the SERS spectra were merged with the digitized serum CEA values to generate a data fusion array. Machine learning algorithms such as linear discriminant analysis (LDA), k-nearest neighbor (KNN), and support vector machine (SVM) were applied to train the fused dataset using five-fold cross-validation. Notably, the fusion strategy achieved superior performance compared to the pure SERS spectral discrimination model, with the KNN algorithm demonstrating very high accuracy (>85%) in distinguishing the three clinical groups of lung cancer vs. non-cancer, other cancers vs. non-cancer, and lung cancer vs. other cancers. These results highlight the synergistic diagnostic capability of combining molecular spectroscopic fingerprints with tumor biomarkers for pleural effusion analysis, thereby providing a new strategy for rapid and accurate clinical cancer discrimination via liquid biopsy.

Graphical abstract: Mid-level data fusion of pleural effusion SERS spectra and serum CEA levels using machine learning algorithms for precise lung cancer detection

Article information

Article type
Paper
Submitted
07 Apr 2025
Accepted
11 Jun 2025
First published
12 Jun 2025

Nanoscale, 2025,17, 16349-16360

Mid-level data fusion of pleural effusion SERS spectra and serum CEA levels using machine learning algorithms for precise lung cancer detection

L. Wang, W. Hong, D. Fan, J. Lin, Z. Liu, M. Fan, X. Lin, D. Lin and S. Feng, Nanoscale, 2025, 17, 16349 DOI: 10.1039/D5NR01405K

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements