High-throughput DeepPRM-Stellar proteomics coupled with machine learning enables precise quantification of atherosclerosis-stroke progression biomarkers and risk prediction

Abstract

Predicting the progression of asymptomatic large-artery atherosclerosis (LAA) to acute ischemic stroke (AIS) remains a significant challenge when relying solely on anatomical stenosis. To address this clinical gap, we integrated discovery-phase serum proteomics with machine-learning techniques to identify circulating biomarkers capable of predicting atherosclerotic progression. Utilizing a dual-cohort design (Cohort I: discovery stage, n = 43; Cohort II: validation stage, n = 39), we established a Serum Protein Candidate Biomarker Bank (SPCBB) encompassing 1484 proteins by harmonizing literature-derived evidence (1369 proteins) with 222 differentially expressed proteins (DEPs) identified through mass spectrometry analysis. Global proteomics revealed that LAA-associated proteins were enriched in cholesterol metabolism, whereas AIS was characterized by the activation of complement/coagulation cascades. We performed targeted validation of 171 peptides (corresponding to 156 proteins) using DeepPRM on the Stellar platform, thereby facilitating machine learning-based optimization of the biomarker panel. The XGBoost algorithm identified two diagnostic signatures: a three-protein panel (RNASE4, HBA1, ATF6B) that differentiates AIS from LAA, with an area under the curve (AUC) of 0.917 and specificity of 80.0%; and a six-protein panel (MRC1, HBA1, GUC2A, HBD, CLEC3B, FLNA) that distinguishes AIS/LAA from healthy controls, achieving an AUC of 0.971 and specificity of 86.0%. To further validate key candidates, we performed ELISA assays for GUCA2A and FLNA, which confirmed their significant elevation in patients with AIS and LAA (p < 0.01), consistent with the proteomics findings. Both internal and external validations confirmed robust performance across cohorts. These validated biomarker panels establish a proteomics-driven framework for serum-based, dynamic monitoring of LAA progression, thereby supporting clinical decision-making aimed at optimizing early stroke prevention in asymptomatic individuals.

Graphical abstract: High-throughput DeepPRM-Stellar proteomics coupled with machine learning enables precise quantification of atherosclerosis-stroke progression biomarkers and risk prediction

Supplementary files

Article information

Article type
Paper
Submitted
09 Apr 2025
Accepted
30 Jun 2025
First published
12 Jul 2025

Analyst, 2025, Advance Article

High-throughput DeepPRM-Stellar proteomics coupled with machine learning enables precise quantification of atherosclerosis-stroke progression biomarkers and risk prediction

Y. Liu, O. Hu, Z. Wang, J. Wang, Y. Qiu, J. Xiao, X. Cheng, P. Yang, N. Xia, Y. Xiong and Q. Yuan, Analyst, 2025, Advance Article , DOI: 10.1039/D5AN00396B

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements