PepMSND: integrating multi-level feature engineering and comprehensive databases to enhance in vitro/in vivo peptide blood stability prediction†
Abstract
Deep learning has emerged as a transformative tool for peptide drug discovery, yet predicting peptide blood stability—a critical determinant of bioavailability and therapeutic efficacy—remains a major challenge. While such a task can be accomplished through experiments, it requires much time and cost. Here, to address this challenge, we collect extensive experimental data on peptide stability in blood from public databases and the literature and construct a database of peptide blood stability that includes 635 samples. Based on this database, we develop a novel model called PepMSND, integrating KAN, Transformer, GAT, and SE(3)-Transformer to perform multi-level feature engineering for peptide blood stability prediction. Our model can achieve an ACC of 0.867 and an AUC of 0.912 on average and outperforms the baseline models. We also develop a user-friendly web interface for the PepMSND model, which is freely available at http://model.highslab.com/pepmsnd. This research is crucial for the development of novel peptides with strong blood stability, as the stability of peptide drugs directly determines their effectiveness and reliability in clinical applications.