Machine learning-driven molecular engineering of nucleic acids

Qien Shi; Hui Lv; Fei Wang; Chunhai Fan; Mingqiang Li

doi:10.1039/D5CS01091H

Machine learning-driven molecular engineering of nucleic acids

Qien Shi,^a Hui Lv,^b Fei Wang,^a Chunhai Fan

*^ac and Mingqiang Li

*^a

Author affiliations

* Corresponding authors

^a State Key Laboratory of Synergistic Chem-Bio Synthesis, School of Chemistry and Chemical Engineering, Frontiers Science Center for Transformative Molecules, New Cornerstone Science Laboratory, Zhang Jiang Institute for Advanced Study, National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai, China
E-mail: limingqiang@sjtu.edu.cn, fanchunhai@sjtu.edu.cn

^b Institute of Materiobiology, College of Sciences, Shanghai University, Shanghai, China

^c Institute of Molecular Medicine, Shanghai Key Laboratory for Nucleic Acids Chemistry and Nanomedicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China

Abstract

Molecular engineering has played a pivotal role in biomedical fields, driving significant advancements in gene therapy, disease diagnosis, and biosensing. However, nucleic acid molecular engineering faces various challenges including vast design spaces, complex structure–function relationships, lengthy application validation cycles, and inefficient optimization processes. Machine learning (ML), with its superior pattern recognition, multidimensional data integration, and automated optimization capabilities, offers a unique opportunity to construct predictive models of sequence-structure–function relationships, thereby enabling a paradigm shift from empirically driven to data-driven approaches. This review systematically surveys recent progress in ML applications across three major domains: nucleic acid structure construction, performance modulation, and application expansion. It also explores core challenges such as data quality, model interpretability, and experimental validation efficiency, along with potential resolution strategies. These insights are poised to propel nucleic acid molecular engineering from static structure prediction toward dynamic behavior simulation, and from single-molecule design to complex system engineering, guiding future directions in hybrid ML-quantum models and expanded applications to non-canonical nucleic acids for transformative innovation in biomedicine, environmental monitoring, and information technology.

Chemical Society Reviews

Machine learning-driven molecular engineering of nucleic acids

Abstract

Supplementary files

Article information

Download Citation

Permissions

Machine learning-driven molecular engineering of nucleic acids

Social activity

Search articles by author

Spotlight

Advertisements