Themed collection Most popular 2025 machine learning and automation articles
Machine learning spectroscopy to advance computation and analysis
Spectroscopy enables studying matter via its interaction with electromagnetic radiation, supporting analysis, with machine learning further advancing its capabilities.
Chem. Sci., 2025,16, 21660-21676
https://doi.org/10.1039/D5SC05628D
AI and automation: democratizing automation and the evolution towards true AI-autonomous robotics
Through artificial intelligence and robotics, autonomous labs are transforming chemical and materials research by enabling high-throughput, data-driven experiments with minimal human input.
Chem. Sci., 2025,16, 15769-15780
https://doi.org/10.1039/D5SC03183D
Incorporating targeted protein structure in deep learning methods for molecule generation in computational drug design
This review examines current deep learning methods for structure-based drug discovery, detailing how various representations of protein pockets can be integrated to encode structural information and design new molecules.
Chem. Sci., 2025,16, 20677-20693
https://doi.org/10.1039/D5SC05748E
A review of machine learning methods for imbalanced data challenges in chemistry
Imbalanced data, where certain classes are significantly underrepresented in a dataset, is a widespread machine learning (ML) challenge across various fields of chemistry, yet it remains inadequately addressed.
Chem. Sci., 2025,16, 7637-7658
https://doi.org/10.1039/D5SC00270B
A review of large language models and autonomous agents in chemistry
This review examines the roles of large language models (LLMs) and autonomous agents in chemistry, exploring advancements in molecule design, property prediction, and synthesis automation.
Chem. Sci., 2025,16, 2514-2572
https://doi.org/10.1039/D4SC03921A
Towards automatically verifying chemical structures: the powerful combination of 1H NMR and IR spectroscopy
Experimental 1H NMR and IR spectra can be scored against calculated data to verify candidate molecules. We show that combining these techniques is significantly more powerful for automated structure verification than using either one individually.
Chem. Sci., 2025,16, 21590-21599
https://doi.org/10.1039/D5SC06866E
Data-driven recommendation of agents, temperature, and equivalence ratios for organic synthesis
A machine learning framework predicts suitable agents, temperature, and equivalence ratios for reactants and agents. The model consistently outperforms strong baselines, enabling more complete and automation-ready reaction protocols.
Chem. Sci., 2025,16, 18176-18189
https://doi.org/10.1039/D5SC04957A
Generative design of singlet fission materials leveraging a fragment-oriented database
Combining the FORMED database with a generative model and the prediction of excited state propertoes, we generate molecular candidates for singlet fission (SF). Amidst known candidates, we find a promising neocoumarin (2-benzopyran-3-one) scaffold.
Chem. Sci., 2025,16, 17956-17969
https://doi.org/10.1039/D5SC03184B
AIQM2: organic reaction simulations beyond DFT
AIQM2's high speed, competitive accuracy, and robustness enable organic reaction simulations beyond what is possible with the popular DFT methods. It can be used for TS structure search and reactive dynamics, often with chemical accuracy.
Chem. Sci., 2025,16, 15901-15912
https://doi.org/10.1039/D5SC02802G
Guided multi-objective generative AI to enhance structure-based drug design
IDOLpro is a modular framework for guided diffusion which can generate molecules with a plurality of optimized properties for structure-based drug design, accelerating the drug discovery process.
Chem. Sci., 2025,16, 13196-13210
https://doi.org/10.1039/D5SC01778E
A universal foundation model for transfer learning in molecular crystals
Multi-modal transfer learning for predicting and explaining universal properties of organic crystals.
Chem. Sci., 2025,16, 12844-12859
https://doi.org/10.1039/D5SC00677E
NMRExtractor: leveraging large language models to construct an experimental NMR database from open-source scientific publications
NMRExtractor is a large language model-powered pipeline that automatically extracts experimental NMR data from massive open-access publications, resulting in the construction of NMRBank—the largest open-access NMR dataset available to date.
Chem. Sci., 2025,16, 11548-11558
https://doi.org/10.1039/D4SC08802F
AIMNet2: a neural network potential to meet your neutral, charged, organic, and elemental-organic needs
Machine learned interatomic potentials (MLIPs) are reshaping computational chemistry practices because of their ability to drastically exceed the accuracy-length/time scale tradeoff.
Chem. Sci., 2025,16, 10228-10244
https://doi.org/10.1039/D4SC08572H
PepINVENT: generative peptide design beyond natural amino acids
PepINVENT introduces a generative model for designing peptides that extend beyond natural amino acids, enabling non-traditional peptide discovery and optimization.
Chem. Sci., 2025,16, 8682-8696
https://doi.org/10.1039/D4SC07642G
Accurate prediction of the kinetic sequence of physicochemical states using generative artificial intelligence
GPT-based generative modeling of MD trajectories enables efficient prediction of state transitions by capturing long-range correlations, offering accurate kinetic and thermodynamic forecasts for diverse physicochemical systems.
Chem. Sci., 2025,16, 8735-8751
https://doi.org/10.1039/D5SC00108K
Machine learning workflows beyond linear models in low-data regimes
This work presents automated non-linear workflows for studying problems in low-data regimes alongside traditional linear models.
Chem. Sci., 2025,16, 8555-8560
https://doi.org/10.1039/D5SC00996K
Chatbot-assisted quantum chemistry for explicitly solvated molecules
Virtual agents and cloud computing have enabled chemists to easily access automated simulations of explicitly solvated molecules.
Chem. Sci., 2025,16, 3852-3864
https://doi.org/10.1039/D4SC08677E
Grappa – a machine learned molecular mechanics force field
We propose Grappa, a machine learned molecular mechanics force field for proteins. Grappa, operating on the molecular graph, accurately predicts energies and forces and agrees with experimental data such as J-couplings and folding free energies.
Chem. Sci., 2025,16, 2907-2930
https://doi.org/10.1039/D4SC05465B
About this collection
This specially curated collection highlights some of our most popular articles from 2025 in the application of machine learning and automation towards advances in the chemical sciences.
The collection presents some outstanding contributions reviewing large language models and autonomous agents, neural networks for atoms-in-molecules, AI reaction simulations and force fields created through machine learning.
As with all Chemical Science articles, they are all completely free to access and read. We hope you enjoy browsing through this collection!