Issue 5, 2025

Challenges and opportunities for machine learning potentials in transition path sampling: alanine dipeptide and azobenzene studies

Abstract

The growing interest in machine learning (ML) tools within chemistry and material science stems from their novelty and ability to predict properties almost as accurately as underlying electronic structure calculations or experiments. Transition path sampling (TPS) offers a practical way to explore transition routes between metastable minima such as conformers and isomers on the multidimensional potential energy surface. However, TPS has historically suffered from the computational cost vs. accuracy trade-off between affordable force-field simulations and expensive high-fidelity quantum mechanical calculations. ML interatomic potentials combined with TPS offer a new approach for the exploration of transition pathways at near-quantum mechanical accuracy, while keeping the computational cost comparable to classical force fields. In this study, we employ the HIP-NN-TS and ANI-1x neural network-based ML potentials, both trained on the ANI-1x dataset of 5 million HCNO structures. We first verify the correctness of our approach by applying it to alanine dipeptide and compare the resulting energy surface and transition paths to the literature. Our findings suggest that proposed approach holds promise for conformational searches, as evidenced by the chemical accuracy (errors ≲1 kcal mol−1) for thermal molecular dynamics trajectories of alanine dipeptide. While we were able to successfully reconstruct alanine dipeptide's potential energy landscape using both HIP-NN-TS and ANI-1x frameworks, we observed that ML models with a lower accuracy may still locate additional important conformations. We also find that manual active learning, augmenting the training data by structures taken from TPS trajectories, improved the accuracy by ∼30% with small amounts of additional data. Finally, we evaluated a more intricate case, azobenzene, and observed that seemingly simple torsions may bear a challenge for ML potentials and limit their applications in TPS. Inability of HIP-NN-TS to correctly describe the energetics of major rotational pathway in azobenzene isomerization highlights deficiencies of the reference method in describing the electronic degrees of freedom. Our study underscores the importance of domain expertise in selecting physically meaningful pathways for benchmarking ML potentials, especially considering the intricacies of electronic structure in chemical dynamics and non-equilibrium processes.

Graphical abstract: Challenges and opportunities for machine learning potentials in transition path sampling: alanine dipeptide and azobenzene studies

Supplementary files

Article information

Article type
Paper
Submitted
15 Aug 2024
Accepted
07 Apr 2025
First published
07 Apr 2025
This article is Open Access
Creative Commons BY-NC license

Digital Discovery, 2025,4, 1158-1175

Challenges and opportunities for machine learning potentials in transition path sampling: alanine dipeptide and azobenzene studies

N. Fedik, W. Li, N. Lubbers, B. Nebgen, S. Tretiak and Y. W. Li, Digital Discovery, 2025, 4, 1158 DOI: 10.1039/D4DD00265B

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements