Generalization of Long-Range Machine Learning Potentials in Complex Chemical Spaces

Michał  Sanocki; Julija Zavadlav

doi:10.1039/D5DD00570A

Generalization of Long-Range Machine Learning Potentials in Complex Chemical Spaces

Michał Sanocki and Julija Zavadlav

Abstract

The vastness of chemical space makes generalization a central challenge in the development of machine learning interatomic potentials (MLIPs). While MLIPs could enable large-scale atomistic simulations with near-quantum accuracy, their usefulness is often limited by poor transferability to out-of-distribution samples. Here, we systematically evaluate different MLIP architectures with long-range corrections across diverse chemical spaces and show that such schemes are essential-not only for improving in-distribution performance but, more importantly, for enabling significant gains in transferability to unseen regions of chemical space. To enable a more rigorous benchmarking, we introduce biased train-test splitting strategies, which explicitly test the model performance in significantly different regions of chemical space. Together, our findings highlight the importance of long-range modeling for achieving generalizable MLIPs and provide a framework for diagnosing systematic failures across chemical space. Although we demonstrate our methodology on metalorganic frameworks, it is broadly applicable to other materials, offering insights into the design of more robust and transferable MLIPs.

Supplementary files

Article information

DOI: https://doi.org/10.1039/D5DD00570A
Article type: Paper
Submitted: 18 Dec 2025
Accepted: 12 Apr 2026
First published: 13 Apr 2026
This article is Open Access

Download Citation

Digital Discovery, 2025, Accepted Manuscript

Permissions

Request permissions

Generalization of Long-Range Machine Learning Potentials in Complex Chemical Spaces

M. Sanocki and J. Zavadlav, Digital Discovery, 2025, Accepted Manuscript , DOI: 10.1039/D5DD00570A

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Digital Discovery

Generalization of Long-Range Machine Learning Potentials in Complex Chemical Spaces

Abstract

Supplementary files

Article information

Download Citation

Permissions

Generalization of Long-Range Machine Learning Potentials in Complex Chemical Spaces

Social activity

Search articles by author

Spotlight

Advertisements