Issue 4, 2023

Machine learning reaction barriers in low data regimes: a horizontal and diagonal transfer learning approach

Abstract

Machine learning (ML) models can, once trained, make reaction barrier predictions in seconds, which is orders of magnitude faster than quantum mechanical (QM) methods such as density functional theory (DFT). However, these ML models need to be trained on large datasets of typically thousands of expensive, high accuracy barriers and do not generalise well beyond the specific reaction for which they are trained. In this work, we demonstrate that transfer learning (TL) can be used to adapt pre-trained Diels–Alder barrier prediction neural networks (NNs) to make predictions for other pericyclic reactions using horizontal TL (hTL) and additionally, at higher levels of theory with diagonal TL (dTL). TL-derived predictions are possible with mean absolute errors (MAEs) below the accepted chemical accuracy threshold of 1 kcal mol−1, a significant improvement on pre-TL prediction MAEs of >5 kcal mol−1, and in extremely low data regimes, with as few as 33 and 39 new datapoints needed for hTL and dTL, respectively. Thus, hTL and dTL are powerful options for providing insight into reaction feasibility without the need for extensive high-throughput experimental or computational screening or large dataset generation for training bespoke ML models.

Graphical abstract: Machine learning reaction barriers in low data regimes: a horizontal and diagonal transfer learning approach

Supplementary files

Article information

Article type
Paper
Submitted
04 May 2023
Accepted
23 May 2023
First published
31 May 2023
This article is Open Access
Creative Commons BY license

Digital Discovery, 2023,2, 941-951

Machine learning reaction barriers in low data regimes: a horizontal and diagonal transfer learning approach

S. G. Espley, E. H. E. Farrar, D. Buttar, S. Tomasi and M. N. Grayson, Digital Discovery, 2023, 2, 941 DOI: 10.1039/D3DD00085K

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements