Issue 5, 2024

How to actively learn chemical reaction yields in real-time using stopping criteria

Abstract

Chemical reactions are central for the creation of new materials, drug design and many more fields. Obtaining high reaction yields is of great importance to reduce cost and increase the efficiency and purity of the obtained product. To reduce the number of experiments for high reaction yield screening in organic chemistry, the use of active learning (AL) is an interesting approach. Unfortunately, the majority of AL is based on “retro-AL” where all the reactions are already available. One problem of “real-time” AL is determining when to stop the AL loop without creating an external labeled test set to analyze the performance of the model. The stopping procedure presented in this work is a stopping criterion, namely the stabilization prediction (SP) (Bloodgood et al., Proceedings of the Thirteenth Conference on Computational Natural Language Learning, 2009, 39–47). It uses an unlabeled equivalent of a test set called a stop set to indirectly evaluate the accuracy of the AL loop. To benchmark the stability of this method and investigate its applicability in chemistry, two datasets from the organic literature, four estimators, three types of descriptors, two sizes of queries per iteration (QPI) and stop set size were investigated. We determine that the present method is the most stable with a Support Vector Classification (SVC) estimator, 50 QPI and a stop set size containing 30% of the data. It produces the best compromise between an early stop (consumes less than 50% of the data) and a reliable accuracy over 10 different runs compared to the accuracy obtained with classical supervised machine learning. We do hope that this method would be of use to create “real-time” AL in chemistry.

Graphical abstract: How to actively learn chemical reaction yields in real-time using stopping criteria

Supplementary files

Article information

Article type
Paper
Submitted
21 Nov 2023
Accepted
25 Jan 2024
First published
26 Jan 2024

React. Chem. Eng., 2024,9, 1206-1215

How to actively learn chemical reaction yields in real-time using stopping criteria

V. Delmas, D. Jacquemin, A. Blondel, M. Vacher and A. D. Laurent, React. Chem. Eng., 2024, 9, 1206 DOI: 10.1039/D3RE00628J

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements