Reaction classification and yield prediction using the differential reaction fingerprint DRFP

Daniel Probst; Philippe Schwaller; Jean-Louis Reymond

doi:10.1039/D1DD00006C

Reaction classification and yield prediction using the differential reaction fingerprint DRFP

Daniel Probst,

*^a Philippe Schwaller

^b and Jean-Louis Reymond

*^a

Author affiliations

* Corresponding authors

^a Department of Chemistry and Biochemistry, University of Bern, Freiestrasse 3, 3012 Bern, Switzerland
E-mail: daniel.probst@dcb.unibe.ch, jean-louis.reymond@unibe.ch

^b IBM Research – Europe, Säumerstrasse 4, 8803 Rüschlikon, Switzerland

Abstract

Predicting the nature and outcome of reactions using computational methods is a crucial tool to accelerate chemical research. The recent application of deep learning-based learned fingerprints to reaction classification and reaction yield prediction has shown an impressive increase in performance compared to previous methods such as DFT- and structure-based fingerprints. However, learned fingerprints require large training data sets, are inherently biased, and are based on complex deep learning architectures. Here we present the differential reaction fingerprint DRFP. The DRFP algorithm takes a reaction SMILES as an input and creates a binary fingerprint based on the symmetric difference of two sets containing the circular molecular n-grams generated from the molecules listed left and right from the reaction arrow, respectively, without the need for distinguishing between reactants and reagents. We show that DRFP performs better than DFT-based fingerprints in reaction yield prediction and other structure-based fingerprints in reaction classification, reaching the performance of state-of-the-art learned fingerprints in both tasks while being data-independent.

This article is part of the themed collection: Machine Learning and Artificial Intelligence: A cross-journal collection

Article information

https://doi.org/10.1039/D1DD00006C

Article type

Paper

Submitted

26 Aug 2021

Accepted

12 Jan 2022

First published

21 Jan 2022

This article is Open Access

Download Citation

Digital Discovery, 2022,1, 91-97

Permissions

Request permissions

Reaction classification and yield prediction using the differential reaction fingerprint DRFP

D. Probst, P. Schwaller and J. Reymond, Digital Discovery, 2022, 1, 91 DOI: 10.1039/D1DD00006C

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Digital Discovery

Reaction classification and yield prediction using the differential reaction fingerprint DRFP

Abstract

Transparent peer review

Article information

Download Citation

Permissions

Reaction classification and yield prediction using the differential reaction fingerprint DRFP

Social activity

Search articles by author

Spotlight

Advertisements