Commit: Reaction classification and yield prediction using the differential reaction fingerprint DRFP

Abstract

In “Reaction classification and yield prediction using the differential reaction fingerprint DRFP”, we introduced a chemical reaction fingerprint based on the symmetric difference AΔB of two sets A and B. With DRFP, were present a reaction as the two sets R and P, where R contains the fragments of one or more reactants and P the fragments of one or more products. The SMILES strings of the fragments in the symmetric difference of fragments RΔP are then hashed and folded into a binary vector. We evaluated DRFP-trained models on high through put experiment data where it performed at least as well as DFT-based and learned fingerprints. In this commit, we present the evaluation of DRFP-trained XGBoost and Random Forest regressors on a recently released set of electronic laboratory notebook-extracted Buchwald–Hartwig reactions where it performs better than other methods by a wide margin. This result underlines the status of DRFP as a strong baseline for reaction representation and yield prediction.

Graphical abstract: Commit: Reaction classification and yield prediction using the differential reaction fingerprint DRFP

Transparent peer review

To support increased transparency, we offer authors the option to publish the peer review history alongside their article.

View this article’s peer review history

Article information

Article type
Technical Note
Submitted
06 Mar 2025
Accepted
23 Jun 2025
First published
03 Jul 2025
This article is Open Access
Creative Commons BY license

Digital Discovery, 2025, Advance Article

Commit: Reaction classification and yield prediction using the differential reaction fingerprint DRFP

D. Probst, Digital Discovery, 2025, Advance Article , DOI: 10.1039/D5DD00089K

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements