Issue 7, 2024

Expanding the chemical space using a chemical reaction knowledge graph

Abstract

In this work, we present a new molecular de novo design approach which utilizes a knowledge graph encoding chemical reactions, extracted from the publicly available USPTO (United States Patent and Trademark Office) dataset. Our proposed method can be used to expand the chemical space by performing forward synthesis prediction by finding new combinations of reactants in the knowledge graph and can in this way generate libraries of de novo compounds along with a valid synthetic route. The forward synthesis prediction of novel compounds involves two steps. In the first step, a graph neural network-based link prediction model is used to suggest pairs of existing reactant nodes in the graph that are likely to react. In the second step, product prediction is performed using a molecular transformer model to obtain the potential products for the suggested reactant pairs. We achieve a ROC–AUC score of 0.861 for link prediction in the knowledge graph and for the product prediction, a top-1 accuracy of 0.924. The method's utility is demonstrated by generating a set of de novo compounds by predicting high probability reactions in the USPTO. The generated compounds are diverse in nature and many exhibit drug-like properties. A brief comparison with a template-based library design is provided. Furthermore, evaluation of the potential activity using a quantitative structure–activity relationship (QSAR) model suggested the presence of potential dopamine receptor D2 (DRD2) modulators among the proposed compounds. In summary, our results suggest that the proposed method can expand the easily accessible chemical space, by combining known compounds, and identify novel drug-like compounds for a specific target.

Graphical abstract: Expanding the chemical space using a chemical reaction knowledge graph

Supplementary files

Article information

Article type
Paper
Submitted
24 Nov 2023
Accepted
24 May 2024
First published
11 Jun 2024
This article is Open Access
Creative Commons BY license

Digital Discovery, 2024,3, 1378-1388

Expanding the chemical space using a chemical reaction knowledge graph

E. Rydholm, T. Bastys, E. Svensson, C. Kannas, O. Engkvist and T. Kogej, Digital Discovery, 2024, 3, 1378 DOI: 10.1039/D3DD00230F

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements