Issue 4, 2025

GraphXForm: graph transformer for computer-aided molecular design

Abstract

Generative deep learning has become pivotal in molecular design for drug discovery, materials science, and chemical engineering. A widely used paradigm is to pretrain neural networks on string representations of molecules and fine-tune them using reinforcement learning on specific objectives. However, string-based models face challenges in ensuring chemical validity and enforcing structural constraints like the presence of specific substructures. We propose to instead combine graph-based molecular representations, which can naturally ensure chemical validity, with transformer architectures, which are highly expressive and capable of modeling long-range dependencies between atoms. Our approach iteratively modifies a molecular graph by adding atoms and bonds, which ensures chemical validity and facilitates the incorporation of structural constraints. We present GraphXForm, a decoder-only graph transformer architecture, which is pretrained on existing compounds and then fine-tuned using a new training algorithm that combines elements of the deep cross-entropy method and self-improvement learning. We evaluate GraphXForm on various drug design tasks, demonstrating superior objective scores compared to state-of-the-art molecular design approaches. Furthermore, we apply GraphXForm to two solvent design tasks for liquid–liquid extraction, again outperforming alternative methods while flexibly enforcing structural constraints or initiating design from existing molecular structures.

Graphical abstract: GraphXForm: graph transformer for computer-aided molecular design

Supplementary files

Article information

Article type
Paper
Submitted
22 Oct 2024
Accepted
13 Mar 2025
First published
14 Mar 2025
This article is Open Access
Creative Commons BY license

Digital Discovery, 2025,4, 1052-1065

GraphXForm: graph transformer for computer-aided molecular design

J. Pirnay, J. G. Rittig, A. B. Wolf, M. Grohe, J. Burger, A. Mitsos and D. G. Grimm, Digital Discovery, 2025, 4, 1052 DOI: 10.1039/D4DD00339J

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements