Enhancing deep chemical reaction prediction with advanced chirality and fragment representation

Abstract

This work focuses on organic reaction prediction with deep learning, with the recently introduced fragSMILES representation — which encodes molecular substructures and chirality, enabling compact and expressive molecular representation in a textual form. In a systematic comparison with well-established molecular notations — Simplified Molecular Input Line Entry System (SMILES), Self-Referencing Embedded Strings (SELFIES), Sequential Attachment-based Fragment Embedding (SAFE) and tree-based SMILES (t-SMILES) — fragSMILES achieved the highest performance across forward- and retro-synthesis prediction, with superior recognition of stereochemical reaction information. Moreover, fragSMILES enhances the capacity to capture stereochemical complexity — a key challenge in synthesis planning. Our results demonstrate that chirality-aware and fragment-level representations can advance current computer-assisted synthesis planning efforts.

Supplementary files

Article information

Article type
Communication
Submitted
13 May 2025
Accepted
25 Aug 2025
First published
26 Aug 2025
This article is Open Access
Creative Commons BY license

Chem. Commun., 2025, Accepted Manuscript

Enhancing deep chemical reaction prediction with advanced chirality and fragment representation

F. Mastrolorito, F. Ciriaco, O. Nicolotti and F. Grisoni, Chem. Commun., 2025, Accepted Manuscript , DOI: 10.1039/D5CC02641E

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements