Issue 11, 2025

GEOM-drugs revisited: toward more chemically accurate benchmarks for 3D molecule generation

Abstract

Deep generative models have shown significant promise in generating valid 3D molecular structures, with the GEOM-drugs dataset serving as a key benchmark. However, current evaluation protocols suffer from critical flaws, including incorrect valency definitions, bugs in bond order calculations, and reliance on force fields inconsistent with the reference data. In this work, we revisit GEOM-drugs and propose a corrected evaluation framework: we identify and fix issues in data preprocessing, construct chemically accurate valency tables, and introduce a GFN2-xTB-based geometry and energy benchmark. We retrain and re-evaluate several leading models under this framework, providing updated performance metrics and practical recommendations for future benchmarking. Our results underscore the need for chemically rigorous evaluation practices in 3D molecular generation. Our recommended evaluation methods and GEOM-drugs processing scripts are available at https://github.com/isayevlab/geom-drugs-3dgen-evaluation.

Graphical abstract: GEOM-drugs revisited: toward more chemically accurate benchmarks for 3D molecule generation

Transparent peer review

To support increased transparency, we offer authors the option to publish the peer review history alongside their article.

View this article’s peer review history

Article information

Article type
Paper
Submitted
19 May 2025
Accepted
18 Sep 2025
First published
02 Oct 2025
This article is Open Access
Creative Commons BY license

Digital Discovery, 2025,4, 3282-3291

GEOM-drugs revisited: toward more chemically accurate benchmarks for 3D molecule generation

F. Nikitin, I. Dunn, D. R. Koes and O. Isayev, Digital Discovery, 2025, 4, 3282 DOI: 10.1039/D5DD00206K

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements