Inconsistency of LLMs in Molecular Representations

Bing Yan; Angelica Chen; Kyunghyun Cho

doi:10.1039/D5DD00176E

Inconsistency of LLMs in Molecular Representations

Bing Yan, Angelica Chen and Kyunghyun Cho

Abstract

Large language models (LLM) have demonstrated remarkable capabilities in chemistry, yet their ability to capture intrinsic chemistry remains uncertain. Within any familiar, chemically equivalent representation family, rigorous chemical reasoning should be representation-invariant, yielding consistent predictions across these representations. Here, we introduce the first systematic benchmark to evaluate the consistency of LLMs across key chemistry tasks. We curated the benchmark using paired representations of SMILES strings and IUPAC names. We find that the state-of-the-art general LLMs exhibit strikingly low consistency rates (≤1%). Even after finetuning on our dataset, models still generate inconsistent predictions. To address this, we incorporate a sequence-level symmetric Kullback–Leibler (KL) divergence loss as a consistency regularizer. While this intervention improves surface-level consistency, it fails to enhance accuracy, suggesting that consistency and accuracy are orthogonal properties. These findings indicate that we must consider both consistency and accuracy to properly assess LLMs' capabilities in scientific reasoning.

Article information

https://doi.org/10.1039/D5DD00176E

Article type

Paper

Submitted

30 Apr 2025

Accepted

05 Aug 2025

First published

08 Aug 2025

This article is Open Access

Download Citation

Digital Discovery, 2025, Accepted Manuscript

Permissions

Request permissions

Inconsistency of LLMs in Molecular Representations

B. Yan, A. Chen and K. Cho, Digital Discovery, 2025, Accepted Manuscript , DOI: 10.1039/D5DD00176E

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Digital Discovery

Inconsistency of LLMs in Molecular Representations

Abstract

Transparent peer review

Article information

Download Citation

Permissions

Inconsistency of LLMs in Molecular Representations

Social activity

Search articles by author

Spotlight

Advertisements