Benchmarking explainable AI methods for toxicophore detection and toxicity prediction

Abstract

Recent studies have reported inconsistent behavior across explainable AI (XAI) methods in molecular property prediction, raising concerns about their reliability. This work investigates whether such inconsistencies arise from the XAI methods themselves or from the accuracy of the underlying predictive model. A high-accuracy model was first trained on deterministic functional-group labels, where all evaluated XAI methods consistently highlighted the correct atoms corresponding to the true structural motifs. The analysis was extended to mutagenicity prediction, where the methods again identified known toxicophores and chemically meaningful scaffolds. Model performance was then systematically degraded by introducing controlled amounts of label noise. As predictive accuracy decreased, agreement between XAI methods weakened gradually, and the highlighted features became less chemically relevant. When accuracy reached around 0.65, this trend changed, with a much sharper loss of agreement, indicating an explainability cliff. These findings underline the importance of assessing model accuracy before drawing conclusions from XAI outputs.

Graphical abstract: Benchmarking explainable AI methods for toxicophore detection and toxicity prediction

Supplementary files

Article information

Article type
Paper
Submitted
22 Dec 2025
Accepted
05 May 2026
First published
14 May 2026
This article is Open Access
Creative Commons BY license

Digital Discovery, 2026, Advance Article

Benchmarking explainable AI methods for toxicophore detection and toxicity prediction

D. Khasanova and I. V. Tetko, Digital Discovery, 2026, Advance Article , DOI: 10.1039/D5DD00576K

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements