Issue 2, 2026

When machine learning models learn chemistry I: quantifying explainability with matched molecular pairs

Abstract

Explainability methods in machine learning-driven research are increasingly being used, but it remains challenging to assess their reliability without deeply investigating the specific problem at hand. In this work, we present a Python-based Workflow for Interpretability Scoring using matched molecular Pairs (WISP). This workflow can be applied to assess the performance of explainability methods on any given dataset containing SMILES and is model-agnostic, making it compatible with any machine learning model. Evaluation on two physics-based datasets demonstrates that the explanations reliably capture the predictions of the respective machine learning models. Furthermore, our workflow reveals that explainability methods can only meaningfully reflect the property of interest when the underlying models achieve high predictive accuracy. Therefore, the explainability performance on a test set can function as a quality measure of the underlying model. To ensure compatibility with any model type, we developed an atom attributor, which generates atom-level attributions for any model using any descriptor that can be obtained using SMILES representations. This method can also be applied as a standalone explainability tool, independently of WISP. WISP enables users to interpret a wide range of machine learning models in the chemical domain and gain valuable insights into how these models operate and the extent to which they capture underlying chemical principles.

Graphical abstract: When machine learning models learn chemistry I: quantifying explainability with matched molecular pairs

Supplementary files

Article information

Article type
Paper
Submitted
05 Sep 2025
Accepted
05 Dec 2025
First published
09 Feb 2026
This article is Open Access
Creative Commons BY license

Digital Discovery, 2026,5, 571-582

When machine learning models learn chemistry I: quantifying explainability with matched molecular pairs

K. Janssen, J. M. Wollschläger, J. Proppe and A. H. Göller, Digital Discovery, 2026, 5, 571 DOI: 10.1039/D5DD00398A

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements