Challenges in data-driven catalysis modelling: case study on palladium-NHC catalyzed Suzuki–Miyaura reactions

Abstract

In this study, we synthesized a set of 21 N-heterocyclic carbene (NHC)Pd complexes and evaluated them in a benchmark reaction for Suzuki–Miyaura coupling under 12 different conditions, resulting in a high-quality dataset tailored for machine learning applications. We present a detailed analysis of the data, enabling a thorough assessement of the various parameters (ligand structure and reaction parameters) influencing the reaction yield. We used a new workflow to select descriptors for building linear regression models. The models achieved satisfactory performance in interpolation across all reaction conditions. To ensure these results were not artifacts, we critically examined our models, assessing features explainability, featurization strategies, the impact of train-test splits, and the influence of conformer sets. This work highlights key practical considerations for modeling catalytic activity using machine learning.

Graphical abstract: Challenges in data-driven catalysis modelling: case study on palladium-NHC catalyzed Suzuki–Miyaura reactions

Supplementary files

Article information

Article type
Edge Article
Submitted
12 Aug 2025
Accepted
11 Nov 2025
First published
09 Jan 2026
This article is Open Access

All publication charges for this article have been paid for by the Royal Society of Chemistry
Creative Commons BY-NC license

Chem. Sci., 2026, Advance Article

Challenges in data-driven catalysis modelling: case study on palladium-NHC catalyzed Suzuki–Miyaura reactions

V. A. Voloshkin, C. Valsecchi, F. Medina, L. Lefort, M. Muuronen, M. Jouffroy and S. P. Nolan, Chem. Sci., 2026, Advance Article , DOI: 10.1039/D5SC06138E

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements