Mechanism-informed machine learning for rational catalyst design: application to regioselectivity of allyl acetate hydroformylation

Abstract

Hydroformylation is a key strategy for C–C bond formation and the synthesis of value-added aldehydes, with regioselectivity critically determining downstream efficiency and product applicability. However, the design of highly regioselective catalysts still relies heavily on empirical knowledge derived from existing experiments, and for less-studied functionalized olefin substrates, effective and quantitative predictive methods remain underdeveloped. This work presents, for the first time, a mechanism-informed machine learning model for predicting regioselectivity in hydroformylation, identifying steric hindrance, Rh-centered electronic symmetry, hydride charge distribution, and dispersion interactions as the key cooperative factors governing selectivity. Notably, the Rh_Anisotropy is established as a critical descriptor that effectively captures the geometric and electronic environment around the Rh center, providing a quantitative basis for understanding how ligand structure dictates selectivity. Using the trained model for prediction, low-cost commercial ligands with high linear-selectivity potential were identified and experimentally validated, with the optimal ligand achieving approximately 98% linear aldehyde selectivity under mild conditions. Meanwhile, the model identified structurally innovative potential ligands, including modifications on the xanthene moiety of the xantphos scaffold, providing clear guidance for future ligand design and optimization. This work establishes a computation-data fusion framework that bridges mechanistic understanding and predictive modeling, offering a general paradigm for the rational design of highly selective catalysts for olefin hydroformylation.

Graphical abstract: Mechanism-informed machine learning for rational catalyst design: application to regioselectivity of allyl acetate hydroformylation

Supplementary files

Article information

Article type
Paper
Submitted
21 Nov 2025
Accepted
15 Dec 2025
First published
23 Dec 2025

Catal. Sci. Technol., 2026, Advance Article

Mechanism-informed machine learning for rational catalyst design: application to regioselectivity of allyl acetate hydroformylation

M. Zhang, Y. Sun, Y. Chen, L. Wang, K. Song and H. Gong, Catal. Sci. Technol., 2026, Advance Article , DOI: 10.1039/D5CY01398D

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements