A machine learning-driven prediction of Hammett constants using quantum chemical and structural descriptors

Abstract

Understanding and predicting chemical reaction behavior is a fundamental challenge in chemistry. The Hammett equation, introduced in 1935, has been a cornerstone in modelling structure-activity relationships, particularly in physical organic chemistry. This study leverages machine learning (ML) to predict Hammett constants (σm and σp) for a diverse set of benzoic acid derivatives. We developed an open-source dataset of over 900 molecules, including meta-, para-, and symmetrically substituted variants, and employed various ML models to predict Hammett constants. Quantum chemical descriptors, combined with Mordred-based electronic, steric, and topological descriptors, were used to train models such as Extra Trees (ET) and Artificial Neural Networks (ANNs). The ANN model achieved the highest accuracy, with a test R2 of 0.935 and an RMSE of 0.084, outperforming other models and a previously developed graph neural networks. Feature importance analysis revealed key descriptors, including NBO charges and HOMO energies, driving the predictions. Applicability domain (AD) analysis identified outliers and compounds outside the AD, ensuring model reliability. This work highlights the potential of ML in predicting Hammett constants, offering a robust tool for chemical reactivity analysis and molecular design.

Supplementary files

Article information

Article type
Paper
Submitted
27 Mar 2025
Accepted
30 May 2025
First published
30 May 2025

Phys. Chem. Chem. Phys., 2025, Accepted Manuscript

A machine learning-driven prediction of Hammett constants using quantum chemical and structural descriptors

V. Saini and R. Kumar, Phys. Chem. Chem. Phys., 2025, Accepted Manuscript , DOI: 10.1039/D5CP01184A

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements