Bisphosphine ligand conformer selection to enhance descriptor database representation: improving statistical modelling outcomes

Jamie A. Cadge; Sierra D. Hart; Richard C. Walroth; Kyle A. Mack; Matthew S. Sigman

doi:10.1039/D5SC04691B

View PDF Version

Open Access Article

This Open Access Article is licensed under a
Creative Commons Attribution 3.0 Unported Licence

DOI: 10.1039/D5SC04691B (Edge Article) Chem. Sci., 2025, Advance Article

Bisphosphine ligand conformer selection to enhance descriptor database representation: improving statistical modelling outcomes

Jamie A. Cadge† ^ab, Sierra D. Hart†^a, Richard C. Walroth^c, Kyle A. Mack^c and Matthew S. Sigman*^a
^aDepartment of Chemistry, University of Utah, Salt Lake City, Utah 84112, USA. E-mail: sigman@chem.utah.edu
^bDepartment of Chemistry, University of Bath, Bath, BA2 7AY, UK
^cDepartment of Small Molecule Process Chemistry, Genentech Inc., San Francisco, California 94080, USA

Received 25th June 2025 , Accepted 29th September 2025

First published on 29th September 2025

Abstract

A foundational consideration in the development of computationally derived molecular feature libraries is the generation and selection of conformers. It has been shown that several feature values have a degree of conformer depencency – which may have significant mechanistic implications, partiticulary in the field of homogeneous enantioselective catalysis. However, the computational cost of calculating conformers often prohibits this analysis from being performed, especially when large flexible systems are involved. We report here a practical, chemically-intutive conformer selection tool for bisphosphine-ligated palladium(II) dichloride complexes that provide a good balance between representation and computational cost. Conformer-weighted features generated from this method were applied to two previous statistical modelling case studies, where weighted features improve model quality with respect to predictive power. This selection methodology has the potential to be applied to a range of complex molecular systems beyond bisphosphine-ligated organometallic complexes.

Introduction

The use of diverse computationally derived molecular descriptor libraries has been crucial for interfacing data science tools with organic reaction development.¹ Recent examples of such libraries from our group and others include organic compounds such as aryl halides,² carboxylic acids and amines³ as well as monodentate⁴ and bidentate⁵ ligands in organometallic chemistry. The extracted molecular features can be used in downstream machine learning (ML) and statistical modelling efforts. These applications can include the deconvolution of reaction mechanisms, definition of structure function relationships, and prediction of optimal reaction conditions.⁶ These libraries often consist of “ground-state” computed structures from which a series of molecular features can be extracted (Fig. 1A). A key assumption in these efforts is that ground-state features can be used to explain trends in reactivity or selectivity akin to the use of Hammett plots and other linear free-energy relationships in physical organic chemistry.⁷


	Fig. 1 Introduction.

A further consideration when constructing feature libraries is the inclusion of conformationally derived descriptors as both steric and electronic properties can be sensitive to dynamics and conformation. For example, Sterimol⁸ (a steric measurement of the maximum and minimum lengths perpendicular to a pre-defined axis)⁹ and V_min (an electronic feature commonly used to describe the σ-donating ability of monodentate phosphine ligands)¹⁰ can vary depending on the conformation a molecule adopts. Reaction energy barriers can have a range of up to 10 kcal mol⁻¹, as a result of structure conformation, and it is challenging to know a priori which conformer(s) are required to explain reactivity, i.e., in a transition state or intermediate involved in the turnover-limiting or selectivity-determining step.¹¹ This phenomenon is of particular significance in the field of homogeneous catalysis.

In this context, incorporation of conformational information into a descriptor library building campaign is dependent on two key factors: ease of automation and computational costs. The former has been enabled by packages such as AutoQChem,¹² AQME¹³ and molli¹⁴ from the Doyle, Paton and Denmark labs, respectively, as well as a recent workflow developed by our group.³ These methods often utilize density functional theory (DFT) to facilitate accurate featurization. However, as molecules increase in size and complexity (e.g., organometallic species), DFT incurs a significant computational cost that can be limiting on scale.¹⁵ This becomes especially challenging when incorporating conformational ensembles. Our previous approach to featurized bisphosphine ligands used a force-field approach followed by ensemble pruning to five conformers based on RMSD atomic positions. From these five conformers, the lowest energy conformer defined through DFT was used for downstream featurization and modelling efforts.^5e While this feature library has been successful in various optimisation and experimental design campaigns,^2c,5e we hypothesised this approach may be inadequate for the representation of the energetically accessible ligand conformer landscape. Additionally, this may limit the predictive ability and/or the domain of applicability of modelling tasks (Fig. 1B).

When considering the design of a DFT feature library for systems of high molecular complexity and conformational flexibility, there is clearly a need for effective conformer selection. Herein, we report the development of a feature-based approach for effectively sampling the bisphosphine ligand conformational landscape. Specifically, a privileged geometric feature, bite angle, was used to select conformers derived from a library of bisphosphine-ligated palladium(II) dichloride complexes. This strategy functions as an effective compromise between ensemble representation and computational cost. To evaluate this approach, two previous challenging statistical modelling campaigns were examined, where inclusion of conformer-derived features was found to improve predictive power. We anticipate that with the appropriate selection of a feature or features, they could be used for the sampling of other systems of high complexity.

Selection and calculation of complexes

Several strategies towards conformer selection have been reported, including tools such as CREGEN from Grimme et al.,¹⁶ COSMOconf from Klamt and co-workers,¹⁷ the ReSCoSS workflow (Udvarhelyi, Rodde and Wilcken)¹⁸ and CONFPASS from Goodman and co-workers.¹⁹ Many of these methods rely on sampling conformers by energy, RMSD atom deviations or changes in dihedral angles. For complex organometallic structures, this can result in impractically large ensembles depending on computational resources. To further down select and lower the computational cost, we hypothesized that conformer selection should be informed by a chemically intuitive feature(s). We chose to examine the accuracy trade-off between the computational approaches used, and thereby the cost incurred for calculation of conformation ensembles (vide infra).

To accomplish this, 12 palladium(II) dichloride complexes were selected from our group's previously computed bisphosphine descriptor library,^5e to represent scaffold variance across chemical space. This was visualized using t-distributed stochastic neighbour embedding (t-SNE) and hierarchal clustering (n = 10) (Fig. 2A).²⁰ This feature space was constructed with predominantly free ligand descriptors as this was presumed to be more generally representative of ligand diversity compared to metal complex descriptors. Pleasingly, these dimensionality reduction and clustering techniques intuitively categorized ligand backbones together and informed the subsequent selection of 12 bisphosphine ligands for calculation.²¹ Fig. 2B shows the diversity of ligands used in the following analyses, including ligands with alkyl backbones (L1, L8), aromatic backbones (L2, L3, L7 and L9–L12), as well as those based on ferrocene (L4–L6).


	Fig. 2 (A) Selection of representative ligand using clustered chemical space and (B) selected ligands by cluster. Ar¹ = 3,5-^tBu-4-OMe-Ph; Ar² = 4-(CF-di-CF₃)-Ph.

Using the initial library of DFT computed structures as starting coordinates, conformer ensembles for these ligands were generated using the open-source CREST program (where conformers are differentiated using rotational constants).¹⁶ A 5 kcal mol⁻¹ energy window using the GFN2-xTB//GFN-FF composite method (Fig. 3A) was applied to each conformer search.²² This method was chosen as structure optimization at the GFN-FF level has been shown to produce geometries with high accuracy when compared to other force field approaches.²³ Structurally redundant conformers were removed using a PCA/k-means clustering technique of relevant dihedral angles (as implemented in CREST).²⁴


	Fig. 3 (A) Computational methods for generating “Set 1” and “Set 2” structures and bisphosphine ligand features collected and (B) comparison of “Set 1” and “Set 2” features: (top) comparison of absolute average % differences in feature values; (middle) linear regression models comparing DFT-refined and non-DFT-refined structure feature values and (bottom) description of the three features used in the linear regression models. For a description of all other features used here, see the SI.

We then probed whether the geometries produced using this workflow were sufficiently accurate for generating ligand features, which would remove the costly DFT geometry refinement step. To evaluate this, subsequent DFT single point correction (SPC) calculations, required for property collection, were performed on the CREST geometries at the PBE(0)-D3(BJ)/def2-TZVP level of theory. The Perdew–Burke–Ernzerhof (PBE) functionals²⁵ were selected as they have been previously used in our library building campaigns^4b,5e and demonstrated to effectively predict ³¹P NMR chemical shifts.²⁶ These calculations form “Set 1”. These were then compared to calculations defined as “Set 2”, in which the CREST generated structures were calculated at the PBE0-D3(BJ)/def2-TZVP//PBE-D3(BJ)/def2-SVP level of theory prior to featurization.

For each Set, a representative series of steric, geometric, and electronic descriptors were collected for comparison.²⁷ In the case of steric features, ligand equivalent cone angle (derived from a solid angle calculation), percentage buried volume (%V_bur) at Pd (with radii, r, between 2 Å and 7 Å, in increments of 1 Å), total molecular volume, and the solvent accessible surface area (SASA) at Pd were selected. For geometric descriptors, Pd–Cl and Pd–P bond lengths, and bite angle (∠P–Pd–P) were selected. Frontier molecular Kohn–Sham orbital (HOMO and LUMO) energies and molecular dipole were chosen as examples of global electronic descriptors. Atom-specific properties included atomic partial charges (on Pd, P and Cl atoms) derived from a natural population analysis (NPA) calculation and isotropic and anisotropic ³¹P NMR chemical shifts. In instances where there can be more than one possible value for a descriptor (e.g., Pd–P bond length or P NPA charge), feature values were averaged.

This comparison provides information on whether the time-intensive geometry refinement step of this computational workflow could be circumvented, which would allow for more conformations to be computed.

Effect of DFT geometry refinement on GFN-FF optimized structures

To compare the quality of the features from calculation Set 1 and Set 2, the average (over the 12 ligands) percentage difference between the descriptors were initially evaluated. Fig. 3B (top) shows the comparison of lowest energy DFT-PBE0 feature values. Similar analyses for Boltzmann-weighted average descriptors (298 K) as well as the maximum and minimum values are given in the SI.

For steric and geometric features, there is generally good agreement between Set 1 and Set 2, where average differences are approximately 5% or less. In contrast, there is less agreement between feature values from Set 1 and Set 2 for electronic descriptors. For example, LUMO energy and anisotropic ³¹P NMR chemical shift show significant differences in their feature values (>15%). This indicates that both the global and atom-level electronic structure are poorly described in structures that did not undergo DFT geometry refinement.

While the absolute electronic feature values between the two calculation Sets are different, their relative values may have better agreement. To test this, the collinearity between the two Sets was considered (Fig. 3B, middle). However, in all cases, poor correlations (R² < 0.5) were observed between feature values in Set 1 and Set 2. Notably, the anisotropic ³¹P NMR chemical shift gave an R² value of 0.24. Similar results were obtained with HOMO energy, LUMO energy and isotropic ³¹P NMR chemical shift (giving R² = 0.64, 0.56, and 0.46, respectively). We suspect these differences result from a mismatch in the electronic structure theory methods used (i.e., different optimized local minima between semi-empirical vs. DFT and/or inaccurate bond lengths/angles with semi-empirical methods) to determine the complex geometries. Resultant minor structural changes between Set 1 and Set 2 are enough to cause divergences in the electronic structure of the metal complex and, therefore, electronic features obtained. Unsurprisingly, correlations observed with steric and geometric features were significantly better. These show a good linear relationship between descriptors in Set 1 and Set 2 (R² > 0.8, see SI for full set).

Given these significant differences observed between electronic descriptors obtained from DFT-level geometry refinement and geometries obtained from CREST, structures from the latter (Set 1) were deemed inappropriate, even in relative terms. Additionally, the accuracy of electronic features is crucial as these have been demonstrated to be key in previous modelling efforts for both mono-²⁸ and bisphosphine ligands.^5d,e However, due to the size and complexity of the bisphosphine complexes used in this study, as mentioned above, DFT-level geometry refinement on all complexes generated from a conformer search would incur significant computational cost if this approach was extended to the entire bisphosphine ligand library of >600 ligands. As the steric and geometric features obtained from CREST were in good agreement with those derived from a subsequent DFT refinement, we considered whether ensembles could be pruned based on one (or a combination of) steric and geometric features. Notably, a similar approach was taken in the kraken monophosphine ligand library, where representative conformers were selected based on a number of steric descriptors.^4b

Selection of conformers using geometric and steric features

In choosing features to use in the selection of conformers for calculation at the DFT level, we took inspiration from previous descriptors used for bisphosphine ligands in the literature. For example, bite angle has been an informative descriptor of bisphosphine ligands inspiring the development of many transition metal-catalysed processes such as hydroformylation.²⁹ Additionally, this geometric descriptor encompasses both an electronic and steric effect. With increasing bite angle, the symmetry and energy of the coordinated metal's frontier orbitals change leading to a subsequent destabilization of a metal complex.³⁰ This destabilization, in several cases, enhances reactivity such as a higher propensity to undergo oxidative addition. Bite angle also impacts the steric profile of the metal centre that substrates can access during catalysis (akin to a binding pocket).³¹ Narrow bite angles can be used to alleviate steric interactions which, for example, can lead to increased transition state stability. Other descriptors often used are the equivalent cone angle and %V_bur. The equivalent cone angle was originally developed by Weigand and co-workers and serves as a useful analogue to Tolman's cone angle at bisphosphine-ligated metal complexes.³² %V_bur pioneered by Nolan and Cavallo et al. has also been used to effectively describe the steric environment of both mono- and bisphosphine ligands.³³

Before examining steric and geometric features for choosing conformers, selection based on the GFN2-xTB energy was first investigated. This would provide a good comparison with selection based on steric and geometric features for all 12 bisphosphine palladium(II) dichloride complexes (Fig. 4A). Conformers were selected based on their equidistant GFN2-xTB energy values, i.e., taking the minimum and maximum values as well as evenly distributed points in between. For practical purposes, up to ten conformers in each selection was used to acquire the maximum conformer diversity at a reasonable computational cost. The use of five conformers was also investigated but gave inadequate representation – especially with larger conformer ensembles.


	Fig. 4 (A) Depiction of possible conformer selection methods investigated here and analysis of both methods for two ligands: (B) L11 with a total of 17 conformers with (top) selection based on GFN2-xTB energy and bite angle and (bottom) performance in the full calculated DFT ensemble. (C) Shows equivalent analysis with L9 with a total of 55 conformers.

Selections for complexes of ligand L11 and L9 are depicted in Fig. 4B and C, respectively as illustrative examples, with the average and largest conformer ensemble sizes, respectively (for analysis of the remaining ten ligands, see the SI). Selection based on GFN2-xTB energies gives a structurally limited ensemble. For example, the distribution of bite angles obtained is limited compared to the entire ensemble i.e., the range of the pruned ensemble is less than that of the full ensemble (4° vs. 6° for L9). Similar observations are made with ligand %V_bur values. To obtain better structural diversity, a similar selection methodology was applied using bite angle. In addition to increased structural diversity in the new conformer ensemble, this approach also resulted in a good distribution of GFN2-xTB energies across the full 5 kcal mol⁻¹ window. It was determined that the lowest GFN2-xTB energy conformer would also be included (if not one of those selected) to further increase the conformer energy span.³⁴

As this strategy provided a good coverage of the conformer ensemble in terms of energy and structural diversity, we then examined the effects of this treatment in the full DFT-generated conformer ensembles in calculation Set 2 (Fig. 4B and C, bottom). Given the diversity of ligands used in this study, this method provided a simulation if this selection method would be applied to the entire bisphosphine ligand library. DFT-level feature distributions for the Set 2 ensembles of ligands L11 and L9 are again analysed. Equivalent cone angle, bite angle, anisotropic ³¹P NMR chemical shift and the HOMO energy were chosen as representative features (see SI for analysis with all features). Dots highlighted on the distributions in Fig. 4B and C (bottom) show the feature values of bite angle-selected conformers to directly compare with the full DFT conformer ensemble. In the case of ligand L11 with an average size ensemble, there is an adequate distribution of selected conformers for all four features indicated by their ranges being nearly identical. However, for the larger ensemble of ligand L9, there is reduced coverage of the DFT ensemble by the selected conformers where some of the extrema of feature values are missed upon selection. For example, smaller values of solid cone angle and bite angle are not included in the selection method. Nevertheless, maintaining a practical view on what calculations are performed in the context of a large ligand library building campaign, we view this as a reasonable compromise between conformer feature representation and computational cost. For the ligands examined in this workflow preparation, a comparison was made between DFT optimization times required for the full ensemble of ligands and the representative selection of ligands. By reducing the conformational ensemble size to 11 or fewer ligands, the total time required for DFT geometry refinement was reduced by 70.1%.

Application of selection method in bisphosphine modelling campaigns

With a conformer selection methodology established, we generated new ligand features analogous to those from our previously published bisphosphine library which now include conformer-weighting. Conformer-weighted features include lowest energy conformer value, minimum and maximum feature values as well as Boltzmann-weighted averaged features (at 298 K) and conformer arithmetic mean values giving a total of 2088 features. This set was reduced to features that were conformationally dependent (i.e., those showing variance in feature value across an ensemble). For features that showed low variance across an ensemble, only the Boltzmann-weighted average value was retained, reducing the feature set to less than 1300 descriptors.³⁵ To test the performance of the new ligand features, they were applied in two bisphosphine modelling case studies previously reported by our group. Both of these examples represent small dataset sizes where model building can be a challenge but is often the reality the chemical sciences.³⁶

Hayashi–Heck cross-coupling regioselectivity

The first case study for feature comparison concerns a palladium-catalysed Hayashi–Heck cross-coupling reaction (Fig. 5A).^5e Here, the arylation regioselectivity (ΔΔG^‡) was modelled using multivariate linear regression (MLR). However, prediction of validation ligands led to mixed results with some examples giving a mean absolute error (MAE) of up to 1.8 kcal mol⁻¹. The reason we previously presented for these types of poor predictions was due to the structures being outside the domain of model applicability. Therefore, we hypothesised that our new conformer-weighted ligand features could enhance the predictive ability of validation ligands. With the full set of ensemble features (a combination of conformer-weighted and lowest energy conformer features), a three step forward stepwise model search was conducted using 22 ligands to train models and tested on 30 ligands, which were defined in the previous study.^5e This gave a model where ΔΔG^‡ is described by the conformer maximum ³¹P NMR chemical shift and the Boltzmann-weighted average of the Pd–P σ-bonding occupancy (from a NPA calculation). Interestingly, both of these terms are derived from the phosphorus atom with the smallest %V_bur (r = 3.0 Å). Additionally, the P–R σ-bonding energy from the free ligand is also present in the model equation. Together these features indicated a strong electronic influence of the ligand on the observed regioselectivity similar to the previously reported statistical model. Moreover, the steric dependence incorporated into the workflow to define these features may infer that experimental regioselectivity is based on binding or donating ability of the smaller of the two phosphorus donors. To assess the overall statistical model quality, we used the R², mean absolute error (MAE) and root mean squared error (RMSE) of both the training and test set ligands. Overall, the generated three-term MLR model gave adequate training (R² = 0.76, MAE = 0.30 kcal mol⁻¹, RMSE = 0.41 kcal mol⁻¹) and validation statistics (R² = 0.57, MAE = 0.51 kcal mol⁻¹, RMSE = 0.64 kcal mol⁻¹). The prediction error (the absolute difference between predicted and measured ΔΔG^‡) of the training and test split is also described in Fig. 5B, left. Ligands in the training and test sets are represented as a distribution (from a kernel density estimation) to allow ready visualization of the error of individual datapoints. Unsurprisingly, this analysis revealed a wider distribution of the test set of ligands compared to those in the training set.


	Fig. 5 (A) MLR model for Hayashi–Heck cross-coupling regioselectivity including examples of representative ligands, (B) analysis of model performance with and without conformer-weighted features represented as distributions of individual ligand absolute prediction error and (C) analysis of feature conformer dependence. For a full definition of model features see the SI.

To investigate the effect of conformer-weighting on model performance, a model was generated using the analogous lowest energy features. Compared to the model containing the conformer-weighted features, the statistics of the training set were somewhat preserved when using the lowest energy conformer features (R² = 0.46, MAE = 0.48 kcal mol⁻¹, RMSE = 0.61 kcal mol⁻¹). However, when evaluating the validation set the model statistics are significantly poorer (R² = 0.29, MAE = 0.69 kcal mol⁻¹, RMSE = 0.82 kcal mol⁻¹). This can also be seen in the prediction error distribution (Fig. 5B, right). Compared to the error distribution of the model generated from conformer-weighted features, there are a greater number of datapoints that have a higher prediction error.

From the overall model statistics and examining the individual prediction errors of ligands, there is improvement when including conformer-weighting into bisphosphine ligand features. To understand the origin of the conformational impact of the features on model performance, violin plots of the feature values were constructed (Fig. 5C). Four representative ligands are depicted with the highest and lowest ΔΔG^‡ as well as two mid-range ΔΔG^‡ values. For electronic features, one would expect a lower conformational dependency compared to steric or geometric features, which is consistent with the observed distributions. Nevertheless, there are modest distributions of feature values across the ensembles, in keeping with observations in Fig. 4B and C. In the case of the ³¹P NMR chemical shift, there is a small distribution of feature values for the most selective ligand (L13), where the least selective ligand has a range of chemical shift of ∼30 ppm. Similar trends are apparent with the P σ-bonding occupancy and P–R σ-bonding energy features. The conformational dependence of these features provided further evidence that incorporating dynamic information improves the MLR model performance.

Sulfonimidamide aryl carbonylation enantioselectivity

Following the increase in predictivity of an MLR model by including conformer-weighted features in the Hayashi–Heck cross-coupling case study, we turned our attention to a more complex statistical modelling challenge. In this second case study, a data-driven ligand selection was used to design a training set for evaluation against an enantioselective aryl carbonylation of sulfonimidamides (Fig. 6A). From this initial screen, a MandyPhos-type ligand L17 was identified to produce 100% conversion and excellent enantioselectivity (96 [thin space (1/6-em)]

4 er). However, a linear statistical model to explain the resultant enantioselectivity (and perhaps prediction of a better performer) was not found. In lieu of this, a second round of ligands was also evaluated using similarity to the best performing ligand as determined through chemical space analysis. A considerably more diverse set of bisphosphine ligands were selected for this reaction compared to the previous Hayashi–Heck example, the comparison of the two case study datasets represented in t-SNE chemical space (introduced in Fig. 2) is shown in Fig. 6B. Perhaps as a result of the increased complexity of this dataset, a linear statistical model to explain the resultant enantioselectivity was not found. A total of 53 ligands were evaluated for the reaction, and still with this larger dataset, no linear statistical models were found, which we assumed was a result of the increased complexity of the dataset. This provided us with an opportunity to directly compare how the newly developed conformer-weighted features would perform on a challenging dataset.


	Fig. 6 (A) Sulfonamidamide aryl carbonylation reaction used for modelling case study 2 with representative ligands show. (B) A comparison of the ligand diversity in the Hayashi–Heck and sulfonamidamide carbonylation datasets using the t-SNE chemical space from Fig. 2. Ar = 3,5-bis(trifluoromethyl)phenyl.

Recent analysis of the bisphosphine ligands in the sulfonamidamide carbonylation dataset showed that eight ligands originally used had oxidised (the majority giving low enantioselectivity) and these were removed from our analysis (see SI). Using the enantioselectivity data for the remaining 42 bisphosphine ligands with 4-methylmorpholine base, we opted for a decision tree regression modelling strategy (Fig. 7A). We speculated that the use of a non-linear modelling architecture would be able to provide a correlation required to include the high diversity of ligand structural types. This model was constructed following feature reduction from the original 1242 features down to four using the permutation feature importance metric (on the test set) and subsequent hyperparameter tuning (see SI for full details) with a randomized 33 [thin space (1/6-em)] :9 train:test split. The four features used in this model are: the Boltzmann-weighted average of the chloride NPA charge (maximum value of the two possible chloride ligands); the Boltzmann-weighted average of the phosphorus-backbone angle; the Boltzmann-weighted average of the P–Pd bond occupancy (average value of the two phosphine donors) and the conformer minimum value of the free ligand P–R bond occupancy (depicted in Fig. 7C). This combination of features indicates that there is an electronic and geometric dependence on the enantioselectivity for the evaluated reaction. For example, the chloride NPA partial charge provides an electronic readout of the phosphorus donor atom trans to the chloride. The two phosphorus-backbone angles could be related to the flexibility of the phosphorus donors with respect to the backbone that enforce the chiral metal environment. Details on the feature importance from the decision tree regression model can be gleaned by performing a SHAP analysis (Fig. 7B).³⁷ This ranks features in order of importance: Boltzmann-weighted average of the chloride NPA charge; Boltzmann-weighted average of the phosphorus–backbone angle; Boltzmann-weighted average of the average P–Pd bond occupancy and the conformer minimum value of the free ligand P–R bond occupancy


	Fig. 7 (A) Analysis of the decision tree regression model used for the sulfonamidamide carbonylation dataset. (B) SHAP analysis highlighting the relative importance of each feature. (C) Description of the features used in the decision tree regression model and analysis of their conformer dependence.

Using the same metrics used for the MLR model analysis with the Hayashi–Heck cross-coupling reaction, the decision tree regression model provided excellent statistics for the training set (R² = 0.96, MAE = 0.08 kcal mol⁻¹, RMSE = 0.10 kcal mol⁻¹). Aside from a clear outlier in the test set, the validation statistics were adequate (R² = 0.45, MAE = 0.36 kcal mol⁻¹, RMSE = 0.53 kcal mol⁻¹). Analysis was performed on this outlier ligand, compared to the better predicted ligands in the test set with respect to their SHAP values and decision tree paths (see SI, Section 6.2.3). Unfortunately, this did not provide conclusive reasoning for the origin of this outlier. Removing this outlier gave R² = 0.86, MAE = 0.22 kcal mol⁻¹ and RMSE = 0.26 kcal mol⁻¹. Conscious of the decision tree regression model overfitting the data, we performed a series of cross-validation analyses. A leave-one-out (LOO) cross-validation gave R² = 0.49, MAE = 0.31 kcal mol⁻¹ and RMSE = 0.31 kcal mol⁻¹. Additionally, a five-fold cross-validation gave an average R² = 0.31, MAE = 0.39 kcal mol⁻¹ and RMSE = 0.51 kcal mol⁻¹. Following these cross-validation tests, the effect of the random state and the training:test split ratio was examined. Using 100 randomly generated starts, an average test MAE = 0.38 kcal mol⁻¹ was obtained. Changing the split ratio from 0.1 to 0.5 in 0.1 increments, gave an average test MAE = 0.43 kcal mol⁻¹. While these analyses may suggest some degree of overfitting the data, we determined this model was adequate for the examination of conformer ensemble importance in bisphosphine ligand feature generation.

As before, the equivalent decision tree regression model was generated with the same hyperparameters and training:test split ratio was generated with the analogous lowest energy conformer features. In this model, almost identical training set statistics were obtained (R² = 0.94, MAE = 0.10 kcal mol⁻¹, RMSE = 0.12 kcal mol⁻¹). However, the test set statistics showed significant deterioration (R² = 0.06, MAE = 0.56 kcal mol⁻¹, RMSE = 0.70 kcal mol⁻¹). This can also be shown by comparing the absolute error distributions of both models in Fig. 7A. This indicated that use of lowest energy conformer features, compared to their conformer-weighted congeners, gave rise to a poorly predictive model.

Next, we investigated the degree of conformational dependence on the four features used in the decision tree regression model (Fig. 7C). This analysis used ligands L18, L19, L11 and L20, which are representative of the range of enantioselectivities obtained (full analysis of all ligands used in the data set is provided in the SI). With the exception of ligand L19, all ligands exhibit conformational flexibility. This is consistent with the hypothesis that including conformational flexibility in feature design can enhance model performance.

Conclusions

In conclusion, we have developed a method of selecting conformers for bisphosphine-ligated palladium(II) dichloride complexes that provides balance between representation and practicality in terms of computational time. In addition to this, strategy relies on chemical intuition by utilizing bite angle, a historically important stereo-electronic bisphosphine feature for conformer down selection. This was accomplished by the finding that geometries from the CREST conformer search were sufficiently accurate, with respect to their steric and geometric features, compared to DFT-refined geometries. Balance between representation and computational cost as well as providing a chemically intuitive means of conformer selection are critical considerations when constructing a DFT feature library. The new descriptor set was applied to statistical modelling campaigns of two previously analysed palladium-catalysed reactions which employ bisphosphine ligands. An MLR model was generated for a Hayashi–Heck cross-coupling case study which showed improved performance with the use of conformer-weighted features compared to their lowest energy conformer equivalents. The increased diversity in ligand sampling for the SIA carbonylation case study, compared to the Hayashi–Heck cross-coupling example, presented additional challenges in modelling campaigns and required the use of a decision tree regressor to describe ligand performance. Using this non-linear modelling technique, we were also able to showcase the improved model performance when using conformer-weighted features. Additional work on enhanced bisphosphine ligand training set design and featurisation are currently underway to address further challenges one might encounter when undertaking modelling campaigns with bisphosphine ligands (e.g., improved feature design) and will be reported in due course. Nevertheless, this chemically intuitive feature-based conformer selection methodology has the potential to be applied to complex molecular systems (i.e., one which is large and flexible) beyond bisphosphine ligands where an important steric or geometric feature is anticipated.

Author contributions

J. A. C., S. D. H. and M. S. S. conceptualised the project. J. A. C. and S. D. H. developed the methodology, ran the calculations and modelled the reaction data. J. A. C., S. D. H., R. C. W., K. A. M. and M. S. S. analysed the data and refined the methodology. J. A. C. wrote the initial draft that was reviewed and edited by J. A. C., S. D. H., R. C. W., K. A. M. and M. S. S. All authors have given approval to the final version of the manuscript.

Conflicts of interest

There are no conflicts of interest to declare.

Data availability

Data for this article, including Python code used for data analysis, conformer selection and modelling, are available on GitHub in this repository: https://github.com/SigmanGroup/BisphosphineConformerSelection. DFT calculation files used in both modelling case studies are available on Zenodo: https://doi.org/10.5281/zenodo.15690854.

Supplementary information is available containing full calculation, modelling details and additional plots depicting conformer selection. See DOI: https://doi.org/10.1039/d5sc04691b.

Acknowledgements

We acknowledge the financial support from the NSF under the CCI Center for Computer Assisted Synthesis (C-CAS) (CHE-2202693) for work completed in the Sigman lab. The support and resources from the Center for High Performance Computing (CHPC) at the University of Utah are gratefully acknowledged. We thank Dr Beck Miller for their helpful insights into non-linear modelling techniques.

Notes and references

(a) D. J. Durand and N. Fey, Computational Ligand Descriptors for Catalyst Design, Chem. Rev., 2019, 119, 6561–6594 CrossRef CAS PubMed; (b) D. J. Durand and N. Fey, Building a Toolbox for the Analysis and Prediction of Ligand and Catalyst Effects in Organometallic Catalysis, Acc. Chem. Res., 2021, 54, 837–848 CrossRef CAS PubMed; (c) W. L. Williams, L. Zeng, T. Gensch, M. S. Sigman, A. G. Doyle and E. V. Anslyn, The Evolution of Data-Driven Modeling in Organic Chemistry, ACS Cent. Sci., 2021, 7, 1622–1637 CrossRef CAS PubMed; (d) J. M. Crawford, C. Kingston, F. D. Toste and M. S. Sigman, Data Science Meets Physical Organic Chemistry, Acc. Chem. Res., 2021, 54, 3136–3148 CrossRef CAS PubMed; (e) Z. Tu, T. Stuyver and C. W. Coley, Predictive chemistry: machine learning for reaction deployment, reaction development, and reaction discovery, Chem. Sci., 2023, 14, 226–244 RSC; (f) P. Raghavan, B. C. Haas, M. E. Ruos, J. Schleinitz, A. G. Doyle, S. E. Reisman, M. S. Sigman and C. W. Coley, Dataset Design for Building Models of Chemical Reactivity, ACS Cent. Sci., 2023, 9, 2196–2204 CrossRef CAS PubMed.
(a) S. K. Kariofillis, S. Jiang, A. M. Żurański, S. S. Gandhi, J. I. Martinez Alvarado and A. G. Doyle, Using Data Science To Guide Aryl Bromide Substrate Scope Analysis in a Ni/Photoredox-Catalyzed Cross-Coupling with Acetals as Alcohol-Derived Radical Sources, J. Am. Chem. Soc., 2022, 144, 1045–1055 CrossRef CAS PubMed; (b) T. Tang, A. Hazra, D. S. Min, W. L. Williams, E. Jones, A. G. Doyle and M. S. Sigman, Interrogating the Mechanistic Features of Ni(I)-Mediated Aryl Iodide Oxidative Addition Using Electroanalytical and Statistical Modeling Techniques, J. Am. Chem. Soc., 2023, 145, 8689–8699 CrossRef CAS PubMed; (c) L. van Dijk, B. C. Haas, N.-K. Lim, K. Clagg, J. J. Dotson, S. M. Treacy, K. A. Piechowicz, V. A. Roytman, H. Zhang, F. D. Toste, S. J. Miller, F. Gosselin and M. S. Sigman, Data Science-Enabled Palladium-Catalyzed Enantioselective Aryl-Carbonylation of Sulfonimidamides, J. Am. Chem. Soc., 2023, 145, 20959–20967 CrossRef CAS PubMed.
B. C. Haas, M. A. Hardy, S. Sowndarya S. V, K. Adams, C. W. Coley, R. S. Paton and M. S. Sigman, Rapid prediction of conformationally-dependent DFT-level descriptors using graph neural networks for carboxylic acids and alkyl amines, Digit. Discov., 2025, 4, 222–233 RSC.
(a) J. Jover, N. Fey, J. N. Harvey, G. C. Lloyd-Jones, A. G. Orpen, G. J. J. Owen-Smith, P. Murray, D. R. J. Hose, R. Osborne and M. Purdie, Expansion of the Ligand Knowledge Base for Monodentate P-Donor Ligands (LKB-P), Organometallics, 2010, 29, 6245–6258 CrossRef CAS; (b) T. Gensch, G. dos Passos Gomes, P. Friederich, E. Peters, T. Gaudin, R. Pollice, K. Jorner, A. Nigam, M. Lindner-D’Addario, M. S. Sigman and A. Aspuru-Guzik, A Comprehensive Discovery Platform for Organophosphorus Ligands for Catalysis, J. Am. Chem. Soc., 2022, 144, 1205–1217 CrossRef CAS PubMed.
(a) J. Jover, N. Fey, J. N. Harvey, G. C. Lloyd-Jones, A. G. Orpen, G. J. J. Owen-Smith, P. Murray, D. R. J. Hose, R. Osborne and M. Purdie, Expansion of the Ligand Knowledge Base for Chelating P,P-Donor Ligands (LKB-PP), Organometallics, 2012, 31, 5302–5306 CrossRef CAS PubMed; (b) N. Fey, A. Koumi, A. V. Malkov, J. D. Moseley, B. N. Nguyen, S. N. G. Tyler and C. E. Willans, Mapping the properties of bidentate ligands with calculated descriptors (LKB-bid), Dalton Trans., 2020, 49, 8169–8178 RSC; (c) J. De Jesus Silva, N. Bartalucci, B. Jelier, S. Grosslight, T. Gensch, C. Schünemann, B. Müller, P. C. J. Kamer, C. Copéret, M. S. Sigman and A. Togni, Development and Molecular Understanding of a Pd-Catalyzed Cyanation of Aryl Boronic Acids Enabled by High-Throughput Experimentation and Data Analysis, Helv. Chim. Acta, 2021, 104, e2100200 CrossRef CAS; (d) J. Xu, S. Grosslight, K. A. Mack, S. C. Nguyen, K. Clagg, N.-K. Lim, J. C. Timmerman, J. Shen, N. A. White, L. E. Sirois, C. Han, H. Zhang, M. S. Sigman and F. Gosselin, Atroposelective Negishi Coupling Optimization Guided by Multivariate Linear Regression Analysis: Asymmetric Synthesis of KRAS G12C Covalent Inhibitor GDC-6036, J. Am. Chem. Soc., 2022, 144, 20955–20963 CrossRef CAS PubMed; (e) J. J. Dotson, L. van Dijk, J. C. Timmerman, S. Grosslight, R. C. Walroth, F. Gosselin, K. Püntener, K. A. Mack and M. S. Sigman, Data-Driven Multi-Objective Optimization Tactics for Catalytic Asymmetric Reactions Using Bisphosphine Ligands, J. Am. Chem. Soc., 2023, 145, 110–121 CrossRef CAS PubMed.
(a) F. Häse, L. M. Roch, C. Kreisbeck and A. Aspuru-Guzik, Phoenics: A Bayesian Optimizer for Chemistry, ACS Cent. Sci., 2018, 4, 1134–1145 CrossRef PubMed; (b) B. J. Shields, J. Stevens, J. Li, M. Parasram, F. Damani, J. I. M. Alvarado, J. M. Janey, R. P. Adams and A. G. Doyle, Bayesian reaction optimization as a tool for chemical synthesis, Nature, 2021, 590, 89–96 CrossRef CAS PubMed; (c) R. J. Hickman, M. Aldeghi, F. Häse and A. Aspuru-Guzik, Bayesian optimization with known experimental and design constraints for chemistry applications, Digit. Discov., 2022, 1, 732–744 RSC.
(a) L. P. Hammett, Some Relations between Reaction Rates and Equilibrium Constants, Chem. Rev., 1935, 17, 125–136 CrossRef CAS; (b) C. Hansch, A. Leo and R. W. Taft, A survey of Hammett substituent constants and resonance and field parameters, Chem. Rev., 1991, 91, 165–195 CrossRef CAS.
(a) A. Verloop, W. Hoogenstraaten and J. Tipker, in Drug Design, ed. E. J. Ariëns, Academic Press, Amsterdam, 1976, vol. 11, pp. 165–207 Search PubMed; (b) K. C. Harper, E. N. Bess and M. S. Sigman, Multidimensional steric parameters in the analysis of asymmetric catalytic reactions, Nat. Chem., 2012, 4, 366–374 CrossRef CAS PubMed.
A. V. Brethomé, S. P. Fletcher and R. S. Paton, Conformational Effects on Physical-Organic Descriptors: The Case of Sterimol Steric Parameters, ACS Catal., 2019, 9, 2313–2323 CrossRef.
C. H. Suresh, Molecular Electrostatic Potential Approach to Determining the Steric Effect of Phosphine Ligands in Organometallic Chemistry, Inorg. Chem., 2006, 45, 4982–4986 CrossRef CAS PubMed.
(a) A. K. Vitek, T. M. E. Jugovic and P. M. Zimmerman, Revealing the Strong Relationships between Ligand Conformers and Activation Barriers: A Case Study of Bisphosphine Reductive Elimination, ACS Catal., 2020, 10, 7136–7145 CrossRef CAS; (b) M. Besora, A. A. C. Braga, G. Ujaque, F. Maseras and A. Lledós, The importance of conformational search: a test case on the catalytic cycle of the Suzuki–Miyaura cross-coupling, Theor. Chem. Acc., 2011, 128, 639–646 Search PubMed; (c) R. Laplaza, M. D. Wodrich and C. Corminboeuf, Overcoming the Pitfalls of Computing Reaction Selectivity from Ensembles of Transition States, J. Phys. Chem. Lett., 2024, 15, 7363–7370 CrossRef CAS PubMed.
A. M. Żurański, J. Y. Wang, B. J. Shields and A. G. Doyle, Auto-QChem: an automated workflow for the generation and storage of DFT calculations for organic molecules, React. Chem. Eng., 2022, 7, 1276–1284 RSC.
J. V. Alegre-Requena, S. Sowndarya S. V., R. Pérez-Soto, T. M. Alturaifi and R. S. Paton, AQME: Automated quantum mechanical environments for researchers and educators, Wiley Interdiscip. Mol. Sci., 2023, 13, e1663 CrossRef CAS.
A. S. Shved, B. E. Ocampo, E. S. Burlova, C. L. Olen, N. I. Rinehart and S. E. Denmark, molli: A General Purpose Python Toolkit for Combinatorial Small Molecule Library Generation, Manipulation, and Feature Extraction, J. Chem. Inf. Model., 2024, 64, 8083–8090 CrossRef CAS PubMed.
(a) D. Balcells, G. Drudis-Solé, M. Besora, N. Dölker, G. Ujaque, F. Maseras and A. Lledós, Some critical issues in the application of quantum mechanics/molecular mechanics methods to the study of transition metal complexes, Faraday Discuss., 2003, 124, 429–441 RSC; (b) Y. Minenkov, D. I. Sharapa and L. Cavallo, Application of Semiempirical Methods to Transition Metal Complexes: Fast Results but Hard-to-Predict Accuracy, J. Chem. Theory Comput., 2018, 14, 3428–3439 CrossRef CAS PubMed; (c) A. Nandy, C. Duan, M. G. Taylor, F. Liu, A. H. Steeves and H. J. Kulik, Computational Discovery of Transition-metal Complexes: From High-throughput Screening to Machine Learning, Chem. Rev., 2021, 121, 9927–10000 CrossRef CAS PubMed.
P. Pracht, F. Bohle and S. Grimme, Automated exploration of the low-energy chemical space with fast quantum chemical methods, Phys. Chem. Chem. Phys., 2020, 22, 7169–7192 RSC.
A. Klamt, F. Eckert and M. Diedenhofen, Prediction of the Free Energy of Hydration of a Challenging Set of Pesticide-Like Compounds, J. Phys. Chem. B, 2009, 113, 4508–4510 CrossRef CAS PubMed.
A. Udvarhelyi, S. Rodde and R. Wilcken, ReSCoSS: a flexible quantum chemistry workflow identifying relevant solution conformers of drug-like molecules, J. Comput. Aided Mol. Des., 2021, 35, 399–415 CrossRef CAS PubMed.
C. C. Lam and J. M. Goodman, CONFPASS: Fast DFT Re-Optimizations of Structures from Conformation Searches, J. Chem. Inf. Model., 2023, 63, 4364–4375 CrossRef CAS PubMed.
L. van der Maaten and G. Hinton, Visualizing Data using t-SNE, J. Mach. Learn. Res., 2008, 9, 2579–2605 Search PubMed.
The bisphosphine ligand feature space was also represented using principal component analysis and clustered using k-means clustering. A comparison was made between this technique and the t-SNE space to select a total of 12 ligands (see the SI for more details).
An energy window of 10 kcal mol–1 was also investigated and gave similar results in terms of conformer diversity as 5 kcal mol–1 (see SI).
S. Spicher and S. Grimme, Robust Atomistic Modeling of Materials, Organometallic, and Biochemical Systems, Angew. Chem., Int. Ed., 2020, 59, 15665–15673 CrossRef CAS PubMed.
Comparison were made between conformer ensembles at a 10 kcal mol–1 window and those that were unclustered. In both cases, similar structures (from RSMD of coordinate positions) and feature values were obtained, but with an increased number of conformers. See SI for full details.
(a) J. P. Perdew, K. Burke and M. Ernzerhof, Generalized Gradient Approximation Made Simple, Phys. Rev. Lett., 1996, 77, 3865–3868 CrossRef CAS PubMed; (b) J. P. Perdew, K. Burke and M. Ernzerhof, Generalized Gradient Approximation Made Simple, Phys. Rev. Lett., 1997, 78, 1396 CrossRef CAS; (c) C. Adamo and V. Barone, Toward reliable density functional methods without adjustable parameters: The PBE0 model, J. Chem. Phys., 1999, 110, 6158–6170 CrossRef CAS.
(a) S. K. Latypov, F. M. Polyancev, D. G. Yakhvarov and O. G. Sinyashin, Quantum chemical calculations of 31P NMR chemical shifts: scopes and limitations, Phys. Chem. Chem. Phys., 2015, 17, 6976–6987 RSC; (b) S. K. Latypov, S. A. Kondrashova, F. M. Polyancev and O. G. Sinyashin, Quantum Chemical Calculations of 31P NMR Chemical Shifts in Nickel Complexes: Scope and Limitations, Organometallics, 2020, 39, 1413–1422 CrossRef CAS; (c) W. H. Hersh and T.-Y. Chan, Improving the accuracy of 31P NMR chemical shift calculations by use of scaling methods, Beilstein J. Org. Chem., 2023, 19, 36–56 CrossRef CAS PubMed.
Some of the features described here may be categorised as “stereoelectronic”, i.e., having both an electronic and steric component. However, for convinience we have broadly classed them as steric, geometric and electronic. Definitions of all the features used here are provided in the SI.
(a) S. Zhao, T. Gensch, B. Murray, Z. L. Niemeyer, M. S. Sigman and M. R. Biscoe, Enantiodivergent Pd-catalyzed C–C bond formation enabled through ligand parameterization, Science, 2018, 362, 670–674 CrossRef CAS PubMed; (b) T. P. McFadden, R. B. Cope, R. Muhlestein, D. J. Layton, J. J. Lessard, J. S. Moore and M. S. Sigman, Using Data Science Tools to Reveal and Understand Subtle Relationships of Inhibitor Structure in Frontal Ring-Opening Metathesis Polymerization, J. Am. Chem. Soc., 2024, 146, 16375–16380 CrossRef CAS PubMed.
(a) L. A. van der Veen, P. C. J. Kamer and P. W. N. M. van Leeuwen, Hydroformylation of Internal Olefins to Linear Aldehydes with Novel Rhodium Catalysts, Angew. Chem., Int. Ed., 1999, 38, 336–338 CrossRef CAS; (b) F. Agbossou, J.-F. Carpentier and A. Mortreux, Asymmetric Hydroformylation, Chem. Rev., 1995, 95, 2485–2506 CrossRef CAS; (c) S. Chakrabortty, A. A. Almasalma and J. G. de Vries, Recent developments in asymmetric hydroformylation, Catal. Sci. Technol., 2021, 11, 5388–5411 RSC.
(a) S. Otsuka, Chemistry of platinum and palladium compounds of bulky phosphines, J. Organomet. Chem., 1980, 200, 191–205 CrossRef CAS; (b) M.-D. Su and S.-Y. Chu, Theoretical Study of Oxidative Addition and Reductive Elimination of 14-Electron d10 ML2 Complexes: A ML2 + CH4 (M = Pd, Pt; L = CO, PH3, L2 = PH‘2CH2CH2PH2) Case Study, Inorg. Chem., 1998, 37, 3400–3406 CrossRef CAS.
Y. Koide, S. G. Bott and A. R. Barron, Alumoxanes as Cocatalysts in the Palladium-Catalyzed Copolymerization of Carbon Monoxide and Ethylene: Genesis of a Structure–Activity Relationship, Organometallics, 1996, 15, 2213–2226 CrossRef CAS.
T. Niksch, H. Görls and W. Weigand, The Extension of the Solid-Angle Concept to Bidentate Ligands, Eur. J. Inorg. Chem., 2010, 2010, 95–105 CrossRef.
A. C. Hillier, W. J. Sommer, B. S. Yong, J. L. Petersen, L. Cavallo and S. P. Nolan, A Combined Experimental and Theoretical Study Examining the Binding of N-Heterocyclic Carbenes (NHC) to the Cp*RuCl (Cp* = η5-C5Me5) Moiety: Insight into Stereoelectronic Differences between Unsaturated and Saturated NHC Ligands, Organometallics, 2003, 22, 4322–4326 CrossRef CAS.
Similar results were obtained when %Vbur and a combination of bite angle and %Vbur were used as selection criteria. However, due to the historical importance and relevance to bisphosphine ligands, bite angle was used for all subsequent conformer pruning.
A full description of this feature reduction process is given in the SI.
(a) H. Shimakawa, A. Kumada and M. Sato, Extrapolative prediction of small-data molecular property using quantum mechanics-assisted machine learning, npj Comput. Mater., 2024, 10, 11 CrossRef; (b) M. Dabros, H. Münkler, F. Yerly, R. Marti, M. Parmentier and A. Udvarhelyi, Quantum Descriptor-Based Machine-Learning Modeling of Thermal Hazard of Cyclic Sulfamidates, J. Chem. Inf. Model., 2025, 65, 8624–8636 CrossRef CAS PubMed.
S. M. Lundberg and S.-I. Lee, A Unified Approach to Interpreting Model Predictions, Adv. Neural Inf. Process., 2017, 30, 4768–4777 Search PubMed.

Footnote

† These authors contributed equally to this research.

Click here to see how this site uses Cookies. View our privacy policy here.