Open Access Article
R. A.
Talmazan
a,
J.
Gamper
b,
I.
Castillo
c,
T. S.
Hofer
*b and
M.
Podewitz
*a
aInstitute of Materials Chemistry, TU Wien, Getreidemarkt 9, A-1060 Wien, Austria. E-mail: maren.podewitz@tuwien.ac.at
bInstitute of General, Inorganic and Theoretical Chemistry, Leopold Franzens University of Innsbruck, Innrain 80/82, 6020 Innsbruck, Austria. E-mail: t.hofer@uibk.ac.at
cInstituto de Química, Universidad Nacional Autónoma de México, Ciudad Universitaria, Ciudad de México, 04510, Mexico
First published on 2nd September 2025
Supramolecular 3d transition-metal catalysts are large, flexible systems with intricate interactions, resulting in complex reaction coordinates. To capture their dynamic nature, we developed a broadly applicable, high-throughput workflow, that leverages quantum mechanics/molecular mechanics molecular dynamics (QM/MM MD) in explicit solvent, to investigate a Cu(I)-calix[8]arene-catalysed C–N coupling reaction. The system complexity and high amount of data generated from sampling the reaction requires automated analyses. To identify and quantify the reaction coordinate from noisy simulation trajectories, we applied interpretable machine learning techniques (Lasso, Random Forest, Logistic Regression) in a consensus model, alongside dimensionality reduction methods (PCA, LDA, tICA). By employing a Granger Causality model, we move beyond the traditional view of a reaction coordinate, by defining it instead as a sequence of molecular motions leading up to the reaction.
Complimentary to experimental advances, computational chemistry has played a key role in the design and understanding of catalytic systems, by elucidating reaction mechanisms.9–12 Despite considerable efforts, quantum chemistry is limited in its predictive abilities.13 As the systems grow in complexity, there is a need to improve the chemical model that describes the catalytic system in its environment.14 Often, the errors introduced by a too simplistic chemical model exceed those arising from the use of an approximate theoretical methodology such as Density Functional Theory (DFT).13,15 Consequently, the goal is to create a “digital twin” of the reaction flask that is an in silico model, which fully replicates experimental conditions: a catalyst in explicit solvent, at finite temperature and pressure. As a description with full ab initio quantum chemistry is not feasible, a tailored multiscale strategy is required, accounting for conformational flexibility, explicit solvation, and the dynamic nature of reactions.
Although many computational mechanistic studies are still performed on a single structure, conformer searches recently gained popularity14,16,17 thanks to easily accessible tools,18,19 facilitating a transition to structure ensembles, which provide a more complete picture of the reactivity.14,17 The standard implicit solvation model can favour unrealistic, very compact structures with many intramolecular hydrogen bonds.7,19 Ideally, a more realistic model would account for explicit solvent molecules either through a full condensed phase calculation or through a microsolvation approach.20,21
Molecular dynamics simulations can describe the dynamic nature of the catalysts bringing its description closer to operando conditions, which can reveal new insights into the reaction mechanism.22–25 While this treatment increases computational demands, by relying on multiscale methods, this hurdle can be greatly diminished. Quantum mechanics/molecular mechanics (QM/MM) models describe the catalytic centre and substrates at a QM level, while the surrounding environment is treated with MM.26–28 A transition to QM/MM molecular dynamics (MD) in explicit solvent allows for sampling timescales magnitudes higher than in a pure QM approach. Due to thermal fluctuations, repeated sampling is a requirement for statistically relevant information regarding energy barriers and structural information.
While the setup of such multiscale methods is a challenge in itself, a vast amount of data is generated from the simulations. Chemical knowledge can be extracted from these data, for example in the form of the reaction coordinate i.e., the movements of atoms which take place as the chemical reaction proceeds. Knowledge of the reaction coordinate allows for further reduction in computational costs, by enabling the use of enhanced sampling methods geared towards overcoming large energy barriers, such as those observed in chemical reactions.29 The identification of the reaction coordinate from simulation data can be performed in a variety of ways.30 While machine learning (ML) approaches can be used to evaluate the data and extract condensed results from simulations, such as the committor function,24,31 complex neural network approaches generally are not directly interpretable.32 Whether through the use of convolutional neural networks or graph neural networks,33,34 a subsequent analysis is still needed to render the reaction coordinate interpretable. Instead, interpretable, or explainable, machine learning techniques, such as Decision Trees, Random Forests or Logistic Regression, as well as path sampling-based methods, such as predictive power analysis,35,36 offer good performance in extracting relevant information from large datasets and presenting them in easily understandable ways. In addition, dimensionality reduction techniques, such as Principal Component Analysis37 (PCA) or time-lagged Independent Component Analysis38 (tICA) effectively detect combined coordinates from the trajectories, revealing the key motions of a system. Furthermore, methods such as k-nearest neighbour and t-distributed stochastic neighbour embedding have shown some promise in dealing with complex datasets.39–41 Yet these dimensionality reduction techniques have almost exclusively been applied to biomolecules42–44 with few exceptions.45,46 A combination of aforementioned methods offers great promise to detect a cumulative reaction coordinate from a multitude of independent trajectories, providing chemical insight into the mechanism and reactivity of the system. However, to the best of our knowledge these combined methods have not been applied to study reaction mechanisms in explicit solvent, let alone large supramolecular transition-metal catalysts.
Another aspect that has until now been neglected in chemistry is causality. While the concept is widespread across various scientific domains47 – ranging from economics48–52 and climate research53–57 to biology58,59 and medical studies60–62 – it remains surprisingly absent in the field of chemistry. Although a handful of precedents in biomolecular simulations exists,63–65 it has not been explored to study chemical reactions, not to mention transition-metal catalysis. As MD simulation trajectories are essentially discrete time series, containing the various degrees of freedom of the system, causality can be statistically inferred from the analysis of these trajectories. Consequently, the reaction coordinate can be decomposed into a sequence of motions leading up to the reaction, exposing the intricate interplay of functional groups of the system, offering an unprecedented view of reactivity.
A supramolecular catalyst that has shown remarkable catalytic efficacy for C–N coupling is the Cu(I)-1,5-(2,9-dimethyl-1,10-phenanthroyl)-2,3,4,6,7,8-hexamethyl-p-tert-butylcalix[8]arene, short noted as [Cu(C8PhenMe6)I].7 The macrocyclic ligand, sketched in Fig. 1, allows for usage of earth abundant metals, here Cu, which is an essential step towards more sustainable chemical processes.66–71 The investigated system necessitates explicit solvation for accurate results, as implicit solvation models lead to a collapse of the macrocyclic cage, which compromises catalytic activity (see also Fig. S14–S15, Table S5 in the SI).7 To account for the conformational flexibility of the supramolecular cage7,72 and the dynamic nature of the system as a whole, a dynamic, ensemble-based approach is required to study the reaction. While the mechanism of this catalyst was established to be a sequence of oxidative addition/reductive elimination,7 the dynamic effects of the system, in particular the contribution of the cage, are unknown. From previous studies, we know that explicit solvent molecules are crucial to maintain the shape and functionality of the cavity, however, we see no experimental or theoretical evidence of their participation in the reaction itself.7,72
We developed a multiscale QM/MM MD approach to understand the bond formation dynamics of the C–N coupling step with the Cu-calix[8]arene catalyst in explicit chloroform. By relying on the GFN2-xTB73 method to describe the QM part, we achieved massive sampling, resulting in 152 individual unbiased reaction trajectories. To extract chemically relevant information from these data, we employed supervised and unsupervised interpretable machine learning dimensionality reduction models, in order to identify the cumulative reaction coordinate and to detect critical movements in the structure. A consensus approach combining individual machine learning techniques improved the performance. The statistical Granger Causality analysis model74,75 was employed to decompose the reaction coordinate into a sequence of individual consecutive movements. Finally, Random Forest models and Decision Rules allowed us to quantify the reaction coordinate. This work serves as a broadly applicable template for any mechanistic investigation, revealing and quantifying complex reaction coordinates, along with causal effects derived from the individual movements leading up to the reaction.
The reaction energy was obtained by averaging the ensembles of the educt and product states and calculating the difference; it amounts to −212 ± 25 kJ mol−1 (See SI Sections 3 for details regarding uncertainty estimation). A sigmoid fit through the smoothened energy profile of each simulation (see SI, Fig. S2B and S2C) allowed identification and the calculation of the energy barrier to be 13 ± 9 kJ mol−1. These GFN2-xTB reaction energies and structures are in excellent agreement with full DFT data, obtained with PBE0/def2-SVP/D3 (see SI Section 4, Fig. S3 and Table S2).
To achieve good separation between the three states, educt, transition state, and product, we developed a reduced internal coordinate description of the system (see Fig. 2A) to minimize the noise from highly correlated coordinates.76 We used bond, angles, and torsion coordinates and described rigid fragments, such as the individual calixarene units (cX), the phenanthroline bridge (Phena), the phenyl (Phe) and aniline (NPh) moieties by their respective centres of mass (Fig. 2A). An overview over the distribution of the internal coordinates which define the reduced model can be found in the SI, Fig. S5. This internal coordinate set nicely separates educts and products in the PCA space (Fig. 2B), but still shows overlap between educts and transition states. Analysing the loadings of the principal components (see SI Tables S3 and S4), we can see that the main contributions belong to the coordinates defining the reaction centre (C–N, NPh–C, NPh–Phena distances), as well as to the distances between the product and the cage, describing changes in the coordination at the Cu centre as the product is formed and the settling of the product in the cavity.
A second method, Linear Discriminant Analysis (LDA),77 was also used for comparison. LDA is independent of feature selection and consistently yields excellent separation between groups (Fig. 3D and SI Fig. S6). While LDA produces compact, distinct ensembles, it does not provide separation within the ensembles themselves.
Analysis of the PCA loadings (see SI Table S3) revealed that amongst the top contributors to separating the three ensembles are the change in the distances defining the reaction centre (C–N, Phe–N, NPh–N, NPh-C and Phena-Cu-Br). Secondary features, such as the distance between the copper and O7, also plays an important role for the Logistic Regression (LR) classifier (Fig. 3A). The Random Forest (RF) identifies the C–N distance alongside Phe–N and NPh-C as important (Fig. 3B) with distinct product ensembles emerging. While in RF PC1 has the highest loading of all methods (0.68, see Table S3), the separation between the states is best for the LassoCV approach.
As the feature selection methods differ in the selected internal coordinates and performance (see also SI Fig. S7 for a visual display of features across the methods), we switched to a consensus model (see methods for details on the creation of the consensus model). This consensus approach retains smaller number of features (49 internal coordinates), namely only those that were found to be of high importance in 75% of all previously used ML methods (Fig. 4A), that is in PCA and LDA models. For sake of comparison, the highly important features of a PCA-only and LDA-only consensus model are depicted in the SI in Fig. S8. It is important to note that individual models assign varying importance scores to each internal coordinate. As a result, the feature importance ranking derived from the overall consensus model is likely to differ from those of the individual models. We performed an additional PCA on the 49 consensus features (Fig. 4B). As evident from Table S3, in this model PC1 shows the highest retained variance (0.76) of all PCAs.
![]() | ||
| Fig. 4 Analysis of internal coordinates deemed as highly important by the consensus model. (A) Internal coordinates and their contribution; (B) PCA of the consensus features. | ||
Judging the importance of the consensus model features (Fig. 4A), we see that the C–N bond distance is most important (in agreement with chemical intuition), alongside several distances and angles describing to the reaction centre. Notably, the c6–C distance is also deemed highly important, which indicates that the cage indeed plays a role in defining the reaction coordinate. This small set of internal coordinates yields almost perfect separation of the three states, as well as distinguishing the product conformations (Fig. 4B), thereby outperforming any of the individual feature selection methods.
Employing tICA on the initial reduced internal coordinate set (Fig. 2A), resulted in a good separation of the product and educt states, mainly across the first independent component (IC1), as shown in Fig. 5A, where the values of IC1 and IC2 are plotted separately, and Fig. 5B, where they are plotted against each other and the structures are color-coded according to their labels. Yet the transition state ensemble cannot be fully separated from the educts. When taking into consideration IC2 (Fig. 5A and B), we observe a broad distribution of the product ensemble, indicating significant conformational flexibility. The contributions to ICs can be traced down, by relating the contribution strength (Fig. 5C) to the degree of freedom it corresponds to (Fig. 5B insert). A positive contribution (coloured in red) means that the respective feature values increase as the values of the IC increases, while a negative contribution (coloured in blue) means the feature values decreases as the IC values increase. The absolute value of a contribution (colour intensity) represents the importance of the feature in defining the IC. Please note that going from the educt to the product corresponds to a decrease in IC1.
IC1 reveals the changes at the reaction centre related the Cu adopting a planar configuration upon product formation, as evident for example by the Phena-Cu-Br angle (1st column) that increases with decreasing IC1 (1st column). Of particular interest are the increase in Cu–C and the Cu–N bond distances (1st column) when going to the products, accompanied to a shortening of C–N (5th column), as indicative of the reductive elimination step, as well as the strong contributions of the distances between the aniline product moiety (NPh) and the c0, c2, c6 and c7 calixarene units (4th column). The later hint at the formation of π–π interactions between the cage and the product. Additionally, a tilting of the calixarene cage can be inferred, when looking at the changes in the Phena-Cu-cX and Phena-Cu-oX angles: For example, Phena-Cu-O2, Phena-Cu-O3, and Phena-Cu-O4 (1st column), all located at the lower rim of the calixarene cage (Fig. 5B (insert)), anticorrelate with IC1, hence, they increase when the product is formed (decreasing IC1), whereas those on the lower rim of the cage, such as Phena-Cu-c6 (4th column) as well as Phena-Cu-O7 and Phena-Cu-c7 (5th column) decrease. This finding indicates the movement in opposing directions, when looking at units on opposite sides of the cage. IC2 acts to separate various conformers within the product ensemble, where we can see a difference in the position of the product in the calixarene cage, inferred from the strength of the contributions of the product-calixarene unit distances. To further corroborate π–π interactions, we clustered the product ensemble and performed an NCIPlot analysis78 of the noncovalent interactions, which revealed weakly attracting van-der Waals interactions between the formed coupling product and the cage (see SI Fig. S16–S20).
With this causality model, we utilized the 49 consensus features (Fig. 4) and analysed each trajectory separately. Therefore, we can assess which each feature is influenced by which of the other 48 features; however, a complete analysis of all features quickly becomes impractical. Having identified C–N bond formation and changes in Cu-coordination as the most critical factors during product formation, we focused our GC analysis on the C–N feature. We analysed which of the remaining 48 features influenced the C–N feature and evaluated how often each of the 48 variables was found to Granger-cause C–N across all trajectories. For an overview over the 49 × 49 causality matrices generated for each reaction trajectory, the reader is referred to the GitHub.
In addition, we performed a hierarchical clustering of the consensus features (branches in Fig. 6), which tells us, what features are correlated. Further insight can be obtained by quantifying the correlation with a Pearson score, resulting in a clustermap (Fig. S9 SI), which also reorders the internal coordinates according to their correlation to each other. From this we can observe two distinct regions in the catalyst, corresponding largely to the upper and lower rim of the calixarene cage.
Combing the information of the correlation with Granger Causality, we analysed the causality for C–N coupling, i.e., C–N bond shortening, not only for the individual features but also for the combined branches (Fig. 6). While the red arrows in Fig. 6 show the direction of causality the percentages indicate in how many trajectories C–N bond formation was caused by the respective feature or group of features. Fig. 6 can be read as a map of movements leading to the coupling reaction, by choosing a starting point and following the arrows towards the highlighted C–N feature.
For example, in 11% of all trajectories C–N coupling is caused by a change in Phe-O4, O4-C, and c4-C distances (most left part of orange cluster, Fig. 6). Combining all features in the orange cluster, in 37% of all trajectories, the coupling is triggered by a move of features summarized in this cluster, consisting of the calixarene 2,3,4 and 7 to C distances as well as Cu-Phe distance. When we include information about the Phena-Cu-Br angle, alongside Cu and Br distances to NPh, from the purple cluster (right hand side of Fig. 6), as well as information from the red cluster, we can infer causality in 65% of all trajectories. The green cluster consists mostly of information related to the position of the calixarenes around the reaction centre. We see that changes in the Phena-Cu-calixarene angles (e.g., Phena-Cu-O7, Phena-Cu-O2, …) on the upper and lower rim are correlated. Combined with the information from the clustermap (Fig. S9), we see that the angle with the upper rim decreases (positive Pearson correlation with C–N coupling, coloured in red), whereas those on the lower rim increases (negative Pearson correlation, coloured in blue), indicative of a tilt of the calixarene cage. When we combine all information together, we can infer that the C–N coupling is caused by movements in the calixarene cage, alongside the Cu coordination change in 87% of the trajectories. Please note that because of the correlation of the individual features, percentages of the combined features do not necessarily equal the sum of the individual contribution.
In addition to C–N coupling, the Phena-Cu-Br angle was selected as a high importance feature by the consensus model (Fig. 4A), it's the 5th in the ranking and the first angle, and we subjected it to GC analysis. It also corresponds to an intuitive view of the change in coordination around the copper. From the correlation analysis and clustering, it relates closely to the C–N bond shortening and it directly Granger-causes C–N coupling 12% of the time. Conversely, we found that the C–N distance shortening does not cause the change in the Phena-Cu-Br angles, allowing us to deduce that the two movements happen either simultaneously or, more likely, the change in the angle precedes the C–N bond shortening. Furthermore, we observed that movements of the calixarene cage Granger-cause changes in the Phena-Cu-Br angle in 72% of all trajectories, establishing the sequence of cage movement – Phena-Cu-Br change – C–N-shortening. Due to the increasing complexity, we opted not to conduct further analysis of the GC matrices.
The Decision Tree in Fig. 7 has been trained on the whole dataset. To avoid biasing against the transition state ensemble, which contains significantly fewer structures, balanced weight is given to all classes via oversampling. For comparison, the results of an unbalanced tree can be found in Fig. S10.
Analysing the tree, we can see that the Cu–C distance plays a key role in the splitting of the educts and products, with the majority of transition states being grouped with the educt class, indicative of an early transition state. The remaining transition state contamination of the product ensemble can be separated by taking the angle determined by the phenanthroline bridge, Cu and Br atoms into account (Phena-Cu-Br), where values below 112.8° are indicative of a transition state.
To differentiate between the transition states and educts, the C–N bond represents an effective metric, where values higher than 2.33 Å indicate an educt, while distances below indicate a transition state. The educt states that do exhibit a C–N bond distance similar to that of the transition state can be identified by a smaller phenyl to calixarene unit 4 oxygen atom distance (Phe–O4). When transition states exhibit a C–N bond distance over 2.33 Å, they can be distinguished from the educts by the calixarene c0 and calixarene O5 distance greater than 11.79 Å and the angle defined by the phenanthroline bridge, Cu and c0 (Phena-Cu-c0) smaller than 110.5°.
A major shortcoming of Decision Trees is that their results depend on the initiation conditions. However, their reliability can be improved by utilizing many Decision Trees in a Random Forest (RF) classifier and averaging the results. In general, this improves accuracy, but reduces the interpretability. To overcome this limitation Decision Rules can be deduced from the results, providing a semantic understanding of the RF classifier. We used 30 Decision Trees, each trained on a subset of the data, to yield the RF. When applied to our dataset, this method provides rules for each of the three classes, as seen in Fig. 8, below. A complete diagram of the Decision Rules can be found in the SI, Fig. S11.
![]() | ||
| Fig. 8 Graphical depiction of the Decision Rules derived from the RF approach, Product A, B, C refers to different conformer ensembles within the product category. | ||
Notably, the Decision Rules approach identifies three distinct rule sets for defining a product. Color-coding these three product states in the PCA with consensus showed clear separation (see SI, Fig. S12), which was further confirmed by kmeans clustering, where only minor overlap occurred. Hence, these three states are distinguishable to some degree.
As we observed a convergence of the PCA with increased sampling (see SI, Fig. S13), we assumed the total simulation time to be sufficient. In addition, we observed spontaneous C–N coupling in 142 of the 152 trajectories, hence, we could directly analyse these unbiased simulations.
By analysing the reaction energy profiles, we were able to quantify the reaction energy, as well as the reaction barrier, including uncertainty values. The computed reaction barrier amounted to 13 ± 9 kJ mol−1, which is within 6 kJ mol−1 of that obtained from the full DFT trajectories (18 kJ mol−1). Notably, the static quantum chemical (DFT) approach yielded a slightly higher barrier of 23 kJ mol−1,7 indicating that dynamics lower the barrier. For the reaction energy, differences are more pronounced with −212 ± 25 kJ mol−1 obtained with QM/MM MD vs. −255 kJ mol−1 with static DFT.7 However, these differences are due to the different conditions modelled: In the static DFT model, the studied structure was obtained at 0 K, which represents the bottom of the potential energy surface, while in this QM/MM MD study not only the average over all conformations is taken into account, but also the thermal energy, so the structures are not 0 K structures and therefore no minima on the potential energy surface. In addition, we assume that the product can undergo slow relaxation to lower energy conformations, which are too slow to observe in the timescales investigated here and which further contribute to the difference in reaction energies decreasing the variance between the static and the dynamic approach. Besides energetics, the reaction profile allowed for the categorization of the structures into three ensembles, namely educt, transition state, and product. This step was a key in improving the reaction coordinate recognition, as it allowed for the use of supervised learning methods to reduce the coordinate space. Due to the large energy difference between reactants and products, which renders the transformation irreversible in the context of unbiased MD at 300 K, we sampled a single chemical reaction per trajectory. Thus, there is one transition state structure before the large drop in energy and, as we have not biased our simulation, this structure should be on the dividing surface. Naturally, structures very similar to the transition state are likely to be found in the sampling leading up to the identified transition state due to recrossing events. Such transition state and transition state-like structures could probably be identified using a structural criterion and, in an iterative process, be removed from the educt ensemble. However, in the present case, we observe a good separation between transition state and reactant ensembles, also visible in the analysis of the internal coordinate distribution (SI Fig. S5), where for example, the C–N or the Phe–N coordinate show excellent separation of the three states. This finding indicates that there is no large transition state structure contamination within the reactant ensemble.
While PCA has been used in the past to determine the reaction coordinates, we have demonstrated that it is insufficient for a highly complex system, with many degrees of freedom. To yield any separation between the three states in the PCA, we had to transform cartesian coordinates to a set of internal coordinates, which we further reduced to minimize the number of highly correlated coordinates in the data set, thus reducing noise. Although this single step of our workflow is not fully automated at the moment, clear guidelines to obtain the reduced coordinate set can be applied, (i) usage of internal coordinates and (ii) representation of rigid groups such as phenyl by their centre of mass, which are applicable to describe any chemical system. While product states could be separated from educts and transition states, the latter two still showed overlap. In contrast, standard PCA on the cartesian coordinates resulted in no separation of the three states. We suspect the poor performance stems from failure to fully eliminate rotational and translational degrees of freedom from the system.
Utilizing the labelled data in PCA and LDA combined with supervised ML approaches resulted in a much better separation of the three ensembles. While the performance of PCA was highly dependent on the internal coordinate set, LDA showed remarkable separation between the three ensembles, highlighting the robustness of the method. A consensus model developed to combine the performance of the various dimensionality and feature reduction methods, identified 49 internal coordinates to be relevant, with the C–N distance being the most prominent one, which is in agreement with chemical intuition.
From looking at the cumulative results of all analyses performed on the reaction trajectories we deduced which movements took place during the reaction. From the PCA, LDA and the correlation analysis (clustermap), we saw a tilt in the calixarene cage. Likewise, we observed changes in the coordination around the Cu centre, as indicated for example by Phena-Cu-Br angle changes and the C–N bond shortening upon product formation.
Complementary to PCA, tICA was used to identify slow movements of the system. While tICA provided structural insights into the product conformer ensemble, it failed to distinguish educts from transition states. Thus, tICA was insufficient fully identify the reaction coordinate Finally, the tICA results suggest a settling of the product in the calixarene cage, governed by π–π stacking interactions, which was further corroborated by analysis of non-covalent interactions of representative structures (SI Section 12 Fig. S16–S20). These correlations can be then temporally defined by interpreting the Granger causality analysis. Through the causality analysis of changes in the C–N bond, we deduce that the reaction is Granger-caused by a movement in the reaction centre (such as a change in the Phena-Cu-Br angle) or a change in the calixarene cavity conformation, particularly calixarene units 2,3 and 4, whereas in turn changes in the Phena-Cu-Br angle are Granger-caused by changes in the calixarene conformation. Finally, the π–π stacking increased once the formed product reoriented itself inside the cavity. Therefore, we may create a sequential image of the reaction, schematically depicted in Fig. 9 as follows: (i). The calixarene cage tilts perpendicularly to the phenanthroline; (ii). The change in Cu-coordination takes place before the C–N distance shortening, but after the tilt of the cage; (iii). The C–N bond distance shortens; (iv). π–π stacking effects drive the movement of the product inside the cage and below the phenanthroline.
To the best of our knowledge, this study represents the first application of Granger Causality analysis to a complex chemical reaction – specifically, the C–N coupling reaction step catalysed by [Cu(C8PhenMe6)I] – and serves as a proof of principle. However, the model provides a simple interpretation of causality, as it analyses pairs of variables and not more complex composite variables. Transitioning to more advanced causal discovery methods, such as PC Momentary Conditional Independence (PCMCI)-based approaches,79 would enable better interpretation of nonlinearity and the presence of hidden variables within the dataset. Nonetheless, such advancements are beyond the scope of this work.
To quantify the changes in internal coordinates, we employed a Decision Tree trained on the three ensembles, identifying the Cu–C and C–N distances as main feature for separating products from educts and transition states, respectively. While Decision Trees are sensitive on the initialisation condition and require size limitations to maintain interpretability, a Random Forest (RF) approach mitigates these limitations by aggregating multiple trees. Applying the Decision Rule method then allows for the semantic interpretation to a RF classifier, effectively separating the three ensembles. In addition, three distinct product conformers are identified, each defined by a unique set of decision rules and distinguishable in the PCA consensus features plot (Fig. S12) as well as in IC2 of the tICA analysis. Structural differences are related to changes in cage conformations. These conformations will likely converge as the product diffuses out of the cavity.
To the best of our knowledge there is only a single study of the reaction dynamics of a transition-metal complex with explicit solvent with repeated sampling of the reaction step.24 This study on Fe-oxo-mediated C–H functionalization reactions by Joy et al. used kinetic energies, quantum numbers and velocities to distinguish between two different dynamic reaction pathways.24 While they also used ML for feature selection, their focus is on physical chemical factors that impact reactivity. Our focus lies on the investigation of structural changes to ease interpretation and to facilitate causality analysis, which has never been attempted for chemical reactions, but opens a completely new angle on how to understand reactivity. From all statistical analyses, we can see that the transition state structures tend to be close to the educt or even overlay with the educt space: for example, this is visible in the unsupervised PCA (Fig. 2), supervised PCA (Fig. 3) or tICA (Fig. 5), as well as in the histograms of all internal coordinates (SI Fig. S5), where educt and transition state distributions overlap for many internal coordinates, while being clearly separated from the product. We surmise that this is an indication that we have an early transition state here – information that can be utilised in further optimisation of the catalyst. While the reaction step under investigation does not represent the rate-determining step, as previously reported, the energy for release of the product from the catalyst is notably reduced by the presence of the calixarene cage, allowing for improved catalytic performance.7 Our analysis revealed that the reaction starts with a tilt in the cage. Therefore, modification of this moiety distal to the reaction centre can affect reactivity and even selectivity, similar to the allosteric effects seen in natural enzymes. Experimentally, modifications could include changing the size and rigidity of the cage, or introducing asymmetric modifications to one side of the cage. All of these efforts can result in greater control over substrate selectivity. Identifying possible allosteric sites in other systems could help advance supramolecular chemistry in rational design, which is of particular interest for homogeneous catalysis, where such insights may lead to greater substrate selectivity or more efficient catalysts.
Our study presents a proof-of-concept approach to the discovery of complex reaction coordinates that is transferable to any chemical reaction. The presented methodology is not only applicable to the study of single reaction steps, but can be extended to reactions with competing reaction pathways that determine selectivity: If the reaction energy profile is different, then the energy can be used as a simple criterion to group the trajectories and separate the pathways. In case, the energy differences are similar and only the product structures are different, this will be highlighted by PCA or tICA, allowing separation using a clustering approach. This information can then be leveraged to either separate the sampling data and categorize the reaction trajectories or to perform new, biased or restrained simulations which direct the reaction mechanism through a single pathway.
Our protocol can be integrated into computational chemistry workflows to investigate the reactivity of complex systems under operando conditions. By identifying secondary contributions to the reaction coordinate, such as remote conformational changes, it can guide further studies and inform the design of reactivities and selectivity by tracing the impact of modifications on the reaction profile under operando conditions. Consequently, additional insights can be obtained from reaction trajectories with minimal additional computational effort.
By devising a high throughput QM/MM MD workflow, we were able to study the C–N coupling reaction dynamics of a supramolecular Cu-calix[8]arene catalyst under experimental conditions. This development is a crucial step towards a predictive operando model for complex catalytic reactions. It allowed us to extract not only reaction energies and barriers with uncertainties, but also provides insights into the intricate dynamic nature of the macrocyclic transition-metal catalyst in explicit solvent.
Interpretable machine learning techniques have proven invaluable in dealing with the vast amount of data, because of their ability to trace results back to structural changes. However, a consensus model is needed to identify the internal coordinates with the highest contribution to separating educt, transition state and product, and to eliminate the inherent variability and instability of individual ML approaches.
By performing a causality analysis of the internal coordinates of the consensus model, an extra temporal dimension can be added to the reaction coordinate, allowing us to explain the chemical reaction as a sequence of movements leading up to C–N bond formation. This information, pinpointing the exact source (group of atoms) that triggers the reaction, allows the experimental chemist to generate testable hypotheses to enhance reactivity, for example by suitable chemical modifications. By checking the outcome of Decision Trees and Decision Rules run on this modified system, we can gauge the impact of a specific change on the reaction coordinate. The methodology is implemented in a python script and presented in the Jupyter notebook provided on github (see Data availability Section). The analysis is largely automated, requiring only a few inputs from the user, namely: (i) system trajectories; (ii) coordinates which should be considered for dimensionality reduction e.g. centres of mass for rings; (iii) labelling criteria for the various states of the system; (iv) cutoff values for the consensus model. Notably, the labelling of the states can be done on any descriptor of the system, including energetic criteria, as well as partial charges or other molecular properties. As a result, it can be readily applied to other systems as a digital tool. By limiting the analysis to structural changes in the system, we believe this technique is accessible to non-expert users. The results reflect coordinate changes during the reaction, which makes them easy to understand for a general chemist.
Our methodology was demonstrated on a highly flexible Cu-calix[8]arene catalyst. However, the approach supports the exploration of both covalent and non-covalent interactions, allowing for the applicability to the investigation of other phenomena such as cluster formation and aggregation. Nevertheless, a requirement is that structures can be labelled, using either energetic criteria or any molecular feature. If that is given, it offers a broadly applicable framework for probing reaction coordinates in a variety of dynamic chemical systems, ranging from small (in)organic complexes to more intricate biomolecular structures.
![]() | ||
| Fig. 10 General workflow to identify and quantify the reaction coordinate by analyzing correlations and causality in an ensemble of reaction trajectories. | ||
We used chloroform in our simulations instead of toluene, which was chosen experimentally. Chloroform and toluene have comparable permittivities (εr 4.8 and 2.4),93 both typical of apolar solvents. Thus, both solvents provide similar apolar environments, consistent with the observation that the catalyst is active in different media. Experimentally, the reaction also works in tetrachloroethane but toluene was selected as a more benign and environmentally safe option. As there is no evidence for strong solvent effects on reactivity, the choice of solvent is unlikely to significantly alter the system. In simulations, chloroform was preferred because its smaller size enables faster equilibration than toluene.
To generate an appropriate starting structure the system was equilibrated and a 100 ns of a MM/MD simulation was carried out, using the multistep protocol implemented in PyConSolv.19 We conducted 152 simulation runs using the QM/MM MD protocol of the reductive elimination step, employing GFN2-xTB as the QM method. Among these runs, 142 simulations captured the reductive elimination step and were utilized for evaluating the reaction barrier and structural analysis of the reaction. All 152 trajectories were used for labelling and evaluation of the reaction energy (see below).
The reaction barrier was defined as the difference between the transition state structure energy and the maximum value of a sigmoid function fitted through the reaction profile (see Section 2 of the SI). As DFT sampling on a similar scale as GFN2-xTB was not achievable, the ensembles were too small to obtain the reaction energy by averaging over them. Thus, to evaluate the reaction energy from the DFT simulations, we fitted sigmoids between the educts and product. By subtracting the highest sigmoid value from the lowest we were able to calculate a reaction energy and compared it to that obtained from GFN2-xTB using the same approach. As further confirmation of the accuracy of the transition state labelling, we confirmed that the structures in the ensemble resemble that of a transition state optimized using eigenvector following in a static approach (see SI Section 4 and Fig. S3).
We utilized Principal Component Analysis (PCA)37 and Time-lagged Independent Component Analysis (tICA)38 as dimensionality reduction techniques to identify coordinates that separate the states. The two unsupervised methods complement each other in regards to addressing the variances present in the dataset.95,96 For supervised dimensionality reduction, we opted for Linear Discriminant Analysis (LDA)77 due to its efficacy in separating distinct classes within a given dataset. To further enhance the separation capability of PCA and LDA, several methods of automated feature selection were chosen and implemented, namely Recursive Feature Elimination with Cross Validation97 (RFECV) using both Random Forest98 (RF) and Logistic Regression (LR)99 as classifier models, and Lasso100 with Cross Validation (LassoCV), all using fivefold stratified cross-validation.101 The RF and Logistic Regression classifiers were additionally evaluated by selecting only the top 10% of features. When subjecting the highlighted features to dimensionality reduction using PCA or LDA, we observed varying importance of the features to the principal components (SI, Tables S3 and S4). To alleviate this discrepancy between the results and increase separation performance, we defined a consensus model, which combines the results of all previously mentioned approaches and highlights features that are consistently found across all models. These features were identified by performing a PCA and LDA on each of the models and examining the loadings. Subsequently, we computed the average contributions to the first three principal components, as well as the two LDA components. The selected features needed to appear in at least 75% of the elimination models to be deemed significant. This process yielded a set of 49 internal coordinates for the PCA. These coordinates were hierarchically clustered to group together correlated movements102 and then subjected to causal inference analysis (see statistical model for causality inference).102
In tandem with automated feature reduction methodologies, we leveraged Decision Trees to quantify the most relevant features from our dataset. These Decision Trees underwent training with class balancing that is augmenting the ensembles via oversampling. Furthermore, we adopted a Decision Rule framework, employing the skoperules103 library. Within this framework, a Random Forest bagging classifier, consisting of 30 Decision Trees, was used to provide a semantically quantitative characterization of the three ensembles.
Supplementary information: it contains the detailed simulation protocol, information on the labeling of data as well as statistics and performance of the methodology, robustness and performance of sampling, an overview over the distribution of all internal coordinates, details about the statistical analyses, detailed analysis of the product (distribution) and solvation effects. See DOI: https://doi.org/10.1039/d5dd00216h.
| This journal is © The Royal Society of Chemistry 2025 |