Low-cost prediction of molecular and transition state partition functions via machine learning

Evan Komp; Stéphanie Valleau

doi:10.1039/D2SC01334G

View PDF VersionPrevious ArticleNext Article

Open Access Article

This Open Access Article is licensed under a
Creative Commons Attribution 3.0 Unported Licence

DOI: 10.1039/D2SC01334G (Edge Article) Chem. Sci., 2022, 13, 7900-7906

Low-cost prediction of molecular and transition state partition functions via machine learning†

Evan Komp * and Stéphanie Valleau *
Chemical Engineering, University of Washington, 3781 Okanogan Ln, Seattle, WA 98195, USA. E-mail: evankomp@uw.edu; valleau@uw.edu

Received 5th March 2022 , Accepted 10th June 2022

First published on 14th June 2022

Abstract

We have generated an open-source dataset of over 30 [thin space (1/6-em)] 000 organic chemistry gas phase partition functions. With this data, a machine learning deep neural network estimator was trained to predict partition functions of unknown organic chemistry gas phase transition states. This estimator only relies on reactant and product geometries and partition functions. A second machine learning deep neural network was trained to predict partition functions of chemical species from their geometry. Our models accurately predict the logarithm of test set partition functions with a maximum mean absolute error of 2.7%. Thus, this approach provides a means to reduce the cost of computing reaction rate constants ab initio. The models were also used to compute transition state theory reaction rate constant prefactors and the results were in quantitative agreement with the corresponding ab initio calculations with an accuracy of 98.3% on the log scale.

Introduction

The reaction rate constant or speed of a chemical reaction defines its success. Evaluating reaction rate constants in organic chemistry is critical for drug design, catalyst design, and so forth. Unfortunately, reactions are usually part of large chemical networks. The challenge thus becomes evaluating the kinetics of these networks, i.e. all single reaction rate constants must be computed. The full ab initio calculation of a reaction rate constant can take several months of computational and human time.¹ Indeed, most kinetic theories require the exploration of one or more potential energy surfaces and rely on the evaluation of a reactant and transition state partition function.^2–4 These can only be computed when the structure of the corresponding species is known. In many cases, e.g. transition state theory (TST)⁵ the kinetics is represented by a single minimum energy path (MEP) which passes through a transition state. Finding the transition state structure is the most demanding part of the computation. Indeed, most MEP search methods⁶ require the iterative evaluation of gradients and energies of intermediate geometries until an MEP is found. Lastly even converged MEPs are not always guaranteed to include a true transition state and one must use e.g. eigenvector following to ensure a first order saddle point is found. In this context, using machine learning (ML) to predict any information regarding the transition state without knowledge of its structure is of strong interest.

Once input features have been computed, a trained machine learning estimator may predict chemical properties within seconds.^7–15 Hence, in the last few years several efforts have been made to leverage ML for kinetics.²¹ Models have been trained to predict reaction energies,²² bond dissociation energies²¹ and recently activation energies.^23,24 Machine learning has also been used to predict reaction products,^25–28 yields^29,30 or ideal synthesis conditions.³¹ Recently ML estimators were trained to predict quantum reaction rate constants for one dimensional minimum energy paths.^20,32 All of the above is encouraging and only limited in application by the size of the training datasets and lack of large kinetic datasets.¹⁹ ML has also been used to help accelerate the search for MEPs^16–18 and recently larger datasets of transition states^33,34 and quantum rate constants²⁰ have become available.

In this work we investigated the use of ML to predict transition state partition functions without knowledge of its geometry as well as predicting partition functions when geometries are known (Fig. 1). Together with an activation energy predictor,^23,24 an accurate ML predictor of partition functions could circumvent the need to find a transition state structure when using TST to compute reaction rate constants. This would enable the rapid evaluation of kinetics for large networks of reactions. In addition, quantum reaction rate constant theories such as the flux–flux autocorrelation function still need the reactant partition function.³⁵ Hence this predictor would help accelerate the calculation of quantum reaction rate constants. Lastly, it could be used to accelerate the evaluation of all other thermodynamic quantities which rely on knowledge of the partition function: free energy, entropy, pressure, heat capacity, etc. We generated a dataset of over 30 k partition functions for gas phase organic chemistry species extracted from an existing dataset³⁶ of reactant, product, and transition state structures for unimolecular reactions. With this data, two deep neural network (DNN) partition function estimators were optimized. The first DNN, “Qest”, predicts the natural logarithm of the molecular partition function from featurized molecular geometry and inverse temperature, 1/T. The second DNN, “QesTS”, predicts the natural logarithm of the partition function of an unknown transition state, log [thin space (1/6-em)] Q_TS(T) from the difference between product and reactant featurized geometries, the reactant and product partition functions on the log scale, and 1/T. These models were then used to predict partition functions to be used in the computation of transition state theory (TST) reaction rate constants. Our reaction rate constant predictions were also compared to ab initio TST reaction rate constants. The various steps of our workflow, our results, and conclusions will be discussed in the following subsections.


	Fig. 1 (Panel I) Partition functions were computed for a set of DFT optimized reactant, product, and transition state geometries taken from a dataset of 11961 unimolecular reactions.³⁶ (Panel II) With this data, a deep neural network (DNN), Q_est, was trained to predict a structure's partition function from its geometry and the inverse temperature, 1/T. A second DNN, QesTS, was trained to predict the partition function of an unknown transition state from the reactant and product structures, partition functions, along with 1/T.

Results and discussion

Generation of a partition function dataset

Canonical partition functions were computed for 35 [thin space (1/6-em)]

883 reactant, product, and transition state geometries taken from a dataset³³ which contained 11 [thin space (1/6-em)]

961 gas phase, organic chemistry reactions involving molecules of at most seven C, O and or N atoms. The partition functions were computed under the assumption that electronic, translational, rotational, and vibrational degrees of freedom were separable (eqn (1)), and by using the rigid-rotor and harmonic-oscillator approximations:


Q(T) ≈ Q_el(T)Q_trans(T)Q_rot(T)Q_vib(T).	(1)

See Table S1 in the ESI† for the equations of each partition function term in eqn (1). Although some products consisted of two or more distinct molecules, these had been considered as single structures when energies and hessians were computed in the original dataset.³⁶ We used the single geometries and vibrational frequencies from ref. 33 to compute the partition functions. A more accurate representation would require the separation of the product geometry into the individual molecular geometries. When computing the vibrational partition functions, zero-point energies were included. For the rigid rotor rotational partition function, we computed the moments of inertia using the geometries and determined the symmetry number by identifying invariant symmetry operations using pymatgen,³⁷ and by using our own software to determine which symmetry operations were proper. We accounted for the exception of linear molecules for which C₂ rotations are returned as improper reflections by pymatgen.

Partition functions were computed over a set of 50 temperatures randomly sampled for each reaction from a uniform distribution of 1/T within the range of T = (50, 2000) K such that each reaction was considered at a unique set of temperatures. A histogram of the natural logarithm of the partition functions for all structures at all temperatures is shown in Fig. 2. The histogram shows a smaller count of low values of the natural logarithm of the partition function. This is due to the dominating contribution of the vibrational partition function to eqn (1). Our data has been made open source and is available to download from ref. 38.


	Fig. 2 Histogram of all partition function values computed in our dataset for 35883 organic chemistry species. Each entry is either a reactant, product, or transition state at one of 50 temperatures T ∈ (50, 2000) K, sampled uniformly with respect to 1/T. Structures that exhibit the smallest and largest partition functions in the dataset are depicted in subpanels (A) and (B). The shape of this histogram is largely due to the dominating contribution of the vibrational partition function in eqn (1).

The entire dataset was split into a hold out test set (10%) and a development set (90%). The hold out set was used to test final ML predictors. The development set was further split into 5 folds to use for cross validation during model optimization. Both the hold-out and the fold splitting were conducted using the Murcko scaffold of the reactant molecule.³⁹ Here the scaffold is a representation of the structural backbone of the molecule. In splitting by scaffold we ensured that the same and similar molecules in the dataset were grouped together, producing a better estimate of the models accuracy on unseen data when conducting cross validation or testing. For Qest, the input consisted of a molecule's featurized structure (reactant, product, or transition state) and the inverse temperature, while the target was the corresponding natural logarithm of the partition function at that temperature. On the other hand, for QesTS the inputs were the difference between product and reactant featurized structures, product and reactant partition functions, and the inverse temperature while the target was the natural logarithm of the transition state partition function.

Optimization and training of machine learning models

We first determined the optimal format of the input data using the cross validated development set. This involved a screening of featurization methods and data standardization. Only 3D structure featurizers were investigated because 2D featurizers require bond connectivity which is arbitrary for unstable structures such as transition states. Furthermore, 2D featurizers cannot resolve for orientation differences in 3D geometries such as those coming from different local minima for reactants. This means that 2D featurizers cannot be used for the Qest model. Congruently, we note that the partition functions and rate constants we computed in this work only consider the given reactant or product local minima for each reaction from the original dataset. To compute a more accurate reaction rate constant, multiple local minima would need to be considered. We chose to use the difference in product and reactant input feature vectors as it describes the change in 3D structure for the reaction. Further, product and reactant feature differences have shown to be successful when training other ML models to predict reaction quantities.^19,23,24,40 Concatenating feature vectors may also be effective, but would require a larger DNN.

We found that the EncodedBonds⁴¹ featurizer with min–max scaling and target normalization were optimal for the Qest Model. For the QesTS model we employed the same input data feature representation as Qest, i.e. Encoded Bonds. We also carried out the screening of data standardization and found it was better not to use standardization. We believe that this is because the distribution of values for the difference in product and reactant feature vectors is peaked around zero; also, the distribution of partition function values for transition states is narrower than the overall partition function distribution for all species. For more information see ESI, section 2.†

With these optimal input features, we carried out a search over DNN hyperparameters to identify the best activation function, regularization, bias and weight initialization, learning rate, and neuron configuration for Qest and QesTS. For more information see ESI, section 3.†

Partition function prediction

The optimal featurizer, standardization, and hyperparameters (Table S4†) were used to train the final DNN models on the development dataset and test its performance on the test set. To limit extrapolation in the final model, some outliers were dropped from the total dataset, see ESI, section 5.†

In Fig. 3 we show parity plots of the final QestTS (panel A) and Qest (panel B) model predictions with respect to the test set. In both cases we see a low test set mean absolute error (MAE) of 4.01 (2.0%) and 4.37 (2.1%) on the logarithm of the chemical species or transition state partition function for QesTS and Qest respectively. Percentages are MAE compared to the test set standard deviation. Also, in both cases the spread of the distribution of predictions around the identity is quite narrow. The distribution of errors is shown in ESI, section 4.† This indicates that both DNN models accurately predict the partition functions.


	Fig. 3 (Panel A) Parity plot of the final QesTS DNN predicted values of transition state logQ_TS (y-axis) with respect to exact unseen test set values (x-axis). A perfect model is represented by the identity line (red dashes). Predicted examples (blue dots) closely match the true value and the overall MAE on the logQ is 4.01 (2.0%). (Panel B) Same as panel A but for the Qest prediction of logQ. Here the overall MAE on the test set logQ is 4.37 (2.1%). Percent error is compared to the test set standard deviation. We see a strong agreement of our DNN predictions with the test set values.

To verify that the Qest model was learning from the molecular structures in the dataset and not simply from the temperature, we created a “null” linear model log [thin space (1/6-em)] Q(T) = m × 1/T + b and fitted it to the development set. The use of this null model for both reactant and transition state partition functions corresponds to the commonly used assumption that the ratio of partition functions is equal to 1.0. The null model performed poorly on the test set with an MAE of 31.8%. Our Qest model error was much lower: 1.9%, confirming that the Qest model learned from the molecular structure.

The prediction MAEs on the test set for Qest, QesTS, and the null model are listed in Table 1.

Table 1 Mean absolute error of the null, Qest and QesTS models in predicting the test set molecular (reactant, product and transition state) or transition state partition functions on the log scale. Percent error is given compared to the test set standard deviation

	Null MAE	Qest MAE	QesTS MAE
LogQ	72.5 (34.5%)	4.37 (2.1%)	N/A
LogQ_TS	72.1 (35.1%)	4.25 (2.1%)	4.01 (2.0%)

Given that both models predicted with a low error on the test set, we combined these to predict transition state partition functions with machine learned reactant and product partition functions. We will discuss this “Double” model in the next section. The trained models are available on GitHub.⁴²

Reaction rate constants computed with machine learned partition functions

With our Qest and QesTS partition function estimators, and the existing energies,³⁶ we computed transition state theory reaction rate constants for 1086 test set reactions. We recall the expression for the transition state theory,^43,44 TST, reaction rate constant,

where E_a is the activation energy as the difference in ground state energies between transition state and reactant, β = 1/k_BT while Q_TS and Q_R are the transition state and reactant partition functions including zero point energy contribution at a given temperature T.

To understand how machine learned partition functions influence the accuracy of the reaction rate constant, three approaches were considered. In the first both reactant and transition state partition functions were predicted from their geometries by using Qest. In the second approach, the reactant and product geometries and computed ab initio partition functions were used to predict the transition state partition function with QesTS. This approach avoids the time demanding search for a transition state geometry. In the last approach, Qest was used to predict reactant and product partition functions which were used, together with their geometries, as input to QesTS to predict a transition state partition function. This method, hereafter referred to as the Double predictor, only requires knowledge of the reactant and product geometries and does not need computed partition functions or transition state geometries.

The MAE of the three methods for predicting test set transition state partition functions and TST reaction rate constants is listed in Table 2. For the prediction of transition state partition functions (Table 2, column 1) all models have a similar MAE. The Double approach is the least accurate with an MAE of 2.7% while the QesTS approach has the lowest MAE of 2%.

Table 2 Performance of transition state partition function prediction, and TST reaction rate constant calculation using predicted partition functions for our three ML based methods (Qest, QesTS and Double). Here E_a is the reaction activation energy. Percent error is given compared to the test set standard deviation

	LogQ_TS (T) MAE	Logk^TST(T) MAE	Required inputs for k(T)
Qest	4.25 (2.1%)	4.50 (1.7%)	Reactant, transition state structures, temperature, E_a
QesTS	4.01 (2.0%)	4.01 (1.5%)	Reactant, product structures and partition functions, temperature, E_a
Double	5.58 (2.7%)	4.50 (1.7%)	Reactant, product structures, temperature, E_a

The fact that QesTS is the best predictor is not surprising given that it was trained specifically to predict transition state partition functions while Qest was not. Further, QesTS has knowledge of reactant and product partition functions from its input. In column 2 of Table 2 we see that when computing reaction rate constants by using the predicted partition functions, the Double and Qest approaches have the same MAE while the QesTS approach remains the most accurate with an MAE of 4.01. The increase in accuracy of the Double approach compared to the other methods is most likely due to a cancelation of errors from Qest predicted reactant and QesTS predicted transition state partition functions. Nonetheless, we can achieve the same average accuracy of reaction rate constant calculations when using only reactant and product structures and no information on the transition state geometry. The only cost remains providing the value of the activation energy. Here it is worth noting that machine learning has successfully been employed to predict activation energies.^23,24

In Fig. 4, panel A we plot the error of the predicted transition state partition function with respect to the test set partition functions as a function of temperature for Qest, QesTS, and Double. In panel B we show the logarithm of the ratio of predicted transition state theory prefactors, Q_TS/Q_R, with respect to computed prefactors. We see some error cancelation: the error in the ratio of partition functions (panel B) is smaller than that for the single values (panel A). Errors for all models tend to increase at lower temperatures even though low temperatures were sampled frequently when generating the data. This could come from the fact that the partition function is more sensitive to small changes in temperature at low temperatures, making it more difficult to learn. Regardless, these errors do not significantly impact the predicted value of the reaction rate constant, with average error orders of magnitude lower than the value of the logarithm of the reaction rate constant. This is also seen in Fig. 5, where we show the average error compared to the reaction rate constant for the ring breaking of gamma-butyrolactone.


	Fig. 4 (Panel A) MAE averaged over temperature bins on the test set for predictions of the logarithm of the transition state partition function using Qest, QesTS, and Double models. While the error of the transition state logged partition function increases as temperature decreases, the ratio of transition state to reactant partition function, which is the prefactor for TST rate constants, has low error. This is shown in the plot of the logged ratio of true to predicted prefactors binned over temperature (panel B). Here 0.0 indicates a perfect match between predicted and true value.


	Fig. 5 (Panel A) Natural logarithm of the reaction rate constant for a randomly selected test set reaction: the ring breaking of gamma-butyrolactone. Here we show the logarithm of the reaction rate constant computed with predicted Qest (red), QesTS (orange), and Double (blue) partition functions as well as the reaction rate constant computed using the ab initio values of the partition functions (green). The error bars correspond to the error on the entire test set averaged over temperature bins. (Panel B) A zoom in on the low temperature section of the data in plot A. Here the average error is plotted as dotted margins instead of error bars. One error bar (blue) is depicted. The error on k(T) for all prediction methods increases at low temperatures, however the average error is significantly smaller than the magnitude of the reaction rate constant.

In Fig. 6 panel A, we show a plot of the transition state theory reaction rate constant for another reaction randomly selected from the test set, the ring opening of tetrahydro-2H-pyran-2-imine. Here the partition functions were computed using the original ab initio data (green line) and predicted with the ML models. In panel B, we show the absolute error as the normed difference between true and predicted values. We see that the error of the models is more than an order of magnitude smaller that the value of the logarithm of the reaction rate constant which confirms the accuracy of our trained models.


	Fig. 6 (Panel A) Natural logarithm of the TST reaction rate constant as a function of inverse temperature, 1/T, using Qest, QesTS, and Double predicted and computed partition functions for the ring opening of tetrahydro-2H-pyran-2-imine. This reaction was randomly selected from the test set. The predicted values are all close to the ab initio values. The absolute error of these models on the logarithm of the rate constant is shown in (panel B). Qest prediction has larger error at low temperatures for this reaction, however all methods predict with error an order of magnitude lower than the logarithm of the rate constant.

Conclusion

In this work we have computed rigid rotor, rigid body, harmonic oscillator partition functions over a broad range of temperatures for a dataset of 11 [thin space (1/6-em)]

961 organic chemistry unimolecular reactions with reactant, transition state, and product structures. With this dataset two DNN based models were trained to predict partition functions at given input temperatures. Qest predicts a molecular partition function from the molecule's featurized geometry, QesTS predicts transition state partition functions from reactant and product featurized geometries and their partition functions; lastly Double uses QesTS to predict Q_TS with Qest estimated reactant and product partition functions as well as reactant and product geometries. The Double approach requires no knowledge of the transition state structure; it only requires reactant and product structures. We showed that these estimators are accurate in their prediction and the MAE on the logarithm of the partition function is on the order of 2% (Table 1) for structures containing no more than 7 C, N, and O atoms at temperatures between 50 K and 2000 K. With these predicted partition functions, we computed transition state theory reaction rate constants and found an MAE of 1.6 ± 0.1% on log [thin space (1/6-em)]

k(T) (Table 2). Predictions of unseen test set reactions such as the ring breaking of tetrahydro-2H-pyran-2-imine closely followed the exact test set values. For predictions outside of the temperatures explored here or structures involving more atoms, we recommend retraining. Indeed the model's accuracy can only be evaluated within the testing range.^45,46 In the future, we plan to generate datasets for systems with more atoms, and to evaluate anharmonic partition functions to train new estimators. Transfer⁴⁷ or delta⁴⁸ learning might be used to accelerate these tasks.

The models we have created in tandem with activation energy predictors provide an approach to predict reaction rate constants without the need to search for minimum energy paths. Our models enable a more rapid estimation of reaction dynamics in the context of coupled reactions, reaction networks, and reactor design. We also would like to note that very recent work has used ML to predict transition state structures directly.^49,50 Should these models become accurate on a broader scale they could provide an alternative path towards predicting TST rate constants.

In the future, we aim to move beyond unimolecular reactants and consider bimolecular reactions as well as reactions which occur in the presence of a solvent. We also plan to investigate other machine learning approaches, for instance, on employing end-to-end message passing models, given their recent success for the prediction of adjacent molecular and reaction quantities.^51,52

Data availability

The partition function dataset can be found on Zenodo.³⁸ The trained ML estimators are available on GitHub.⁴²

Author contributions

E. K. and S. V. generated the software to compute the rotation symmetry number and partition functions. E. K. carried out the calculations for the partition function dataset and machine learning estimators. E. K. and S. V. generated and analyzed the results and wrote the manuscript and supporting information.

Conflicts of interest

There are no conflicts to declare.

Acknowledgements

The authors would like to acknowledge the Hyak supercomputer system at the University of Washington for support of this research.

References

B. Fu, X. Shan, D. H. Zhang and D. C. Clary, Chem. Soc. Rev., 2017, 46, 7625–7649 RSC.
S. J. Klippenstein, V. S. Pande and D. G. Truhlar, J. Am. Chem. Soc., 2014, 136, 528–546 CrossRef CAS PubMed.
E. Pollak and P. Talkner, Chaos, 2005, 15, 026116 CrossRef PubMed.
P. Hänggi, P. Talkner and M. Borkovec, Rev. Mod. Phys., 1990, 62, 251–341 CrossRef.
B. Peters, Reaction Rate Theory and Rare Events, Elsevier Science, 2017 Search PubMed.
G. Henkelman, B. P. Uberuaga and H. Jónsson, J. Chem. Phys., 2000, 113, 9901–9904 CrossRef CAS.
O. V. Prezhdo, J. Phys. Chem. Lett., 2020, 11, 9656–9658 CrossRef CAS PubMed.
M. Haghighatlari and J. Hachmann, Curr. Opin. Chem. Eng., 2019, 23, 51–57 CrossRef.
F. Häse, S. Valleau, E. Pyzer-Knapp and A. Aspuru-Guzik, Chem. Sci., 2016, 7, 5139–5147 RSC.
F. Häse, C. Kreisbeck and A. Aspuru-Guzik, Chem. Sci., 2017, 8, 8419–8426 RSC.
M. Rupp, A. Tkatchenko, K.-R. Müller and O. A. von Lilienfeld, Phys. Rev. Lett., 2012, 108, 058301 CrossRef PubMed.
K. T. Schütt, F. Arbabzadah, S. Chmiela, K. R. Müller and A. Tkatchenko, Nat. Commun., 2017, 8, 6–13 CrossRef PubMed.
Z. W. Ulissi, A. J. Medford, T. Bligaard and J. K. Nørskov, Nat. Commun., 2017, 8, 14621 CrossRef PubMed.
Y. Wang, J. M. L. Ribeiro and P. Tiwary, Curr. Opin. Struct. Biol., 2020, 61, 139–145 CrossRef CAS PubMed.
Y. Wang, J. M. L. Ribeiro and P. Tiwary, Nat. Commun., 2019, 10, 3573–3581 CrossRef PubMed.
A. A. Peterson, J. Chem. Phys., 2016, 145, 074106–074111 CrossRef PubMed.
O. P. Koistinen, V. Ásgeirsson, A. Vehtari and H. Jónsson, J. Chem. Theory Comput., 2020, 16, 499–509 CrossRef PubMed.
O. P. Koistinen, F. B. Dagbjartsdóttir, V. Ásgeirsson, A. Vehtari and H. Jónsson, J. Chem. Phys., 2017, 147, 152720–152721 CrossRef PubMed.
E. Komp, N. Janulaitis and S. Valleau, Phys. Chem. Chem. Phys., 2022, 24, 2692–2705 RSC.
E. Komp and S. Valleau, J. Phys. Chem. A, 2020, 124, 8607–8613 CrossRef CAS PubMed.
M. Meuwly, Chem. Rev., 2021, 16, 10218–10239 CrossRef PubMed.
S. Stocker, G. Csányi, K. Reuter and J. T. Margraf, Nat. Commun., 2020, 11, 1–11 CrossRef PubMed.
C. A. Grambow, L. Pattanaik and W. H. Green, J. Phys. Chem. Lett., 2020, 11, 2992–2997 CrossRef CAS PubMed.
K. Jorner, T. Brinck, P. O. Norrby and D. Buttar, Chem. Sci., 2021, 12, 1163–1175 RSC.
W. Jin, C. W. Coley, R. Barzilay and T. Jaakkola, Predicting Organic Reaction Outcomes with Weisfeiler-Lehman Network, arXiv, 2017, arXiv:1709.04555v3 Search PubMed.
V. Mann and V. Venkatasubramanian, AIChE J., 2021, 67, e17190 CrossRef CAS.
M. A. Kayala, C. A. Azencott, J. H. Chen and P. Baldi, J. Chem. Inf. Model., 2011, 51, 2209–2222 CrossRef CAS PubMed.
C. W. Coley, W. Jin, L. Rogers, T. F. Jamison, T. S. Jaakkola, W. H. Green, R. Barzilay and K. F. Jensen, Chem. Sci., 2019, 10, 370–377 RSC.
P. Schwaller, A. C. Vaucher, T. Laino and J. L. Reymond, Mach. Learn. Sci. Technol., 2021, 2, 015016 CrossRef.
F. Sandfort, F. Strieth-Kalthoff, M. Kühnemund, C. Beecks and F. Glorius, Chem, 2020, 6, 1379–1390 CAS.
H. Gao, T. J. Struble, C. W. Coley, Y. Wang, W. H. Green and K. F. Jensen, ACS Cent. Sci., 2018, 4, 1465–1476 CrossRef CAS PubMed.
P. L. Houston, A. Nandi and J. M. Bowman, J. Phys. Chem. Lett., 2019, 10, 5250–5258 CrossRef CAS PubMed.
C. A. Grambow, L. Pattanaik and W. H. Green, Sci. Data, 2020, 7, 1–8 CrossRef PubMed.
G. Falk Von Rudorff, S. N. Heinen, M. Bragato and A. Von Lilienfeld, Mach. Learn. Sci. Technol, 2020, 1, 45026 CrossRef.
W. H. Miller, S. D. Schwartz and J. W. Tromp, J. Chem. Phys., 1983, 79, 4889–4898 CrossRef CAS.
C. A. Grambow, L. Pattanaik and W. H. Green, Sci. Data, 2020, 7, 1–8 CrossRef PubMed.
S. P. Ong, W. D. Richards, A. Jain, G. Hautier, M. Kocher, S. Cholia, D. Gunter, V. L. Chevrier, K. A. Persson and G. Ceder, Comput. Mater. Sci., 2013, 68, 314–319 CrossRef CAS.
K. Evan and S. Valleau, Low-Cost Prediction of Molecular and Transition State Partition Functions via Machine Learning (1.0.0) [Data set], 2022, DOI:10.5281/zenodo.6326560.
G. W. Bemis and M. A. Murcko, J. Med. Chem., 1996, 39, 2887–2893 CrossRef CAS PubMed.
N. Schneider, D. M. Lowe, R. A. Sayle and G. A. Landrum, J. Chem. Inf. Model., 2015, 55, 39–53 CrossRef CAS PubMed.
C. R. Collins, G. J. Gordon, O. A. Von Lilienfeld and D. J. Yaron, J. Chem. Phys., 2018, 148, 241718 CrossRef PubMed.
E. Komp and S. Valleau, QuickQ, GitHub Repository, https://github.com/valleau-lab/quickq Search PubMed.
H. Eyring, J. Chem. Phys., 1935, 3, 63–71 CrossRef.
M. G. Evans and M. Polanyi, Trans. Faraday Soc., 1935, 31, 875–894 RSC.
P. J. Haley and D. Soloway, Extrapolation limitations of multilayer feedforward neural networks, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks, 1992, vol. 4, pp. 25–30, DOI:10.1109/IJCNN.1992.227294.
K. T. Butler, D. W. Davies, H. Cartwright, O. Isayev and A. Walsh, Nature, 2018, 559, 547–555 CrossRef CAS PubMed.
P. Schwaller, A. C. Vaucher, R. Laplaza, C. Bunne, A. Krause, C. Corminboeuf, T. Laino and S. Correspondence Philippe Schwaller, Wiley Interdiscip. Rev.: Comput. Mol. Sci., 2022, e1604 Search PubMed.
M. Bogojeski, L. Vogt-Maranto, M. E. Tuckerman, K. R. Müller and K. Burke, Nat. Commun., 2020, 11, 1–11 CrossRef PubMed.
L. Pattanaik, J. B. Ingraham, C. A. Grambow and W. H. Green, Phys. Chem. Chem. Phys., 2020, 22, 23618–23626 RSC.
R. Jackson, W. Zhang and J. Pearson, Chem. Sci., 2021, 12, 10022–10040 RSC.
J. Zhou, G. Cui, S. Hu, Z. Zhang, C. Yang, Z. Liu, L. Wang, C. Li and M. Sun, AI Open, 2020, 1, 57–81 CrossRef.
E. Heid and W. H. Green, Machine Learning of Reaction Properties via Learned Representations of the Condensed Graph of Reaction, J. Chem. Inf. Model., 2022, 62(9), 2101–2110 CrossRef CAS PubMed.

Footnote

† Electronic supplementary information (ESI) available: Model architecture, optimization, and additional performance plots. See https://doi.org/10.1039/d2sc01334g