Daniel
Dalland
,
Linden
Schrecker
* and
King Kuok (Mimi)
Hii
Department of Chemistry, Imperial College London, Molecular Sciences Research Hub, 82, Wood Lane, London W12 0BZ, UK. E-mail: linden@solvechemistry.com
First published on 13th September 2024
The ability and desire to collect kinetic data has greatly increased in recent years, requiring more automated and quantitative methods for analysis. In this work, an automated program (Auto-VTNA) is developed, to simplify the kinetic analysis workflow. Auto-VTNA allows all the reaction orders to be determined concurrently, expediting the process of kinetic analysis. Auto-VTNA performs well on noisy or sparse data sets and can handle complex reactions involving multiple reaction orders. Quantitative error analysis and facile visualisation allows users to numerically justify and robustly present their findings. Auto-VTNA can be used through a free graphical user interface (GUI), requiring no coding or expert kinetic model input from the user, and can be customised and built on if required.
Rate = kobs[A]m[B]n[C]p | (1) |
A global rate law can be constructed empirically from experimental data, without explicit considerations of the reaction mechanism or mass transfer effects.11,12 Traditionally, kinetic experiments include “flooding”13,14 or the initial rates method.15,16 While these methods are generally easy to analyse (as the data can be linearised), the results must be treated with caution, as they are either performed under non-synthetically relevant conditions,4,17 or cannot detect changes in reaction orders associated with more complex mechanisms, such as catalyst deactivation or product inhibition.18,19
In the past few decades, modern advances in process analytical tools and computing power have greatly accelerated the development of data-rich kinetic experiments under synthetically relevant conditions. In the late 1990's, Blackmond pioneered the development of Reaction Progress Kinetic Analysis (RPKA),20 which greatly streamlines the determination of rate laws from a series of “same excess” and “different excess” experiments.21–23 The experimental data can be manipulated in an electronic spreadsheet to obtain overlays between rate vs. concentration plots to obtain individual reactant order. Later, a “time adjusted” concentration vs. time plot method was developed,7 followed by the Time Normalization Analysis (TNA) method for determining catalyst order, further expanding the suite of visual kinetic analysis tools.24 Later in the same year, this was further extended to all reacting species as “Variable Time Normalization Analysis” (VTNA).25 One of the most attractive features of these visual kinetic analysis tools is that reaction orders can be derived without the need for bespoke software or significant complex mathematical calculations, hence making them more accessible to the synthetic chemistry community than previously available kinetic tools.26–29
In the last few years, the widespread popularity of Python as a general programming language has greatly accelerated the automation of machine-readable tasks.30–33 Recently, Hein and co-workers have developed a Python package “Kinalite”, a simple API for performing VTNA.34–36 Kinalite requires kinetic data from each experiment to be imported as individual csv files, containing time–concentration data of different reaction species. The user selects a reaction species and two relevant experimental data to determine the reaction order automatically. For multiple species, or for more than two data series, this process must be performed sequentially, determining one species order at a time. The package presents the result as a plot of ‘errors’ associated with different order values for a specific reaction species, demonstrating the best order value. These individually calculated orders can be combined to yield the global rate law. By removing the need for visual inspection, Kinalite removes human bias. However, other challenges remain, such as:
(1) Analysing more than two experiments at a given time.
(2) Determining the order of multiple or all reaction species concurrently.
(3) Robustly quantifying error in the reaction orders computed.
In this work, the development of a new and more robust Python package to perform VTNA is described, which can elucidate the reaction orders of several species simultaneously. As well as enabling significant time saving, an unlimited number of initial concentrations can be altered between experiments by computationally assessing the overlay across a wide range of order value combinations. This presents a novel way of carrying out “different excess” experiments which improves the amount of kinetic information obtained per run.
(1) Manual and automatic VTNA in a more time efficient manner.
(2) VTNA with several normalised reaction species.
(3) Visualisation of overlay score across different reaction orders.
(4) Quantification of overlap and error.
(5) Improved accessibility.
Each of these features will be presented below as a ‘unique selling point’ (USP) of the package, followed by a Results and discussion, where the application of Auto-VTNA is showcased with several examples.
![]() | ||
Fig. 1 VTNA overlay plots from literature with the reported order values giving concentration profile overlay.39–42 |
To develop a robust automated method, a Python script was developed to emulate the manual VTNA routine which calculates the transformed time axis for a chosen reaction species and corresponding order value(s) using a selected subset of the kinetic data, tested on selected literature examples (see ESI Appendix SA2‡).25 Subsequently, a robust automatic method of assessing the overlay of reaction progress profiles was developed, removing the need for manual visual inspection. This method involves fitting all profiles to a common flexible function and utilising a goodness-of-fit score (e.g. RMSE), as an ‘overlay score’, to quantify the degree of overlay.
In order to process non-linear fitting, a 5th degree monotonic polynomial fitting, a statistical method with several scientific applications,43–45 was selected as the default method. Although a more computationally efficient non-monotonically constrained polynomial fitting can be selected that is circa. 10 times faster, there is an increased risk of overfitting effects for reaction profiles with few datapoints. Additionally, Auto-VTNA allows linear fitting, which is particularly useful when reaction profiles linearise upon complete normalisation of the time axis, and can evaluate the kobs of the global rate law (see ESI, Section S6.2‡).
The ‘overlay score’ of Auto-VTNA, based on total fitting to a flexible function, is different from that employed in Kinalite, which relies on the difference in y-axis values of datapoints when sorted by transformed time value (‘error’) (see ESI, Section S2.2‡). This ‘error’ measure can in some cases yield incorrect order values that do not reflect optimal concentration profile overlay, especially if the density of the two compared data series is different.36 In contrast, Auto-VTNA reliably yields the order value(s) that maximise concentration profile overlay as judged by visual inspection on all tested examples (see ESI, Section S6.3‡), thus successfully automating the visual part of VTNA.
Upon establishing a robust method for computationally assessing the degree of concentration profile overlay, an algorithm was developed to perform automatic VTNA and embedded into a Python package named “Auto-VTNA”.
Traditional “different excess” experiments only alter the initial concentration of one reaction species at a time. Auto-VTNA can simultaneously determine the reaction orders of several reaction species efficiently. This reduces researcher analysis time and facilitates more efficient “different excess” experiments, where the initial concentrations of several reaction species are altered between experiments at the same time. Potentially, such an experimental method could reduce the number of experiments required to determine all reaction species orders in a complex reaction mixture. However, the penalty on accuracy of determined orders with reduction in the number of experiments should be considered and could be assessed through Auto-VTNA.
![]() | ||
Fig. 3 Different representations of VTNA results: (a) Overlay plots for simulated data involving the catalytic conversion of reactants A + B → P, including catalyst decomposition.25 (b) Plot showing the overlay scores (RMSE of 5th degree monotonic polynomial fits) against the reaction order in catalyst obtained using Auto-VTNA. (c) Corresponding “error plot” obtained using Kinalite. |
Through its “error” metric, Kinalite can also produce a rudimentary overlay score (Fig. 3c and ESI Section S2.2‡), which produces almost the same order assignment as Auto-VTNA for the simulated kinetic data in Fig. 3a. However, its reduced reliability becomes evident from the erratic curve produced, particularly at negative order values.36
When utilising Auto-VTNA to determine the orders of two species, the correlation between the overlay score and the combinations of order values can be visualised by a contour plot (Fig. 4, central plot). If the algorithm is set to obtain the reaction orders of three or more species, overlay scores can be tabulated for a selected number of order value combinations. However, in this case, the results are challenging to visualise due to the higher dimensionality of the data. Alternatively, if a reaction order in a species is already known, this can be inputted as a fixed value, if desired. The algorithm then calculates the overlay score as a function of the remaining species. This lowers the dimensionality of the results obtained so that they can be visualised by a graph or contour plot.
![]() | ||
Fig. 4 VTNA overlay plots generated using kinetic data on the aza-Michael reaction between methyl itaconate (1) and piperidine (2).46 The overlay plots are generated by right-clicking on the contour plot. |
The graphs and contour plots generated by Auto-VTNA (Fig. 3 and 4, respectively), allow individual overlay plots to be produced in an interactive manner through mouse clicks (Fig. 4). By left-clicking on the graph or contour plot, a traditional overlay plot is generated, whereas right-clicks show the plot with normalised y axis, the fitted function, and the overlay score of the click order value(s). This allows the operator to interrogate the overlay score landscape obtained by Auto-VTNA and verify that the calculated reaction order(s) give optimal concentration profile overlay.
![]() | ||
Fig. 5 (a) Overlay score vs. order in catalyst for simulated kinetic data.25 (b) Contour plot illustrating the degree of concentration profile overlay across different reaction order values in 1 and 2 for data on the aza-Michael addition in EtOH (Scheme 1).27 The blue dots illustrate the order combination values with overlay scores within 15% of the optimal point at (1.03, 1.02). |
The absolute optimal overlay score can also provide valuable information about the reaction under study. High optimal RMSE overlay scores reflect effects that can lower the confidence in the global reaction order afforded by VTNA, such as noisy kinetic data, incorrectly cropped data, or insufficient data point density for accurate numerical integration, as well as more complex reaction mechanisms giving changing order values, e.g. catalyst deactivation. To improve the utility of absolute overlay score values, it is necessary to limit the influence of factors other than the degree of concentration profile overlay. For example, to prevent the concentration scale from influencing goodness-of-fit values, Auto-VTNA normalises the y-axis prior to generating the overlay score. Moreover, the effect of over- and under-fitting on the overlay score is limited by employing a flexible 5th or 7th degree monotonic polynomial fit (see ESI, Section S6‡). Lastly, to facilitate the comparison of optimal overlay scores obtained with different datasets, RMSE has been set as the default goodness-of-fit measure.
![]() | ||
Scheme 1 Aza-Micheal addition of piperidine 2 to dimethyl itaconate 1.46 |
Utilising the Auto-VTNA Calculator with default settings (5th degree monotonic fitting with an RMSE overlay score), reaction orders in 1 and 2 were identified for the ethanol, IPA, THF, and DMSO datasets (Table 1) within 3 minutes. Generally, the Auto-VTNA derived order values match the orders reported within 0.1 and gave equal or better concentration profile overlay than that obtained using reported orders (see ESI, Fig. SA38‡). An exception is the reaction in DMSO, for which the computed reaction orders of 0.77 and 1.86 deviated more significantly from the reported order values of 1 and 2. This is attributed to the sparsity of kinetic data obtained from the reaction DMSO (3 experiments with 4 time points each), which leads to a greater influence of random errors and inaccurate numerical integration. The results from each calculation can be represented as contour plots (Fig. 7), showing well-defined overlay score minima around the best order values, in particular for the ethanol dataset (Fig. 7a).
Solvent | Reported value | Auto-VTNA |
---|---|---|
EtOH | [1]1[2]1 | [1]1.03[2]1.01 |
IPA | [1]1[2]1.6 | [1]1.00[2]1.66 |
DMSO | [1]1[2]2 | [1]0.77[2]1.93 |
THF | [1]1[2]2 | [1]0.94[2]1.86 |
Overall, this example illustrates that Auto-VTNA reliably computes order values based on optimal concentration profile overlay that match reported orders obtained by visual inspection. It also emphasises the importance of a suitable quality and quantity of kinetic data for application of any kinetic analysis technique.
Entry | Reaction system | Reported orders | Calculated orders | Total VTNA overlay score with | ||
---|---|---|---|---|---|---|
Sequential VTNA | Total VTNA | Reported orders | Total Auto-VTNA orders | |||
156 |
![]() |
[4]1[5]1[6]1[7]0[8]1 | [4]1.00[5]1.13[6]0.82[7]−0.03[8]1.04 | [4]1.00[5]1.04[6]1.02[7]−0.02[8]1.01 | 0.0379 | 0.0378 |
248 |
![]() |
[11]1[12]1[13]−0.5 | [11]0.85[12]0.92[13]−0.48 | [11]0.94[12]0.99[13]−0.49 | 0.0174 | 0.0143 |
352 |
![]() |
[14]0.9[12]1.0[15]1.0[16]0 | [14]0.85[12]1.04[15]1.07[16]0.09 | [14]0.95[12]1.02[15]1.06[16]0.04 | 0.0178 | 0.0162 |
463 |
![]() |
[18]1[19]1[20]1 | [18]1.06[19]0.98[20]1.02 | [18]1.00[19]0.95[20]0.91 | 0.0356 | 0.0338 |
557 |
![]() |
[22]0.85[23]−0.6[24]0.9 | [22]0.95[23]−0.58[24]0.94 | [22]0.90[23]−0.58[24]0.94 | 0.0729 | 0.0590 |
654 |
![]() |
[27]1[28]1[29]1 | [27]0.91[28]0.97[29]0.90 | [27]0.92[28]0.97[29]0.96 | 0.0308 | 0.0231 |
753 |
![]() |
[31]1.5[32]1[33]0.15 | [31]1.59[32]0.89[33]0.15 | [31]1.51[32]1.18[33]0.20 | 0.0114 | 0.0055 |
86 |
![]() |
[34]−1[35]1[36]1 | [34]−1.10[35]1.05[36]1.14 | [34]−1.07[35]1.05[36]1.12 | 0.1157 | 0.0537 |
96 |
![]() |
[34]−1[35]1[36]1 | [34]−0.43[35]1.60[36]1.08 | [34]−0.44[35]0.99[36]1.19 | 0.3911 | 0.2866 |
1062 |
![]() |
[38]1[39]1[40]1 | [38]1.04[39]0.97[40]0.83 | [38]1.10[39]0.95[40]0.83 | 0.0557 | 0.0455 |
1110 |
![]() |
[42]1[43]0.75[44]1[45]0.5[46]0[47]1 | [42]1.05[43]0.77[44]1.02[45]0.36[46]0.04[47]0.94 | [42]1.07[43]0.60[44]1.02[45]0.30[46]−0.04[47]0.99 | 0.1163 | 0.1015 |
1256 |
![]() |
[49]0[50]0.9[51]1.1[52]1 | [49]0.11[50]0.79[51]1.24[52]0.91 | [49]−0.03[50]0.62[51]1.26[52]0.86 | 0.1250 | 0.0912 |
1350 |
![]() |
[54]1[55]0[56]0[57]1 | [54]0.85[55]0.13[56]−0.07[57]0.95 | [54]0.76[55]0.10[56]−0.10[57]1.07 | 0.0618 | 0.0359 |
1461 |
![]() |
[60]1[61]1[62]1 | [60]0.74[61]0.96[62]0.96 | [60]0.71[61]0.95[62]0.86 | 0.1412 | 0.1013 |
1541 |
![]() |
[64]0[65]1[66]1 | [64]0.32[65]1.02[66]0.98 | [64]0.53[65]0.92[66]0.79 | 0.1231 | 0.1102 |
1651 |
![]() |
[68]2.05[69]0.78[70]−0.03[71]0.20[23]0.02 | [68]0.97[69]0.60[70]0.11[71]0.18[23]0.10 | [68]1.05[69]0.62[70]−0.04[71]0.18[23]0.03 | 0.1275 | 0.0230 |
The current approach to VTNA experimentation involves performing a “standard” reaction, under synthetically relevant conditions, followed by “different excess” experiments, where only the initial concentration of each reaction species is altered. This experiment design we define as “sequential VTNA”, as the orders are obtained for one reaction species at a time by identifying the order that makes the standard and relevant different excess profiles overlay. Pleasingly, we found that Auto-VTNA could identify the order values giving optimal overlay in all cases using sequential VTNA. A monotonic polynomial degree of 5 was sufficient in most instances to avoid inaccurate order values due to underfitting. However, for concentration profiles exhibiting significant curvature, a 7th degree polynomial was necessary to ensure convergence to the optimal overlay order value (see ESI, Section S6.4‡).
In “total-VTNA”, all experiments are combined and normalised with respect to every species for which an order has been obtained via sequential VTNA. If every species contributing to the rate has been normalised, linearisation occurs so that the observed rate constant can be derived.8,10,48,50–53,63 However, despite previous VTNA methods supporting total-VTNA, such methods have been utilised only occasionally to derive reaction orders.51,56 As Auto-VTNA significantly reduced the time and effort to determine the reaction orders of an unlimited number of reaction species rapidly through one single calculation, it facilitates a simultaneous total-VTNA approach. Researchers can vary multiple initial concentration at a time, increasing the information content per reaction run, and lowering the signal to noise ratio for the total VTNA calculation. This change from sequential to total VTNA is analogous to the improvement offered by design of experiments (DoE) over one-factor-at-the-time (OFAT) approaches for reaction optimisation in terms of accuracy and speed.64
Currently, there are few reported kinetic studies with different excess experiments in which more than one initial concentration is varied at a time.46,56 Nonetheless, as long as the concentration profiles of every reaction species have been measured or can be inferred from the product profiles, total VTNA can also be applied to ordinary VTNA datasets. This was performed for kinetic data obtained from 17 publications.6,10,40,41,48–57,61–63 Two of these publications were omitted as sequential VTNA had revealed that reaction orders change with the initial concentrations of reactants.39,49 Kinetic data from the remaining 15 publications was re-analysed by Auto-VTNA sequentially and by automatic total VTNA (Table 2).
For entries 1–8 of Table 2, Auto-VTNA for both sequential and total VTNA yielded order values in close agreement with the reported values. The 15% overlay score intervals for the optimal order values were also moderate for these datasets (see ESI, Fig. S47‡). For entries 9–14 of Table 2, the order values obtained by automatic total VTNA deviate more significantly from the reported values. However, the calculated order values demonstrate closer overlay than the reported values, as evidenced by their superior overlay scores. This supports the claim that Auto-VTNA yields the order values that maximise concentration profile overlay without human bias. This allows calculated order values and a measure of error (15% overlay score intervals) to be reported alongside rounded order values, even if the latter is believed to be the “true” order (see ESI, Section S8‡). Entry 15 of Table 2 produced broad order intervals both in sequential and total VTNA, suggesting that more experiments may be needed to confirm the true order, particularly in the pyrrolidine substrate (see ESI, Fig. SA34‡). Entry 16 of Table 2 yielded order values in close agreement with those reported, with the exception of the order in the alkyne which was found to be 1.05 rather than 2.05 (see ESI, Fig. SA35 and SA36‡).
![]() | ||
Fig. 8 VTNA overlay plots illustrating the degree of concentration profile overlay achieved by total VTNA using: (a) the reported order values; (b) and those calculated using Auto-VTNA.6 Every colour refers to a different experiment. |
A kinetic study by Lancaster et al. on a catalytic asymmetric alkene bromoesterification reaction (Table 2, entry 1) is a good demonstrator of the power of automatic total VTNA for elucidating rate equations for catalytic reaction systems (Fig. 9a). Using Auto-VTNA the analysis could be expanded to include same excess and product doping experiments, revealing an order of −0.75 in the amide by-product and an order of 0 in the desired product (Fig. 9b). The linearisation achieved by also normalising the time axis with respect to the inhibitory by-product indicates that the complete rate equation has been elucidated.
![]() | ||
Fig. 9 VTNA overlay plots obtained utilising Auto-VTNA for the catalytic asymmetric alkene bromoesterification reaction reported by Lancaster et al.55 analysed for: (a) all reactants and catalyst, with the relevant different excess experiments; and (b) all species present in the reaction including product, by-product, and internal standard, with both different excess, same excess and product addition experiments. |
In the final example, Auto-VTNA was applied to analyse product concentration profiles for kinetic data by Newton et al. for the Heck reaction between iodobenzene and methyl acrylate catalysed by varying amounts of Pd(OAc)2,9 where it was reported that the catalyst order is dependent on catalyst loading, due to the presence of more than one active catalyst species and different catalyst deactivation rates. Investigation utilising Auto-VTNA emulated the reported results: for low catalyst loading experiments (30, 50, 100, and 200 ppm), a catalyst order of 1.73 was identified, close to the reported value of 1.7; while a lower catalyst order of 0.87 was obtained for higher catalyst loadings (200, 400, 600, 800, and 1000), similar to the reported value of 0.9. The concentration profiles with high catalyst loadings exhibited a wider interval of reaction order values with overlay scores only 15% above the minimum (0.79 to 0.94 vs. 1.70 to 1.77) as well as a higher overlay score at the optimal catalyst order (0.041 vs. 0.023) (Fig. 10).
![]() | ||
Fig. 10 Graphs illustrating overlay score versus order in catalyst and VTNA overlay plots with catalyst order values found to maximise concentration profile overlay for kinetic data on a Heck reaction reported by Newton et al.9 for (a) and (b) low catalyst loadings; and (c) and (d) high catalyst loadings. |
To investigate whether the reported catalyst order step change at 200 ppm is supported by Auto-VTNA, order values were calculated for pairs of experiments with consecutive catalyst loadings (Fig. 11). Indeed, a step change is observed at 200 ppm in catalyst order value, although the catalyst order obtained from the 600 and 800 ppm experiments appears to be an outlier. The equivalent plots for order values obtained from groups of 3 or 4 experiments with consecutive catalyst loadings also indicate a step change from high to low catalyst orders, although less pronounced as the mid-range order values are obtained from concentration profiles from both the high and low catalyst loading domains (see ESI, Fig. SA40–SA42‡). This demonstrates the power of Auto-VTNA to quickly and accurately provide kinetic analysis with robust quantified justification.
![]() | ||
Fig. 11 Calculated reaction orders in catalyst for pairs of reaction profiles with consecutively higher loadings of Pd(OAc)2 from the kinetic data collected by Newton et al. on a Heck reaction with low catalyst loadings.9 Open circles and dashed lines represent visual VTNA performed in the original work by Newton et al. on four “low catalyst loadings” and five “high catalyst loadings”. |
Auto-VTNA was tested on kinetic data obtained from literature to demonstrate the key features and advantages. The ability to compute several reaction orders, utilising kinetic data obtained from multiple experiments, (total-VTNA) is especially highlighted. The application of the method in a Heck reaction that involves changes in the catalyst order at different catalyst loadings was also demonstrated.
Last but not least, a graphical user interface has been constructed and made freely available, as well as the underlying code, to facilitate the use of Auto-VTNA by researchers. The program has been optimised for efficiency of use and does not require coding experience. It includes several features, such as removal of selected datapoints, kinetic data visualisation, and interactive representation of Auto-VTNA results. We hope that this work will encourage wider adoption of VTNA by researchers and enable the justification of assigned reaction order values in a more informative and efficient manner.
Footnotes |
† For any use of Auto-VTNA, the seminal paper by Búres should also be cited along with this paper.25 |
‡ Electronic supplementary information (ESI) available: Range of interests and levels of detail for Auto-VTNA from those wishing to use the Auto-VTNA GUI to those developing their own software in this research space. Sections include details on: the theory of VTNA; choices in the development of Auto-VTNA; algorithmic efficiency analysis; best practice for the reporting of order values and interpreting overlay scores; a user guide to the Auto-VTNA code; a user guide to the Auto-VTNA Calculator GUI; appendices including overlay graphs of all reactions in Table 2, more complex examples, and simulated data. A tutorial video for the Auto-VTNA Calculator GUI is also available, linked from https://github.com/ddalland/Auto-VTNA. See DOI: https://doi.org/10.1039/d4dd00111g |
This journal is © The Royal Society of Chemistry 2024 |