Oliver
Schilter
*ab,
Daniel Pacheco
Gutierrez
c,
Linnea M.
Folkmann
c,
Alessandro
Castrogiovanni
a,
Alberto
García-Durán
c,
Federico
Zipoli
ab,
Loïc M.
Roch
c and
Teodoro
Laino
ab
aIBM Research Europe, Säumerstrasse 4, 8803 Rüschlikon, Switzerland. E-mail: oli@zurich.ibm.com
bNational Center for Competence in Research-Catalysis (NCCR-Catalysis), Switzerland
cAtinary Technologies, Route de la Corniche 4, 1066 Epalinges, Switzerland
First published on 29th April 2024
Reaching optimal reaction conditions is crucial to achieve high yields, minimal by-products, and environmentally sustainable chemical reactions. With the recent rise of artificial intelligence, there has been a shift from traditional Edisonian trial-and-error optimization to data-driven and automated approaches, which offer significant advantages. Here, we showcase the capabilities of an integrated platform; we conducted simultaneous optimizations of four different terminal alkynes and two reaction routes using an automation platform combined with a Bayesian optimization platform. Remarkably, we achieved a conversion rate of over 80% for all four substrates in 23 experiments, covering ca. 0.2% of the combinatorial space. Further analysis allowed us to identify the influence of different reaction parameters on the reaction outcomes, demonstrating the potential for expedited reaction condition optimization and the prospect of more efficient chemical processes in the future.
Discovering, developing, and optimizing chemical reactions lead to the creation of new, more efficient, and more sustainable products, technologies, and processes, enabling the possibility of revolutionizing these industries.10–15 However, finding the most optimal reaction path and conditions is a complex problem. Traditionally, these processes relied on trial-and-error approaches, including techniques such as one factor at a time (OFAT16,17) or design of experiments (DoE18) and on the expertise of researchers and scientists.
As R&D undergoes continuous digitization and is fueled by the abundance of untapped data and impressive advancements in artificial intelligence (AI), there arises a growing imperative to adopt a data-driven methodology.19–22 This adoption will empower scientists and researchers to utilize AI and to learn from past experiments, recognize patterns in the data, and ultimately suggest the next best experiments expediting R&D.23
One prominent trend in this data-driven approach involves the rising popularity of Bayesian optimization24 (BO) and reinforcement learning in recent years.25–32 Notably, deep reinforcement learning techniques were used along with domain knowledge of chemistry to improve the yield of four reactions carried out in microdroplet reactors.28 Similarly, Taylor et al.33 used a flow-based reactor to optimize the Suzuki–Miyaura coupling (3 continuous variables and one categorical) and C–H activation (5 continuous variables) via multi-task BO in 5–22 experiments, thereby reducing R&D costs over conventional optimization techniques. In a different study, BO techniques were used to improve the yield of the Heck cyclization-deprotection34 in a flow system based on an automated continuous flow platform parametrized by 4 input control variables such as the residence time, equivalents and temperature. The authors were able to achieve a yield of 81% in just 14 h (13 total experiments) and discovered a favorable competing pathway. Such findings highlight the versatility and potency of BO in diverse R&D scenarios.
One key challenge of incorporating BO methods in R&D is that these methods are typically considered as black boxes with limited explainability and interpretability,35 hindering their widespread adoption. Additionally, when the search space is large, researchers face difficulties in visualizing and understanding the way that the parameters influence the objectives. Though non-linear dimensionality reduction techniques such as t-SNE (t-distributed stochastic neighbor embedding)36 or UMAP (uniform manifold approximation and projection)37 have been used in chemistry applications such as anomaly detection, process control,38 and biomolecule molecular dynamics simulations,39 their application in BO models is limited. For example, Shields et al.9 used t-SNE to cluster BO samples in optimization campaigns, but the approach did not address the objective space or surrogate model predictions, leaving the relationship between inputs and objectives unexplored. Moreover, chemistry applications involve categorical variables and constraints which can pose challenges to conventional dimensionality reduction techniques. Successful application of these techniques to BO methods can shed light on the mechanisms by which BO methods can efficiently optimize problems.
To further enhance the speed and efficiency of R&D, conducting suggested experiments using automated robotic hardware and automatically feeding their results back to the AI platform can be highly beneficial. This integration of AI and automation is commonly referred to as a self-driving lab.40,41 These self-driving labs mark a revolutionary advancement in R&D, redefining our perception of time efficiency and of what is possible. Self-driving labs are like time machines, presenting the chemical and advanced materials sector with an opportunity to reshape their R&D processes, accelerating their pipelines to address challenges across the board. Recently, national and international efforts have led to the development of a hub towards democratizing these smart automated and data-driven approaches.42–46
An area where these self-driving labs represent a transformative leap is reaction optimization, where researchers systematically explore reaction conditions to identify the optimal set of experimental conditions and parameters for a desired chemical transformation.11,26,27,47,48
In this study, we integrated the machine learning (ML) engine from Atinary Technologies (SDLabs) and the robotic platform from IBM Research (RoboRXN) to demonstrate the autonomous optimization of the iodination of terminal alkynes within a complex search space with categorical variables and several constraints.
This reaction was chosen due to its significance in enabling the synthesis of complex organic molecules by introducing an iodide group while preserving the alkyl group. The resulting compounds can undergo various subsequent transformations, such as nucleophilic additions,49 cross-coupling reactions,50 and cycloaddition,51 allowing for further functionalization of iodoalkynes.52,53 There are multiple chemical routes available for achieving iodoalkynes (see Fig. 1). In this study, we focused on two different routes simultaneously to determine the most effective reaction conditions: (i) the first route utilizes N-iodosuccinimide (NIS) as the iodinating agent,54 while (ii) the second route involves chloramine-B as the oxidant and iodine salts as the halogen source.55
![]() | ||
Fig. 1 Iodination of 1-chloro-2-ethynylbenzene using two different routes (A) and the alternative three different aromatic alkynes (B) that are simultaneously optimized. |
We optimized the reaction using three distinct Bayesian optimizers. Impressively, two out of the three optimizers achieved over 80% conversion for all four reactants within just 23 reactions. To gain insights into the underlying reasoning of BO algorithms, we employed various visualization techniques.
In particular, we introduced a contour plot based on t-SNE dimensionality reduction, specifically designed to visualize categorical spaces. This tool allowed us to enhance the interpretability of BO methods, shedding light on their ‘black box’ nature and providing a more transparent, comprehensible approach to self-driving labs.
Parameter | |
---|---|
Alkyne group | 2-Ethynyltoluene |
4-Ethynyltoluene | |
1-Chloro-2-ethynylbenzene | |
1-Chloro-4-ethynylbenzene | |
Chloramine [eq.] | 0, 1, 1.5, 2 |
Solvent | MeOH, CH3CN/H2O, THF, DMF, DMSO, DCM, EtOAc, 1,4-dioxane, MTBE, DCE |
Iodine source | KI, NaI, TBAI, NH4I, NIS |
Iodine source [eq.] | 0, 1, 1.5, 2 |
Catalyst | PSTA, AcOH, none |
Catalyst [eq.] | 0, 0.1, 1 |
Temperature [°C] | 25, 35, 45, 55, 65, 75, 85, 95 |
Solutions of the internal standard and starting alkynes were prepared in acetonitrile at a concentration of 1 M. 100 ml stainless steel reactors were preloaded with the required iodide salts and iodinating reagents. Subsequently, the robotic platform added the solvent, followed by 1 ml of a 1 M solution of the alkynes. If applicable, a catalyst was added by the robotic platform. The quantity of solvent was carefully adjusted to ensure a consistent reactor concentration of the alkyne, maintaining a total reaction volume of 20 ml. The mixture was then heated to the desired temperature and stirred for 2 hours under a nitrogen atmosphere. After 2 h, the mixture was cooled to 25 °C, and the reaction was quenched by adding 1 ml of a saturated thiosulfate solution (in acetonitrile) followed by 1 ml of an internal standard (1 M acetophenone in acetonitrile). A 0.3 ml sample of the reaction mixture was automatically diluted 50x with acetonitrile and filtered. From this diluted solution, a 5.0 μl sample was injected by the robotic platform into an HPLC/diode array detector (DAD) setup for analysis. A more detailed example synthesis procedure can be found in Section A.5 of the ESI.†
![]() | (1) |
![]() | (2) |
When new reaction parameters become available, the backend application queries the Nexus platform to process them into a synthesis procedure. This synthesized procedure is then automatically executed on the RoboRXN platform.
The operator can access the reaction status and relevant information through a frontend web application (see Fig. 3), which provides instructions on the amount of chemicals to load into the robot and enables the operator to return the result of the HPLC DAD-chromatogram. Once the peak area of the chromatogram is returned, the backend application calculates the conversion and yield based on before-run calibrations and returns these values to Atinary SDLabs. Overall, this workflow enables a seamless communication between the two platforms and facilitates the automation and optimization of chemical synthesis.
Three algorithms were used to optimize the experimental and process conditions of the iodination of terminal alkynes. These included three flavors of Atinary Falcon (Falcon Light, Falcon DNGO (deep neural network global optimization) and Falcon GPBO (Gaussian process Bayesian optimization)), a suite of general-purpose algorithms, which can solve optimization problems that include continuous, discrete and/or categorical variables with or without physicochemical descriptors, as well as batch-constrained optimization. These algorithms are defined in terms of their surrogate model and their acquisition function. The surrogate model is a probabilistic method that predicts the objective values and uncertainty from the input parameters. The acquisition function is a policy that will select the next points to evaluate given the surrogate model: it balances the trade-off between exploration (sampling from areas where the uncertainty is high) and exploitation (sampling from areas where the predicted value is optimal). The yield and conversion were jointly optimized for each of the alkyne groups. The stopping (convergence) criterion for each alkyne group was set to be surpassing an 80% conversion. In order to jointly optimize the yield and the conversion, a weighted-sum scalarization was implemented providing a weight of 0.9 to the normalized yield and 0.1 to the normalized conversion. This weighting scheme was designed to prioritize yield significantly more than conversion during the optimization process. However, conversion was still accorded a measure of importance to favor reactions showcasing the reactivity of the starting material over reaction conditions that experience no conversion at all. The result of this weighted-sum function is from now on referred to as merit. The obtained merit from the multi-objective function was chosen as the maximization target for all the optimizers.
Given that RoboRXN has a total of 6 reactors available anytime, we opted to utilize parallelization. Instead of sending a single set of parameters, we opted to send a set of two reaction parameters simultaneously, which enabled us to run two reactions in parallel for each of the three algorithms.
Fig. 4 illustrates the joint optimization of the conversion for the four alkynes in 23 experiments for the Falcon Light optimizer. The marker shapes and colors correspond to the alkyne group and the reaction pathway used (either NIS or chloramine). The figure shows a fairly low distribution of conversion for all alkyne groups in the initial 15 experiments (3 iterations), and then shows that the convergence criterion is reached for 2-ethynyltoluene at experiment 17, followed by 1-chloro-2-ethynylbenzene at experiment 19, 4-ethynyltoluene at experiment 20 and 1-chloro-4-ethynylbenzene at experiment 22. Regarding the reaction pathways, 4 out of the initial 11 points featured NIS (green), and the remaining used chloramine. After this initial set, only the chloramine route is exploited by this algorithm.
Even though most of the experiments were performed with 1-chloro-2-ethynylbenzene (circle in the Fig. 4), these optimality conditions were leveraged to optimize the other alkynes. In fact, only two experiments used 4-ethynyltoluene before converging and only three experiments used 1-chloro-4-ethynylbenzene and 2-ethynyltoluene before they converged, compared to 10 experiments required to optimize 1-chloro-2-ethynylbenzene. Similar results can be seen for the other two optimizers (see Fig. A.1 and A.2 in the ESI†). The following sections compare the optimal conditions across alkyne groups to further illustrate how their information was transferred by the optimizers.
Fig. 5 shows a parallel coordinates chart illustrating the global optimality regions from all the seen experiments in terms of their measured conversion, yield and computed weighted-sum merit to be maximized (0.9 yield + 0.1 conversion). Every path represents an experiment; the left columns display the parameters while the columns on the right show the measurements (objectives). High values for merit correspond to good performance and low values correspond to poor performance as per the color-coding.
![]() | ||
Fig. 5 Parallel coordinates of the 16 converged experiments (conversion above 80%) from the three optimization campaigns. For the sake of clarity, the remaining experimental conditions that lead to a conversion below 80% are not displayed. Each line corresponds to an experiment and the columns correspond to the different variables (parameters, objectives and weighted-sum merit). The highlighted traces correspond to the experiments with a conversion surpassing 80%. The merit corresponds to a normalized weighted sum; the higher the weighted sum the better the performance. The traces are color-coded by the merit (i.e. darker traces indicate better performance than lighter ones). The plot with all 49 observations is shown in Fig. A.24 in the ESI.† |
The parallel coordinates plot shows that there are multiple optimal conditions for the four alkyne groups, all using chloramine, and only one using NIS. The chloramine paths (chloramine amount above 1 eq.) feature at least 1 eq. of any reagent, 1 excluding NIS, polar solvents such as MeOH, CH3CN and CH3CN/H2O, temperatures below 65 °C, and no catalyst. These results correspond to the optimality values reported by Liu et al.,55 where they report the highest yield at room temperature in CH3CN. The only NIS route uses CH3CN/H2O as a solvent, a PTSA catalyst, 1.5 eq. of NIS and a temperature of 55 °C, which resulted in a conversion above 80%, but still resulted in a near-zero yield and a poor merit value. A plausible explanation for the low yield observed from this reaction condition is the fact that PTSA, as a relatively strong acid, can cause electrophilic hydration of the alkyne group to the corresponding ketone. This reaction is reported in the literature by Liu et al.,56 which explains the high conversion with corresponding low yield. The fact that uncatalyzed NIS reactions showed a low yield and conversion corresponds to the observation by Yao et al.,54 where they showed that the chemoselectivity and regioselectivity are driven by the presence of an acid catalyst. The absence of the catalyst can lead to the formation of a diiodovinyl substrate instead of the desired product. The literature-reported optimal conditions for the NIS route feature temperatures above 70 °C and the AcOH catalyst, which was not visited in any of the experiments. All of the seen NIS experiments resulted in poor performance, which seemed to direct the optimizers to focus on the more robust chloramine route. The following sections visualize the performance of the Bayesian optimizer as a function of the parameter space and the seen experiments.
A contour plot was generated to visualize the objective values and surrogate model predictions of the Bayesian methods. This contour plot is the result of reducing the parameter space to 2 dimensions and setting the elevation to be the objective values at the given iteration. Fig. 6 shows an example plot with the seen objective values, and Fig. 7 illustrates the same plot showing the surrogate-model predictions instead, which allows a visual qualification of the models' predictive performance when compared with the ground truth. Each experiment is shown as a marker in the map: similar experiments appear close to each other in the contour plot. To reduce the high-dimensional space to 2 (xy) dimensions, t-SNE (t-distributed stochastic neighbor embedding) was used due to its ability to capture non-linear similarities in the high-dimensional space and reflect them in the low-dimensional space.36 The resulting plot is shown in Fig. 6.
![]() | ||
Fig. 7 t-SNE plot for the three optimization campaigns (Falcon Light, DNGO and GPBO). The marker annotations correspond to the iteration number of the specific campaign. The color-code corresponds to the predicted surrogate model using only the labelled (closed) markers as training data. A fully annotated version of these plots is shown in the ESI.† |
Fig. 6 illustrates the t-SNE plot using all the measured values of the 49 experiments from the three independent campaigns. The markers are shaped according to the alkyne group of the experiment, and the annotations refer to the solvent, iodine source and temperature of the experiment. Green markers correspond to converged experiments surpassing a conversion of 80%.
The annotations of Fig. 6 help visualize the way that t-SNE reduced the parameter space into 2 dimensions. We see that the experiments are mainly clustered by their solvent and temperature. The vertical half is characterized by high temperatures and the lower half by low temperatures. The solvent clusters can be read from left to right as follows: DCM, DCE, DMSO, CH3CN/H2O, CH3CN and MeOH.
Overall, the best performing region of the contour locates at the right-hand side, and the majority of the converged points locate here. We can see more clearly here that none of the NIS experiments had high yields, and most of them concentrate on the left-hand side (poor performance) except the high-conversion point at higher temperature and CH3CN/H2O as shown in Fig. 5. Overall, the low-temperature chloramine route was preferred and shown to be more robust across reagents, solvents and temperatures.
The dimensionality reduction contour plot is also useful to visualize the optimization trajectory. Fig. 7 illustrates the sequence of experiments for the three optimization campaigns under the same dimensionality reduction as for Fig. 6. The color-code in Fig. 7 corresponds to the model predictions using only their individual campaign data as training, as opposed to Fig. 6 which shows all the experimental points color-coded by their true value. In Fig. 7, the evaluated experiments of each campaign are annotated with the iteration number at which they were visited, whereas the unlabeled open markers are only predicted and shown for comparison purposes with the ground-truth. Although the first iteration composed of 11 experiments is equivalent for all 3 algorithms, each of the optimization campaigns seems to exploit a particular region of the space and leverage this information for the remaining alkyne groups. For instance, the Falcon Light trajectory shows that the first iterations concentrate on the top left region for all three campaigns, but after discovering the MeOH solvent at iteration 3 and the TBAI reagent at iteration 4, the algorithm exploits this information to optimize the remaining alkyne groups (1-chloro-2-ethynylbenzene and 2-ethynyltoluene). Similar analysis can be done on the other two algorithms: Falcon DNGO concentrates on the bottom left and center regions (CH3CN/H2O|NaI/KI|25 °C), and Falcon GPBO concentrates on the top central region (MeOH solvent at higher temperature).
The t-SNE plots also permit comparing the predictive behavior of the different Falcon algorithms. When placing side-by-side the merit contour map of Fig. 6 – with the ground-truth merit values – with each of the contour maps of Fig. 7 – using only the campaign points as training data – it is possible to qualitatively compare the predictions of each of the algorithms with the real values by comparing the color schemes. This comparison reveals the deviated predictions in the unexplored regions for each of the algorithms. For example, while Falcon DNGO captures part of the local optimum around the top right of the contour plot, it mistakenly predicts a poor performance around the MeOH solvent and high temperature region for 2-ethynyltoluene, whereas this is an optimal region found by Falcon GPBO. For Falcon GPBO, the predicted merit follows a similar trend to that of Fig. 6, yet it mistakenly predicts a local optimum region at the bottom left (DCM solvent and NaI reagent for 2-ethynyltoluene). Falcon Light also identifies the optimality region at the top right close to the observed points yet fails to identify the optimal region at the bottom center. Qualitatively speaking, the surrogate of Falcon GPBO seems to capture the overall trend more accurately than the other algorithms given the explored points of each campaign. Gaussian processes are known to provide a good predictive power in the low-data regime, and visually the explored points of Falcon GPBO are more spread out in the space than for the other algorithms. Overall, the t-SNE plot can be used to observe mean-value predictions, uncertainty or acquisition function values of Bayesian optimizers to better explain their “reasoning” for experiment planning.
All the algorithms reached the goal of surpassing a conversion of 80% with Falcon GPBO and Falcon Light at 23 experiments (6 iterations) and Falcon DNGO at 25 experiments (7 iterations), which are a fraction of the total number of 12036 combinations. Considering the 8-dimensional parameter space, converging in 23–25 experiments (0.2% of the search space) and 6–7 BO loops is consistent with other referenced BO applications,9,33,34 showcasing the robust applicability of these methods to different chemistry challenges.
These algorithms leverage the observed data to carefully select the next points to evaluate, thereby augmenting the sample efficiency in large and constrained search spaces. While two reaction paths were available, all the algorithms selected the chloramine path as it proved to be more robust to the tested reagents, solvents and temperatures.
Our study focused on the iodination of terminal alkynes, which holds significant importance in synthesizing complex organic molecules while preserving the alkyl group and introducing an iodide group. By exploring two different routes, one involving chloramine salts and the other utilizing N-iodosuccinimide (NIS) as the iodinating agent, we sought to determine the most effective reaction conditions from a total of 12036 possible combinations. Through the use of three distinct BO optimizers, we achieved impressive results, with all optimizers reaching over 80% conversion for all four reactants within a minimal number of just 25 distinct reactions, covering only ca. 0.2% of the total number of possible combinations. These findings showcase promising opportunities for efficient and highly productive iodination of terminal alkynes.
Finally, we took a first step towards tackling the interpretability issues of BO models through the implementation of diverse visualization techniques. Specifically, the incorporation of a customized contour plot based on t-SNE dimensionality reduction proved to be instrumental in improving the comprehension of BO methods. This novel approach effectively illuminates the otherwise opaque nature of BO algorithms, rendering them more transparent and insightful as an integral part of the self-driving labs platform. By addressing the challenge of interpretability, our research makes a valuable contribution to the wider acceptance and utilization of BO methods in chemical synthesis and other research and development endeavors.
In conclusion, we believe that the integration of ML-driven experiment planning, automated equipment, and interpretability tools has the potential to change the way we identify optimal synthesis routes and conditions, allowing for a cleaner and more sustainable industry.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3sc05607d |
‡ https://rxn.res.ibm.com. |
§ https://atinary.com. |
This journal is © The Royal Society of Chemistry 2024 |