Kakasaheb Y.
Nandiwale
a,
Travis
Hart
a,
Andrew F.
Zahrt
a,
Anirudh M. K.
Nambiar
a,
Prajwal T.
Mahesh
a,
Yiming
Mo
a,
María José
Nieves-Remacha
b,
Martin D.
Johnson
c,
Pablo
García-Losada
b,
Carlos
Mateos
b,
Juan A.
Rincón
b and
Klavs F.
Jensen
*a
aDepartment of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, USA. E-mail: kfjensen@mit.edu
bCentro de Investigación Lilly S.A., Avda. de la Industria 30, Alcobendas-Madrid 28108, Spain
cSmall Molecule Design and Development, Eli Lilly and Company, Indianapolis, Indiana 46285, USA
First published on 30th March 2022
Continuous manufacturing of pharmaceuticals and fine chemicals is attractive due to its small physical footprint, consistent product quality, and demonstrated benefits from safety, economic, and environmental perspectives. However, handling solids in research-scale flow reactors creates hurdles, as the solids often lead to reactor channel clogging. To tackle this problem, we present a continuous stirred-tank reactor (CSTR) cascade that can handle slurries/solids during a chemical transformation in flow. Additionally, the design includes a light emitting photo diode (LED) array with photon flux characterized by chemical actinometry. We employ mixed-integer nonlinear programming (MINLP) and Bayesian optimization algorithms for single and multi-objective optimization, respectively. We demonstrate the autonomous optimization of three multiphase catalytic reactions of synthetic importance involving solid substrates, catalysts, and inorganic bases in an automated flow platform comprising a CSTR cascade, newly developed photoreactor, slurries feeding pumps, and an online process analytical technology (PAT). The two first case studies involve MINLP optimization of yield for a Pd-catalyzed Suzuki–Miyaura cross-coupling reaction involving both solid substrates and catalyst for the synthesis of an advanced intermediate, and a metallaphotoredox-catalyzed sp3–sp3 cross-coupling of carboxylic acids with alkyl halides in the presence of solid inorganic base. The third study presents Bayesian multi-objective optimization of a diastereoselective, metallaphotoredox cross-coupling reaction between trans-4-hydroxy proline and 4-bromoacetophenone for which both yield and amount of the trans isomer were optimized while handling potential formation of a solid product.
A significant limitation of such flow platforms based on tubular reactors is clogging of narrow channels due to the incompatibility of insoluble reaction components.42 Typically, when adapting a reaction protocol developed in batch for a flow system, it is common to replace solid reagents with soluble components, but such modifications can be sub-optimal for the chemistry of interest. As such, it would be preferable to develop flow platforms that tolerate solids, such as insoluble bases or precipitates formed as reactions progress. One method by which this has been achieved is by using a continuous stirred-tank reactor (CSTR) cascade,24,43–45 which have achieved long continuous run times and good mixing for heterogeneous mixtures.46–48 Alternative strategies include acoustic irradiation,49–51 continuous oscillatory baffled reactors (COBRs),24,52–54 and oscillatory flow reactor (OFR).55,56
Herein we report an automated CSTR cascade flow platform capable of leveraging a number of automated optimization protocols along with three case studies demonstrating handling of solid catalysts and bases, as well as precipitates formed during reaction. Additionally, automated single- and multi-objective optimization routines using mixed-integer nonlinear program (MINLP) and Bayesian optimization algorithms are employed. In particular we use two case studies to demonstrate MINLP optimization of yield for (1) a Pd-catalyzed Suzuki–Miyaura cross-coupling reaction involving both solid substrates and catalyst for the synthesis of an advanced intermediate, and (2) a metallaphotoredox-catalyzed sp3–sp3 cross-coupling of carboxylic acids with alkyl halides in the presence of different solid inorganic bases. The third study takes advantage of Bayesian multi-objective optimization to find optimum conditions for achieving high yield and amount of the trans-isomer for a diastereoselective, metallaphotoredox cross-coupling reaction between trans-4-hydroxy proline and 4-bromoacetophenone in which solid product can be formed.
![]() | ||
Fig. 1 (a) Schematic of cascade CSTR reactor. (b) Schematic of photoreactor with LED array and Peltier (thermoelectric) cooler, and (c) LED array, diode placement. CAD design, and (c) actual system. |
LabVIEW (National Instrument, NI ver. 19.0.1f3), MATLAB and SIMULINK (MathWorks, Inc., ver. R2019a), and Python were used to control all the hardware and perform the optimization campaign. The LabVIEW software was designed and written in-house for the purpose of on-demand reaction optimization, with the exception of some drivers provided from NI or supplier (Fig. S3 and S4†). Control and monitoring of the system were performed through USB communication with the control unit at the back side of the pumping platform. NI data acquisition (MCC DAQs) devices and serial to USB converters facilitated effective communication with the user's LabVIEW user interface. The control unit also provided power to all components on the pumping and reaction platforms, including motor controllers, three-way valves, LED array, CSTR cartridge heaters, temperature controllers, the selector valve, and line vibrators. The LabVIEW routine comprised a central virtual instrument (VI) that executed simultaneous loops for automated design of experiments (DoE) based on MINLP, flow rate manipulation, online monitoring, temperature control, HPLC sampling, HPLC analysis, and optimization (Fig. S5†).
The pumping platform consisted of pumps, temperature control, pressure transducers, reagent storage, degasser storage, three-way valves, and hardware controls. The external hardware components were powered from the control box at the back of the pumping platform and process analytical technology (PAT) passed data back to the LabVIEW user interface through the control box. Two types of pumps were housed on the platform, (1) six positive displacement pumps (Vici Valco M6/M50) and (2) two syringe pumps (Harvard Apparatus PhD Ultra).
The positive displacement pumps served as an all-purpose component for flow chemistry applications, utilizing a ceramic rotor and polytetrafluoroethylene (PTFE) cross-linked stator for excellent chemical compatibility. These pumps were able to deliver liquids from ∼5 nL min−1 to ∼5 mL min−1 up to ∼17.5 bar for over one million cycles. In the current platform, the number of these pumps dictate the number of discrete variables accessible in optimization campaigns. In future iterations, this number can be significantly increased by incorporating a selector valve between the pumps and reagent vials. The syringe pumps on the system allowed for the handling of slurry solutions in the system. The syringe pumps were mounted are on a variable angle shelf along with the magnetic tumbler/stirrers (V&P Scientific VP 710D3-4) and PTFE coated magnetic stir bars inside the syringe, which kept solids suspended in the solution inside the syringes. Moreover, enhanced transport of slurries, while avoiding clogging was accomplished by attaching an oscillator (Precision Microdrives 306-10H) at the connecting tube between the syringe and the CSTR cascade.47 This approach to delivering and handling slurries was based on our initial experience with optimization of Suzuki–Miyaura cross-coupling with solid substrates (discussed in the Results section). Proportional-integral-derivative (PID) temperature control (omega engineering) and inline pressure monitoring (DJ instruments) were part of the control unit on the pumping platform.
The reaction platform consisted of the CSTR cascade, stir plate, back pressure regulators (BPRs), three-way valves, and six-port two position valve. The combination of the three-way valve and six-port two position valve controlled by LabVIEW allowed for automated sampling and online HPLC analysis (Fig. S6†). Specifically, upon exiting the reactor the reaction mixture was diluted with solvent (the identity of which depends on the specific reaction) to solubilize bulk solids via a T-connection. This mixture was passed through a static mixer, which aids to dissolve solids and avoids clogging in the outlet tubes. The LabVIEW automation controls the openings of 3-way valve to the mixture then passed through an inline filter prior to reaching the sample loop in the 6-port 2-position valve leading to the HPLC. For further details, including system start-up and operation, please see the ESI.†
The Dragonfly62,63 open-source Bayesian optimization package constructs a Gaussian process surrogate model to describe the relationship between input variables and objective functions. Both continuous and discrete variables can be defined in the optimization domain. In the initialization phase, a space-filling design of experiments is generated by using Latin hypercube sampling (LHS) for continuous variables and random sampling for discrete variables. Once the initialization results are returned to the algorithm, either the upper confidence bound (UCB) or Thompson sampling (TS) acquisition functions are selected (with equal probability) to generate each refinement experiment. These acquisition functions balance an exploitative strategy (query regions where the output is expected to be high) with an explorative one (query regions where uncertainty is high) to help ensure that globally optimal points are found.62,63 In multi-objective optimization, the goal is to identify Pareto optimal points that represent the trade-off between potentially conflicting objectives.3 To find multiple Pareto optimal points, Dragonfly employs a random scalarization strategy, where different weights (relative importance) for each objective are sampled at each refinement iteration and the weighted sum is maximized.63 To assess algorithm convergence, the hypervolume indicator was utilized.3 For a two-objective optimization, the hypervolume corresponds to the area enclosed by the current Pareto points.
![]() | ||
Fig. 3 Reaction scheme (top) and optimization variables (bottom) for the Suzuki–Miyaura cross-coupling reaction example involving solid substrates and catalyst based on prior work by Cole et al.2 |
In the first refinement iteration (experiment number 18 to 20, first gray block containing 3 experiments), the algorithm proposed one experiment for each catalyst with similar conditions (0.3 mol% catalyst conc., 70 °C, and 30 minutes). The yields of 90%, 61%, and 45% going from the catalyst ID 1 to 3 confirmed the relative ranking observed during the initial experiments. For subsequent refinement iterations (alternating white and gray blocks), the algorithm fathomed the poor performing catalysts 2 and 3 from the consideration and focused on the highest performing catalyst 1. By increasing the catalyst concentration from 0.3 to 0.5 mol%, an optimum yield of ∼94% was achieved. Fig. 4(b–d) exhibits 3D plots with continuous variable values on the three axes and yield of 3 indicated by color. Fig. 5(a) focuses on the end of the campaign where the algorithm's predicted optimal yield converges and the 99% confidence interval gets progressively narrower as more data is collected close to the predicted optimum.
Fig. 5(b–d) shows the fitted quadratic response surface models for yields as a continuous function of temperature and residence time for each catalyst. The surfaces correspond to low (0.20 mol%), medium (0.35 mol%), and high (0.50 mol%) catalyst concentration. (Model parameters and their associated uncertainties can be found in Tables S4 and S5 in the ESI†). The response surface plots help to visualize the relative sensitivity of yield with respect to the continuous variables, with temperature having the greatest effect in this case. The ability to create mathematical models to quantitatively predict the reaction yield with good accuracy (for all three catalyst types) using a small number of efficiently designed experiments demonstrates the utility of this approach for achieving process understanding while decreasing the use of raw material and the number of experiments during process development.
![]() | ||
Fig. 6 Metallaphotoredox-catalyzed sp3–sp3 cross-coupling of carboxylic acids with alkyl halides1 used in the second case study (top). Continuous and discrete optimization variables (bottom). |
Fig. 7(b and c) exhibits 3D plots with continuous variable values on the three axes and yield of 6 indicated by color. During the initialization phase (yellow block, experiment number 1 to 15) within the explored conditions, the K2CO3 (base 2) exhibited superior performance compared to Cs2CO3 (base 1). The first refinement iteration (experiment number 16 and 17, first gray block) confirmed that K2CO3 has a better performance than the Cs2CO3 and the algorithm focused on K2CO3 for the subsequent optimization campaign. After 25 total experiments, the algorithm predicted an optimum yield of 68.9 ± 4.7% at the following conditions 38.7 °C, 29.9 min, and 100% LED brightness (Fig. 7 and Table S6†).
Fig. 7(d and e) shows the quadratic response surface plots of the predicted yield of 6 for each base. The model parameters and their associated uncertainties can be found in the ESI† (Tables S7 and S8). Predicted yield is plotted on the vertical axis as a continuous function of temperature and residence time for each base, whereas different surfaces correspond to low (60%), medium (80%), and high (100%) LED power. In contrast to the previous Suzuki–Miyaura example, the temperature did not have much of an effect on yield of 6 with either base. Although longer residence time resulted in higher yield, increasing the LED power had the most significant effect on yield of 6, demonstrated by the roughly 10% spacing in yield between the surfaces for K2CO3. By extrapolating this trend, it seems likely that if the photon flux was increased beyond the maximum capacity of the LED array used in this study, higher yields would be achieved. However, it might lead to heat generation and it would demand higher capacity heat removal system. Although this was not pursued, it is worth mentioning that including LED power as a variable was beneficial since it highlighted the possibility of achieving higher yields for this reaction with LED array improvements.
![]() | ||
Fig. 8 Model reaction for multi-objective Bayesian optimization (top) and optimization variables and objectives (bottom). |
At the outset of the Dragonfly Bayesian optimization, a number of discrete and continuous variables were defined, specifically, photocatalyst 1 and 2 were discrete variables, and residence time, temperature, and solvent mixture composition represented continuous variables (Fig. 8). The solvent mixture composition variable was defined as percent ethyl acetate in a mixture of ethyl acetate and dimethylacetamide (DMA). These solvents were selected with consideration given to a prior report in which DMA was determined as the optimal solvent for flow conditions.65 However, in the same study, preliminary batch reaction results indicated ethyl acetate might be a superior solvent but was not suitable for flow chemistry, owing to the formation of insoluble particulates over the course of the reaction. Because the developed experimental system in this report tolerates the formation of solids, examining this solvent was no longer problematic. Further, it was possible that tuning the solvent mixture could increase yield or diastereoselectivity. As such, the bounds of percent ethyl acetate were set from zero to 70%. The choice of nickel source, ligand, and base were kept consistent with previous studies.
The optimization process proceeds first with the selection of initialization points. Continuous variable values are selected using Latin hypercube sampling, while discrete variable values are randomly sampled. The selection process was repeated until four initialization reactions with each photocatalyst 1 and 2 were included in the initial set. In this way, equal coverage over the discrete variables is obtained. The initial reaction conditions and reaction outcomes are tabulated in runs 1–8 of Table 1. In order to provide a convenient continuous output for diastereoselectivity (dr), it was converted to the percent trans-isomer formed, whereas yield refers to the total yield of both diastereomers.
Initialization runs | Refinement runs | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Run no. | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 |
Photocatalyst ID | 2 | 1 | 2 | 1 | 1 | 1 | 2 | 2 | 2 | 1 | 2 | 1 | 2 | 2 | 2 |
Temperature (°C) | 41.3 | 52.8 | 37.1 | 38.3 | 43.7 | 48.1 | 50.8 | 45.5 | 44.9 | 54.2 | 45.1 | 54 | 36.1 | 45.6 | 45.3 |
Residence time (min) | 19.8 | 25.1 | 21.6 | 15.3 | 27.7 | 17.4 | 29.1 | 27.0 | 21.6 | 28.7 | 21.9 | 27.7 | 20.8 | 15.2 | 27.5 |
EtOAc (%) | 10.2 | 4.2 | 34.7 | 43.7 | 22.5 | 57.7 | 67.5 | 38.7 | 47.7 | 42.5 | 24.5 | 45.4 | 30.5 | 29.5 | 64.0 |
Yield (%) | 50.7 | 57.5 | 72.5 | 52.3 | 57.0 | 39.1 | 60.8 | 14.4 | 58.0 | 64.0 | 64.3 | 66.0 | 83.0 | 77.0 | 83.0 |
Selectivity (% trans) | 81.7 | 81.0 | 73.4 | 76.1 | 76.9 | 76.8 | 73.0 | 85.1 | 81.0 | 84.0 | 79.4 | 82.0 | 79.0 | 78.0 | 77.0 |
A number of interesting observations become evident upon examination of the initialization data. First, even at the outset of experimentation, higher yields were obtained with respect to the previous report, as observed in run 3. Second, it is clear that the multidimensional relationship governing how catalyst choice, solvent composition, reaction time, and temperature influence reaction outcome is not intuitive. For example, one would expect the diastereoselectivity to generally decrease as temperature increases. However, comparison of runs 1 and 2 with 3 and 4 defies this expectation, with reactions at higher temperature resulting in higher selectivity. It is likely that this is a result from the difference in solvent composition. Finally, when comparing runs 3 and 8, it is likely that an error occurred during run 8, as the reaction conditions are relatively too similar to reasonably account for the difference in reaction outcome. Rather than discarding or repeating these conditions, this run was left in to assess the tolerance of the optimization process to experimental error.
The refinement runs were executed using a batch size of 2 reactions. Notably, after completion of run 10, the next batch included two nearly identical reaction conditions within 1 unit of each other. In this case, only one condition was manually chosen and performed before selecting the next batch. This manual intervention can be automated by computing the distance between points to see if it falls below a certain threshold. The first four refinement runs (runs 9–12) select both catalysts, and it appears that a local maximum for catalyst 1 is identified in run 12. Run 13 discovers conditions which give higher yield but lower selectivity. To assess the convergence behaviour of the multi-objective algorithm, the hypervolume indicator (which corresponds to the area enclosed by the current Pareto points) was computed after each refinement experiment (Fig. 9b). While there was modest improvement during runs 9–12, the identification of Pareto optimal condition 13 led to a significant increase in the hypervolume. The optimization was terminated after run 15 since high-performing conditions that are likely sufficiently close to the true Pareto front had been identified within our experimental budget (Fig. S12†). Given the total quantity of the desired stereoisomer formed (i.e., yield x selectivity), condition 13 was selected as the optimal reaction conditions. Notably, the best reaction conditions (run 13) are very similar to the conditions evaluated in the erroneous reaction (run 8). Thus, despite this erroneous data point, the algorithm was still able to find a high-performing condition. This bodes well for future use of the algorithm, as it demonstrates robustness to error and noise arising from run-to-run reproducibility common to many optimization campaigns. This is clearly indicated when visualizing the results in a yield vs. selectivity plot (Fig. 9a). The initialization runs generally have lower yield and selectivity, whereas the refinement runs aggregate at the top right corner, optimally balancing yield and selectivity. To verify the conditions in run 13, the same reaction conditions were run for two residence times. The crude mixture was diluted with water, extracted with ethyl acetate, concentrated and purified to afford 252 mg of the desired product (78% yield) in 4:
1 dr.
Footnote |
† Electronic supplementary information (ESI) available: Details of photoreactor fabrication, experimental setup, operation of automated optimization system, design of experiments (DoE), experimental data and analytical methods. See DOI: https://doi.org/10.1039/d2re00054g |
This journal is © The Royal Society of Chemistry 2022 |