Open Access Article
Cameron T.
Armstrong‡
a,
Cailean Q.
Pritchard‡
b,
Daniel W.
Cook
a,
Mariam
Ibrahim
a,
Bimbisar K.
Desai
a,
Patrick J.
Whitham
c,
Brian J.
Marquardt
c,
Yizheng
Chen
a,
Jeremie T.
Zoueu
a,
Michael J.
Bortner
*b and
Thomas D.
Roper
*a
aChemical and Life Science Engineering, Virginia Commonwealth University, Richmond, VA, 23219 USA. E-mail: tdroper@vcu.edu
bDepartment of Chemical Engineering and Macromolecules Innovation Institute, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24061 USA. E-mail: mbortner@vt.edu
cMarqMetrix, Inc., Seattle, WA, 98103 USA
First published on 30th January 2019
Continuous flow chemistry has the potential to greatly improve efficiency in the synthesis of active pharmaceutical ingredients (APIs); however, the optimization of these processes can be complicated by a large number of variables affecting reaction success. In this work, a screening design of experiments was used to compare computational fluid dynamics (CFD) simulations with experimental results. CFD simulations and experimental results both identified the reactor residence time and reactor temperature as the most significant factors affecting product yield for this reaction within the studied design space. A point-to-point comparison of the results showed absolute differences in product yield as low as 2.4% yield at low residence times and up to 19.1% yield at high residence times with strong correlation between predicted and experimental percent yields. CFD was found to underestimate the product yields at low residence times and overestimate at higher residence times. The correlation in predicted product yield and the agreement in identifying significant factors in reaction performance reveals the utility of CFD as a valuable tool in the design of continuous flow tube reactors with significantly reduced experimentation.
In order to achieve these improved processes, it is critical that careful and thorough studies of reaction conditions are carried out.10,11 Fortunately, continuous flow chemistry lends itself to efficient experimentation as a result of its capability for changing multiple variables in sequence.1–5 For instance, the flow rate of a single reagent and the temperature of a reactor can be varied to collect data over a range of operating parameters without having to modify the experimental setup. Additionally, the ease of scalability of continuous flow chemistry allows for process development to be performed at lower scales before transitioning to a production scale, further reducing time and waste.2
Even with the use of continuous flow chemistry, experimental optimization can be tedious and potentially expensive if costly reagents must be used. This is particularly true when using the common one-variable-at-a-time (OVAT) approach.11 An alternative approach to OVAT-based experimental optimization is statistical design of experiments (DoE). DoE utilizes statistical modeling to identify the variables which have a significant effect on a chosen response. The implementation of fractional factorial design is particularly useful in screening studies as it reduces the number of experiments needed to identify those significant factors while elucidating interactions between the variables that the OVAT approach is unlikely to uncover.10,11 Even with the fractional factorial design, DoE requires significant experimentation, time and resources.
In contrast to experimental approaches, computational simulations allow for optimizing many variables without necessitating the consumption of chemical resources. The modeling of organic reactions can be challenging due to complex reaction profiles which can result in radial gradients of temperature and concentration complicating the use of commonly used flow reactor design equations.12 Nevertheless, the impact of these gradients can be determined through numerical methods such as computational fluid dynamics (CFD). These methods incorporate multiple coupled transport equations which govern the multiple reactant, diluent, and product species' behavior throughout the flow reactor.13 With the ever-increasing availability of computing power, CFD has been growing in popularity in recent years for simulating complex pharmaceutical synthesis reactions saving time and costs versus experimental based approaches.14–17
Both DoE and CFD are widely reported in the literature for process optimization;11,14,18–20 however, they have often been used tangentially, in that CFD is used to provide a possible theoretical justification for the experimental work.17,21–23 The aim of this work is to evaluate the capabilities of CFD for screening the effect of various experimental conditions on product yield. By comparison of CFD and an experimental DoE, the predictive capabilities of CFD are evaluated both in absolute prediction of product yields as well as the identification of the most significant experimental factors which impact the product yield.
For the current work, the comparison of results obtained for CFD and DoE was conducted for the reaction shown in Fig. 1. This reaction is the first step in the GlaxoSmithKline synthesis of cabotegravir24 as well as the recently reported flow synthesis of dolutegravir (DTG),25 an HIV integrase inhibitor that is recommended as part of a universal first line combination therapy for treatment of HIV-AIDS.26,27
![]() | ||
| Fig. 1 Reaction scheme studied for the synthesis of a DTG intermediate.24 | ||
| Factor | High | Low |
|---|---|---|
| Length, L (m) | 5 | 1 |
| Inner diameter, ID (mm) | 1 | 0.25 |
| Flow rate, Q (mL min−1) | 1 | 0.1 |
| Temperature, T (°C) | 40 | 10 |
| Molar ratio, χ | 1.5 | 0.95 |
Design-Expert software (Version 10, Stat-Ease, Minneapolis, MN) was utilized for the design and statistical analysis of the experiments. Each experiment was performed in triplicate (48 total runs) and completed in random order as specified by the design software. Center-points for this DoE could not be properly implemented due to a restraint on tubing ID; the true midpoint of 1.0 mm and 0.25 mm ID values is 0.625 mm which was not readily available.
CFD simulations were performed at the intended experimental DOE conditions so that direct comparisons could be made. While the factors and levels were identical, a full-factorial design was used for CFD simulations as the additional data points only required additional computation time and replicates were not needed.
For both the experimental and CFD DoE analyses, the percent yield of Enamine (3) was used as the response. The significant factors identified in the two DoE analyses were then compared to evaluate the applicability of CFD for optimization of reaction parameters.
:
1 and 2
:
1 DMF-DMA
:
M4MAA. Multivariate curve resolution-alternating least squares (MCR-ALS) was used to extract reaction trends for each reactant.29,30 The reaction was found to follow second-order reaction kinetics (first order in both reactants) and an Arrhenius plot (depicted in Fig. S3†) was used to determine the activation energy (Ea = 57.93 kJ mol−1) and the pre-exponential factor (A = 2.00 × 108 L mol−1 min−1). In situ infrared spectroscopy using a ReactIR 15 equipped with a DST series fiber conduit probe (Mettler Toledo, Columbus, OH) was used to validate the kinetic results.
The specific heat of each reaction component and the heat of reaction were experimentally determined using the EasyMax batch reactor and HfCal calorimetry unit (Mettler Toledo, Columbus, OH). The viscosities of each reaction component, listed in Table S2† were found using a Brookfield LVDV-E viscometer with a Brookfield SC4-18 spindle and 13R sample chamber. Samples were tested with 6.7 mL over a range of shear rates from 0.4 to 130 s−1.
| Reactor length (m) | Reactor ID (mm) | Mesh elements |
|---|---|---|
| 1 | 1 | 18 327 |
| 1 | 0.25 | 81 188 |
| 5 | 1 | 103 154 |
| 5 | 0.25 | 403 962 |
The simulations were carried out using a supercomputing cluster of 96 cores of Intel® Xeon® 2× E5-2680v3 @ 2.5 GHz CPU with 128GB RAM @ 2133 MHz and were completed in 360 CPU hours (Advanced Research Computing, Blacksburg, VA). Product yields were then determined by the ratio of Enamine (3) to the limiting reagent at the output of the reactor.
![]() | (1) |
![]() | (2) |
![]() | (3) |
, was applied to simulate the axial symmetry of the reactor, where f is each respective transport variable (T, u, and N) and r is the radial spatial coordinate. The outside walls of the tubing in the simulation were set to the desired reactor temperature and kept constant over the course of the reaction. This mimics the water bath used in the experimental reactor design.
:
1 DMF-DMA
:
M4MAA), and greatest temperature (40 °C) was subjected to a sweep of diffusivities ranging from 1 × 10−6 to 1 × 10−12 m2 s−1. As shown in Fig. S10,† product yield is independent of diffusion coefficient signifying that the system is not limited by mass transfer effects and is instead kinetically limited.
Low Reynolds numbers, on the order of unity, are observed within our simulated reactor signifying laminar flow throughout the straight channels. High Péclet numbers, 2.8 × 103–3.5 × 105, at the tee junction show that convection dominates significantly over diffusive transport regardless of the relatively slow flow rates. The combination of high Péclet number and low Reynolds numbers has been shown to exhibit stable chaotic advection arising from the sharp 90° turn in the tee.33 As a result, convective transport remains dominant over diffusive transport validating our assumption that mixing completes quickly after the tee junction. The three-dimensional simulation, Fig. 3, further confirms that the fluid was mixed approximately 2 cm after the tee. A comparison of reactors with and without this tee junction result in minimal deviation in product yield (<0.1%). Therefore, we neglected the tee junction in further simulations and assume completely and instantaneously mixed reactant flow through the inlet of the reactor.
At the inlet of the reactor, a radial and axial temperature gradient was observed due to the introduction of room temperature reactants into either a cooled or heated water bath per the design conditions of our experiments. These gradients dissipated after approximately 0.2 m and 0.5 m from the reactor inlet corresponding with simulations with low and high fluid velocity. At this point an isothermal reactor state was seen throughout the remainder of the reactor. The exothermic nature of our reaction had a meaningful impact on the heat transfer profile of our reactor near the inlet, however, the reduced reaction rate in the second half of the reactor led to a decrease in its significance.
Product yields were calculated as the ratio of Enamine (3) to the limiting reagent (as dictated by stoichiometry) at the output of the reactor at steady state. Transient data, Fig. 4, was also collected to observe the startup behavior of the reactor. It was found that a steady state solution was obtained after 2.5 residence times for all reaction conditions.
This CFD simulation was utilized to simulate a full factorial DoE to identify which reaction parameters significantly impact product yield. Representative results are shown in Fig. 5 to highlight the effects of molar ratio and temperature over the length of the reactor. Full results are shown in Fig. S6–S9.† Increasing temperature and molar ratio of DMF-DMA
:
M4MAA was found to increase product yield through an increase in reaction rate.
| Statistical term | Experimental | CFD |
|---|---|---|
| p-Value (F-test, α = 0.05) | <0.0001 | <0.0001 |
| F-Value (α = 0.05) | 56 | 96 |
| p-Value (lack-of-fit test, α = 0.05) | 0.27 | |
| Lack-of-fit F-value (α = 0.05) | 1.4 | |
| Pure error | 0.012 | |
| R 2 | 0.93 | 0.97 |
| Adj. R2 | 0.91 | 0.96 |
| Pred. R2 | 0.89 | 0.94 |
The DoE models containing the statistically significant factors for the experimental analysis and CFD simulations are shown in eqn (4) and (5), respectively. The relative importance of each factor can be deduced by the magnitude and sign of each coefficient. Considering that these are screening studies, the comparison between CFD and experimental model coefficients is limited to an order of magnitude analysis.
| Log10(%yieldExp.enamine) = 0.13L + 0.25ID − 0.19Q + 0.13T + 0.0013χ − 0.039ID × Q + 0.043ID × T − 0.041Q × χ + 0.047T × χ + 1.09 | (4) |
| Log10(%yieldCFDenamine) = 0.29L + 0.50ID − 0.41Q + 0.40T + 0.78 | (5) |
ANOVA for both the experimental and CFD models indicates that both are significant with no significant lack-of-fit (F-value ∼1) for the experimental model.10 Lack-of-fit was not calculated for CFD due to a lack of replicates since repeated simulations would result in identical values. The experimental and CFD models also have R2 and adjusted R2 values that are in good agreement (difference less than 0.2) as well as high predicted R2 values, indicating that the models are appropriate.10 The experimental and CFD model coefficients and their respective p-values are listed in Table 3. Both CFD and experimental outputs identified tubing length, tubing ID, flow rate and temperature to be significant. Of the four single-factor terms, length, ID, and flow rate are components of residence time. The magnitude by which each term affects residence time is reflected in the coefficient of each factor in Table 4. ID is related to reactor volume, and thus residence time, through a squared term. Flow rate is inversely related to residence time and tubing length is directly proportional to residence time as a multiplicative term. According to the DoE model, increasing ID and reactor length (i.e., increasing residence time) increases product yield. Thus, the residence time is the most impactful factor to product yield, along with temperature. Temperature is the fourth single-factor term, owing to faster reaction kinetics at higher temperatures. The disparity in the significance of temperature and reactor length between the two models may be explained by the CFD simulation boundary condition depicting perfect heating throughout the reactor, whereas in the experimental setup it is likely that temperature gradients and imperfect heating occurred within the water bath and reactor coil, reducing the significance of temperature.
| Experimental model | CFD model | |||
|---|---|---|---|---|
| Factor | Coefficientsa | p-Value prob. > F | Coefficientsa | p-Value prob. > F |
| a All coefficients are log-scaled. b Included to maintain hierarchy. | ||||
| A-length (L) | 0.13 | <0.0001 | 0.29 | <0.0001 |
| B-inner diameter (ID) | 0.24 | <0.0001 | 0.50 | <0.0001 |
| C-flow rate (Q) | −0.19 | <0.0001 | −0.41 | <0.0001 |
| D-temp (T) | 0.13 | <0.0001 | 0.40 | <0.0001 |
| E-molar ratio (χ) | 0.013 | 0.41b | ||
| BC | −0.039 | 0.020 | ||
| BD | 0.043 | 0.011 | ||
| CE | −0.041 | 0.016 | ||
| DE | 0.047 | 0.0061 | ||
| Intercept | 1.09 | 0.78 | ||
The experimental DoE identified four statistically significant (α < 0.05) interaction terms while the CFD results did not detect any significant inter-factorial interaction. Reactant molar ratio was found to be insignificant as a single factor in the experimental DoE model; however, it is included because of its interaction with both temperature and flow rate. Fig. 7 represents a surface and contour plot of the interaction between molar ratio and temperature. At lower temperatures, molar ratio has no effect on product yield; however, at higher temperatures product yield is influenced by molar ratio. Thus, this interaction term was included in the experimental model to accurately predict the product yield.
Fig. 8 illustrates the CFD predicted Enamine (3) yield versus the experimental yield. This prediction plot is best interpreted as two distinct linear trends, where the lower yield values show a slope of 0.85 and the higher yield values give a slope of 1.31. The critical point between underestimation and overestimation occurs at approximately 20% yield.
In order to understand the nature of these prediction errors, the percent yield of Enamine (3) along with the corresponding CFD simulation results with respect to residence time are shown in Fig. 9. As expected, increased temperatures and longer residence times lead to higher yields and higher absolute prediction errors. These errors were as low as 2.4% yield at low residence times (<5 min) and up to 19.1% yield at high residence times (>8 min). The crossover from under to overestimation is again observed to take place at approximately 20%. The abundance of the unknown impurity was also found to be the greatest (≥2%) at these longer residence times, suggesting that the increase in impurity concentration contributes to the deviation between the simulation and experimental results.
![]() | ||
| Fig. 9 Product yield as a function of residence time – CFD simulated data (■), experimental data (●), solid symbols = runs at 40 °C, open symbols = runs at 10 °C. | ||
Footnotes |
| † Electronic supplementary information (ESI) available: (1) Experimental determination of kinetic parameters, (2) molar ratio calculations, (3) CFD transport equations, (4) fluid and material properties, (5) mesh independence study, (6) numerical diffusion analysis, (7) complete CFD yield results, (8) DoE Pareto plots. See DOI: 10.1039/c8re00252e |
| ‡ These authors contributed equally to this work. |
| This journal is © The Royal Society of Chemistry 2019 |