Open Access Article
Alexandra P.
Platt
a,
Heidi
Klem
b,
Sam J. B.
Mallinson
c,
Yannick J.
Bomble
*c and
Robert S.
Paton
*a
aDepartment of Chemistry, Colorado State University, Fort Collins, CO 80523, USA. E-mail: robert.paton@colostate.edu
bThermodynamics Research Center, Applied Chemicals and Materials Division, National Institute of Standards and Technology, Boulder, CO 80305, USA
cBiosciences Center, National Renewable Energy Laboratory, Golden, CO 80401, USA. E-mail: yannick.bomble@nrel.gov
First published on 4th April 2025
Cell-free biocatalysis (CFB) is an efficient and environmentally friendly method to synthesize molecules such as pharmaceuticals, biochemicals, and biofuels through the in vitro use of enzyme cascades. These enzymes often require redox cofactors to drive chemical reactions. Natural redox cofactors (NAD(P)H) are expensive to isolate, motivating synthetic nicotinamide cofactor biomimetics (NCBs) as a cost-effective solution. A select handful of NCBs have been identified as potential NAD(P)H alternatives with comparable or improved redox capabilities, however, they display a tendency to degrade in common buffers. In this study, a library of 132 NCB candidates is systematically generated, over 85% of which have not been characterized in the literature, to expand the diversity of currently explored NCBs. The decomposition mechanism of NCBs in phosphate is evaluated using density functional theory (DFT), revealing protonation at the nicotinamide C5 position as a reporter of cofactor stability. Based on this result, we trained a linear regression model on DFT calculated descriptors to predict NCB stability in phosphate buffer, achieving mean absolute error (MAE) and root mean squared error (RMSE) values within computational accuracy. Analysis of key atomic descriptors and qualitative trends in our dataset informed the design of novel NCB candidates we propose with optimized stability. This work enables researchers to predict the relative stability of NCBs before synthesis, thereby streamlining the process to make CFB more affordable and viable at industry scales.
Green foundation1. This work systematically expands the chemical diversity of nicotinamide cofactor biomimetics (NCBs) and develops a computational model to predict chemical stability in phosphate buffer from calculated descriptors. The design of NCBs is crucial for large-scale implementation of cell-free biocatalysis, a chemical synthesis strategy popular for its low environmental impact and high yields.2. Our NCB stability model provides an efficient and accurate (MAE 0.61 kcal mol−1) process to filter the candidate pool, informing molecule selection for experimental synthesis and characterization. This work revealed important stability design features: an aryl N-substituent on the pyridine ring, an electron withdrawing C3-substituent, and a methyl C5-substituent. 3. Future work will explore additional properties of nicotinamide cofactor biomimetics, such as redox potentials and enzymatic binding affinity, to further promote cell-free biocatalysis at an industrially viable scale. |
NCBs are small molecules designed to mimic the redox role of natural nicotinamide cofactors, while providing advantages such as ease and low cost of manufacturing, and tunable enzyme specificity to enable bioorthogonal redox cascades.13,18–20 Simple NCBs contain a central pyridine ring to maintain the hydride transfer ability of their natural counterparts (Fig. 1A), and various ring substituents, especially at the nitrogen position, are introduced to tune chemical properties. Fig. 1B displays sixteen well-characterized cofactor mimics found in the literature,16,19,21 three of which contain variations in the R3 substituent such that they are not nicotinamide structures, however, they are still classified as NCBs for their ability to replace NAD(P)H. Not only are NCBs able to replicate the catalytic reductive properties of NAD(P)H, but they can, in principle, be used in key immobilization processes like those using nanostructures or hydrogels.22 Many ene-reductases and some oxidoreductase enzymes have been shown to readily accept NCBs as redox cofactors.23 Other types of enzymes do not always naturally accept NCBs, either due to poor binding or incompatible reduction potentials, but activity can be engineered for these synthetic cofactors.15,24–27
![]() | ||
| Fig. 1 (A) Reduction of NAD+. (B) Selected NCBs previously seen in literature.16,19,21 The 1,4-dihydropyridine in both NAD(P)H and NCBs, shown in teal, is featured as the one constant design component throughout this work. | ||
The use of NCBs makes CFB more industrially feasible due to their reduced cost compared to the natural nicotinamide cofactors. However, many NCBs tested to date are also at risk of decomposing at the nicotinamide moiety in buffers used in CFB.28,29 This decomposition risk also applies to immobilized NCBs in reactors. Due to the difficulty of replacing decomposed cofactors in reactors, closer examination of NCB decomposition is crucial to the implementation of NCBs for industrial uses. The proposed mechanism of nicotinamide decomposition by Alivisatos et al. is through addition across the 5,6-double bond in the 1,4-dihydropyridine ring, depicted in Scheme S1.†29 This decomposition occurs most quickly in a potassium phosphate buffer,14,16 which often yields high enzyme activity.23 To further promote the applicability of NCBs in CFB, we propose their stability should be included as a feature to optimize during engineering campaigns.
Another opportunity for improvement in the NCB field is expansion of the chemical space. Within the realm of simple NCBs, described by Black et al. as structures which do not contain nucleotide components, there are vast possibilities for the R1 and R2 substituents (N-substituents and C3-substituents, respectively).25 However, only a small number of NCBs have been synthesized and characterized in vitro. Tan et al. investigated chemical and physical properties of NCBs with a methyl R3 substituent (C5-substituent), pointing out the lack of research into modifications at the R3 position.16 However, they proposed adding an R3 methyl substituent to only two NCB structures of the eleven they report. Additionally, R2 variants have only been examined for NCBs with a benzyl R1 substituent. In our literature review, we found sixteen simple synthetic NCBs out of which eleven R1, four R2, and two R3 substituents are explored (Fig. 1B).16,19,21 By computationally expanding the possible NCB structures, we can examine a larger area of chemical space than previously available without the need for experimental characterization of potentially unstable structures.3,30,31
In this work, we expand the chemical space of NCB candidates through a combinatorial generation of simple NCBs from the previously evaluated substituents, resulting in a library of 132 NCB candidates (including the sixteen previously proposed) (Fig. 2, Step I). Quantum chemical calculations are used to provide a mechanistic understanding of NCB decomposition in phosphate and determine an approach to conduct high throughput estimates of NCB stability (Fig. 2, Steps II & III). Finally, we curate a dataset of quantum chemical descriptors for each NCB candidate (Fig. 2, Step IV). This dataset is subjected to qualitative and quantitative analysis and evaluated by numerous predictive model schemes (Fig. 2, Step V). We use the results of these analyses to design novel, more stable NCBs. Our work allows quick, high-throughput stability screening of an expanded space of NCB candidates to select top choices for downstream, more costly experimental validation.
36 was used to further refine the ensemble before clustering into 10 or fewer representative structures. Transition state conformers were generated using constrained CREST conformational sampling without CENSO to refine the ensembles. More details can be found in section 4 of the ESI.†
Density functional theory (DFT) calculations were performed using Gaussian 16, Revision C.01 (Fig. 2, Step III).37 All geometry optimizations used PBE0-D3(BJ)/6-31+G(d)/SMD(water) as the level of theory.38–44 Vibrational frequency analyses at the geometry optimization level of theory confirmed the nature of transition structures and ground state structures based on the presence of only one imaginary normal mode or none at all, respectively. Intrinsic Reaction Coordinate (IRC) calculations also used this level of theory to ensure the nature of transition structures during the mechanistic study. Additional single-point energy corrections with larger basis sets were used to obtain more accurate energies. For the calculation of all barrier heights, we used ωB97M-V/def2-TZVP/SMD(water) to obtain these energies.38,45,46 Calculations which used the ωB97M-V functional were done using Orca version 5.0.3
47 with support from libXC version 5.1.0.48 Energy corrections for structures in our library and the newly-designed NCBs were performed with PBE0-D3(BJ)/def2-TZVP/SMD(water). Free energies were calculated and potential energy surfaces constructed using GoodVibes version 3.2.49
Natural bonding orbital (NBO) program version 7.0.5
50 was used to obtain atomic partial charges. NBO partial charges at ten key atoms were gathered from the ground state structure of each reduced cofactor. Additionally, nucleophilic and electrophilic condensed Fukui indices were calculated using NBO partial charges for each after the subtraction and addition of an electron, respectively (Fig. 2, Step IV).51
Predictive modeling was done using ROBERT52 software version 1.0.6 (Fig. 2, Step V). Our final model was trained using DFT-level descriptors after testing numerous model architectures (Fig. S7†) before settling on the highest-performing one, a multivariate linear regression model with a 101
:
18
:
13 train
:
validation
:
test split. Default settings were used except permutation feature importance analysis (PFI). PFI scores were calculated in ROBERT and tabulated, then used to isolate the 8 most important descriptors to train the model. More details about the use of ROBERT software for predictive modeling, including model architectures and data splits tested, permutation feature importance analysis, and verification of model performance and generalizability, can be found in section 7 of the ESI.†
After SMILES (text-based) representations of each NCB in our library were generated, we followed an in-house automated protocol to perform conformational sampling and obtain optimized geometries at the DFT level of theory. This in-house script can be found at https://github.com/aplatt22/ncb_stability. Our conformational sampling procedure, using CREST to generate conformers and CENSO then CREST clustering to refine the ensemble, was used to isolate ten or fewer conformers of each NCB which were used for DFT optimizations and the curation of electronic-structure derived descriptors. A description of all calculated descriptors can be found in section 6 of the ESI.† Principal component analysis (PCA) was applied to visualize the variations within our dataset of NCBs featurized by calculated atomic descriptors (Fig. 3B). The first two principal components account for 71.55% of the total variance in the data, providing a sufficient representation of the diversity in our NCB library. Most of the experimental structures occupy a localized region of the chemical space, alluding to an improvement in molecular diversity. This robust library of structures allows more research into various substituent effects, opening the door for tuning specific properties of NCBs, such as reduction potential and stability in buffer.
![]() | ||
| Fig. 4 Potential energy surface for the decomposition reaction of NCBs for MNAH (A1a), BNAH (F1a), P2NAH (G1a), and P3NAH (H1a). Calculations were performed with the level of theory ωB97M-V/def2-TZVP/SMD(water)//PBE0-D3(BJ)/6-31+G(d)/SMD(water).38–46 | ||
After identifying TS I as a predictor of NCB stability, we sought a simpler stability representation that would be more suitable for high-throughput calculations. Optimization of transition state structures often requires more computationally intensive protocols and manual intervention. Therefore, we investigated whether the free energy difference of this first elementary step (ΔG) was an appropriate alternative. Calculating ΔG requires only the optimization of minima structures and is therefore computationally cheaper. Such an approach is well grounded in studies of catalytic reactivity, where the thermodynamics of an elementary step is used to quantitatively assess the kinetic feasibility, formalized by the Evans–Polanyi principle53 and linear (free) energy relationships (LFERs).54
To ensure that this assumption holds, we optimized transition state structures for a subset of 34 NCBs from our library. These 34 representative structures were selected using a binning technique based on important features of the molecules (details in section 3 of the ESI†). There is a highly correlated LFER between thermodynamic (ΔG) and kinetic (ΔG‡) quantities, demonstrated by a Pearson's correlation coefficient (R2) of 0.92 (Fig. 5 and Table S1†). Furthermore, we compared thermodynamic stabilities of five NCBs with rates of decomposition from literature. Though quantitative decomposition rates were not available, we achieved qualitative accuracy in ranking the stabilities of B1a, B1b, D1a, F1a, and F1b from Tan et al. with 100 mM NCB concentrations.16 Due to high correlation between ΔG and ΔG‡, as well as qualitative agreement with experiment, we decided to model NCB stability through only a thermodynamic analysis of the first step, a procedure that is more amenable to high-throughput computation and analysis.
![]() | ||
| Fig. 5 Correlation between ΔG and ΔG‡ for the first step of NCB decomposition with grey lines denoting 1 kcal mol−1 (4.184 kJ mol−1) greater or less than the linear fit (black) for these data. Values for ΔG were obtained with PBE0-D3(BJ)/def2-TZVP/SMD(water)//PBE0-D3(BJ)/6-31+G(d)/SMD(water).38–44 Values for ΔG‡ were computed with ωB97M-V/def2-TZVP/SMD(water)//PBE0-D3(BJ)/6-31+G(d)/SMD(water).38–46 Shown below the graph are schemes used to calculate each variable. | ||
Having determined an appropriate metric for NCB stability, we analyzed the relative stability of our structures (Fig. S4†). The stabilities of structures in our NCB library range from −141.2 to −121.2 kcal mol−1 (−591.0 to −507.2 kJ mol−1). The majority of our library shows increased stability when compared with a common mononucleotide NCB, nicotinamide mononucleotide (NMNH−), which has a stability of −134.1 kcal mol−1 (−560.9 kJ mol−1). These data suggest that we will be able to design NCBs with higher stability than those currently used in CFB.
To limit computational cost as we moved forward, descriptors were calculated only for the reduced species of each NCB. Using a single species for descriptor calculation also reduces the computational cost of making future predictions with out-of-sample NCBs. We selected the reduced cofactor species for descriptor generation due to its role as the “universal intermediate” for both decomposition and catalytic reduction, allowing this set of descriptors to be used in a future multi-objective optimization for both stability and reduction potential. Permutation feature importance (PFI) analysis was implemented to isolate and use only the most important descriptors for NCB stability to help prevent model overfitting. PFI begins with a model trained on all descriptors, then systematically permutates one descriptor at a time to determine the impact of that descriptor on the overall model. The worse a model performs when a descriptor is permutated, the more important that descriptor is to making predictions. This analysis is automated in ROBERT, resulting in a PFI score for each descriptor (Table S5†). The eight descriptors with the highest PFI scores were used in our model, including atomic charges at N1, C2, C3, C5, and C4H and electrophilic condensed Fukui indices at C3, R2, and C4H. We also performed a comparison of these properties calculated from the lowest energy conformer, which was used in model training, with the Boltzmann weighted properties for our representative subset of 34 NCBs. The maximum differences between values for all atomic charges and electrophilic condensed Fukui indices of these NCBs were 0.001 and 0.008, respectively. The differences are low relative to the magnitude of the values, suggesting we do not need to include the full conformational ensemble for our descriptors.
An array of four model types and seven train
:
validation
:
test splits were used to train possible DFT-level models for predicting stability using ROBERT (Fig. S7†). We selected the highest-performing model architecture after default PFI analysis based on the root mean squared error (RMSE) values of the validation set, a multivariate linear regression (MLR) employing a 101
:
18
:
13 train
:
validation
:
test split (Fig. 6). PFI scores were then further analyzed to limit model training to only the eight most important physical properties of the NCBs. This number of descriptors was selected because it was the lower limit at which we still had high model performance (Table S6†). The final descriptors included in our model are the NBO partial charges at N1, C2, C3, C5, and C4H, as well as f(−) at C3, R2, and C4H. While our list of key descriptors does not include direct contributions from the R1 or R3 substituents, properties from N1 and C5 are used to predict stability, which we believe incorporate effects from their respective substituents. In support of this, descriptors of R3 were highly correlated with the C5 NBO partial charge and not considered for model training. More details regarding model training can be found in section 7 of the ESI.†
![]() | ||
Fig. 6 (A) Parity plot and results for the best-performing predictive model trained with DFT-level descriptors, a MLR with a 101 : 18 : 13 train : validation : test split using the eight most important descriptors. Model performance metrics are shown at the bottom right of the graph for the validation and test sets. (B) Visual representation of the eight descriptors used for model training and MLR equation from standardized descriptor data. Raw weights can be found in the ESI.† | ||
This analysis led to a high-performing model, shown by a high correlation (test set R2 = 0.98), as well as mean absolute error (MAE) and RMSE within DFT accuracy levels (≤3 kcal mol−1, 12.552 kJ mol−1).55 Furthermore, ROBERT automates the additional evaluation of model performance, see https://github.com/aplatt22/ncb_stability/tree/main/Data/ROBERT_Results for results. This extra evaluation includes testing for worse performance when training with mean stability values, shuffled stability values or one-hot encoded descriptors, and performing 5-fold cross-validation to ensure our model is robust. Our model passed these tests, demonstrating a generalizable model with an appropriate train
:
validation
:
test split. The data was also checked for outliers and the ratio of data points to descriptors as part of this additional model evaluation. More details regarding this evaluation can be found in section 7 of the ESI.†
As further assessment of the performance of our model, we calculated Spearman rank-order correlations comparing stability from our model predictions with those from experiment. With the limited experimental stability data available, a problem we address with our computational NCB library, we are unable to compare our model predictions to many experimental stabilities. Using the limited data, we are able to examine rankings of the 5 NCB structures from Tan et al.16 Our model ranks these cofactors B1a < D1a < F1a < B1b < F1b, which is the same ordering found experimentally, earning a Spearman correlation of 1.000. When comparing our model predictions to experimental data of 4 NCBs from Nowak et al.,14 our model predictions showed rankings, in order of increasing stability, of H1a < A1a < G1a < F1a and the experimental rankings are A1a < H1a < F1a < G1a, earning a Spearman correlation of 0.600. The rankings for the Nowak et al. experimental results were not predicted as well as for the Tan et al. results, which can be explained by the nature of the experimental stability data. For the Nowak et al. set of NCBs, the range of calculated stabilities is only 1.9 kcal mol−1 (8.0 kJ mol−1), while the Tan et al. set has a range of 5.2 kcal mol−1 (22.0 kJ mol−1). Compared to our full computational library, which has a range in stabilities of 20.0 kcal mol−1 (83.8 kJ mol−1), this subset of experimental stability data is not very diverse, so this available experimental data gives limited insight for our chemically diverse dataset. Additionally, all of the structures for which we have experimental stability data are similar, each with 1 for the R2 substituent, only examining different R3 substituents for 2 R1 substituents, and only sampling 5 R1 substituents of the 11 possibilities, further demonstrating the limited insight from comparing model results with current experimental data.
Finally, we tested our model performance with a Spearman correlation over our full NCB library, which a gave rank-order correlation of 0.974 when comparing model predictions to calculated stabilities, demonstrating the ability of our model to correctly order NCBs with respect to their relative stabilities (Table S10†). This feature is ideal for an optimization protocol, where only the most stable NCBs are considered for experimental validation, so getting the correct stability ranking is as effective as predicting the actual value. We also explored a model trained with semi-empirical descriptors in an attempt to reduce the computational cost of making predictions, but the difference in performance relative to the cost of each method was not enough to favor the semi-empirical model (see section 7 of the ESI†).
We performed a similar analysis of the R2 substituent (Fig. 7B), showing that the aldehyde moiety (4) tends to have higher stability than the other functional groups represented in our library at this position. The five most stable NCB structures in our library, and fifteen of the twenty most stable, have 4 as the R2 substituent. We hypothesize that this trend is due to the highly electron-withdrawing nature of aldehydes. Though 4 is not a substituent previously evaluated in literature, we added it to our library as an additional substituent to determine if the electron-withdrawing characteristics made a large impact on NCB stability. Based on these data, this is a key design feature in optimizing stability in NCBs. The next most stable substituent, 2, is also highly electron-withdrawing as a nitrile, which has been evaluated previously.
Finally, it is clear when looking at the relationship between C4H partial atomic charge and stability colored by R3 substituent (Fig. 7C) that NCBs with an R3 methyl substituent have higher stability than those with only hydrogen at that position. While examining NCBs with similar C4H partial charges, those with b at R3 demonstrate greater stability. Furthermore, our eleven most stable NCBs have a methyl substituent at the R3 position. Tan et al. hypothesized that this improved stability is due to kinetic effects,16 and we are still able to replicate this trend even though our analysis is strictly thermodynamic. This observation could be caused by additional steric effects, such as lower flexibility in the 1,4-dihydropyridine ring with greater congestion at the C5 carbon with a methyl substituent rather than a hydrogen.
In addition to altering the R1 substituent, we also looked at how changing the R2 and R3 substituents impacted stability. To do this, we designed four structures with new R2 substituents. These moieties included a thioaldehyde (R2sub-I), thioketone (R2sub-II), trifluoromethyl (R2sub-III), and nitro (R2sub-IV). Thioamides have literature precedent as dinucleotide biomimetic cofactors, which share most of their structure with NAD(P)H, and are proposed to behave similarly to their amide counterparts but have higher reduction potentials than the natural cofactors.57 We selected a thioketone rather than a thioamide because ketones demonstrate higher stability than amides in our library. Additionally, the increased stabilities shown by R2 aldehydes led us to also choose a thioaldehyde due to the increased stability shown by aldehydes in our study. Electron-withdrawing groups seem to improve stability at the R2 position, with aldehydes performing well, so we selected a highly electron-withdrawing substituent, trifluoromethyl, to further investigate this trend. This reasoning also led us to design an NCB with a nitro R2 substituent. Additionally, we designed two structures with a trifluoromethyl R3 substituent (R3sub-I & R3sub-II). We selected this moiety as a possible R3 substituent to test if electron-withdrawing effects at this position would have any influence on stability in addition to the bulkiness of the substituent. Trifluoromethyl substituents have only a slightly larger size than the previously tested methyl substituent but are more electron-withdrawing.
The new proposed NCBs were among the most stable tested, with three structures surpassing stability of those currently in our library: R3sub-II (−117.1 kcal mol−1, −490.1 kJ mol−1), R2sub-IV (−120.0 kcal mol−1, −502.3 kJ mol−1), and R1sub-IV (−117.3 kcal mol−1, −490.9 kJ mol−1) (Fig. S11†). The increased stability of these NCBs relative to the remainder of the library is promising since it shows that we can use trends in existing data to propose and successfully design new NCBs with enhanced stability. Furthermore, the higher stability seen with the new substituents at each position supports our hypotheses that NCBs with aryl R1 substituents, electron-withdrawing R2 substituents, and bulky and/or electron-withdrawing R3 substituents are likely to be stable structures. With this new information about how substituent identity and properties at each position impact stability, we can make informed decisions to eventually suggest novel NCBs which will be more resistant to decomposition via C5–C6 saturation.
In addition to high thermodynamic stability, these out-of-sample NCBs also show high kinetic stability and correlation between ΔG and ΔG‡, as was determined with NCBs in our library (R2 = 0.92) (Fig. S12 and Table S12†). We also used these novel structures to test our model's performance with out-of-sample structures. All except three of these out-of-sample NCBs showed accurate predictions (<3 kcal mol−1, 12.552 kJ mol−1, prediction error), with each poor prediction occurring for structures with out-of-sample R2 groups. The accurate predictions on unseen substructures gives confidence in the predictive ability of our model. Additionally, the model tends to predict stabilities higher than the benchmark DFT calculations, especially for the three out-of-sample structures errors in stability prediction for these out-of-sample structures assume higher stability than calculated using DFT, a trend present for all except one structure. Errors tending this way reduce the chance that synthetic chemists disregard highly stable structures due to our model when choosing stability-ranked NCB candidates to characterize experimentally.
However, there were also some structures which showed low model accuracy (error >3 kcal mol−1, 12.552 kJ mol−1), likely due to the type of descriptors utilized to make predictions and absence of sufficient structure representations in the training data. When analysing these structures with poor model accuracy, R2sub-I (error of 13.8 kcal mol−1, 57.7 kJ mol−1), R2sub-II (error of 8.0 kcal mol−1, 33.5 kJ mol−1), and R2sub-IV (error of 10.2 kcal mol−1, 42.6 kJ mol−1), we found that each NCB had at least three descriptors used in the model which were outside the range seen by the initial library. For example, R2sub-I and R2sub-II each had f(−) values at the R2 substituent (−0.09 and −0.03, respectively) that were outside the observed range (−0.02 to 0.02), likely causing poor stability predictions in these structures. The other NCB which had a poor prediction was R2sub-IV. Our model likely failed in making an accurate prediction of stability for R2sub-IV because all other structures within our library have a neutral carbon atom at R2, so the electronic descriptors at this atom and C3 (directly bound to R2) are beyond the scope of our trained model.
While the expansion of our NCB space with these out-of-sample structures showed that our model carries predictive power even with substrates it was not trained with, the model can be adapted over time through expansion of the training data, which would be recommended if a substituent chemically distinct from our current library is found to be relevant in NCB design. Nevertheless, our model tends to predict stabilities higher than the benchmark DFT calculations, especially for the three high-error out-of-sample structures. This error trend reduces the chance that a highly stable structure would fail to advance to an experimental stage in a high-throughput NCB discovery pipeline, and even though some “false positives” might appear in the top ranks, our results show that is unlikely. For example, if an experimental group chose to synthesize and characterize the top 10% of stability-ranked structures, including the out-of-sample designs, this would result in two of the fifteen candidates (rounded up), with DFT calculated ΔG values that would have excluded them from the top 10%, but that still place them in the top 20%. The design of our computational workflow and ability of our model to predict the correct rankings of stability, even of newly designed NCB structures, reduces the chance that experimental chemists create additional reaction waste by synthesizing NCBs with low stability.
Using our NCB library and simplified model of stability, we fit a regression model on DFT-calculated electronic descriptors to predict NCB stability in phosphate buffer and used the results of our model to design new NCBs proposed to demonstrate higher stability. Our model has high accuracy, shown by low MAE and RMSE values, as well as high ranked-order correlations, demonstrating successful rankings of NCB stabilities, even if we cannot predict the correct quantitative stability. Using trends in key descriptors, we drew qualitative conclusions about designing stable cofactors and tested these hypotheses through the design of novel NCBs, most of which showed high stability. We also tested these structures as out-of-sample predictions for our predictive model, through which we discovered strengths and weaknesses of the model. In the future, we also hope to expand the library of NCBs to improve generalizability of our predictive models.
As shown by this work, the stability of NCBs can be successfully tuned. We now have an initial pass at designing more effective NCBs, narrowing the search to only NCBs which demonstrate stability. Following this work, we can continue down the molecular design pathway, focusing on cofactor activity and optimization of other desirable properties. Additionally, this work integrates only publicly available packages into an automated workflow available at https://github.com/aplatt22/ncb_stability, highlighting the transparency and reproducibility of this work. Future work in this area can be efficiently adapted to build predictive models for other key NCB properties, such as reduction potential or solubility. The binding affinity of NCBs within an enzyme active site has been studied through enzyme engineering,58 so provided that the NCBs demonstrate high stability and solubility, as well as an optimized reduction potential, we are confident that the NCBs can successfully be used for industrial CFB. This work provides a robust computational study of new NCB structures with the goal of motivating experimental validation, a direction currently pursued. Tuning NCBs and their properties on a case-by-case basis with workflows like ours will increase the usefulness of CFB, reducing waste from synthesis of NCBs known to have low stability and making the method more efficient and feasible for use in industrial settings.
This work was authored in part by the National Renewable Energy Laboratory, operated by Alliance for Sustainable Energy, LLC, for the U.S. Department of Energy (DOE) under Contract No. DE-AC36-08GO28308. Funding provided by U.S. Department of Energy Office of Energy Efficiency and Renewable Energy under the Cell Free and Immobilization Technologies initiative. The views expressed in the article do not necessarily represent the views of the DOE or the U.S. Government. The U.S. Government retains and the publisher, by accepting the article for publication, acknowledges that the U.S. Government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this work, or allow others to do so, for U.S. Government purposes.
H. Klem was supported by a NIST National Research Council (NRC) Research Postdoctoral Associateship Award in the Thermodynamics Research Group of the Materials Measurement Laboratory during the preparation of this manuscript.
We would like to thank Louis DeLescure (Sanofi Pharmaceuticals) and Andrei F. Kazakov (Thermodynamics Research Center, NIST) for their time and effort reviewing the manuscript during the internal NIST editorial review process.
C Bioreduction Using Enoate Reductases, Org. Lett., 2013, 15(1), 180–183, DOI:10.1021/ol303240a.Footnote |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5gc00351b |
| This journal is © The Royal Society of Chemistry 2025 |