Yan
Xie
and
Scott
Calabrese Barton
*
Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI 48824, USA. E-mail: scb@msu.edu
First published on 31st May 2021
The efficiency of cascade reactions, which consist of multiple chemical transformations that occur in a single pot without purification steps, is limited by the transport efficiency of intermediates between adjacent steps. Electrostatic channeling is a proven strategy for intermediate transfer in natural chemical cascades, but implementation into artificial cascades remains a challenge. Here, we combine infrequent metadynamics (InMetaD), umbrella sampling (US), and kinetic Monte Carlo (KMC) models to computationally study the transfer mechanism of glucose-6-phosphate (G6P) on a poly-arginine peptide bridging hexokinase (HK) and glucose-6-dehydrogenase (G6PDH). Transport of G6P by hopping in the presence of poly-arginine peptides is shown to be a rare event, and InMetaD is used to compute the hopping activation energy. US simulations capture the configurational change in the desorption process and enable the determination of the desorption energy. Parameterized by these results, a KMC model is used to estimate transport efficiency for the bridged enzyme complex. Results are compared to a similar complex using a poly-lysine bridge, using kinetic lag time as a metric. Even at a high ionic strength of 120 mM, poly-arginine peptides may be capable of more efficient transport as compared to poly-lysine, with a predicted lag time of 6 seconds for poly-arginine, compared to a previously reported lag time of 59 seconds for poly-lysine. This work indicates that poly-arginine peptides may be an improved bridge structure for electrostatic channeling of anionic intermediates.
Many examples are found in nature where substrate channeling improves the efficiency of cascade reactions, by enabling intermediate transport between adjacent reaction active sites without equilibrating with a bulk concentration.2 Intermediates can be thereby protected from unfavorable binding or reaction in the bulk, enhancing overall cascade reaction efficiency. Some strategies of substrate channeling observed in nature are intramolecular tunneling,3 chemical swing arms,4,5 spatial organization,4,6,7 and electrostatic channeling.7–9 Electrostatic channeling controls the transport of charged intermediates via electrostatic interaction with the oppositely charged surface between active sites. It is a promising way to channel artificial cascades because it is not limited by structural aspects, and most metabolic intermediates are charged.10
The transport efficiency of electrostatic channeling can be measured by the determination of the transient lag time—the time the system takes to reach steady-state flux. For a perfect channeling mechanism, the lag time is near zero; for an electrostatically channeled system, a lower lag time reflects increased channeling efficiency.11
In recent decades, considerable efforts have focused on two well-studied examples of electrostatic channeling in nature. The first example is the bifunctional enzyme dihydrofolate reductase-thymidylate synthase (DHFR-TS), where bounded diffusion guides negatively charged dihydrofolate intermediates on the positively charged protein structure between two active sites.10 The other often studied example is the super enzyme complex of malate dehydrogenase (MDH) and citrate synthase (CS) within the tricarboxylic acid (TCA) cycle, where oxaloacetate, carrying two negative charges, crosses the positively charged surface between MDH and CS via electrostatic interaction.12 Structural characterization of each of these systems has revealed a charged surface present between active sites, and lag time analyses have demonstrated the influence of this surface on kinetics.10,13 However, experimental approaches provide a limited view of how intermediates interact with these electrostatic surfaces. Thus, advanced computational simulations provide a view into the interaction mechanisms underlying electrostatic channeling.
![]() | ||
Scheme 1 Dual site conversion of glucose to gluconolactone via hexokinase (HK) and glucose-6-phosphate dehydrogenase (G6PDH). Glucose-6-phosphate (G6P) is a stable intermediate. |
Computational studies have addressed substrate channeling mechanisms, including DNA-scaffolded bienzyme systems,7,14,15 co-immobilization,16 and encapsulation,17–20 in artificial cascades. Our group and collaborators have recently reported studies of electrostatic channeling in an artificial cascade comprising hexokinase (HK) and glucose-6-phosphate dehydrogenase (G6PDH) where the reactions take place according to Scheme 1.21–23
In those works, an enzyme complex of HK and G6PDH was covalently conjugated by a positively charged poly-lysine peptide, and the negatively charged intermediate glucose-6-phosphate (G6P) was channeled through electrostatic interactions. A hopping transport mode was considered, and diffusion and desorption energy barriers were computed for intermediate transport along the poly-lysine bridge. Computational results along with experimental data were used to parameterize a kinetic Monte Carlo (KMC) model to compute the transport efficiency of G6P in terms of lag time. Computational results matched well with the experiment, and both results demonstrated the effectiveness of the poly-lysine bridge in decreasing lag time by the increasing transport efficiency of the intermediate. Electrostatic channeling efficiency was shown to decrease with increasing ionic strength, and loss of intermediate was found to be significant at each terminal enzyme, particularly for HK, which has a significant net negative charge.
As an alternative cationic amino acid to lysine, arginine residues are more polarized and can form multidentate hydrogen bonds with phosphate groups,24 enabling stronger interaction with charged intermediates. Because of the strong interaction, poly-arginine peptides are more electrostatically “sticky” than poly-lysine, increasing both the desorption energy and hopping activation energy of G6P on poly-arginine peptides. High desorption energy lowers desorption probability and prevents G6P's leakage at high ionic strength.
However, such high hopping activation energy traps G6P in energy wells on the poly-arginine peptide, increasing the time scale between observed hopping events to hundreds of nanoseconds. Hopping of G6P in the presence of arginine is thus a rare event in MD simulations. In contrast, the hopping process of G6P remains orders of magnitude faster than the HK enzyme's turnover frequency of 0.7 s−1.22
Based on these considerations, poly-arginine peptides represent a promising electrostatic bridge candidate, and the study of rare hopping events using poly-arginine bridges would benefit from advanced sampling techniques. Among the methods of advanced sampling of MD simulation, infrequent metadynamics (InMetaD)25 is efficient in calculating kinetic rates and has been successfully applied to ligand binding and unbinding studies of various systems.26–28 For example, InMetaD has successfully estimated the unbinding rate of an inhibitor from the p38 mitogen-activated protein (MAP) kinase to be 0.02 s−1, comparable to the experimental result 0.14 s−1,28 and reveals the unbinding mechanism of a ligand from α7 nicotinic acetylcholine receptor with the unbinding rate to be 0.0033 s−1, comparable with the experimental value of 0.0003 s−1.26
InMetaD enhances the sampling of a system by adding bias on predefined collective variables (CVs) to release the system from energy minima, enabling observation of system dynamics and kinetics. A key aspect of implementing InMetaD is the selection of CVs. Good CVs properly describe a system in terms of both thermodynamics and kinetics, and common CVs include separation distances between atoms and molecules, bond angles, and combinations of these. Here we use infrequent metadynamics to study the rate and energy barrier of hopping of G6P on a poly-arginine peptide. The desorption pathway and energy barrier were studied using umbrella sampling.29 Rate constants obtained by infrequent metadynamics and umbrella sampling were used as parameters for KMC studies to estimate lag time due to transport over the polypeptide surface. These studies provide an improved understanding of the influence of scaffolding architecture in electrostatic channeling.
![]() | (1) |
![]() | (2) |
![]() | (3) |
We adopted three CVs based on distances between the phosphate group of G6P and arginine residues. The first, labeled d246, represents the sum of the distances, di, between the center of mass of the G6P phosphate group and the center of mass of the guanidinium group of the ith arginine residue, for three arginine residues numbered 2, 4, and 6 in Fig. 1a. Initially, the phosphate group was closely associated with these residues, where a close association is defined as di ≤ 0.5 nm, and d246 = d2 + d4 + d6 = d246 ≈ 1.5 nm, which was considered to be a minimum value. Besides d246, 19 other possible triple association basins were defined; the details are in ESI.†
The second collective variable, nA, represents the number of arginine residues with which the phosphate group is closely associated at each time step. The following function defines nA continuously:
![]() | (4) |
The third and final CV is nS, representing the ligand's solvation number (i.e., the number of water oxygens near the ligand). The value nS can be defined continuously as:28
![]() | (5) |
A committer analysis was implemented in all simulations. Specifically, 32 simulations were initiated with d246 ≈ 1.5 nm and terminated once any of the 19 predefined triple association energy basins has been reached within a wall time of 48 hours (simulation time ∼24 ns). A committer basin is defined based on the sum of three associated distances, dijk where i, j, and k indicate an arginine residue as labeled in Fig. 1a. When the sum dijk is less than 1.5 nm for a specific basin, while dijk for all other basins is greater than 2.0 nm, that basin is considered to be occupied.
Once any of the predefined basins is occupied, the simulation was terminated, and the simulated time was converted to unbiased transition times using eqn (3). Assuming that the transitions are stochastically independent, they may be treated as Poisson processes, and the reaction times should follow an exponential distribution.38 Thus, the empirical cumulative distribution function (ECDF) was computed from the unbiased transition times, t, and compared with the theoretical cumulative distribution function (TCDF) of a homogeneous Poisson process:28
![]() | (6) |
To validate the precision of the recovered transition times, the two-sample Kolmogorov–Smirnov (KS) test was conducted, following the procedure of Salvalaglio et al.,38 to determine whether they were exponentially distributed. When the p-value of the KS test exceeds the threshold of 0.05, the dataset is deemed to conform to an exponential distribution. This is an important step in InMetaD analysis as it ensures that recovered times are uncorrelated and reliable.
From this data, over 20 US windows were chosen, with a higher window density below 1 nm, where the energy gradient was greatest. All US simulations were conducted at 300 K with a spring constant, k = 2000 kJ mol−1 nm−2 applied over two coordinates. The first US coordinate was the pull distance x1 as defined above. The second, x2, represents the pull vector, defined as the magnitude of (CA2 + CA4 + CA6)/3 − P where CAi is the position of the alpha carbon of the associated arginine residue i, and P is the position of the phosphorus atom of G6P.
MD sampling of each US window was conducted for 20 ns. Grossfield-WHAM39 code was used to calculate the 2D potential of mean force. The average of the area defined by |x1 − x2| = 0.05 nm was used as the projection of the PMF on the pull distance, x1 (Fig. S4, ESI†).
![]() | (7) |
Δt = −ln(ρ2)/Γtotal | (8) |
Due to this strong electrostatic interaction, escaping from the energy well of triple associations is rarely observed within the time scale of MD. Fig. 2a shows residues with which G6P associates (di < 0.5 nm) and the number of associated residues over time at 293 K. After an initial period of the double association with residues 6 and 7, and then with residues 3 and 5, G6P was trapped in the triple association with residues 1, 3, and 5 starting at ∼26 ns. As temperature increases (Fig. 2b), the occurrence of a single association remains relatively rare, while that of a double association decreases, and that of a triple association increases. In our observations, triple association dominates over the entire simulated temperature range from 293 K to 318 K. Thus, we define a full hop on the poly-arginine bridge as a transition between different triple association states. To increase the frequency of transitions, we seek advanced sampling techniques.
![]() | ||
Fig. 2 (a) Associated arginine residue indices (top) and evolution of association number (nA, bottom) over 100 ns. (b) Association state probability dependence on temperature. |
However, metadynamics is mainly used to quantify thermodynamic properties and is not generally applied to recover kinetic information. To calculate the rate of hopping between triple association states, we adopted infrequent metadynamics (InMetaD).25 The essential difference between metadynamics and InMetaD is that in InMetaD, the rate of bias deposition is much slower than in metadynamics, and is intended to move the system out low-energy basins without disturbing the kinetics in high-energy transition regions.25,38
The theoretical and empirical cumulative distributions of the transition times at IS = 0 mM, 310 K are shown in Fig. 3a, where the characteristic time, τ, was 126 ± 3 ns. The p-value was 0.95 as calculated by the Kolmogorov–Smirnov (KS) test.41 This value is much greater than the threshold of 0.05, indicating that the transition time conformed to an exponential distribution. This approach was subsequently applied to calculate values of τ for temperatures from 280 K to 320 K, for which all the p-values were above 0.05.
The hopping rate, khop, can be defined simply as khop = τ−1. At 310 K, khop is 7 ± 0.6 μs−1. Compared to the poly-lysine bridge, the hopping rate on the poly-arginine bridge is 85-fold slower at 310 K. Similarly, the hopping rates and their errors were estimated from the characteristic times and their errors across all the studied temperatures and ionic strengths. Overall, khop displays Arrhenius behavior, and the hopping activation energy was computed for a range of ionic strengths from 0 mM to 120 mM (Fig. 3b). No trend was observed for the dependence of either hopping rate or activation energy with respect to ionic strength. This observation that ionic strength had little impact on hopping kinetics of G6P on poly-arginine peptides is consistent with previous calculations using the poly-lysine peptide.21 In both cases, transport is influenced locally by short-range interactions within the Stern layer, where counterions have relatively little influence. Hopping activation energy of 25 ± 3.6 kJ mol−1 was obtained by simultaneous fitting of rate data at all ionic strengths and is used in the KMC model. Notably, this hopping activation energy value of poly-arginine is about double that observed for poly-lysine (13 ± 0.5 kJ mol−1).21
It is worth noting the feature that appears in the PMF traces of Fig. 4a near 0.72 nm, which varies with ionic strength. By inspection of trajectories, we observed that 0.72 nm is a position beyond which the system transitions from triple to double and single association. Generally speaking, this transition feature becomes less smooth and shifts to lower distances at higher ionic strength, reflecting the effect of counterions to weaken electrostatic interactions with the arginine bridge. As shown in Fig. 4b, the desorption energy declines significantly with increasing ionic strength. This is explained by the counterion shielding of long-range electrostatic forces between G6P and the arginine bridge.
Based on the above InMetaD and US results, we can calculate the rate ratio of hopping to desorption by:
![]() | (9) |
Stop-flow lag time analysis was employed to evaluate the channeling efficiency from all the KMC data. As shown in Fig. 5, the lag time for the poly-arginine bridge was 1.9 s at zero IS and increases slightly to 6.5 s at 120 mM. In comparison, the lag time for the poly-lysine bridge increases three-fold over the same range of ionic strength. Comparison to calculated results for the free enzyme,21 poly-lysine bridge,21 and poly-arginine bridge provides a clear indication that electrostatic channeling can significantly improve transfer efficiency as compared to a free enzyme system, and that poly-arginine represents a better bridge candidate compared to poly-lysine for this molecular motif.
![]() | ||
Fig. 5 Comparison of calculated lag time at varying ionic strength between the free enzyme system, the poly-lysine bridge channeled system, and the poly-arginine bridge channeling system. |
As the reaction system achieves steady state, desorption from the bridge approaches equilibrium, creating the possibility of multiple intermediates adsorbed there. To understand this, we used KMC model results to obtain the probability distribution for the number of the adsorbed intermediates on the poly-lysine and the poly-arginine bridges, each with six hopping sites (Fig. S6, ESI†). Our results show that the bridge was empty for about 80% of the time, the bridge adsorbed one intermediate for about 20% of the time, and the occurrence of more than two adsorbed intermediates was rare for both poly-lysine and poly-arginine bridges. This can be explained by the fact that the turnover frequency of G6PDH is much faster than that of HK and the transit time of the intermediate on either the poly-lysine bridge or the poly-arginine bridge is even faster, so there is relatively little accumulation of G6P intermediates on the bridge. Owing to its higher adsorption energy, the poly-arginine bridge does demonstrate a consistently higher probability for multiple adsorbed intermediates as compared to poly-lysine.
This work only accounts for the movement of G6P on the bridge and did not consider the desorption from HK active site to bridge and bridge to G6PDH active site. These phenomena lead to additional loss of intermediate transport efficiency and therefore higher lag times.23 Additionally, the model assumes that the hopping rate between the bridge and enzyme is the same as that on the bridge, which is only a rough approximation. A more realistic prediction of transfer efficiency and lag time would be obtained by studying the transport of intermediate over the enzyme surfaces in the presence of the poly-arginine bridge, as has been demonstrated for the poly-lysine bridge. We are also pursuing experimental validation of these results by synthesis of the HK-polyArg-G6PDH system followed by stop-flow experiments to determine lag time.
Kinetic Monte Carlo studies based on this rate ratio predict significantly lower lag times for systems built with the poly-arginine bridge as compared to the poly-lysine bridge, and lag time is more resistant to changes in ionic strength. These results will be ameliorated somewhat by including transport losses in the vicinity of the enzymes, an effect that can be studied computationally and validated experimentally.
This demonstrates the use of InMetaD to study transport phenomena at longer time scales, which may be applied to channeling phenomena in other multi-enzyme fusions including TS-DHFR and MDH-CS. This work also aids the design of efficient artificial cascade reactions using electrostatic bridges.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d1cp01304a |
This journal is © the Owner Societies 2021 |