Denis
Maag
a,
Marina
Putzu
a,
Claudia L.
Gómez-Flores
a,
Frauke
Gräter
b,
Marcus
Elstner
ac and
Tomáš
Kubař
*a
aInstitute of Physical Chemistry, Karlsruhe Institute of Technology (KIT), 76131 Karlsruhe, Germany. E-mail: tomas.kubar@kit.edu; Fax: +49 721 608 45710; Tel: +49 721 608 43574
bHeidelberg Institute for Theoretical Studies, 69118 Heidelberg, Germany
cInstitute of Biological Interfaces (IBG-2), Karlsruhe Institute of Technology (KIT), 76131 Karlsruhe, Germany
First published on 11th November 2021
The roles of structural factors and of electrostatic interactions with the environment on the outcome of thiol–disulfide exchange reactions were investigated in a mutated immunoglobulin domain (I27*) under mechanical stress. An extensive ensemble of molecular dynamics trajectories was generated by means of QM/MM simulations for a total sampling of 5.7 μs. A significant number of thiol–disulfide exchanges were observed, and the Cys32 thiolate preferred to attack Cys55 over Cys24, in agreement with previous experimental and computational studies. The structural features as well as electronic structures of the thiol–disulfide system along the reaction were analyzed, as were the electrostatic interactions with the environment. The previous findings of better accessibility of Cys55 were confirmed. Additionally, the reaction was found to be directed by the electrostatic interactions of the involved sulfur atoms with the molecular environment. The relationships of atomic charges, which stem from the electrostatic interactions, lead to the kinetic preference of the attack on Cys55. Further, QM/MM metadynamics simulations of thiol–disulfide exchange in a small model system with varied artificial external electric potentials revealed changes in reaction kinetics of the same magnitude as in I27*. Therefore, the electrostatic interactions are confirmed to play a role in the regioselectivity of the thiol–disulfide exchange reactions in the protein.
Thiol–disulfide exchange is an SN2 reaction between a thiolate anion R1–S− and a disulfide bond R2–S–S–R3, which results in a new disulfide bond, either R1–S–S–R2 or R1–S–S–R3.6 Notably, disulfide bonding patterns in proteins are not necessarily static and stable – rather, they can possess a dynamic character and rearrange spontaneously. Intra- or intermolecular shuffling of disulfide bonds can be triggered by mechanical stress, e.g., pulling the disulfide bond into opposite directions, which unfolds the protein and decreases the activation energy for a nucleophilic attack by a free thiolate.7,8
In a novel approach, Alegre-Cebollada et al. investigated protein unfolding and disulfide isomerization of a mutated I27 immunoglobulin domain (I27*) in real time with force-clamp atomic-force microscopy (AFM).9 I27* was engineered to have a disulfide bond between Cys24 and Cys55 and a free reactive cysteine Cys32, see Fig. 1A. Due to a constant pulling force of 250 pN, far below the force necessary to break covalent bonds (above 1 nN),10,11 the protein unfolded up to the disulfide bond. Residues 25 to 54, including Cys32, were located on a flexible loop which did not stretch because of the disulfide bond, see Fig. 1B. Thus, Cys32 remained in the vicinity of the disulfide bond after the first unfolding step and was able to engage in a nucleophilic attack on Cys24 or Cys55, see Fig. 1C and D. A reaction with Cys55 occurred 3.8 times more frequently than with Cys24. No conclusive explanation of this regioselectivity was found, and it was called for systematic studies on how the reactivity of disulfide bond is affected by their environments.
![]() | ||
Fig. 1 Constant pulling force on the termini of the I27* domain (A) leads to unfolding up to the disulfide bond Cys24–Cys55 (B). This uncages Cys32 so that it can perform a nucleophilic attack on Cys24 (C) or Cys55 (D). Sulfur atoms are depicted by yellow balls. (Image created by the authors, based on PDB ID 1WAA.)12 |
To this end, Kolšek et al. carried out force-clamp MD simulations on I27* using a molecular mechanics-based framework that allowed for disulfide bond rearrangements through Monte Carlo-controlled topology exchanges.13 This approach reproduced the regioselectivity observed by the experimental AFM setup with the advantage of an atomistic description of the process. In the simulations, Cys55 was more readily spatially accessible for a nucleophilic attack than Cys24, and consequently, it was approached by Cys32 more often, leading to the reaction Cys32 → Cys55 occurring more frequently. Thus, steric factors play an important role in disulfide shuffling.
So do electrostatic interactions. The rate of thiol–disulfide exchange, an SN2 reaction, depends on the nucleophilicity of the attacking thiolate Snuc, the electrophilicity of the attacked sulfur Sctr as well as the stability of the leaving group Slg.6,14 These factors are not solely determined by the reactive species themselves; rather, they are affected by the steric and electrostatic interactions with the environment. Notably, the lowest-energy state of a symmetric molecule (R1 = R2 = R3) in the gas phase is a linear trisulfide arrangement with the negative charge delocalized over all three sulfurs.15 This is reflected by the general observation that thiol–disulfide exchange is best catalyzed by hydrophobic, aprotic environments – conditions in which the charge is quite delocalized.16 On the other hand, polar solvents induce a localization of the charges, favoring arrangements with separated molecules, a thiolate and a disulfide.
The charge distribution on a thiol–disulfide center – and consequently, the nucleophilicity and the electrophilicity of the sulfur atoms – are modulated not only by the solvent but also by the microenvironment, e.g. the neighboring functional groups or amino acids. Wu et al. demonstrated that ionic residues in close proximity to the reactants have a major impact on disulfide exchange reaction rates.17 They investigated the reaction between a cysteine as a nucleophile and several small disulfide-bonded peptide homodimers. Net charges ranging from −2 to +2 were introduced in each peptide by putting glutamate or arginine residues in positions adjacent to the disulfide-bonded cysteines. The reactivity showed a linear dependence on the introduced net charges, −2 showing the least and +2 the highest reactivity. Similar effects of the environment on the reactivity were observed in other studies, also.18–21
This work aims to explain the regioselectivity of the disulfide shuffling in proteins, considering the mutated immunoglobulin domain I27* as an example, providing more detail than the previous work in ref. 13. To this end, we perform 334 QM/MM force-clamp simulations of I27*, with an accumulated sampling of ca.5.7 μs. The QM/MM simulations are set up in order to cover a possible nucleophilic attack of the deprotonated reduced Cys32, located on a flexible loop, on both Cys24 and Cys55. To elucidate the prerequisites for a successful disulfide exchange, we analyze 10 ps prior to the formation of transition state in trajectories where a reaction does take place, and analyze potentials of mean force as function of the sulfur–sulfur distances based on all of the trajectories. Also, we perform QM/MM metadynamics simulation of the two different disulfide exchange reactions in I27* but observe difficult convergence. Finally, we demonstrate the impact of electrostatics on disulfide exchange on the basis of metadynamics simulation of a small model system.
Images of proteins were created with VMD.34 Plots and histograms were generated with the Python library Matplotlib.35,36
![]() | (1) |
Also, in pilot metadynamics simulations, the molecules irreversibly reacted to one of several chemically non-sensical species whenever all three sulfur–sulfur distances were very short (below 3 Å), i.e. in a triangular configuration. These structures lie very high in energy and thus are irrelevant for the investigated disulfide shuffling. To prevent such erroneous reactions, restraints were placed on the coordination numbers that were introduced for every sulfur atom as the sum of the switching functions applied to the distances from each of the two other sulfurs, e.g., c(S32) = s(S32–S24) + s(S32–S55). Each of the three coordination numbers was restrained to values below 1.8 with a force constant of 50000 kJ mol−1 nm−2, which penalizes triangular structures with short S–S distances. The sum of all three sulfur–sulfur distances was restrained to values above 9 Å with a force constant of 100
000 kJ mol−1 nm−2, which also prevents the three sulfurs from approaching too closely.
In other high-energy conformations, the sulfur anion came very close to the carbons of the dimethyl disulfide and deprotonated them. Thus, additional restraints were employed to avoid such occurrences. The number of bonded hydrogen atoms was defined as the sum of the switching functions s(C–H) with r0 = 1.3 Å, n = 45 and m = 90, for each carbon atom separately. Each of these quantities was restrained to values above 2.5 with a force constant of 50000 kJ mol−1 nm−2. All non-covalent sulfur–carbon distances were restrained to values above 3 Å with a force constant of 50
000 kJ mol−1 nm−2. Sulfur–hydrogen distances further than two covalent bonds away, were also restrained to values above 3 Å with a force constant of 50
000 kJ mol−1 nm−2.
The input files used to carry out these simulations are available in the ESI† as well as on PLUMED-NEST (http://www.plumed-nest.org), the public repository of the PLUMED consortium,41 as plumID:21.045.
There are three minima of an equal depth that was set to 0 kJ mol−1. The long, narrow minimum at the upper edge of the energy profile corresponds to nearly-linear S32–S24–S55 configurations, while the similar minimum at the lower edge corresponds to nearly-linear S32–S55–S24 configurations. The third deep minimum is found around the distances S32–S24 and S32–S55 of 7 Å, and corresponds to a triangular configuration with S32 located in similar distances from S55 and S24.
Two shallower minima with free energies of ca. 10 kJ mol−1 are located at the distances S32–S24/S32–S55 of 2.7/5.4 Å and 5.4/2.7 Å. These correspond to the transition state structures of the two disulfide exchange reactions. As such, these should be saddle points on the free energy surface, and the observed shallow minima are an artifact of DFTB3/3OB, which underestimates the energy and overestimates the bond lengths of trisulfide species, as discussed previously.42 Nevertheless, this systematic error affects both reactions, S32→S24 and S32→S55, to exactly the same extent, therefore any qualitative conclusions from this study will be unaffected. The height of energy barriers to both reactions is similar, ca. 15 kJ mol−1. A tiny difference in barrier height is expected considering the rather small regioselectivity that was observed in experiments as well as in simulations, however it appears impossible to resolve using free MD simulations like here.
To learn how often S32 approaches S24 or S55, the histogram in Fig. 2A was split into 2 regions, the “upper” region in which S32 is closer to S24, and the “lower” region where S32 is closer to S55. All probabilities in the “upper” region were summed up and converted to a free energy value, and the same was done for the “lower” region; details on this analysis are presented in ESI,† and Fig. S6. We found or in terms of free energy, Glower–Gupper = −0.8 kJ mol−1. This means that S32 is closer to S55 on average, therefore a reaction may occur more frequently with S55 than with S24, as stated previously in ref. 13 on basis of simpler simulations.
To see how the distances between S32 and the disulfide bond correlate with the length of that bond, histograms of the distances S32–S24 and S32–S55 were generated for different S24–S55 bond lengths observed, see Fig. S7 (ESI†). It appears that S32 is increasingly more likely to be closer to S55 with increasing S24–S55 distance. Viewed from the other side: whenever S32 is closer to S55, a longer S24–S55 bond is favored. Consequently, it may be easier for the system to stretch the bond S24–S55 further to pass to a transition state. By contrast, whenever S32 is closer to S24, a shorter bond is favored, thus a transition state is less likely to form.
In spite of the restricted time scale of the QM/MM simulations, a reaction occurred 66 times, with a preference for Cys32 attacking Cys55 (48 reactions) over Cys24 (18 reactions). The preference for Cys32 agrees with the experimental observations, and the Cys55/Cys24 ratio of 2.7 agrees is remarkably similar to the experimental ratio of 3.8. Now, the question arises why one of the reactions is favored over the other. In an attempt to answer this question, we analyzed selected structural and electrostatic parameters in the interval of 10 ps prior to the formation of the transition state in the trajectories where a reaction occurred.
Reaction | S32→S24 | S32→S55 |
---|---|---|
|S32–S24| [Å] | 3.78(0.54) | 5.48(0.53) |
|S32–S55| [Å] | 5.65(0.51) | 3.68(0.52) |
|S24–S55| [Å] | 2.14(0.08) | 2.14(0.08) |
Angle [°] | 149(19) | 143(18) |
Q(S24) [e] | −0.04(0.03) | −0.14(0.04) |
Q(S55) [e] | −0.12(0.04) | −0.02(0.04) |
Q(S32) [e] | −1.09(0.06) | −1.10(0.05) |
ESP(S24) [V] | −2.22(0.35) | −1.91(0.33) |
ESP(S55) [V] | −1.94(0.34) | −2.25(0.36) |
ESP(S32) [V] | −3.26(0.35) | −3.07(0.36) |
The angle Snuc–Sctr–Slg oscillates between 80–180° but this range narrows down to 120–170° right before the formation of TS. At small S32–S24 distances below 3.4 Å, the preferred angle lies between 150–170°, whereas the preferred angle at small S32–S55 distances shows more variance of 120–170°, see also histograms in Fig. S1 (ESI†). Thus, the S–S–S arrangement deviates further from linearity in the 10 ps prior to the disulfide exchange with S55, as compared to the same time frame preceding an exchange with S24. This observed larger flexibility supports the hypothesis that S55 is better accessible for a nucleophilic attack by S32 than S24.13
It turns out that both ΔQ and ΔESP are larger for the reaction S32→S55. This means that Sctr is, on average, a somewhat better target of an SN2 attack, and Slg is a better leaving group, in that reaction compared to S32→S24. Thus, electrostatic interactions contribute to the observed regioselectivity of the disulfide exchange reaction.
An advantage of this small, simple model is that the PMF is completely symmetric as long as no external potential is applied, and this knowledge may be used for a convenient convergence check. In the free energy surface from that simulation (Fig. S11C, ESI†), the minimum energies for bonds between S1–S2 and S1–S3 are 0 kJ mol−1, and for a bond between S2–S3 the energy is 2 kJ mol−1. The energy barriers to the three disulfide exchange reactions lie in the range of 49–52 kJ mol−1. All this illustrates the good convergence of the simulation, with a statistical error of at most 2 kJ mol−1.
Simulations were performed with an additional, external ESP of −0.50 V, −0.25 V, 0 V, +0.25 V, and +0.50 V, respectively, imposed on the atom S1; this additional potential will be denoted ESPext in the following. The simulation setup was designed to sample all three disulfide bonding patterns: S1–S2, S1–S3, and S2–S3, with the respective remaining sulfur atom in the deprotonated reduced (anionic) state. The reaction energies and height of energy barriers to disulfide shuffling are plotted in Fig. 5. The 2D representations of the PMF expressed as function of the S1–S2 and S1–S3 distances (with the S2–S3 distance integrated out) are shown in Fig. S11 (ESI†) together with exemplary molecular structures, and the numerical values of energy barriers are shown in Table S1 (ESI†).
The reaction energies in Fig. 5 (left) exhibit clear trends. First, ESPext applied to the sulfur being attacked, Sctr, has no influence on the reaction energy because the reactant and the product are identical – one of the disulfide-bonded atoms carries ESPext. The other two cases are in fact one: a reaction with ESPext on Snuc is the reverse process to a reaction with ESPext on Slg. Thus, the reaction energy of one equals the negative of the reaction energy of the other. The stabilization or destabilization of the negative charge of the thiolate (which is Snuc prior to the reaction, and Slg thereafter) by ESPext may be considered to rationalize the numerical value of reaction energies: the contribution to total energy from the interaction of that charge Q(S−) with the electrostatic environment (represented here by ESPext) is well approximated as E′ = ESPext·Q(S−). Since Q(S−) ≈ −1.1 e always, E′ is a linear function of ESPext. As an example, for S1 being Snuc and ESPext = −0.5 V, E′ = −0.5·(−1.1) eV = +0.55 eV = +53 kJ mol−1, in a good agreement with the actual observation.
The reaction energies are not quite important in the context of the current work, however. Due to the stretching force applied on I27*, the new free thiolate is immediately pulled away from the newly formed disulfide bond as soon as the first disulfide exchange has taken place. Therefore, the energy of the product and the thermodynamics of the reaction do not play any role. The crucial phenomenon will rather be the effect of ESP, or ESPext in the study of the model system, on the reaction rates.
Let us turn our attention to the heights of energy barrier in Fig. 5 (right). The barrier heights with ESPext on Snuc or on Slg change in a way that is very similar to the reaction energies: the barrier is elevated whenever the reaction energy is positive, while lower barriers are seen in cases that have negative reaction energies. This is a simple consequence of the shape of the corresponding energy landscapes as depicted in Fig. S11 (ESI†). Most interesting in the current context will be the case where S1 is Sctr because this is the kind of data that we have measured in our simulations of I27*. There is a roughly linear dependence of energy barrier on ESPext, with a slope of ca. 10 kJ mol−1 V−1 and positive ESPext giving higher barriers. This is explained easily as positive ESPext leads to a decrease of charge of Sctr, which thus becomes a worse nucleophilic target, and the other way around for negative ESPext.
It has to be pointed out that a certain ESPext applied on S1 in the model system shall have the same effect on the electronic structure of the disulfide bond as the same value of ΔESP in the simulations of I27*. Recall that the ΔESP values found for the reactions were 0.28 and 0.34 V, respectively. The effect of this difference may be compared to the difference of ESPext = 0 and 0.06 V: multiplication by the slope of the dependence of energy barrier on ESPext leads to the difference of energy barriers of ca. 0.6 kJ mol−1, which would cause a ratio of reaction rates of ca. 1.3 from Arrhenius' equation. This factor contributes to the above observed ratio of reaction rates of 2.7, while the remainder of this ratio (of ca. 2) is probably due to other effects like spatial accessibility as discussed previously.13
Next, we asked what structural factors contribute to the preferential attack. The distances and angles between the cysteine sulfur atoms in the trajectories were measured. It was found that S32 can approach S55 over a wider range of angles than S24, therefore S55 is the more easily accessible target of a nucleophilic attack. Further, S32 is located more often closer to S55 than to S24, making a nucleophilic attack on S55 more likely. All that agrees with the previous observations by Kolšek et al.13
Clearly, steric factors play a very important role in disulfide shuffling, but this may not be the complete explanation. Rather, electrostatic interactions may contribute to the reaction control. Thus, we decided to analyze the electron density of the trisulfide system as well as electrostatic interactions in the protein, to see if we find any significant effects. Note that electron density is represented by Mulliken atomic charges within DFTB.
S55 in the role of nucleophilic target carried a more positive charge than S24, and S24 carried a more negative charge as a leaving group than S55 did. This means that S55 is the better nucleophilic target, and S24 the better leaving group of the two. The charges were averaged over two separate ensembles of simulations, and thus there is no bias towards the ensemble with a larger number of simulations.
The observed difference of atomic charges may be accounted to the electrostatic potentials on the sulfur atoms caused by the molecular environment (amino acid side chains, peptide backbones and solvent), which are slightly different for each sulfur atom. Consequently, it is the electrostatic effects of the molecular environment that support the reaction S32→S55 more than S32→S24. This is an additional explanation of the outcome of the force-clamp experiments on I27*, in addition to the previous concept of regioselectivity via accessibility.
In terms of the transition state theory, the steric factors make the approach frequency and thus the pre-exponential factor higher for the reaction S32→S55. Also, the different polarization of electron density results in a lower energy barrier for the same reaction. These two effects act in the same direction, favoring the reaction S32→S55.
A possible electrostatic control of regioselectivity was demonstrated on a model system featuring a symmetric free energy landscape of the disulfide exchange reactions. As soon as an external electric potential is imposed on one of the sulfur atoms, the charges of the sulfurs change, and consequently, so do the free energies: a negative applied ESP results in a more positive charge, which makes the touched atom a better nucleophilic target but a worse leaving group. On the other hand, a positive applied ESP results in a more negative charge, making the atom a better leaving group but a worse nucleophilic target.
We provided a quantitative measure of this effect on the reaction energies and barriers. Electrostatic potential arising from the protein and water environment may polarize the disulfide bond slightly, such that the nucleophile attacks one of the sulfur atoms preferentially. Thus, electrostatics may break the symmetry of the disulfide system. This either induces regioselectivity, or contributes to the regioselectivity due to steric factors.
This model study shows how an external electric field affects the kinetics of disulfide shuffling. The magnitude of ESP applied here, in the order of tenths of volt, corresponds to the differences of potentials observed in protein systems like the I27* domain. In a protein, the “external field” arises from the protein and solvent environment – the surrounding amino acid side chains, peptide backbone as well as any water present. Such an electric field brings on a variation of energy barriers of few kJ mol−1. This modulates the reaction rates by a small factor, and it turns out that the kinetics of disulfide shuffling in proteins is affected by electrostatic effects of the close environment of the disulfide moiety.
Active sites of enzymes and other proteins feature perfectly positioned functional groups and patterns of specific interactions. The case of disulfide exchange reaction investigated here is different but still remarkably similar in the working principle: even though there is no real active site, the selectivity of the reaction is still achieved through the interactions with the environment. All of this likely matters for the disulfide exchange reactions as known in proteins like VWF, integrins and others.
Footnote |
† Electronic supplementary information (ESI) available: Supplementary Fig. S1–S15; supplementary Table S1; procedure for calculation of electrostatic potentials; metadynamics simulation of disulfide shuffling in I27* – method and results; detailed analysis of system setup; input files for the metadynamics simulation of the model system. See DOI: 10.1039/d1cp03129e |
This journal is © the Owner Societies 2021 |