Qiankun Gong,
Haomiao Zhang,
Haozhe Zhang and
Changjun Chen*
Biomolecular Physics and Modeling Group, School of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China. E-mail: cjchen@hust.edu.cn
First published on 3rd January 2020
Insulin is a significant hormone in the regulation of glucose level in the blood. Its monomers bind to each other to form dimers or hexamers through a complex process. To study the binding of the insulin dimer, we first calculate its absolute binding free energy by the steered molecular dynamics method and the confinement method based on a fictitious thermodynamic cycle. After considering some special correction terms, the final calculated binding free energy at 298 K is −8.97 ± 1.41 kcal mol−1, which is close to the experimental value of −7.2 ± 0.8 kcal mol−1. Furthermore, we discuss the important residue–residue interactions between the insulin monomers, including hydrophobic interactions, π–π interactions and hydrogen bond interactions. The analysis reveals five key residues, VlaB12, TyrB16, PheB24, PheB25, and TyrB26, for the dimerization of the insulin. We also perform MM-PBSA calculations for the wild-type dimer and some mutants and study the roles of the key residues by the change of the binding energy of the insulin dimer.
Determination of the binding affinity of a complex in an experiment is difficult because of time and money. Compared to the experiment, molecular simulation on the computer has an advantage of low cost. It becomes an important assistant tool at present. Based on the molecular simulations, some methods have been developed to predict the affinities of the complexes, such as free energy perturbation molecular dynamics (FEP/MD),12 Poisson–Boltzmann surface area (MM-PBSA),13 linear interaction energy (LIE),14 adaptively biased molecular dynamics (ABMD),15 steered molecule dynamics (SMD)16 and so on. Among them, SMD may be a promising tool in drug design,17 because it is able to produce lots of short and non-equilibrium trajectories.
Now there are two ways to compare the binding affinities of a complex by the SMD method. The first way uses the maximum force (rupture force) on the pulling collective variable to compare the affinities.18–22 For example, Mai et al. study two ligands binding to influenza virus and find that there is a strong relationship between the maximum force and the experimental free energy. The correlation coefficient is about 0.97.19 However, a recent study also indicates that there might be at least two different relationships (correlation coefficients) between the maximum force and the experimental value.23 For a specific complex, the real correlation depends on the predominant interaction type between the receptor and the ligand, like the electrostatic interaction, hydrophobic interaction or H-bonds. It makes the comparison work much complicated.
The second way of comparison is calculating the free energy profile.16 It is based on an equality proposed by Jarzynski that connects the external work and the free energy differences.24 With the equality, one can obtain the free energy profile and the related binding free energy of a complex by lots of non-equilibrium trajectories. The whole calculation is quite suitable for parallelization. Up to now, there are numerous successful applications, like two ligands binding to FKBP protein25 and numerous ligands binding to protein FKBP, trypsin and cyclin-dependent kinase 2.26 These studies present that the structural flexibility and the steered speed have large impact on the calculations of the free energies. To obtain accurate free energy, the low speed should be used in SMD simulation. Moreover, to improve the calculation accuracy, a lot of practical methods have been proposed, including the fluctuation-dissipation theorem of Brownian dynamics (BD-FDT),27 adaptive SMD,28 hybrid SMD,29,30 and the non-equilibrium friction correction method.31
A few years ago, Tyka and co-workers proposed a confinement method to calculate the free energy differences between two different stable states of a single molecule.32 Suppose these two stable states are P and Q, respectively. The free energy difference between the two states is ΔFPQ. Based on a thermodynamic cycle, the calculation of ΔFPQ is divided into three independent stages. Two of them are the calculations of the free energy differences, ΔFPP* and ΔFQQ*. They correspond to the transitions from the flexible state, P and Q, to the restrained state, P* and Q*, respectively (method: thermodynamic integration (TI)33). The last stage is the calculation of the free energy difference between the two restrained states P* and Q* (method: normal mode analysis (NMA)34). The confinement method has been shown to be practical for an isolated molecule. Recently, Perthold and Oostenbrink also construct a thermodynamic cycle to calculate the binding free energies of some protein–protein complexes with the perturbed distance restraints.35 Their free energy results are close to the experimental values.
In this work, we borrow the ideas of the above papers into the calculation of the absolute binding free energy of an insulin dimer system. Since TI and NMA are very time-consuming for large systems, we replace them by SMD. In our work, one SMD simulation is performed in a virtual binding process of the two insulin monomers. To remove the effect of the restraints from the results, we also perform two additional SMD simulations to produce the free energy differences between the restrained and unrestrained states. Just as the work in ref. 35, the three processes constitute a fictitious but complete thermodynamic cycle from the unbound state to the bound state for the insulin monomers. The final result in our work is −8.97 ± 1.41 kcal mol−1, which is close to the experimental value −7.2 ± 0.8 kcal mol−1.36
Besides the calculation of the absolute binding energy by SMD, we also investigate the binding mechanisms of the insulin dimer. A set of simulations are performed for the dimer at different distances. The results show the importance of the π–π interaction, hydrophobic interaction and the hydrogen bond interaction in the dimerization process. Finally, we perform the MM-PBSA calculations13 to study how the key residues affect the binding energies of the dimer.
![]() | ||
Fig. 1 (a) Monomeric structure from the pig insulin dimer in Protein Data Bank53 (PDB ID: 4INS54). Chain A and chain B are rendered in red color and green color, respectively. Disulfide bonds, CysA20–CysB19, CysA6–CysA11, CysA7–CysB7 from top to bottom, are plotted in the yellow balls and sticks model. (b) Structure of the pig insulin dimer from the Protein Data Bank53 (PDB ID: 4INS54). Four hydrogen bonds between the two monomers are represented by the black solid lines. The figure is produced by VMD.55 |
In 1985, concentration difference spectroscopy experiment obtained the thermodynamic quantities of porcine insulin, including the changes of enthalpy, entropy and free energy in the binding process.36 The experimental values can be used as reference but the process is too complicated to be simulated on computer, so does the calculation of the absolute binding free energy. In current work, inspired by the confinement method32 and the work by Perthold et al.,35 we also use a thermodynamic cycle to circumvent the issue. The schematic diagram of the cycle is illustrated in Fig. 2.
In the figure, the whole thermodynamic cycle consists of four independent processes. Process D at bottom is corresponding to the real binding process from the unbound flexible monomers to the dimer. Process A, B and C are the virtual processes to close the thermodynamic cycle. Among them, process A at left is a virtual relaxing process from the restrained unbound state to the flexible unbound state. In the restrained unbound state, eleven geometric restraints are applied to the system (see the details below). And in the flexible unbound state, all the restraints are all removed from the system. Process C at right is similar to process A except that the SMD simulation is only performed for the bound state. As to process B at top, it is a virtual binding process from the unbound state to the bound state with a fixed force constant in all the restraint potentials. It must be noted that the simulation of the virtual binding process B is much easier than the real binding process D. This is due to the decrease of the degrees of freedom in the system.
Based on the thermodynamic cycle, we divide the calculation of the absolute binding free energy of insulin dimer ΔFD into three parts, ΔFA, ΔFB and ΔFC. Each part represents a free energy difference in the corresponding process. All the simulations are performed by SMD.16 Finally, the absolute binding free energy of insulin dimer is obtained by the following formula:
ΔFD = −ΔFA + ΔFB + ΔFC | (1) |
![]() | (2) |
The rotational restriction correction term ΔFR is introduced to account for the effect of the restraints in the bound state, which is given in the following formula:35
![]() | (3) |
ΔF = −ΔFA + ΔFB + ΔFC + ΔFV + ΔFR | (4) |
We want to note that the reaction coordinate in the SMD simulation for the virtual binding process B is the distance between the monomers along a fixed X-axis. The free energy does not need the Jacobian correction.58
W = ΔF = FY − FX | (5) |
exp(−βΔF) = 〈exp(−βW)〉 | (6) |
To force the system to move along a fast transition path, some kind of restraint potential should be applied. The potential may have many different forms,59 such as that with the constant forces or torques. But the most widely used one is the spring potential. It has the following form:
![]() | (7) |
λ = λ0 + vt | (8) |
With such spring potential, the new Hamiltonian of the restrained system is
H(r,p;λ) = H0(r,p) + V(r;λ). | (9) |
exp{−β[F(λt) − F(λ0)]} = 〈exp(−βW0→t)〉 | (10) |
W0→t is the external work done on the system when it moves from λ0 to λt on the path. The work can be calculated by an integral on time from 0 to t:
![]() | (11) |
Substitute the restrained Hamiltonian into the formula, the external work becomes:
![]() | (12) |
![]() | (13) |
Here Δy and Δz are the deviations of the center-of-mass of the monomers to the X-axis in the y or z direction, respectively. The form of the restraint potential does not change with time. And it does not have the steered coordinate in the SMD calculation (center-of-mass distance between the monomers on the X-axis).
The rest are six restraints on six different atoms of the monomers. They are used to avoid the rotation of the monomers. As shown in Fig. 3, two red atoms (OE1 of GlnA15 in monomer I and CD1 of LeuA16 in monomer II) are fixed on the red horizontal dotted line. Two purple atoms (ND1 of HidB5 in monomer I and CG1 of ValB2 in monomer II) are fixed on the purple vertical X–Y plane with a fixed z coordinate. Two blue atoms (N of LysB29 in monomer I and CA of ThrA8 in monomer II) are fixed on the blue horizontal X–Z plane with a fixed y coordinate. All these restraints are the harmonic potentials with a force constant of 50 kcal mol−1 Å−2.
![]() | ||
Fig. 3 Six single-atom restraints in the SMD simulation for the insulin dimer. The red, purple and blue solid arrows represent the X, Y and Z axis, respectively. Two red atoms are fixed at the horizontal red dotted line, two purple atoms are fixed on the vertical X–Y plane with a fixed z coordinate and two blue atoms are fixed on the horizontal X–Z plane with a fixed y coordinate. The figure is produced by VMD.55 |
In this work, the force field is ff14SB AMBER force field61 and the time step is 2.0 fs. SHAKE algorithm62 is used to constrain the bonds involving the hydrogens. The non-bonded interaction is calculated with a cut-off of 10.0 Å. All the SMD simulations are performed by AMBER 18 (ref. 63) in the NVT ensemble at 298 K.
In the virtual binding process B in Fig. 2, the center-of-mass distance between the insulin monomers decreases from 36.5 Å to 17.5 Å with a low speed of 0.2 Å ns−1. All the restraints are kept in the simulation. The simulation time of the process lasts for 95 ns. Process A and C are also simulated by SMD but with a little difference. In these two processes, the driven coordinate is the force constant k in the restraint potential that changes from a restrained state (k = 50 kcal mol−1 Å−2) to a flexible state (k = 0 kcal mol−1 Å−2). As a comparison, a standard SMD uses the distance as the driven coordinate. The simulation time of process B and C is 20 ns. To obtain the average free energy difference, we perform 40 independent simulations for every process.
To study the residue–residue interactions in the dimerization of the insulin (hydrophobic interaction, π–π interaction and hydrogen bond interaction), we run five simulations at a center-of-mass distance of 26 Å, 24 Å, 22 Å, 20 Å and 18 Å (native state). Each simulation consists of eight independent 100 ns trajectories, whose initial structure is extracted from process B and further equilibrated for 20 ns. To fix the distance between the monomers, an umbrella restraint potential U is applied to the system in the simulation.
![]() | (14) |
We also perform MM-PBSA calculations to get the change of the binding energy between the wild-type dimer and the mutants. To get the mutants, we remove all the sidechain atoms of the selected residues except the Cβ atom. And then change their names to ‘ALA’ in the pdb file. The missing hydrogen atoms are added automatically by the tleap tool in AMBER 18.63 In all the MM-PBSA calculations, we use the three-trajectory mode and run 27 trajectories for each mutant. For each trajectory, the simulation time is 100 ns and 25 snapshots are extracted evenly from the last 50 ns. So, in total we have 225 snapshots in the MM-PBSA calculation. The calculation is performed by the MMPBSA.py script in AMBER 18.63 The dielectric constants of the solute and the solvent are 1.0 and 80.0, respectively. The nonpolar part of the solvation energy is estimated by:
Gnp,solv = 0.0072 × SASA | (15) |
The free energy profiles of the three processes are shown in Fig. 4. Panel (b) is the data of the virtual binding process B. In the process, the center-of-mass distance between the monomers decreases from 36.5 Å to 17.5 Å. And the least free energy ΔFB is −25.02 ± 0.95 kcal mol−1 at distance 18.2 Å. Panel (a) and (c) are the data of the two virtual relaxing processes A and C at the unbound state (distance 36.5 Å) and bound state (distance 18.2 Å), respectively. In these two processes, the force constants in all the restraint potentials decrease from 50.0 kcal mol−1 to 0 kcal mol−1 gradually. So, at the end of the processes, the molecule turns into a flexible structure completely. The end-to-end free energy differences of the two processes are −77.88 ± 0.89 kcal mol−1 and −61.66 ± 0.53 kcal mol−1, respectively. As expecting, all the free energy errors increase when the molecule moves along the path.
![]() | ||
Fig. 4 Free energy profiles (blue solid lines) and the errors (in red color) in the three processes of the thermodynamic cycle in Fig. 2. (a) Virtual relaxing process A. (b) Virtual binding process B. (c) Virtual relaxing process C. See the descriptions of the processes in the main text. |
In order to have the absolute binding free energy, the standard-state correction and the rotational restriction correction are both considered in our work. The standard-state correction (eqn (2)) is 5.45 kcal mol−1 and the rotational restriction correction (eqn (3)) is −5.62 kcal mol−1. Combining all these data together, the final absolute binding free energy of the insulin dimer is −8.97 ± 1.41 kcal mol−1 (eqn (4)). In experimental aspect, the pig insulin dimer has been studied by the concentration difference spectroscopy. Strazza et al. measure the equilibrium constant Ka at five uniformly-spaced temperatures from 394 K to 311 K.36 At 298 K, the corresponding binding free energy is −7.2 ± 0.8 kcal mol−1.36 It shows that our calculation result is close to the experimental data.
In a previous study, Zoete et al. calculate the absolute binding free energy of the insulin dimer by the MM-GBSA method.46 In this method, the binding free energy is constituted of an enthalpy term and an entropy term. The former includes the difference of the gas phase energy and the solvation free energy between the unbound and the bound state. And the latter contains the difference of the transitional, rotational and vibrational entropies. The MM-GBSA result is −11.91 ± 6.7 kcal mol−1,46 which is comparable to the experimental value.36 But, its enthalpy term and the entropy term are −38.65 ± 5.8 kcal mol−1 and −26.74 ± 3.6 kcal mol−1, respectively.46 The experimental enthalpy and entropy (TΔS) is −10.0 kcal mol−1 and −3.3 kcal mol−1 respectively, which are obtained approximately by van't Hoff relation. These two large quantities are greatly different to that in experiment. This may be because the MM-GBSA method includes many parameters and approximations.13,34,64,65 For the enthalpy term, the error might come from the implicit solvent model,13,66 and for the entropy term, the error might come from the harmonic approximation.34 And there are some improved methods aim to decrease the errors in the MM-GBSA or MM-PBSA calculation.67–70
Compared to MM-GBSA, SMD is quite simple. Its calculation is based on work W in eqn (6), instead of the separated enthalpy and entropy terms. But SMD do have its own parameters, such as the steered speed and the number of the trajectories. Previous studies have shown that both slowing down the steered speed and increasing the number of the trajectories can improve the accuracies of the results.25–27 However, preparation of a proper restraint is still difficult in SMD.
One more thing we want to note is that SMD completes the calculation in the explicit solvent. And MM-GBSA or MM-PBSA are not. In MM-GBSA or MM-PBSA, the snapshots are extracted from the explicit solvent, but the free energy calculation is done in implicit solvent. This may bring the underlying problem to the calculation of the absolute binding free energy. For example, determination of the reference solute dielectric constant value is not easy. It can be 1.0, 2.0 or 4.0 in different MM-PBSA calculations.71–74
Insulin is an essential protein in treating the disease. Understanding the binding mechanism of insulin dimer will be useful to design drug. Fig. 5 shows the change of the average solvent-accessible-surface area (SASA) in the binding process of the insulin monomers. The data come from five independent MD simulations at the center-of-mass (COM) distance of 26 Å, 24 Å, 22 Å, 20 Å and 18 Å (native state), respectively.
The change of the total SASA in the binding process (COM distance from 26 Å to 18 Å) mainly comes from 22 residues on the binding interface (B9–B30). We present eight residues that have the largest SASA differences in Table 1. Their SASA changes are plotted in Fig. 6. Among these residues, three are hydrophobic residues (ValB12, PheB24, PheB25), four are polar residues (SerB9, TyrB16, TyrB26, ProB28) and one is the charged residues (GluB13). According to the definition of the hydrophobic interaction, the SASA reductions of the three hydrophobic residues (ValB12, PheB24, PheB25) should favor the binding of the monomers. These residues are important to the insulin dimerization. Moreover, PheB24 and PheB25 are on the flexible β-strand, their hydrophobic interactions also favor the formation of the β-sheet between the monomers.
Residue | PheB24 | TyrB26 | GluB13 | ValB12 | TyrB16 | SerB9 | ProB28 | PheB25 |
---|---|---|---|---|---|---|---|---|
SASA | 132.3 | 129.0 | 106.7 | 105.5 | 98.0 | 93.2 | 76.0 | 69.6 |
![]() | ||
Fig. 6 The changes of the SASA of the eight residues in Table 1 from the isolated monomers (distance 26 Å) to the complex (distance 18 Å). |
π–π interaction between the aromatic amino acids is a typical non-covalent intermolecular force. Recently, Zhao et al. investigate lots of pairs of aromatic amino acids and find some statistical laws.75 In their definition, π–π interaction satisfies some geometry conditions. First, the center-of-mass distance between two aromatic rings should be less than 7.2 Å. Second, the normal–normal acute angle should be larger than 50° (T-shaped conformation) or less than 30° (parallel conformation). In this work, we use the definition to study the π–π interactions between insulin monomers.
There are four aromatic amino acids (TyrB16, PheB24, PheB25 and TyrB26) for each monomer on the binding interface. In the binding process, the percent of the π–π interactions between four strongest aromatic pairs are shown in Fig. 7(a) (TyrB16_TyrB26, PheB24_PheB24, PheB24_TyrB26 and PheB25_PheB25). From the figure, we find that the first three π–π interactions increase quickly in the binding process. Besides the hydrophobic interaction, the inter-strand hydrogen bond interaction is also important to the stabilization of the complex. Fig. 7(b) gives the change of the percent of four inter-strand hydrogen bonds at different distances between the monomers. The locations of these bonds in the complex are shown in Fig. 1. It presents that all the four inter-strand hydrogen bonds become stable at 20 Å (percent larger than 50%). They make the two flexible β-strands form into one global β-sheet.
In a word, the dimerization of the insulin is favored by the hydrophobic interaction, π–π interaction and hydrogen bond interaction. Among the 22 residues on the binding interface, five key residues ValB12, TyrB16, PheB24, PheB25, and TyrB26 are picked out according their involvement in the interactions. The locations of the key residues in the dimer are shown in Fig. 8. These findings are close to previous alanine scanning experiment.38 The experiment studies some mutants of the insulin dimer (ValB12Ala, TyrB16Ala, PheB24Ala, TyrB26Ala, AsnA21Ala, PheB25Ala, ThrB27Ala) and finds that the first four mutants exist as a monomeric insulin after the mutation. Why these residues (ValB12, TyrB16, PheB24, TyrB26) are important to the dimerization of the insulin can be well explained by the analysis of the residue–residue interactions here. Of course, the hydrophobic residue PheB25 is an exception. In the mutation experiment, PheB25 does not present to be critical to the insulin dimerization apparently. In the simulation, its SASA reduction is smaller than the other hydrophobic residues (Table 1). And its π–π interaction is weaker than the other aromatic pairs (Fig. 7(a), magenta line). So, compared to the other key residues, PheB25 may have less contribution to the binding of the monomers.
![]() | ||
Fig. 8 The positions of five key residues on the binding interface of the insulin dimer. They are ValB12 (tan), TyrB16 (green), PheB24 (orange), PheB25 (blue) and TyrB26 (red). The figure is produced by VMD.55 |
MM-GBSA or MM-PBSA are the popular approaches to study the relative binding affinity between similar complexes.76,77 Because in this case, the entropy contribution can be safely ignored. For instance, Huiyong et al. study lots of protein–ligand systems and find that there is a good correlation between the enthalpy value and the binding affinity in experiment.76 To verify the idea, we further investigate the variation of the enthalpy of the insulin dimer and its mutants by the MM-PBSA method. Seven residues AsnA21, ValB12, TyrB16, PheB24, PheB25, TyrB26 and ThrB27 are mutated to Ala one by one. The entropy term is not included in the calculation.
Table 2 gives the enthalpy of the wild-type dimer and the mutants. The changes of the structural RMSD in the simulations are also presented. Compared to the wild-type dimer, the three mutants, ValB12, TyrB16Ala, TyrB26Ala, have a much higher enthalpy. Mutations on these residues decrease the binding affinities of the dimer remarkably. The results are close to the findings in previous alanine scanning experiment38 except the mutant PheB24Ala. It presents that MM-PBSA is successful in the prediction of the relative binding affinity of the complex. As to the average structural Cα RMSD, all the mutants show larger RMSD values than the wild-type dimer. The mutations destabilize the structure of the native state.
Mutant | Enthalpy (kcal mol−1) | Dimer RMSD (Å) |
---|---|---|
Wile-type | −61.4 | 1.8 |
PheB24Ala | −62.0 | 2.2 |
PheB25Ala | −54.8 | 2.0 |
AsnA21Ala | −48.7 | 1.9 |
ThrB27Ala | −48.7 | 1.9 |
ValB12Ala | −44.4 | 2.0 |
TyrB26Ala | −37.7 | 2.2 |
TyrB16Ala | −35.7 | 2.3 |
To describe the effect of the mutation, we also calculate the secondary structures of the residues on the binding interface (B9–B30) for the wild-type dimer and all the mutants by DSSP.78 Among them, the mutant PheB24Ala shows the most difference than the wild-type dimer. The changes of the secondary structures of the wild-type dimer and the mutant PheB24Ala are shown in Fig. 9. In the figure, red color represents the α-helix and purple is the β-sheet. We find that both of the α-helix (B9–B18) and the β-sheet (B24–B26) are quite stable in the wild-type dimer. The dimerization is helpful to the stabilization of the secondary structures. But for the mutant PheB24Ala, the α-helix changes into the “turn” conformation frequently in the whole simulation time. The average percent of the α-helix (B9–B18) decrease from 83.0% (wild-type) to 72.8% (mutant). As to the individual residues in the helix, in monomer I, the percent of AlaB14 and ValB18 decrease 26.0% and 21.0%, respectively. The other residues, ValB12, GluB13, LeuB15, TyrB16 and LeuB17, decrease about 15%. And in monomer II, the residue AlaB14 decreases 19.0%, and ValB12, GluB13 and ValB18 decrease about 5%.
![]() | ||
Fig. 9 The change of the secondary structures of the wild-type dimer (WT dimer) and the mutant dimer PheB24Ala in the simulation. Different secondary structures are plotted in different colors (defined in the color bar at right). The calculation is performed by DSSP.78 |
This journal is © The Royal Society of Chemistry 2020 |