Andrew T. Church,
Zak E. Hughes and
Tiffany R. Walsh*
Institute for Frontier Materials, Deakin University, Geelong, VIC 3216, Australia. E-mail: tiffany.walsh@deakin.edu.au; Fax: +61 3 5227 1103; Tel: +61 3 5227 3116
First published on 30th July 2015
A reliable description of ion pair interactions for biological systems, particularly those involving polyatomic ions such as carboxylate and divalent ions such as Ca2+, using biomolecular force-fields is essential for making useful predictions for a range of protein functions. In particular, the interaction of divalent ions with the double carboxylate group present in γ-carboxyglutamic acid (Gla), relevant to the function of many proteins, is relatively understudied using biomolecular force-fields. Using force-field based metadynamics simulations to predict the free energy of binding between Ca2+ and the carboxylate group in liquid water, we show that a widely-used biomolecular force-field, CHARMM22*, substantially over-estimates the binding strength between Ca2+ and the side-chains of both glutamic acid (Glu) and Gla, compared with experimental data obtained for the analogous systems of aqueous calcium–acetate and calcium–malonate. To correct for this, we propose and test a range of modifications to the σ value of the heteroatomic Lennard–Jones interaction between Ca2+ and the oxygen of the carboxylate group. Our revised parameter set can recover the same three association modes of this aqueous ion pair as the standard parameter set, and yields free energies of binding for the carboxylate–Ca2+ interaction in good agreement with experimental data. The revised parameter set recovers other structural properties of the ion pair in agreement with the standard CHARMM22* parameter set.
Quantum mechanics (QM) based techniques have the potential to offer a detailed and reliable description of the PEL. First-principles molecular dynamics (FPMD) simulations have provided information about the behaviour of biologically relevant ionic species.24–29 However, the high computational expense of such methods limits the time- and length-scales that can be accessed, making them impractical for the simulation of large biomolecules. While not providing as reliable a description of the PEL as QM techniques, force-field (FF) based simulations offer an acceptable compromise between physical veracity and computational expense. For this reason they have been widely utilised for the simulation of complex biomolecules. Some of the most widely used biomolecular FFs include the CHARMM,30,31 AMBER32,33 and GROMOS34 families of force fields.
Despite their maturity, and as prompted by dual advances in experimental techniques and conformational sampling approaches, the parameters of biomolecular FFs are periodically refined to improve their performance.31,35–37 To describe LJ interactions that have not been explicitly parameterised, FFs such as CHARMM and AMBER use the Lorentz–Berthelot combining rules to generate heteroatomic LJ parameters.38,39 While the parameters derived from combining rules are able to reasonably describe many of the inter-atomic interactions within biomolecular systems, for some interactions, e.g. the interactions between different ions or the interactions between species at the hard-soft matter interface, they may break down,40–46 leading to an unphysical description of the system.
The interplay of interactions between ions, biomolecules and water is complex; several previous studies have sought ion parameters that can capture the balance between the ion–water, ion–biomolecule and ion–ion interactions appropriately.37,41,44,47–50 Properties such as the interaction strength of ions with the side-chains of charged residues, the structuring of solvation shells, and ion coordination are all thought to be important factors that contribute to the ability of a FF to recover this interplay. This interplay can sometimes be highly challenging to recover if the interactions between species are described through the use of default FF parameters and combining rules alone.27,29,37,42,45,47
One particular interaction that has been previously demonstrated to require careful attention is that between Ca2+ ions and carboxylate groups,29,42,45 such as those that appear in the acidic residues of proteins and peptides. Previous studies have shown that a number of common biomolecular FFs do not correctly capture the strength of this interaction.29,42,45 Such a deficiency in the FF will have an influence on the saturation rate of ion pairings in biomolecules and may prevent the FF from accurately modeling the functionality of the system.
The ability of the FF to capture structural details, particularly the different coordination modes (see Fig. 1) of divalent ions with carboxylates, is also critical. The contact ion pair (CIP) refers to the case where there is direct (i.e. non-solvent mediated) contact between the carboxylate oxygen atoms and the divalent ion, and the solvent-shared ion pair (SIP) refers to a solvent-separated mode of interaction. There are two distinct CIP states, the first referred to here as the biCIP state (see Fig. 1a) represents a bidentate coordination, while the monoCIP state (Fig. 1b) represents a monodentate coordination. The term SIP refers to the solvent-separated ion-pair (Fig. 1c). The ability to distinguish and correctly coordinate ions to the specific binding locations on acidic residue side-chains is thought to be crucial in systems including signal transduction pathways and catalytic processes.51–54 In the hydroxyapatite-binding sequence BMP2, previous studies have suggested that the orientation of a Glu side chain, and hence the orientation of the carboxylate group relative to the surface, plays a role in the strength of the surface–residue interaction.55 The effects that different binding modes can exert on hydration shells of Ca2+ when interacting with carboxylate groups have also been reported.56 Such properties highlight the potential importance that coordination modes may play in protein/ion binding.
A further factor to consider regarding the Ca2+–carboxylate interaction is that many hydroxyapatite-binding bio-mineralisation sequences contain the post-transitionally modified γ-carboxyglutamic acid (Gla) residue. The Gla residue contains two carboxylate groups and is thought to be critical in triggering folding and binding of these sequences.11,17,57–59 In Fig. 2 we show the difference in structure between the Glu and Gla residue side-chains. Both are negatively-charged; the Glu side-chain carries a −1e charge, while the charge on Gla is −2e. The Gla residue is also found in many other natural systems,60–64 such as conantokins, e.g. conantokin-Pr1, which adopts a helical conformation in the presence of Ca2+ or Mg2+, but is unstructured in the absence of these ions.5 Though Gla is widely found throughout nature and is central to many well-studied mineralisation sequences, the Gla residue has not been well explored in molecular simulation studies. This may in part be due to the lack of well-defined parametrisation of Gla in many common biomolecular force-fields.
Therefore, there is a clear need to test, and if necessary modify, FF parameters related to the association of carboxylate groups (such as in Glu and Gla) with divalent ions such as Ca2+. The modification of a FF such that it more accurately describes the Ca2+–carboxylate interaction should be done with care. While we wish to reproduce the features (binding strength, coordination mode, interaction with the Gla residue) described above, we also need to ensure that any changes we make do not inadvertently disrupt the behaviour of other parts of the system. The best way of meeting these criteria is specifically targeting the modification of the non-bonded interactions between Ca2+ and carboxylate side-chains.29,42
Recently, Kahlen et al.29 reported the optimisation of Ca2+–Ocarb Lennard–Jones (LJ) parameters for a range of force-fields. To test these modifications, these authors calculated potential of mean force (PMF) profiles for the aqueous calcium acetate complex, and used these to derive the Ca2+–acetate association constant, enabling direct comparison with experimental data. The FFs were optimised via an increase of the heteroatomic σ LJ parameter associated with the Ca2+–Ocarb pair. This optimisation procedure was reported for both the GROMOS34 and OPLS65 FFs; however, the CHARMM22* force-field was neither evaluated nor modified. These modifications were taken on the basis of an integrated property (the association constant), which may not admit a unique PEL as a solution; in other words, many variations of the PEL as a function of the calcium–acetate separation may also produce reasonable association constants. In particular, the delicate balance in free energy between the biCIP, monoCIP and SIP states cannot be resolved via comparison with integrated properties (such as the association constant) alone. In principle, FPMD simulations may offer a viable route to determining this free energy balance, but in practice, at present the uncertainties associated with the resulting free energy profiles computed using this approach29 prevent definitive resolution of this question.
Here, we have investigated and quantified the strength of the Ca2+–carboxylate interaction, and also the resulting local hydration structure and coordination modes. To determine the relevant PMF profiles for these interactions, we carried out metadynamics66,67 simulations of the capped Glu and Gla amino acids interacting with a calcium ion in solution using the CHARMM22* FF,30,31 and compared the resulting association constants with those experimentally obtained for both calcium acetate and calcium malonate. Our results show that the standard CHARMM22* FF parameters for the calcium–carboxylate interaction yield a substantial over-estimate of the binding free energy compared with experimental data. This over-estimation was found to be particularly pronounced for the Gla residue. We then optimised the CHARMM22* (ref. 30 and 31) Ca2+–Ocarb heteroatomic LJ parameters for the Glu residue, as well as the γ-carboxyglutamic acid (Gla) residue, to match our predicted association constants with experimentally-determined values obtained for both calcium acetate and calcium malonate. The structural data generated using this new LJ parameter resulted in close agreement with previously reported findings.
Once both systems were constructed, they were each placed in a 4 nm cubic cell and were then equilibrated. First, each system was energy minimised and subsequently subjected to a short (200 ps) simulation in the Canonical (NVT) ensemble at our target temperature of 310 K, maintained using the Nosé–Hoover thermostat.73,74 Next, we ran a short (0.5 ns) simulation in the isothermal-isobaric (NPT) ensemble75 at 1 atm pressure and 310 K, with a pressure-coupling constant of 0.4 ps. All of our production simulations were carried out at 310 K. This temperature is of interest because it corresponds to the most likely physiological scenario for probing interactions between mammalian (human) Gla-containing proteins and Ca2+ under aqueous conditions. The final frame from these simulations was used as the initial starting configuration of our metadynamics simulations (see below).
To modify the force-field in our work, we have chosen to utilise the approach outlined by Project et al.42 and utilised by Kahlen et al.,29 where slight modifications to the heteroatomic LJ Ca2+–Ocarb σ parameter were implemented.29,42 Modifications to the bespoke heteroatomic LJ parameters can be readily implemented in GROMACS to over-rule the standard Lorentz–Berthelot mixing rules, such that only the interaction between the target pair of atoms is affected. Following Kahlen et al.,29 we systematically increased the σ parameter of the heteroatomic LJ Ca2+–Ocarb, where each increase is expressed as a percentage of the unmodified value, in increments of 0.2%, scanning the range of (σ + 2.0%)–(σ + 3.0%), resulting in a total of seven different parameter values (including the unmodified value). Table S4 in the ESI† provides the explicit σ values used for each case.
We carried out seven well-tempered metadynamics76 simulations for the aqueous Glu–calcium system, one for each value of the LJ Ca2+–Ocarb σ parameter. For the Gla-calcium system, we used our metadynamics results obtained for Glu to narrow down the range of sigma values. In this instance we deemed it sufficient to consider the range (σ + 2.0)–(σ + 2.4%), in addition to the unmodified LJ parameters, giving a total of four metadynamics simulations for Gla.
We used the PLUMED77 plugin along with GROMACS to perform these calculations. The collective variable (CV) for the Glu/calcium simulations was defined as the distance between the central carbon of the carboxylate group and the calcium ion. Because the Gla residue contains two carboxylate groups, the collective variable for the aqueous Gla/calcium system was defined as the distance between the calcium ion and the central carbon of the carboxylate group that was closest to the calcium ion at each time frame. Gaussian hills were deposited every 1 ps with a hill height of 0.1 kJ mol−1 and a width of 0.025 nm. A bias factor of 10 was used throughout. We ran these simulations in the NVT ensemble, with the temperature maintained at 310 K using the Nosé–Hoover thermostat. The full range of the CV explored in our simulations was ∼32 Å. We ran our metadynamics simulations until the resulting potential of mean force (PMF) profile was showing only minimal evolution with time. This yielded a simulation time of 100–120 ns for each metadynamics simulation. All resulting PMF profiles obtained for a given system (either Glu or Gla) were set to zero at a CV separation of 13 Å.
Following our metadynamics simulations, we carried out some additional standard MD simulations for the purposes of generating specific structural data. These simulations were also carried out in the NVT ensemble at the same target temperature and for the same simulation settings as specified above.
![]() | (1) |
Next, gid(r) was integrated following eqn (2), therefore yielding a value for the association constant for the Ca2+–carboxylate interaction.
![]() | (2) |
The upper limit on the integration was taken as the distance corresponding to the separation between the ion and the ligand in the disassociated state, which is reached at the point where the ion environment is that of bulk aqueous solution. We define ‘dissociated’ as the ion pair separations that exceed those corresponding to the first local maximum beyond the local SIP minimum. The association constants can then be converted into Gibbs free energy values based on eqn (3).
ΔG = −RT![]() | (3) |
Once obtained, our predicted values for the Gibbs free energy of ion ligand adsorption can be compared against experimentally-derived values taken from the literature.
While there is a lack of published experimental studies regarding the strength of the interaction between Glu/Gla and Ca2+ under aqueous conditions, there are comparable data reported for the aqueous Ca2+–acetate and Ca2+–malonate systems.78 These molecules are sufficiently similar in structure to the side-chains of Glu and Gla to permit reasonable comparisons between our predictions and experimental data. We therefore tuned our modification of the heteroatomic σ LJ parameter associated with the Ca2+–Ocarb pair to ensure we could recover the relevant experimental binding free energies.
To investigate the structure of water around the interaction pair, we calculated the radial distribution function between the oxygen atoms of the carboxylate group and the oxygen atom in water, for structures in the biCIP configuration. We calculated the hydrogen-bond angle distribution for the same set of configurations, as well as the coordination number of water molecules around Ca2+. For the hydrogen-bond analysis, our identification of hydrogen bonds was based on the standard geometric definition of Jedlovszky et al.81 where the interaction must satisfy an O⋯O distance of less than 3.5 Å.
The resulting calcium–carboxylate binding free energy was calculated to be −12.1 kJ mol−1 at 310 K using the unmodified CHARMM22* FF, approximately twice the experimental value. Following this, we calculated the PMF profiles for this interaction for a total of six modified Ca2+–Ocarb LJ σ values, increasing this value of σ incrementally in steps of 0.2%. The absolute values of sigma are provided in Table S4 of the ESI;† we also provide all of the final PMF profiles of each metadynamics simulation in Fig. S3 of the ESI.† The resulting free energies of binding are summarised in Fig. 4a (see Table S5 in the ESI† for numerical values), revealing that the modification of σ + 2.2% yielded the best comparison with the experimental binding data. The resulting potential of mean force profile is shown in Fig. 3a), corresponding to a binding free energy of −6.1 kJ mol−1 calculated at 310 K.
![]() | ||
Fig. 4 Predicted free energy of binding for the interaction between Ca2+ and the carboxylate side chain of (a) glutamic acid (Glu), and (b) carboxyglutamic acid (Gla), for both the unmodified CHARMM22* force-field, and for different force-field modifications (see text for details). The gray-colored horizontal bar indicates the range of experimentally-determined values.78,79 |
In all cases, the dominant minimum corresponded to the biCIP structure. In general, our modification to the σ value tended to reduce the difference in well depth between the biCIP and monoCIP configurations. The free energy difference between the biCIP and monoCIP minima was ∼7.5 kJ mol−1 with the default CHARMM22* parameters, while for the optimal σ + 2.2% modification this difference reduced to ∼4.5 kJ mol−1. As mentioned in the Introduction, our comparison with an integrated value such as the binding free energy does not allow us to reach a definitive conclusion about the relative balance in free energy between the three coordination modes.
As a function of the CV value in the free energy profile, each minimum corresponds with the ensemble average of all configurations that feature the particular C⋯Ca2+ separation corresponding to the position of the minimum. The configurations associated with each of the three minima suggests that the first minimum is dominated (calculated to be ∼100%) by configurations with the biCIP co-ordination mode. Likewise, the second and third minima correspond to the monoCIP and SIP co-ordination modes, respectively (see Fig. 1).
Previously published results reported by Project et al.42 utilised regular MD simulations with the CHARMM22 force-field to investigate the aqueous Ca2+–carboxylate interaction. While these authors did not report a binding constant for this particular force-field, their findings hinted that the CHARMM22 force field may actually under-predict this interaction strength. However, the CHARMM22* force-field contains explicit differences in the partial charges of the acidic groups compared with the older CHARMM22 force-field, which may in part explain this discrepancy with our findings. This would seem to be a more likely source of discrepancy compared with the differences in sampling between our work and that of Project et al.42 Nonetheless, techniques such as metadynamics allow us to improve our degree of sampling compared with regular MD, and in turn allow us to investigate these interactions in more detail.
For calcium malonate, the experimentally-observed range of log(K) values is reported to be 1.61–2.50 in the standard state.78,79 Again, we assume that K is approximately constant in the temperature range of 298–310 K, allowing us to infer the binding free energies at 310 K. On this basis, these log(K) values correspond to free energies in the range of −9.5 to −14.8 kJ mol−1 at 310 K. Based on our findings for Glu, we narrowed our investigation of the range of modifications to the Ca2+–Ocarb LJ σ value to values adjacent to σ + 2.2%. In Fig. 3b, we show the free energy profile for the association of calcium and the Gla side-chain for the unmodified CHARMM22* FF. As seen for Glu, the free energy profile in all cases supports three distinct minima, where the coordination mode corresponding to each minimum is the same as that described for Glu. There was no evidence to support the existence of a fourth minimum corresponding to a coordination between the two carboxylate groups in Gla. The resulting calcium–carboxylate binding free energy was calculated to be −29.9 kJ mol−1 at 310 K for the unmodified CHARMM22* FF, again approximately twice the experimental value.
We considered three modifications, σ + 2.0%, σ + 2.2% and σ + 2.4%; the resulting free energies of association are summarised in Fig. 4b. While the free energy of association calculated from the profiles generated using both the σ + 2.2% and σ + 2.4% modifications fall within the range of experimental values, we suggest that the σ + 2.2% modification provides an acceptable compromise between performance and transferability. The resulting PMF profile for σ + 2.2% is shown in Fig. 3b). The free energy of association calculated using the σ + 2.2% was −12.5 kJ mol−1 at 310 K.
We investigated the details of the solvent structuring around the ion pair in the biCIP coordination mode. We calculated the radial distribution function (RDF) between the carboxylate oxygen atom (denoted herein as OM) and the water oxygen atom (denoted herein as OW), and compared our findings for both the unmodified CHARMM22* FF and our σ + 2.2% modification, as shown in Fig. 5. Two peaks in the RDF are apparent, the first and most predominant at a OM–OW separation of ∼2.8 Å, corresponding to the first solvation shell, and a second peak at ∼4.4 Å indicating the second solvation shell. These data show that our modification has imparted no appreciable difference to these RDFs, for both Glu and Gla. These RDFs compare favorably with those previously reported by Kahlen et al.29 for aqueous calcium acetate. In Fig. 6 we show the distribution of the hydrogen-bond angle between the carboxylate oxygen atom (OM) and water in the first solvation shell around the carboxylate in the biCIP configuration, calculated for both the unmodified CHARMM22* FF and our σ + 2.2% modification. Again, very little difference is apparent between the unmodified and modified profiles. These profiles broadly agree with those previously published for aqueous calcium acetate,29 particularly in terms of the positions of the peaks in these profiles. In summary, our data indicate that the proposed FF modification did not adversely impact on the local solvation structure when compared with results from the unmodified CHARMM22* FF.
Finally, we also investigated the coordination number of water around Ca2+. The coordination of waters around Ca2+ in the biCIP configuration for both Glu and Gla were found to be unaffected by the FF modification, as indicated in Fig. 7a. The number of water molecules in the first solvation shell compared very well with FPMD predictions reported by Kahlen et al.29 We also show the coordination of waters around Ca2+ in the bulk configuration for the aqueous calcium–Glu system (data for Gla are indistinguishable by eye from those of Glu and are not shown). Again, the number of water molecules in the first solvation layer compare favorably with the FPMD values reported previously.29
![]() | ||
Fig. 7 Number of water molecules around Ca2+ calculated for configurations taken from (a) the biCIP state of Glu and Gla, and (b) the bulk solvation state of Glu. |
Overall, the unmodified CHARMM22* FF appears to overbind the aqueous calcium–carboxylate ion pair compared with analogous experimental data. Our findings indicate that a targeted correction to the heteroatomic LJ σ parameter of +2.2% for the Ca2+–Ocarb pair, which can be readily implemented in GROMACS, can substantially improve the description of the binding strength of the calcium–carboxylate ion pair in solution, without adversely impacting on the structuring of the solvent around this ion pair.
Some discussion of the CV used in these metadynamics simulations is warranted, because appropriate choice of this parameter is a key determinant in the utility of the metadynamics approach.67 It should be noted that the calculation of the free-energies of Glu/Gla with Ca2+ could be performed using an alternative CV, such as that defined by the distance between Ca2+ and oxygen. However, while the choice of a different CV might affect the PMF profiles obtained, any change to the predicted association free energy is likely to be negligible. The free energy differences were calculated from the integration over two broad sets of configurations, the associated and unassociated states. Minor changes to the CV are unlikely to significantly change the ratio of the populations of these two states. To underscore this point, we re-calculated the Glu PMF using a different CV, namely the distance between Ca2+ and one of the carboxylate oxygen atoms, using the default CHARMM parameters. As expected, the PMF for this CV appears completely different (see Fig. S4 in the ESI†). However, upon integration of this PMF, the association constant, and consequently the association free energy, based on this PMF yield very similar results compared with our original simulations. Specifically, ΔGads was found to be −12.3 kJ mol−1, whereas our original CV yielded a value of −12.1 kJ mol−1. Moreover, by defining the distance between Ca2+ and the closest carboxylate carbon atom as the CV, we are consistent with previous recent studies.29,45 Our data also indicate that this CV is appropriate. A poorly-chosen CV can show hysteresis in the resulting free energy profile, due to relatively poor sampling of the energy landscape in directions orthogonal to the CV.67 In Fig. S5 of the ESI† we show the evolution of the CV with simulation time; these data do not indicate such problems with our simulations.
The CHARMM22* FF favors the biCIP coordination mode over the monoCIP mode in terms of stability, in both unmodified and modified forms of the FF. While there is some experimental evidence from 13C nuclear magnetic resonance (NMR) observations that the monoCIP mode may be preferred,82 the FPMD free energy profiles reported by Kahlen et al.29 suggest that the biCIP and monoCIP minima could be roughly equivalent in terms of free energy. However, the large error bars inherent to the free energy profiles generated using FPMD approaches means that this finding is not conclusive. We note from our profiles that our FF correction decreased the energy difference between the biCIP and monoCIP minima; for Glu this was reduced from ∼7.5 kJ mol−1 with the default CHARMM22* parameters, to ∼4.5 kJ mol−1 for the optimal σ + 2.2% FF modification. Similarly, for Gla this difference reduced from ∼14.5 kJ mol−1 in the unmodified case to ∼6 kJ mol−1 for the modified FF. However, the precise energetic ordering of the biCIP and monoCIP minima remains to be to fully resolved in future.
Footnote |
† Electronic supplementary information (ESI) available: Summary of Gla force-field, values of the modified heteroatomic Lennard–Jones σ values, potential of mean force profiles generated using our modified force-field parameters, absolute free energies of binding for the aqueous Ca2+–Glu system, potential of mean force profile using an alternative CV definition, and evolution of the original CV with simulation time for both Glu and Gla. See DOI: 10.1039/c5ra11268k |
This journal is © The Royal Society of Chemistry 2015 |