Open Access Article
Michael Falato,
Ruth Chan and
Liao Y. Chen
*
Department of Physics, University of Texas at San Antonio, San Antonio, Texas 78249, USA. E-mail: Liao.Y.Chen@gmail.com
First published on 24th January 2022
AQP7 is one of the four human aquaglyceroporins that facilitate glycerol transport across the cell membrane, a biophysical process that is essential in human physiology. Therefore, it is interesting to compute AQP7's affinity for its substrate (glycerol) with reasonable certainty to compare with the experimental data suggesting high affinity in contrast with most computational studies predicting low affinity. In this study aimed at computing the AQP7-glycerol affinity with high confidence, we implemented a direct computation of the affinity from unbiased equilibrium molecular dynamics (MD) simulations of three all-atom systems constituted with 0.16 M, 4.32 M, and 10.23 M atoms, respectively. These three sets of simulations manifested a fundamental physics law that the intrinsic fluctuations of pressure in a system are inversely proportional to the system size (the number of atoms in it). These simulations showed that the computed values of glycerol-AQP7 affinity are dependent upon the system size (the inverse affinity estimations were, respectively, 47.3 mM, 1.6 mM, and 0.92 mM for the three model systems). In this, we obtained a lower bound for the AQP7-glycerol affinity (an upper bound for the dissociation constant). Namely, the AQP7-glycerol affinity is stronger than 1087/M (the dissociation constant is less than 0.92 mM). Additionally, we conducted hyper steered MD (hSMD) simulations to map out the Gibbs free-energy profile. From the free-energy profile, we produced an independent computation of the AQP7-glycerol dissociation constant being approximately 0.18 mM.
Among the many experimental and theoretical-computational investigations of aquaglyceroporins, one fundamental question remains: does an AQGP have affinity for its substrate glycerol? In functional characterization experiments in 1994,6 Escherichia coli aquaglyceroporin GlpF was shown to facilitate unsaturable uptake of glycerol up to 200 mM into Xenopus oocytes, suggesting that GlpF has very low affinity for its substrate glycerol. In a series of functional experiments from 2008 to 2014,7–9 human aquaglyceroporins AQP7, AQP9, and AQP10 were shown to conduct saturated transport of glycerol with Michaelis constants around 10 μM, indicating that human AQGPs have high affinities for glycerol. In the crystal structures available to date (GlpF in 2000,10 Plasmodium falciparum PfAQP in 2008,11 AQP10 in 2018,12 and AQP7 in 2020 (ref. 13–15)), glycerol molecules were found inside the AQGP channel and near the channel openings on both the intracellular (IC) and the extracellular (EC) sides, showing that all four AQGPs have affinities for glycerol. If we insisted that unsaturated transport precludes high affinity, these experimental data would suggest inconsistency. However, in an in silico–in vitro study16 of glycerol uptake into human erythrocytes through AQP3,17 it was shown that an AQGP (having high affinity for its substrate glycerol) can conduct glycerol transport that is unsaturated up to 400 mM. The transport pathway for unsaturated transport through a high affinity facilitator protein was shown to involve two glycerol molecules next to each other both bound inside an AQP3 channel (one at the high affinity site and one at a low affinity site) for the transport of one glycerol molecule across the cell membrane.16 It is the substrate–substrate interactions (mostly repulsion due to steric exclusion) inside a single-file channel that make it easy for two glycerol molecules cooperatively to move one substrate molecule across the AQGP channel via the high affinity site.
On the theoretical-computational side, the predicted affinities of AQGPs were derived from the computed free-energy profiles – the PMF curves (the potential of mean force as a function of an order parameter, namely, the Gibbs free energy of the system when the chosen degrees of freedom are set to a given set of values). The predictions are dependent upon the methods of computation used in a given study. For example, the estimated values of the glycerol-GlpF affinity range from <1/M (from the PMF curve of ref. 18 and 19) to >103/M (from the PMF curve of ref. 20). Currently, the estimations of the glycerol-AQP7 affinity stand at <1/M (from the PMF curves of ref. 13 and 15) in contrast with the experimental data of ref. 7 showing high affinity ∼105/M. All these point to the need for further theoretical-computational studies of AQGP-glycerol affinities.
In this research, we aim to reach computational convergence on glycerol-AQP7 affinity. We first carried out direct computations of the glycerol-AQP7 affinity by estimating the probability pb of glycerol binding inside an AQP7 channel for a given glycerol concentration cG. The dissociation constant kD = cG(1 − pb)/pb is the inverse of the glycerol-AQP7 affinity. Running equilibrium molecular dynamics (MD) without any biases or constraints on three systems ranging from approximately 0.2 M atoms to 10 M atoms in sizes, we observed a convergence toward high AQP7-glycerol affinity. We also examined the intrinsic fluctuations of the model systems (Fig. 1). We found that the pressure fluctuations were inversely proportional to the system size as expected based on statistical thermodynamics.21 In a system (consisting of 0.2 M atoms) typical in the current literature, the root mean squared pressure fluctuations >100 bar in the simulation of a system under a constant pressure of 1.0 bar (see, e.g., NAMD User's Guide https://www.ks.uiuc.edu/Research/namd/2.14/ug/node39.html). In another word, the model system is subject to constant agitations of an artificial sonicator in inverse proportion to the system size. These agitations are expected to loosen the binding between a protein and its substrate and thus to reduce the apparent affinity (i.e., the computed value of the glycerol-AQGP affinity). Our simulations of various system sizes showed that the computed values of glycerol-AQP7 affinity were strongly dependent upon the system size and that convergence of computational studies points to strong affinity between an AQGP and its substrate instead of weak affinity observed in small simulations. Seeking an independent confirmation of strong AQGP-glycerol affinity, we also determined the glycerol-AQP7 affinity from the PMF curve that was computed from a large set of hyper steered MD (hSMD) simulations.
844 atoms, which is referred to as SysI (shown in ESI, Fig. S1†). We employed NAMD 2.13 and 3.0 (ref. 26 and 27) as the MD engines. We used CHARMM36 parameters28–30 for inter- and intra-molecular interactions. We followed the literature's standard steps to equilibrate the system.15,31–33 Then we ran unbiased MD for 2000 ns (namely, 8000 monomer·ns) with constant pressure at 1.0 bar (Nose–Hoover barostat) and constant temperature at 303.15 K (Langevin thermostat). The Langevin damping coefficient was chosen to be 1/ps. The periodic boundary conditions were applied to all three dimensions. The particle mesh Ewald (PME) was used for the long-range electrostatic interactions (grid level: 128 × 128 × 128). The time step was 2.0 fs. The cut-off for long-range interactions was set to 10 Å with a switching distance of 9 Å. The last 500 ns (2000 monomer·ns) of the trajectory was used in the computation of the glycerol-AQP7 affinity. We varied the Nose–Hoover barostat parameters and the cut-off distances to ascertain that the large pressure fluctuations are not an accidental consequence of the aforementioned choice of parameters which are typical in the literature.
We replicated SysI 26 times to obtain 27 copies of SysI. With appropriate translations of these copies, we formed SysII consisting of 27 AQP7 tetramers (illustrated in ESI, Fig. S2†). Unbiased MD was run for 15
000 monomer·ns for this large SysII with identical parameters used for SysI except that the PME was implemented on a grid of 384 × 384 × 384. The last 5000 monomer·ns were used in the computation of the glycerol-AQP7 affinity. Likewise, we replicated SysI 63 times to form SysIII consisting of 64 AQP7 tetramers (illustrated in ESI, Fig. S3†). We ran unbiased MD on SysIII (with PME grid of 512 × 512 × 512) for 15
000 monomer·ns and used the last 5000 monomer·ns in the computation of the glycerol-AQP7 affinity.
![]() | ||
| Fig. 2 AQP7 monomer channel with a glycerol molecule (large spheres colored by atoms: C, cyan; O, red; H, white) at the central binding site near the NAA/NPS motifs. The whole monomer protein is shown as cartoons colored by residue types (positively charged, blue; negatively charged, red; hydrophilic, green; hydrophobic, white). The water molecules inside and near the channel are shown in shadowy spheres colored by atoms (O, red; H, white). All molecular graphics in this paper were rendered with VMD.35 | ||
Following the standard literature (e.g.,36), one can relate the binding affinity (inverse of the dissociation constant kDi) at the i-th binding site to the PMF difference in 3 dimensions (3D) and the two partial partitions as follows:
| c0/kDi = exp[−ΔWi/RT]Zi/Z∞. | (1) |
Here ΔWi is the PMF at the i-th binding site minus the PMF in the dissociated state when glycerol is far away from the protein. R is the gas constant. T is the absolute temperature. Zi is the partial partition of glycerol in the i-th bound state which can be computed by sampling the fluctuations in 3 degrees of freedom of the glycerol center of mass and invoking the Gaussian approximation for the fluctuations in the bound state.37 Z∞ = 1/c0 is the corresponding partial partition in the dissociated state with c0 = 1 M being the standard concentration.
844 atoms (ESI, Fig. S1†). We computed the root mean squared deviation (RMSD) from the crystal structure for each of the four protein monomers, which are shown in Fig. 3. The RMSD being 2 to 2.5 Å from the crystal structure confirms the quality of the high-resolution structure of ref. 13 representing the AQP7-glycerol chemistry under equilibrium conditions. The small but significant spikes in the RMSD curves corresponds to the events of binding/dissociating of glycerol into/from the AQP7 channel, in line with the concept of induced fit in a glycerol-GlpF complex10 (and glycerol-aquaglyceroporin complexes, in general).
![]() | ||
| Fig. 3 RMSD from the crystal structure of the protein monomers during the MD run of a system with a single AQP7 tetramer (4 monomer channel proteins) for 2000 ns (i.e., 8000 monomer·ns). | ||
We also learnt from Fig. 3 that equilibrium was not reached until after 1500 ns (i.e., 6000 monomer·ns). Only during the last 500 ns, we observed multiple events of glycerol moving into and out of the AQP7 channel as illustrated in Movie 1.† Therefore, only the last 500 ns (2000 monomer·ns) of the MD trajectory should be used in the statistical analyses of the system. The earlier part of the trajectory represents a transient process toward equilibrium which is inevitably dependent upon the initial conditions of the model system. Multiple monomer·μs simulations are necessary for significant sampling of glycerol-AQP7 kinetics. This indicates that glycerol-AQP7 interactions are strong rather than weak. What interactions are responsible for the strong AQP7-glycerol affinity? First, the hydrogen bonds between glycerol and the channel lining residues of AQP7 and the hydrogen bonds between glycerol and the water molecules inside the channel. When a glycerol resides inside the channel near the NAA-NPS motifs (Fig. 2), it forms 2 hydrogen bonds with the surrounding AQP7 residues and 2 hydrogen bonds with the 2 water molecules (one on each side). More importantly, when a glycerol is away from the protein and fully surrounded by water molecules, it forms 6 hydrogen bonds with the surrounding water molecules, but it interrupts 10 water–water hydrogen bonds because it displaces 4 water molecules. Second, the van der Waals (vdW) energy between glycerol and AQP7 is estimated to be −14.8 kcal mol−1 when glycerol resides inside the protein near the NAA-NPS motifs where there is sufficient room for glycerol. When glycerol is away from the protein, the vdW energy between glycerol and water is estimated to be −4.9 kcal mol−1. All these factors combine to give rise to the strong AQP7-glycerol affinity.
Does the large discrepancy from the in vitro data mean that in silico studies cannot be quantitatively accurate at all? Where does this large discrepancy come from? Our model system (SysI) is typical of the current literature.15 We used the standard CHARMM force field parameters. We did not employ any biases in the MD simulation that can generate artifacts. However, during the NPT run for a constant pressure of 1.0 bar, the model system was actually subjected to the mechanic pressure that fluctuated between ±400 bar (Fig. 1). This pressure fluctuation is inevitable in any simulation because it is intrinsic to any system that is smaller than the thermodynamic limit. The mean square fluctuation of pressure is inversely proportional to the system volume (thus the number of atoms constituting the model system).21 In light of all this, it is only logical to build larger systems to ascertain whether or not the pressure fluctuations caused the glycerol-AQP7 affinity to appear weak.
000 monomer·ns simulations of two larger systems, SysII consisting of 4.3 M atoms and SysIII consisting of 10.2 M atoms. The pressure fluctuations of these two systems (shown in Fig. 1) were significantly smaller than SysI. The mean square fluctuations were approximately in inverse proportion to the system size (the number of atoms) as expected from statistical thermodynamics.21 Using the same criterium as for SysI, when one or more glycerol molecules are within 7.1 Å of the NAA/NPS motifs of an AQP7 monomer, that AQP7 channel is counted as being occupied. For a given time interval, SysII and SysIII have many more events of glycerol binding to and dissociating from an AQP7 channel than SysI. Naturally, with larger simulations, we have better statistics in addition to the fact that we have much smaller artifactitious fluctuations in pressure. Taking the last 5000 monomer·ns of the MD trajectories into the statistical calculations, we obtained the probability for an AQP7 channel being occupied by glycerol, pIIb = 0.969 ± 0.004 for SysII and pIIIb = 0.982 ± 0.003 for SysIII. Correspondingly, the computed values of the glycerol-AQP7 dissociation constant were kIID = 1.6 mM for SysII and kIIID = 0.92 mM for SysIII. Considering the computed value for SysI, kID = 47.3 mM, we observed the convergence toward higher affinities (lower kD values) in larger model systems. There is a strong correlation between the computed kD values and the artifactitious pressure fluctuations that are inevitable in any computational studies. Ideally, one can build a large enough system whose pressure fluctuation is much less than 1.0 bar for NPT runs under a constant pressure of 1.0 bar, which is still infeasible with today's computing power. However, our study of SysI, SysII, and SysIII together showed that the glycerol-AQP7 affinity is indeed high as one would expect for a facilitator protein with its substrate. It is emphasized here that the afore-presented computations are directly from unbiased equilibrium MD simulations. As long as the parameters are accurate for the intra- and inter-molecular interactions, the conclusion of high glycerol-AQP7 affinity should be valid, free from artifacts that may be present in biased MD simulations.
![]() | ||
| Fig. 5 (A) SysII, 27 tetramers in a large simulation. (B) SysIII, 64 tetramers in a huge simulation. The last 5000 monomer·ns of the trajectory (colored in blue) was used in the statistical calculation of the probability. More details are shown ESI, Fig. S4 and S5.† | ||
exp[−ΔW0/RT] for the central binding site. The other factors involved in the determination of the affinities are the fluctuations (shown in ESI, Fig. S7–S10†) which were computed straightforwardly from the equilibrium MD runs with the Gaussian approximation. Combining the fluctuations and the PMF well depth, we obtained the dissociation constants as follows: kD = 0.18 mM for the central binding site. This independent computation of the AQP7-glycerol affinity from hSMD simulations supports our direct computation from equilibrium MD simulations (kD < 0.92 mM).
It is interesting to note that AQP7 and GlpF are similar in channel pore radius:13 the widest part of the channel is around the NAA/NPS motifs of AQP7 (Fig. 2) and the NPA motifs of GlpF,10 respectively. The narrowest part is near the ar/R sf. The PMF curve shown in Fig. 6 clearly reflects these characteristics in similarity to the PMF of GlpF.20 At site 0, near the NAA/NPS motifs, there is sufficient room to accommodate a glycerol there and thus no conformational frustrations exist for the AQP7 residues or the glycerol. The vdW interactions between them are all attractive. At site 1, near the ar/R sf, both the glycerol and the pore residues are frustrated in their conformations for the induced fit.10 Likewise but in a lesser degree, there are conformational frustrations when glycerol passes through the IC side of the channel. All these point to the importance of vdW interactions in the glycerol-aquaglyceroporin affinity.20,38
Footnote |
| † Electronic supplementary information (ESI) available: Two movies and 10 additional figures. See DOI: 10.1039/d1ra07367b |
| This journal is © The Royal Society of Chemistry 2022 |