Carlos
Navarro-Retamal‡
a,
Anne
Bremer‡
b,
Jans
Alzate-Morales
a,
Julio
Caballero
a,
Dirk K.
Hincha
b,
Wendy
González
*a and
Anja
Thalhammer§
*b
aCenter for Bioinformatics and Molecular Simulations, Universidad de Talca, 2 Norte 685, Casilla 721, Talca, Chile. E-mail: wgonzalez@utalca.cl (Wendy González, for correspondence relating to protein modelling)
bMax-Planck-Institut für Molekulare Pflanzenphysiologie, Am Mühlenberg 1, D-14476 Potsdam, Germany
First published on 25th May 2016
The LEA (late embryogenesis abundant) proteins COR15A and COR15B from Arabidopsis thaliana are intrinsically disordered under fully hydrated conditions, but obtain α-helical structure during dehydration, which is reversible upon rehydration. To understand this unusual structural transition, both proteins were investigated by circular dichroism (CD) and molecular dynamics (MD) approaches. MD simulations showed unfolding of the proteins in water, in agreement with CD data obtained with both HIS-tagged and untagged recombinant proteins. Mainly intramolecular hydrogen bonds (H-bonds) formed by the protein backbone were replaced by H-bonds with water molecules. As COR15 proteins function in vivo as protectants in leaves partially dehydrated by freezing, unfolding was further assessed under crowded conditions. Glycerol reduced (40%) or prevented (100%) unfolding during MD simulations, in agreement with CD spectroscopy results. H-bonding analysis indicated that preferential exclusion of glycerol from the protein backbone increased stability of the folded state.
Two of the physiologically, functionally and structurally best characterized plant IDPs are Cold Regulated (COR) 15A and COR15B from the model plant Arabidopsis thaliana. Both belong to the group of late embryogenesis abundant (LEA) proteins that were first described over 30 years ago as a group of proteins that accumulate late in plant seed development.9 However, they are also abundant in vegetative plant tissues following environmental stresses such as drought, cold or high salinity.10 LEA proteins are not restricted to plants; multiple forms are expressed in desiccation-tolerant animals from at least four phyla.11 In the fully hydrated state, most LEA proteins are predominantly unstructured with a preponderance for random coil. While some LEA proteins seem to function as unstructured proteins,12 others can be induced to fold in the presence of artificial membranes13 or during dehydration.14 In the dry state, many LEA proteins fold into mainly α-helical structures (see ref. 15 for a review). This gain of structure is fully reversible upon rehydration.11
COR15A and COR15B are both cold induced nuclear encoded LEA proteins that are targeted to the chloroplast stroma via signal peptides.16–18 The respective genes share a sequence identity of 82%,19 while the amino acid sequence identity of the proteins is 77%. The mature proteins have a molecular mass of approximately 9 kDa.17,18,20 Overexpression of either COR15A or COR15B in A. thaliana significantly increases leaf freezing tolerance, while simultaneous RNAi silencing of both genes reduces freezing tolerance after cold acclimation.20,21 The physiological function of the COR15 proteins is the stabilization of chloroplast membranes, but not of soluble enzymes during freezing20 and the available data strongly suggest functional redundancy of the proteins.20
Both COR15 proteins are highly hydrophilic and predominantly unstructured in solution, but fold into amphipathic α-helices during drying or in the presence of high concentrations of low-molecular-mass crowding agents such as sucrose or glycerol.20,22 Due to their high conformational flexibility, no crystal structures are available for any LEA proteins and there are also no reported NMR spectra of COR15A or COR15B. Here, we investigated the hydration-dependent structural transitions of fully folded COR15A and COR15B using MD simulations. Quantitatively, the folded and unstructured fractions of the proteins in the dry and fully hydrated states, and in the presence of 40% and 100% glycerol were compared between the predictions from simulation and experimental data obtained by circular dichroism (CD) spectroscopy.22 In general, we found good agreement between the predicted and measured secondary structure content of the proteins.
The stereo-chemical quality of all three models was analyzed with PROCHECK. Based on Ramachandran plots, 76.2% of COR15A, 74.7% of COR15B and 87.5% of COR15A-TH model structures were situated in preferential regions, while only 2.5%, 0% and 0% were placed in disallowed regions, respectively. The 3D models were also positively evaluated by ANOLEA analysis (Fig. S2, ESI†). The models indicate that both COR15 proteins form two α-helices with an amphipathic character in the dry state with the opposing polar and non-polar faces oriented along the longitudinal protein axes facing in different directions. In the homology models, the two helices are connected by a loop consisting of 22 and 17 residues in COR15A and COR15B, respectively, while in COR15-TH both helices are connected by a shorter loop of only three residues.
In the COR15A model, the fraction of hydrophobic and hydrophilic amino acids was 33% and 67% (helix I) and 49% and 51% (helix II), respectively. In the threading model COR15A-TH the fraction of hydrophobic and hydrophilic residues were 33% and 67% (helix I), identical to helix I of the homology model, and 36% and 64% (helix II), slightly more hydrophilic than helix II of the homology model. In the COR15B model, helix I had the same distribution as in the COR15A model, while helix II was more hydrophilic, with 29% hydrophobic and 71% hydrophilic residues.
Of the 89 amino acids comprising COR15A 45 were in α-helical conformation at the beginning of the production simulation. This number was reduced to 24 ± 2.2 after 30 ns of simulation. Similarly, of the 90 amino acids of COR15B, 44 were in α-helical conformation at the beginning of the production simulation, while after 30 ns, this number was reduced to 31 ± 2.9. In the case of COR15A-TH, the number of residues in α-helical conformation decreased from 73 to 37 ± 2.4 after the 30 ns simulation.
Fig. 2 shows the unfolding of COR15 proteins in water by means of configuration snapshots at selected time frames. Fig. 3 shows the same process at single-residue resolution, representing the results of 10 replicate simulations for each protein model. Notably, the unfolding process showed a rapid phase over the first 1–2 ns of MD simulation in COR15A and COR15B that can be clearly seen in both representations. The number of unstructured amino acid residues increased more slowly with proceeding simulation time in all three models.
This is in agreement with previous MD simulations reporting that protein unfolding can occur on a picosecond timescale.5,32,33 In all models, unfolding was more pronounced in helix II than in helix I.
In general, globular proteins are stabilized by a hydrophobic core.34–36 Our models imply that such a core is not a distinct feature of COR15A and COR15B as hydrophobic and hydrophilic amino acids are involved in unfolding in a similar ratio (Fig. S3, ESI†). This indicates a tendency of disorder of non-polar residues in the presence of a polar solvent. Therefore we suggest that the lack of a hydrophobic core in the COR15 proteins reduces the stability of the proteins and thus promotes their unfolding in water.
Structural alterations of the COR15 proteins during unfolding in water were further characterized by the calculated changes in the root-mean-square deviation (RMSD) of all atoms in the polypeptide backbone during the MD simulations. Representing the temporal fluctuation of the protein backbone, RMSD is a measure of protein structural flexibility.37
During simulation of COR15A-TH, COR15A and COR15B after release of the constraints in water, RMSD continuously increased until approximately 20 ns. After that the RMSD remained stable, indicating a convergence of all replicates (Fig. 4A). In parallel the potential energy was stable (Fig. S4, ESI†) showing that the system was in an energy minimum. These findings indicate increasing fluctuations of the peptide backbone during unfolding.
Under these conditions water is partially removed from the cells due to ice formation in the intercellular spaces leading to increased intracellular solute concentrations. Accordingly, both proteins show partial folding into α-helices in 50% (v/v) glycerol or 2 M sucrose.20
To characterize this crowding-induced folding further, the dependence on glycerol concentration was experimentally assessed by CD spectroscopy for untagged COR15A (Fig. 7).
The data indicate that the α-helix content of COR15A increased linearly with increasing glycerol concentration (r = 0.959, p < 0.0001). With almost 90% α-helicity, it reached a putative folding maximum also obtained in 50% (v/v) trifluorethanol20 at about 80% (v/v) glycerol, which was the highest experimentally accessible concentration.
MD simulations of both folded COR15 proteins in 40% and 100% glycerol were carried out to monitor the influence of the hydration state on protein unfolding (Fig. 8). In the presence of 40% glycerol, unfolding of COR15A-TH, COR15A and COR15B still proceeded over time (Fig. 8A, C and E), although it was strongly decreased compared to unfolding in water (Fig. 3).
Similar to the simulations in water, RMSD continuously increased in both solvent conditions until approximately 20 ns. After that the RMSD remained stable, again indicating a convergence of all replicates (Fig. 9). Notably, the overall RMSD magnitude decreased from the simulations in water, reaching a minimum in 100% glycerol. This indicates reduced fluctuations of the peptide backbone during unfolding with increasing glycerol concentration.
To clarify whether the increased protein stability in glycerol was due to direct interaction of the solute with the protein, we analyzed the H-bonding patterns derived from MD simulations (Fig. 10).
In pure water (0% glycerol), unfolding was accompanied by an increase of protein–water (Fig. 10D) and a concomitant decrease in intra-protein H-bonds (Fig. 10A, compare also Fig. 4). Intra-protein H-bonds decreased slightly less over time in 40% glycerol (Fig. 10B) but were hardly affected in 100% glycerol (Fig. 10C) in agreement with the experimentally observed folding of COR15 proteins in increasing glycerol concentrations (Fig. 7).
The number of protein–water H-bonds, on the other hand, was much lower in 40% glycerol than in pure water (Fig. 10E). However, this was not accompanied by an increase in protein–glycerol H-bonds (Fig. 10F), indicating exclusion of the osmolyte from the protein. Only in pure glycerol, a relevant number of H-bonds between glycerol and the proteins were formed (Fig. 10G), mostly by amino acid sidechains (Fig. S6, ESI†). However, almost all protein intramolecular H-bonds were preserved during the simulation in 100% glycerol, in agreement with a maximum α-helix formation under this condition. Several publications have demonstrated the importance of salt bridges (SB) for the stability of protein secondary and tertiary structure.39–42Fig. 11 shows the stability of the 8, 13 and 13 SBs that were identified in COR15A-TH/COR15A/COR15B models in vacuum during the MD simulations. Under fully hydrated conditions, COR15A-TH retained only two SBs (GLU64-LYS67, ASP49-LYS53) with a lifetime longer than 50% of the 30 ns MD simulation. Only one of these two SBs was also retained in the COR15A model (GLU64-LYS67), and also one SB (GLU82-LYS43) showed this degree of stability in COR15B, indicating that water strongly competes with the interactions between charged amino acids. Increasing the concentration of glycerol to 40% lead to an increase in the number of SBs with a lifetime longer than 50% of the 30 ns MD simulation in COR15A-TH to four, in COR15A to two, while the number of stable SBs increased to six in COR15B. In 100% glycerol, all eight SBs of COR15A-TH, 12 out of the 13 SBs of COR15A and all 13 SBs of COR15B remained stable during the simulation, indicating that the increased stability of both proteins is accompanied by an increase in the number of stable SBs.
Only one study has previously investigated the hydration-dependent unfolding of a LEA protein using MD simulations.43 However, a 66 amino acid fragment of a 143 amino acid LEA protein from an anhydrobiotic nematode was used for modeling instead of a full-length protein. Further, unfolding was simulated in the presence of different amounts of water, but no crowding agent was present. In addition, no experimental data on the folding state of this fragment were reported, preventing an independent verification of the computational results. In the present study, we have used the full-length sequences of two LEA proteins for the simulation of unfolding kinetics both in the presence of pure water and of two concentrations of glycerol as a model crowding agent. Structural predictions from these simulations were directly compared to experimental data.
Simulation studies with IDPs are faced with unique problems. The unfolded state of an IDP can not be used as the starting point to model its folding, because ab initio modeling of the unfolded state is not possible, as it consists of a fluctuating ensemble of a large number of alternative conformations.44 The only solution is therefore to start from the folded state and model its unfolding under different conditions. However, since IDPs do not crystallize, no X-ray structures are available for LEA proteins that could be used as a template to simulate the structure of the dry proteins. Therefore, we used a comparative modeling approach, based on previous in silico predictions of the secondary structure of COR15A and B,22 indicating that both proteins form two amphipathic α-helices in a helix–loop–helix configuration. Similar to the previous MD simulation study,43 we used human apolipoprotein A–I25 as template to model the structure of the COR15 proteins in vacuo, thereby mimicking the dry state. Additionally, a threading model was built on the crystal structure of a computationally designed helix bundle as template. The results of our MD simulations were in agreement with the experimentally determined secondary structure content of the proteins in the dry state, confirming the appropriateness of both types of models. It should be pointed out that we observed quantitative differences in helix content both experimentally between the tagged and untagged COR15A and computationally between COR15A and COR15A-TH.
A 30 ns MD simulation was used to analyze the unfolding of COR15A-TH, COR15A and COR15B in water. The changes in helicity of COR15A and COR15B during these simulations were in agreement with the experimental data for both proteins, confirming the choice of the proper force field. However, these comparisons between spectroscopic and simulation data were potentially confounded by the use of an N-terminal 6xHIS-tag on the recombinant proteins.45 Therefore, we also analyzed recombinant COR15A expressed without a tag. Secondary structure content was essentially the same for the tagged and untagged protein in the fully hydrated state and slightly higher in the dry state for the untagged protein, confirming the validity of both our earlier spectroscopic data and the present MD simulation results. Reports of an impact of tags on recombinant proteins have been mainly related to an influence on disulfide-bonding patterns,46 conformational changes of DNA-binding sites47 or changes in enzyme function, which are all not relevant for the analysis of COR15 proteins. Moreover, there are also studies reporting a lack of a strong influence of HIS-tags on protein structure.48,49
A previous MD simulation study identified H-bonding interactions as the major factor for the structural stability of a LEA protein in vacuo, with binding energies of protein–water interactions between 20 and 100 kJ mol−1.43 LEA protein unfolding in water might be easily attributed to the high fraction of polar and charged amino acids, which could H-bond more readily to water molecules than to the sidechains of other amino acids. However, the unfolding of COR15A-TH, COR15A and COR15B in water was mediated by H-bonding interactions between water molecules and the protein backbone as well as with protein sidechains (Fig. S7, ESI†). The breaking of intramolecular peptide H-bonds has previously been identified as a key step in protein unfolding.50,51 Unfolding of the COR15 proteins was accompanied by a break of intramolecular H-bonds. Interestingly, the contribution of backbone H-bonds to that process was much higher than the contribution of sidechain and sidechain–backbone H-bonds.
Moreover, a vast majority of SBs was lost during the MD simulations in water, indicating that unfolding of the COR15 proteins is accompanied by a decreased lifespan of SBs between amino acids, suggesting successful competition of water molecules for binding to charged amino acid sidechains. However, with 2–20 kJ mol−1,52 the contribution of SBs to protein stability is much smaller than that of backbone H-bonding interactions. Therefore we consider them as an excellent indicator for protein flexibility52 rather than a major contributor to protein stability.
COR15A and COR15B are highly similar and therefore only small differences were observed during unfolding in water. While the sequences of helix I are identical between the two proteins,22 helix II of COR15B is more hydrophilic than helix II of COR15A. This may account for the larger decompaction of helix II in COR15B than in COR15A during the simulation. However, these differences were not reflected in the spectroscopic data making an impact on protein function unlikely. This is in agreement with the presumed functional redundancy of both proteins.20 Considering that COR15A-TH was built with a different method, it is important to note that the degree of unfolding in water was very similar to the comparative models.
Crowding conditions may impact the folding behavior of IDPs, as the folding free energy landscape of proteins is changed in a crowded environment.53 In general, macromolecular crowding due to the presence of high concentrations of other, globular proteins has only little effect on IDP structure.54,55 In addition, in some plant LEA proteins osmolyte-induced crowding was also ineffective to induce folding.12 For COR15A and COR15B, however, partial folding was observed by CD and FTIR spectroscopy in the presence of 2 M sucrose or 50% glycerol,20 indicating the propensity of these proteins to fold under mild dehydration conditions. Such osmolytes have been shown to model cellular crowding conditions reasonably well.56
Here, we have extended these earlier observations to a wide range of glycerol concentrations (up to 80% v/v) for untagged COR15A. Folding was linearly dependent on the glycerol concentration, in agreement with published observations for other proteins and osmolytes.57 The MD simulations are in agreement with these spectroscopic data. Unfolding of both COR15 proteins was strongly reduced in 40% glycerol compared to unfolding in pure water, while the folded state was completely stable in pure glycerol.
This stability was not only reflected in the increased lifetime of intra-protein salt bridges but also in the stability of intra-molecular H-bonds in pure glycerol. Interestingly, after transfer from vacuum to 40% glycerol, new protein H-bonds were predominantly established with the available water molecules, while the number of protein–glycerol H-bonds remained small throughout the simulation. This indicates that water is preferred over glycerol due to its higher polarity.58 Protein H-bonds to glycerol were more numerous in the absence of water (100% glycerol), but they were restricted to surface-exposed amino acid sidechains of the folded protein, as the number of intramolecular H-bonds was stable during the 30 ns simulation. These findings point to an exclusion of the osmolyte from the protein backbone, either due to a solvophobic effect51,59 or to steric exclusion.60 Vagenende et al.61 proposed that although glycerol molecules possess three hydroxyl groups that could interact with the protein, the molecule is too bulky to interact directly with the backbone of proteins explaining why in the presence of glycerol no unfolding occurred. Instead, in 40% glycerol, the co-solvent molecules are depleted near the protein surface (Fig. S8, ESI†) and enriched in a region about 5 to 7 Å from the proteins. This unequal distribution of water and glycerol leads to a more compact conformation and a higher stability of the COR15 proteins.
The molecular function of COR15 proteins is the stabilization of cellular membranes during freezing.20 The amphipathic helices formed by the proteins are expected to enable protein–membrane interactions through the hydrophobic face of the helices. In general, IDPs function either as entropic chains in the unstructured state, or follow a molecular recognition mechanism, where folding is triggered upon binding to a partner molecule. In the latter case, IDPs undergo either binding-induced folding, i.e. target binding is a necessary requirement for folding, or folding precedes binding, a mechanism known as conformational selection.62 However, folding of the COR15 proteins does not strictly follow either of these routes, but rather an “initial crowding-induced folding > membrane binding > enhanced folding” pathway.63 The COR15 proteins partially fold into amphipathic α-helices during freezing. Only in this partially-folded conformation that can be induced by freezing in vivo or by osmolytes in vitro (referred to in the term “crowding-induced folding”), are they able to interact with membranes. Membrane binding will then enhance protein folding further. Future efforts will be directed towards a further experimental and computational characterization of this crowding-induced folding and binding mechanism and its effect on membrane structure and stability under stress conditions.
The webserver RaptorX (http://raptorx.uchicago.edu/) was used to build the threading model of COR15A (COR15A-TH). Relative and absolute global model quality was confirmed with a p-value of 1.11 × 10−0266 and a GDT (Global Distance Test) of 39, stating a good relationship between the modeling of a residue and its associated error (in Å).67 As template we used the crystal structure of a computationally designed three-helix bundle.66 All models were optimized in vacuum with the Gromacs software version 4.6.368 using 65000 steps of energy minimization with the Conjugated Gradient algorithm and the OPLS-AA force field.24–26 The accuracy of the models was evaluated with ANOLEA69,70 and PROCHECK.71
The secondary structure content was calculated using the DSSP algorithm74 implemented in Gromacs. Root mean square deviation (RMSD) was calculated as described previously.75 The criteria for H-bond formation were a distance between the donor and acceptor atoms shorter than 3.5 Å and a bond angle higher than 120°.43 The salt bridges module of VMD version 1.9.176 was used to determine the number of salt bridges (SB) with an oxygen–nitrogen distance cut-off of 3.2 Å.
To obtain untagged COR15A, the respective cDNA was cloned into the pETite N-HIS Kan SUMO vector (Lucigen; http://www.lucigen.com). The identity of the inserts was checked by sequencing. The vector containing the COR15A gene was expressed in the Escherichia coli strain Rosetta (DE3) pLys S (Novagen, Madison, WI). As COR15A, like most other LEA proteins, remains soluble upon boiling,17 bacterial cell lysates were incubated in a water bath at 100 °C for 10 min. Precipitated proteins were removed by centrifugation (4500g, 45 min, 4 °C). The supernatant was filtered through a 0.2 μm filter and applied to a 5 ml HIS Trap HP column (GE Healthcare; http://www3.gehealthcare.com). The column was equilibrated with 20 mM sodium phosphate, 0.5 M NaCl and 20 mM imidazole (pH 7.4) and washed with increasing concentrations of imidazole. COR15A was eluted with 250 mM imidazole and subsequently dialyzed twice against cleavage buffer (20 mM Tris/HCl (pH 8.0), 150 mM NaCl, 10% glycerol) overnight at 4 °C. The SUMO-6xHIS-tag was removed by proteolytic cleavage according to manufacturer’s instructions. Protein solution was applied to a 5 ml HISTrap HP column to remove the free SUMO-6xHIS-tag and the protease from the untagged COR15A. Column washing and elution of bound protein was done as described above. Fractions were analyzed by SDS-PAGE and those containing pure COR15A protein were pooled and dialyzed against ddH2O (molecular mass cut-off of dialysis membranes 3500 Da; Spectrumlabs; http://de.spectrumlabs.com). Purified COR15A was lyophilized and stored at −20 °C.
Mature 6xHIS-tagged COR15A and COR15B were cloned, expressed and purified as described earlier.22
Footnotes |
† Electronic supplementary information (ESI) available: Table S1. Evaluation of system density with different glycerol concentrations. Fig. S1. Comparison of MD simulations of COR15A unfolding with four different force fields. Fig. S2. Evaluation of 3D models with ANOLEA tool. Fig. S3. Contribution of hydrophobic and hydrophilic amino acids to total amount of amino acids in an α-helical conformation. Fig. S4. Analysis of potential energy during 30 ns MD simulations. Fig. S5. SDS-PAGE and western blot analysis of untagged recombinant COR15A. Fig. S6. Analysis of intermolecular protein backbone–glycerol and sidechain–glycerol H-bonds during 30 ns MD simulations. Fig. S7. Analysis of intermolecular protein backbone–water and sidechain–water H-bonds during 30 ns MD simulations. Fig. S8. RDF analysis of water and glycerol molecules around COR15 models during 30 ns MD simulation. See DOI: 10.1039/c6cp02272c |
‡ These authors contributed equally. |
§ Present address: Physikalische Biochemie, Universität Potsdam, Karl-Liebknecht-Str. 24-25, D-14476 Potsdam, Germany. E-mail: anja.thalhammer@uni-potsdam.de; Tel: +49 0331-9775267 (Anja Thalhammer). |
This journal is © the Owner Societies 2016 |