Bhavesh
Khatri
a,
Puja
Majumder
a,
Jayashree
Nagesh
b,
Aravind
Penmatsa
*a and
Jayanta
Chatterjee
*a
aMolecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India. E-mail: penmatsa@iisc.ac.in; jayanta@iisc.ac.in
bSolid State and Structural Chemistry Unit, Indian Institute of Science, Bangalore, India
First published on 30th July 2020
Abundant n → π* interactions between adjacent backbone carbonyl groups, identified by statistical analysis of protein structures, are predicted to play an important role in dictating the structure of proteins. However, experimentally testing the prediction in proteins has been challenging due to the weak nature of this interaction. By amplifying the strength of the n → π* interaction via amino acid substitution and thioamide incorporation at a solvent exposed β-turn within the GB1 proteins and Pin 1 WW domain, we demonstrate that an n → π* interaction increases the structural stability of proteins by restricting the ϕ torsion angle. Our results also suggest that amino acid side-chain identity and its rotameric conformation play an important and decisive role in dictating the strength of an n → π* interaction.
Fig. 1 A protein backbone depicting a COi → COi+1 n → π* interaction with the distance (d) and angular (θ) criteria used in the crystallographic analyses. |
Contribution of the n → π* interaction towards the stability of the protein structure was initially reported in collagen mimetics.17 The enhanced thermostability of a collagen mimetic with the 4R-configured proline derivative compared to that with the 4S-configured proline derivative was attributed to the stronger n → π* interaction in the 4R-configured proline derivative with the exo-pucker of the pyrrolidine ring.18 The finding was exquisitely substantiated later by the high-resolution crystal structure of the oligoproline PPII helix, where the n → π* interaction was favored by the Cγ-exo pucker and disfavored by the Cγ-endo pucker of the pyrrolidine ring.19 Furthermore, the stability of this PPII helix in the absence of intramolecular hydrogen bonds and hydration emphasizes the role of the n → π* interaction in the structural stability of collagen.
For an idealized geometry, the n → π* interaction between amides contributes ∼0.3 kcal mol−1,20 which may seem moderate. However, given the ubiquity of carbonyl groups in a polypeptide chain, n → π* interactions could have a significant collective contribution towards the overall energetics of protein stability.2 The distribution of the n → π* interaction obtained from analyses of protein crystal structures reveals that >70% of residues in α-helices, as opposed to <5% of residues in β-sheets engage in this interaction.11,12 Furthermore, since one third of all amino acids in the random coil have torsion angles in the α-helical region,21 the n → π* interaction might have an important role in restricting the conformational ensemble of unfolded proteins.22 In this context, it is worth noting that random coils and turn regions of proteins show a high abundance of reciprocal n → π* interactions (back and forth donation between adjacent carbonyl pairs).9
The evidence of the n → π* interaction has been shown by microwave and IR spectroscopy in various small molecular systems.23–27 However, despite enormous excitement in this area, so far experimental measurements of the energy of an n → π* interaction in proteins and its practical consequence on protein structural stability have been lacking. Therefore, we sought to engineer an n → π* interaction at the β-turn within a protein to understand its influence on the protein structure and its stability.
β-Turns (Fig. 2a) are the third most important protein secondary structure representing ∼20% of all protein residues28 having an important role in protein folding.29–31 Furthermore, substituting non-proline residues with proline residues in the β-turn leads to increased stabilization of the turn32 and enhanced protein stability.33,34 The increased stability results from the decreased backbone conformational entropy of the denatured state due to the restricted rotation of the N–Cα bond, also known as the ϕ torsion angle. Since an n → π* interaction also restricts the ϕ angle of an amino acid residue,11 we speculated that engineering an n → π* interaction at the β-turn would have direct consequence on the protein stability.
Here, by using bioinformatic analysis of the β-turn in proteins, we find an interplay between the conformational flexibility of the peptide backbone and the abundance of n → π* interactions at the two central residues, i + 1 and i + 2 of the β-turn. Through subsequent X-ray crystallography and computational analysis of synthetic GB1 proteins with amino acid substitutions at the i + 2 residue of the β-turn, we show that amino acid side-chain identity and its rotameric conformation have a direct influence on the strength of an n → π* interaction. Gratifyingly, the thermal denaturation of the GB1 proteins shows a good correlation between their stability and the strength of an n → π* interaction at the β-turn. Finally, we validate this observation in the Pin 1 WW domain, wherein by amplifying the strength of an n → π* interaction at the β-turn by thioamide incorporation, we could increase the thermal stability of the thioamidated Pin 1 WW domain.
By analyzing a non-redundant subset of high-resolution (≤2.0 Å) protein crystal structures in the Protein Data Bank (PDB), we curated 500 β-turns (identified using Promotif) representing the common type-I, type-II, type-I′, and type-II′ turns. Next, using the distance and angular criteria defining an n → π* interaction (Fig. 1), we determined the abundance of n → π* interactions at the i + 1 and i + 2 residues in the β-turns. We noted that 40–80% of the residues engage in a COi → COi+1 n → π* interaction, whereas, only 3–12% of the residues are involved in the COi+1 → COi+2 n → π* interaction (Fig. 2b).
To identify the underlying cause of this behavior, we determined the torsion angles ϕ and ψ of i + 1 and i + 2 residues in all the β-turns and plotted them on the Ramachandran map. It was interesting to note the broader distribution of ϕ and ψ angles at the i + 2 residue (Fig. 2d) as opposed to the i + 1 residue (Fig. 2c). This is suggestive of restricted conformational freedom at the i + 1 residue, which is associated with the higher abundance of n → π* interactions at this site. We also calculated the difference in mean ϕ and ψ angles (Δϕ and Δψ) in the presence and absence of the n → π* interaction in the respective β-turns (Fig. 2e and f). The differences were significantly higher at the i + 2 residue in comparison to the i + 1 residue. This further indicates that the lower abundance of n → π* interactions at the i + 2 residue is associated with greater conformational flexibility of the peptide backbone.
Fig. 3 (a) Crystal structure of GB1 with –Lys–Thr– in loop L1 forming a type-I β-turn, which has been modified to a type-II′ β-turn in 1–5. The single letter code in lower case indicates D-amino acid. (b) The backbone overlay of 2 (6L9B), 3 (6L9D), 4 (6LJI), and 5 (6L91) with GB1 (2QMT). |
Alanine was chosen due to its preference for an n → π* interaction11 and high helix propensity40 (preference of an amino acid to be in α-helices). Although thermal denaturation of 1 using variable temperature circular dichroism (CD) showed unfolding cooperativity similarly to GB1 (Fig. S8 and S9†), multiple attempts to crystallize 1 remained unsuccessful. This is possibly a consequence of conformational flexibility introduced in the loop L1 by alanine substitution. Our earlier results indicated that the β-branched amino acid D-Val at the i + 1 site stabilizes a type-II′ β-turn more than D-Ala.41 Thus, we synthesized 2, with D-Val–L-Ala in loop L1 (Fig. 3a), which readily crystallized and X-ray diffraction data were collected to a maximum resolution of 1.9 Å. The structure of 2 overlaps closely with the tertiary structure of GB1 (backbone RMSD 0.39 Å) (Fig. 3b), although with a significant displacement of loop L1. Gratifyingly, the D-Val COi+1 and L-Ala COi+2 in the type-II′ β-turn engage in an n → π* interaction, where the torsion angles of L-Ala at the i + 2 site (ϕ, ψ = −59.7°, −42.2°) are remarkably close to the mean torsion angles of a right-handed α-helix (ϕ, ψ = −62°, −41°)42 (Fig. 4).
Encouraged by this result, we next incorporated serine with moderate helix propensity (3), valine (4) and threonine (5) with low helix propensity40 at the i + 2 site of the type-II′ β-turn (Fig. 3a). With the decreasing helix propensity of amino acids in the order Ala > Ser > Val, we noted an increase in both d and θ between COi+1 → COi+2, suggesting a gradual weakening of the n → π* interaction at the i + 2 residue (Fig. 4). Thus, to obtain a quantitative estimate of the n → π* interaction energy (En→π*) at the i + 2 residue in 2, 3, and 4, we resorted to NBO analysis,43 which clearly indicated a decreasing En→π* in the order 2 > 3 > 4 (Fig. 4).
Despite the low helix propensity of threonine, we were surprised to note the shortest d and θ between COi+1⋯COi+2 at the type-II′ β-turn in 5 with an En→π* of 0.46 kcal mol−1 (Fig. 4). An overlay of the type-II′ β-turns of both the β-branched amino acids valine (4) and threonine (5) revealed a clear difference in the side-chain rotamer conformation (Fig. 5a). Valine in 4 crystallized in a gauche− (g−) side-chain rotameric conformation, whereas threonine in 5 crystallized in a gauche+ (g+) conformation. From the statistical analyses of protein structures, it is known that β-branched amino acids favor the g+ side-chain conformation over g− in helices.44–47 Thus, Thr25 in the α-helix of GB1 with a g+ conformation engages in an n → π* interaction, whereas Thr11 at the i + 2 residue in loop L1 with a g− conformation lacks the n → π* interaction (Fig. 5b). Moreover, our dataset revealed that Thr with a g− conformation at the i + 2 residue in the β-turns does not engage in an n → π* interaction (Table S4†). Therefore, despite the low helix propensity, the n → π* interaction in threonine at the type-II′ β-turn in 5 is a result of the altered side-chain rotamer conformation.
The Ramachandran plot of the i + 2 residue (Fig. 5c) in the type-II′ β-turn of 2, 3, 4, and 5 revealed that, as the strength of the n → π* interaction increases, the torsion angles of an amino acid in a non-helical region in the absence of the stabilizing intramolecular hydrogen bond are gradually altered to occupy the right-handed α-helical region. Thus, proline with a high propensity to engage in an n → π* interaction11 is a strong helix initiator.48,49 Hence, our result further supports the crucial role of the n → π* interaction in helix nucleation, as hypothesized earlier.11
By employing a prolyl-based torsion balance system, Raines et al. have shown that a thioamide (CSi) engages in a stronger CSi → COi+1 n → π* interaction than amide COi.20,50 However, due to the longer CS bond length (1.71 Å)51 and larger van der Waals radius of sulfur (1.85 Å),52 thioamide substitution perturbs the local secondary structure of proteins where the amide oxygen participates in a shorter hydrogen bond.53–57 On the other hand, thioamide substitution at a site where the amide oxygen is involved in a longer hydrogen bond or is solvent exposed, leads to minimal perturbation of the secondary structure.53,55,57–59 Therefore, we chose to substitute the solvent exposed COi+1 in the type-II′ β-turn of 2 by CSi+1. The NBO analysis of the COi+1 to CSi+1 substituted type-II′ β-turn in 2, 3, 4, and 5 clearly indicated a significant enhancement in En→π*, due to the amplified CSi+1 → COi+2 n → π* interaction (Table S3†).
Thus, towards the synthesis of i + 1 thionated GB1 (D-Valt–L-Ala; 2a) (the thionated residue is denoted by superscript “t”), we obtained a clean 46-mer polypeptide up to the L-Alai+2. However, on completion of the 56-mer 2a on a solid support, following the acidolytic removal of protecting groups, the mass spectrum corresponded to a 45-mer fragment without the L-Alai+2 (Fig. S16B†). To circumvent the undesirable peptide cleavage, we coupled the tetrapeptide Fmoc-Asn(Trt)-Gly-D-Valt-L-Ala-COOH and Fmoc-Asn(Trt)-Gly-D-Val-L-Ala-COOH onto two individual 45-mer polypeptides. After acidolytic cleavage, although we obtained the 49-mer oxo-polypeptide, the thio-tetrapeptide coupling repeatedly resulted in the 45-mer fragment without the L-Alai+2 (Fig. S16C and D†). This suggests a spontaneous acid catalyzed cleavage of the peptide bond C-terminal to L-Alai+2 in thioamidated GB1, 2a.
With numerous failed attempts to synthesize 2a, we focused towards the 32-mer Pin 1 WW domain, a three stranded β-sheet protein that shows a cooperative two-state folding.60,61 The Pin 1 protein is amenable to loop modification that retains the global fold with alteration in its thermodynamic stability, making it an excellent model protein for structure-folding studies.60 We selected a Pin 1 variant with a type-I′ β-turn in loop 1 and substituted the –Asn–Gly– with D-Val–L-Ala– (6) and D-Ala–L-Ala– (7) (Fig. 6a) to adopt a type-II′ β-turn that was confirmed by characteristic NOEs at the β-turn (Fig. S29†). Subsequently, we synthesized Pin 1 variants with thioamidation at the i + 1 site (D-Valt–L-Ala; 6a and D-Alat–L-Ala; 7a). Remarkably, the acidolytic removal of the protecting groups to obtain 6a resulted in both the desired product and the N- and C-terminal fragmented peptides resulting from the nucleophilic attack of D-Val CSi+1 onto L-Ala COi+2 (Fig. 7a) as observed in 2a.
Fig. 7 (a) Acid-catalyzed cleavage of 6a yielding two N-terminal fragments (1–13), one C-terminal fragment (14–32) and the desired product (1–32). The two N-terminal fragments possibly result from the racemization of the L-Alai+2 due to keto–enol tautomerization of the thiazolone intermediate.63 The polypeptides are color coded and are shown in the HPLC chromatogram. (b) Spontaneous acid-catalyzed cleavage of the thioacylated Aib–Pro dipeptide in aqueous solution.62 The thiazolone intermediate was trapped by passing HCl gas and characterized. The crystal structure of Ph-(CS)-Aib-Pro-Aib-N(Me)Ph clearly depicting the n → π* interaction between CSi and COi+1 of Aib. |
An identical fragmentation was reported by Heimgartner et al. during the aqueous acidolytic workup of the thioacylated Aib–Pro dipeptide (Fig. 7b), towards the synthesis of Ph-(CS)-Aib-Pro-Aib-N(Me)Ph.62 However, by bubbling HCl gas through the dipeptide in THF, the thiazolone intermediate could be characterized, which results from the nucleophilic attack of Ph CSi onto Aib COi+1. To our excitement, the crystal structure of the final product Ph-(CS)-Aib-Pro-Aib-N(Me)Ph revealed the CSi → COi+1 n → π* interaction, leading to a high degree of pyramidalization, Δ = 0.059 Å at Aib COi+1, a firm indicator of the n → π* interaction.19 Thus, the directional (i + 1 → i + 2) fragmentation observed in 2a (Fig. S16†), 6a (Fig. 7a) and 7a (Fig. S17B†) is a chemical signature of the amplified n → π* interaction between CSi+1 and COi+2.
Next, we assessed the folding of Pin 1 variants 6, 6a, 7, and 7a in sodium phosphate buffer (pH 7.4). All the Pin 1 proteins showed the characteristic 227 nm maximum in the CD spectrum, indicating the presence of a folded protein with β-sheets (minimum centered around 215 nm) (Fig. S22A–S25A†). The virtually identical Hα chemical shift perturbation deduced from TOCSY and NOESY experiments indicated that a single atom substitution (O to S) at the solvent exposed COi+1 did not lead to major structural perturbation in 6a and 7a (Fig. S28†). We next performed thermal and chemical denaturation (Fig. S22–S25†) to understand the effect of the amplified n → π* interaction. The proteins showed a two-state unfolding and we were delighted to note that the CSi+1 → COi+2 n → π* interaction enhanced the stability of 6a by 0.8 kcal mol−1 and 7a by 0.3 kcal mol−1 (Fig. 6b).
The increased stability arises from the reduced conformational flexibility of the amino acid residue engaged in an n → π* interaction, a feature that is analogous to the ring constraint in proline, which restricts its conformational space compared to other amino acids and increases protein stability by reducing the entropy of the unfolded state.64,65 An n → π* interaction restricts the conformational space of an amino acid residue with the adoption of torsion angles as depicted in the Ramachandran plot of the β-turn residues (Fig. 2c and d). This would also be expected in an amplified n → π* interaction by thioamide substitution. The adoption of such torsion angles is favorable at the i + 1 and i + 2 positions of a β-turn (Fig. 2a). Furthermore, since β-branched amino acids restrict the backbone conformation more than the unbranched residues,65 the CSi+1 → COi+2 n → π* interaction stabilizes 6a more than 7a.
Thus, our results in the Pin 1 WW domain re-emphasize the role of the amino acid side-chain in tuning the n → π* interaction energy. Not only the side-chain rotamer of the amino acid involved in an n → π* interaction dictates its strength (Fig. 4), the steric interactions imposed by the amino acid side-chain of the donor carbonyl oxygen(i) (CSi+1 in this case) (Fig. 6) can also influence an n → π* interaction.
To conclusively demonstrate the influence of the n → π* interaction on protein stability, we chose to amplify this weak noncovalent interaction by thioamide substitution. Since a strong n → π* interaction induces a “kink” in the polypeptide backbone by optimizing the ϕ, ψ torsion angles suitable for orbital overlap, and thereby reducing the conformational entropy at the β-turn, the thioamide substitution increased the protein stability. It is worth noting that thio-Gly465 in the natural protein methyl-coenzyme M reductase, which is suggested to stabilize the protein secondary structure near the active site, induces a kinked conformation (ϕ, ψ = −68.5°, −47.2°) by engaging in an n → π* interaction with CO of Phe466 (Fig. S30†).70,71 With the recent advancement in ribosome mediated incorporation of thioamide into proteins and polypeptides, thioamide substitution could be potentially utilized to stabilize turns and enhance protein stability,72,73 aided by exogenous factors like salt concentration11 and solvation by water molecules69 that have been shown to influence the n → π* interaction in protein secondary structures.
Footnote |
† Electronic supplementary information (ESI) available. The coordinates for the GB1 crystal structures have been deposited in the Protein Data Bank with the following accession codes 6L9B, 6L9D, 6LJI, 6L91. For ESI and crystallographic data in CIF or other electronic format see DOI: 10.1039/d0sc03060k |
This journal is © The Royal Society of Chemistry 2020 |