Samuel D.
Melton
,
Emily A. E.
Brackhahn
,
Samuel J.
Orlin
,
Pengfei
Jin
and
David M.
Chenoweth
*
Department of Chemistry, University of Pennsylvania, 231 South 34th Street, Philadelphia, Pennsylvania 19104, USA. E-mail: dcheno@sas.upenn.edu
First published on 21st July 2020
The stability of the triple-helical structure of collagen is modulated by a delicate balance of effects including polypeptide backbone geometry, a buried hydrogen bond network, dispersive interfacial interactions, and subtle stereoelectronic effects. Although the different amino acid propensities for the Xaa and Yaa positions of collagen's repeating (Glycine–Xaa–Yaa) primary structure have been described, our understanding of the impact of incorporating aza-glycine (azGly) residues adjacent to varied Xaa and Yaa position residues has been limited to specific sequences. Here, we detail the impact of variation in the Xaa position adjacent to an azGly residue and compare these results to our study on the impact of the Yaa position. For the first time, we present a set of design rules for azGly-stabilized triple-helical collagen peptides, accounting for all canonical amino acids in the Xaa and Yaa positions adjacent to an azGly residue, and extend these rules using multiple azGly residues. To gain atomic level insight into these new rules we present two high-resolution crystal structures of collagen triple helices, with the first peptoid-containing collagen peptide structure. In conjunction with biophysical and computational data, we highlight the critical importance of preserving the triple helix geometry and protecting the hydrogen bonding network proximal to the azGly residue from solvent. Our results provide a set of design guidelines for azGly-stabilized triple-helical collagen peptides and fundamental insight into collagen structure and stability.
Aza-amino acids, characterized by substitution of the residue's Cα for N, have seen extensive use in the literature for applications including structural studies and therapeutics.21–28 Aza-amino acids and aza-peptides exhibit several advantageous qualities including resistance to enzymatic degradation.28,29 Our group has reported the use of aza-glycine (azGly, azG) to serve as a glycine mimic within the collagen triple helix (Fig. 1a).30–33 In most cases, incorporation of this aza-amino acid within a simple model system results in increased triple-helical thermal stability as assessed by thermal melting studies. The azGly residues stabilize the collagen triple helix in two primary methods: the formation of additional interstrand hydrogen bonds via the azGly residue's NH-α and the restricted conformational flexibility compared to Gly residues. Incorporation of an azGly residue adjacent to a central, variable Yaa position in CMPs provides a reliable replacement for glycine residues.33 The identity of the Yaa position can significantly affect azGly's impact on triple-helical thermal stability, ranging from high increases in the melting point of the collagen triple helix (Tm) to essentially no difference when compared to the control sequence. A sequence independent strategy for stabilizing CMPs using azGly residues positioned in triplet repeats positioned away from central substitution site provides a general tool for stabilizing triple-helical collagen peptides. Until now, our understanding of azGly's affect in collagen peptides has been limited to these specific cases. In this current work, we develop a deeper understanding by detailing the impact of neighboring Xaa residues on the ability of an azGly residue to act as a stabilizing agent (Fig. 1b). Along with our previous studies, this work culminates in the development of a new set of guidelines for the incorporation of azGly residues in collagen peptides for any given peptide sequence. This work facilitates, advances, and informs the design of novel, ultra-stable azGly-containing CMPs for applications in a diversity of fields including biomaterials, therapeutics, peptidomimetics, and beyond.
Fig. 1 (a) Structural representation of an azGly substitution within a (Gly–Hyp–Pro) triplet. (b) Structural representation of an azGly substitution adjacent to a variable Xaa position residue. |
Fig. 2 (Top) Sequence representation of CMPs 1a–23a and 1b–23b. (a) Graphical depiction showing azGly substitution sites surrounding a central Xaa position substitution. (b) Tm values of each CMP compared to 1a, the POG control sequence, and the degree of azGly thermal stabilization within each pair of CMPs with the same Xaa substitution, e.g.10a and 10b. CMPs are grouped based on physical properties, e.g. charged residues. *For all peptides, the standard deviation between three triple-helical thermal stability measurements for each entry was ≤1 °C. **Data for CMPs 1a and 1b are taken from ref. 33. |
Previously, we established that incorporating an azGly residue adjacent to an Xaa position Pro residue resulted in an increase of its triple-helical Tm value by approximately 11 °C for a central substitution.30,31 Indeed, both of the secondary amino acids typically found in collagen, proline and hydroxyproline, exhibited increased triple-helical thermal stability when an azGly residue was adjacently incorporated. The Xaa position Hyp CMP (2b) displayed a substantial increase of 13 °C in its triple-helical Tm value, slightly exceeding the stability gained when Pro is present in the Xaa position (Fig. 2). The azGly afforded increase in triple-helical thermal stability is maximized when a secondary amino acid is present in the adjacent Xaa position.
The azGly-containing CMPs that contained charged residues in the Xaa position (3b (aspartate, Asp, D), 4b (glutamate, Glu, E), 5b (lysine, Lys, K), and 6b (arginine, Arg, R)) universally exhibited decreases in their triple-helical thermal stability compared to their non-azGly-containing controls. The azGly-containing CMPs with the acidic side chains Asp and Glu exhibited marginal destabilization, with reduced Tm values of 0 to −1 °C compared to their controls. The azGly-containing CMPs with the basic side chains Lys (5b) and Arg (6b) exhibited moderate destabilization, with reduced Tm values of −4 to −2 °C, respectively, compared to their controls.
The azGly-containing CMPs that contained non-charged polar side chains in the Xaa position (7b (serine, Ser, S), 9b (cystine, Cys, C), 10b (asparagine, Asn, N), and 11b (glutamine, Gln, Q)) displayed varied degrees of reduced triple-helical Tm values compared to their controls, ranging from −1 °C in the case of Asn, −4 °C with Gln or Ser, to −6 °C with Cys. However, the threonine (Thr, T) containing CMP 8b exhibited a substantial decrease in its triple-helical Tm value of −10 °C, almost double the average decrease observed with the other polar residues.
The azGly-containing CMPs with hydrophobic residues followed a trend similar to that which was observed with charged residues, albeit with two notable exceptions. The azGly-containing CMPs that contained non-β-branched residues (12b (glycine, Gly, G), 13b (alanine, Ala, A), 15b (leucine, Leu, L), and 17b (methionine, Met, M)) all exhibited small to moderate decreases in triple-helical thermal stability compared to their control counterparts. These Tm reductions ranged from −1 °C in the case of Gly to −4 °C in the case of Met. Drawing a parallel to our results with Yaa position substitutions, the CMPs containing β-branched side chains (14b (valine, Val, V) and 16b (isoleucine, Ile, I)) exhibited the highest magnitude of decreased thermal stability with Tm value reductions ranging from −12 to −10 °C. This behavior is analogous to the result observed with the Thr-containing CMP 8b, reinforcing the conclusion that azGly residues positioned adjacent to β-branched residues are detrimental to triple-helical thermal stability. The attenuated thermal stability of β-branched residues compared to other residues indicates steric bulk around Cβ plays an important role in weakening the triple-helical assembly.
The azGly-containing CMPs that containined aromatic residues (18b (phenylalanine, Phe, F), 19b (tyrosine, Tyr, Y), 20b (tryptophan, Trp, W), and 21b (histidine, His, H)) exhibited decreased triple-helical thermal stability compared to their non-azGly-containing controls. The reductions in Tm values ranged from −1 °C in the case of Phe to −4 °C in the case of Tyr. Overall, the trend of reduced thermal stability with CMPs containing aromatic side chains was consistent with most other residues, with all CMPs exhibiting small to moderate decreases in triple-helical thermal stability.
Fig. 3 (top) Sequence representation of CMPs 6c, 14c, and 20c. (a) Graphical depiction showing azGly substitution sites surrounding a central Xaa position substitution. (b) Tm values of clamped CMPs compared to 1a, the POG control sequence, the corresponding Gly controls (CMPs 6a, 14a, and 20a), and the adjacent azGly sequence (CMPs 6b, 14b, and 20b). The sequences and data for these comparison CMPs are available in Chart 1 and Fig. S2.† (c) CD thermal melting data for Trp-containing CMPs 20a, 20b, and 20c and for Val-containing CMPs 14a, 14b, and 14c. *For all peptides, the standard deviation between three triple-helical thermal stability measurements for each entry was ≤1 °C. |
To further investigate this, we took inspiration from work done by Goodman et. al. with incorporation of the peptoid residue N-isobutylglycine (Nleu) in collagen peptides.36–39 They incorporated Nleu in the Xaa and Yaa positions of collagen-like peptides, demonstrating (Gly–Nleu–Pro) sequences exhibited increased thermal stability and faster refolding times compared to (Gly–Pro–Nleu) sequences. Using computational models, the authors explained the increased triple-helical thermal stability resulted from the Xaa position Nleu residues being better positioned to form interchain hydrophobic interactions.36,37 Additionally, the N-substitution of peptoid residues removed the ability of the backbone N to act as a hydrogen bond donor. In our Xaa model system, we incorporated the two peptoid residues N-methylglycine (NmetG) in CMPs 22a and 22b, and N-ethylglycine (NethylG) in CMPs 23a and 23b. The peptoid-substituted non-azGly-containing control CMPs (22a and 23a) surpassed most of the canonical amino acids in terms of triple-helical thermal stability, with the only exceptions being the CMPs containing Xaa substitutions of either Pro or Hyp. Excitingly, in both cases the azGly-containing CMPs 22b and 23b exhibited enhanced triple-helical thermal stability compared to their controls with increases in their Tm values of +4 °C for NmetG and +6 °C for NethylG. We suspect the slightly increased triple-helical thermal stability observed with NethylG is a result of the increased steric bulk and hydrophobicity compared to NmetG, further increasing favorable interchain hydrophobic interactions.
Fig. 4 Crystal structures of CMPs 24 (PDB: 6W46) and 25 (PDB: 6W47) solved to 1.25 Å and 1.15 Å resolution, respectively. (a) Full triple helix structure of CMP 25, with water molecules visible. (b) Close-up, head-on view of the local region around an Xaa position N-methylglycine residue (orange) with electron density maps visible. (c) Close-up, side view of the local region around the N-methylglycine residue. (d) Close-up, head-on view of the local region around an Xaa position valine residue (orange) with electron density maps visible. (e) Close-up, side view of the local region around the valine residue. (f) Full triple helix structure of CMP 24, with water molecules visible. In all views, water molecules and their electron density are colored aqua. In views (b–e), the electron density map (2mF0-DFc) is contoured at 1.5 σ for the peptide and at 1.0 σ for water molecules. In views (b–e), some of the peptide chains and water molecules are hidden for clarity. In views (d) and (e), the dashed circle indicates the water molecule capable of participating in H-bond interactions with the peptide backbone in the local region of the Xaa residue; notably, this specific water molecule is absent when the Xaa position is occupied by N-methylglycine as seen in (b) and (c). |
Analysis of these structures demonstrates the solvent accessibility of the peptide backbone is dependent on the Xaa residue (Fig. 4d and e). In the structure with the Xaa position Val residue, a void is present along the surface of the triple helix at the site of Val incorporation (Fig. 4d and e). This is a result of Val's secondary backbone amine being sterically unencumbered compared to the tertiary backbone amine of N-substituted residues like Pro, Hyp, or peptoids. The consequence of this hole is solvent exposure, clearly indicated the presence of a water molecule next to the Val residues in our structure (Fig. 4d and e). This water molecule is well-positioned to hydrogen bond with the NH group of the Val residue at approximately 3.0 Å away. Importantly, this water is also positioned in close proximity with the Cα of the Gly residue adjacent to the Xaa Val approximately 3.3 Å away. This Gly Cα corresponds to the azGly substitution site in our model system. In the NmetG structure, the void in space observed along the surface of the triple helix in the Val structure is filled by the N-methyl group of NmetG (Fig. 4b and c). This prevents the NH backbone–water interaction from occurring and better protects the Gly Cα from solvent exposure, with the nearest water being positioned approximately 3.7 Å away from the backbone NH and approximately 3.5 Å away from the Gly Cα.
Reviewing available structures of collagen triple helices in the PBD reveals similar behaviors. We analyzed ∼50 structures with varied sequences, containing a variety of residues in the Xaa position. Across all structures, it is consistently observed when the Xaa position is not Pro or Hyp the backbone N is able to participate in hydrogen bond interactions with solvent, with a water molecule positioned an average of 3.1 ± 0.2 Å away. When a Pro or Hyp is present, the backbone N is unable to participate in hydrogen bonding and the nearest water molecule is forced further away, at an average distance of 3.8 ± 0.1 Å. Similarly, when the Xaa position is not Pro or Hyp the solvent accessibility of the adjacent Gly Cα increases as well. Across all structures, if the Xaa position is either Pro or Hyp the average distance to the nearest water molecule for any given Gly Cα that is not located at either terminus is 3.9 ± 0.1 Å or 3.9 ± 0.2 Å, respectively. If there is any other residue present in the Xaa position, this average drops to 3.4 ± 0.2 Å. These observations are consistent with our structures, with the NmetG residue blocking solvent to a greater extent than the Val residue. As previously mentioned, we hypothesize that the loss or gain in triple-helical thermal stability observed with our Xaa CMP library is, in part, dependent on the ability of a specific Xaa residue to block solvent. If azGly is present in the adjacent position and the Xaa residue is unable to block solvent, azGly's ability to form productive interstrand H-bonds will be disturbed as solvent interactions occur. This behavior would explain the loss in triple-helical thermal stability observed upon azGly incorporation. Conversely, if the Xaa residue is able to block solvent, azGly's ability to form productive interstrand H-bonds will be enhanced, and triple-helical thermal stability will increase with azGly incorporation.
Beyond solvent exposure, the identity of the Xaa position residue will affect the backbone dihedral angles of nearby residues. Averaging φ angles of Gly residues from collagen structures in the PDB gives a value of −69 ± 5° while averaging ψ angles gives a value of −176 ± 5° or 173 ± 5°.32 In our structure, the Gly residues adjacent to the Xaa NmetG residue adopt conformations closely matching the PDB average Gly (φ, ψ) angles with average values of (−68°, 176°) (Fig. 5d and Table S4†). Comparatively, the Gly residues adjacent to the Val residue adopt conformations with average (φ, ψ) angles of (−61°, 167°), deviating more from the global average (Fig. 5d and Table S3†). Comparing these angles to other structures in the PDB, the Gly residues next to NmetG adopt conformations similar to those adopted by Gly residues adjacent to either Pro or Hyp residues (Fig. 5a and b). When the Xaa position is a not an N-substituted residue, then the adjacent Gly residue is slightly twisted out of this conformation, which may be critical when azGly is present in this position (Fig. 5c). This backbone perturbation could have a profound effect on the azGly residue's ability to form favorable interactions and strong H-bonds. We recently reported the potential energy surfaces for model systems corresponding to collagen-like peptides to determine the energetic barriers for rotation.40 These models consisted of either a Gly or an azGly residue adjacent to what corresponded to either an N-substituted residue or a N-unsubstituted residue, demonstrating that azGly residues are more conformationally constrained than Gly residues. Compared to the average dihedral angle values of (−69°, 173°) or (−69°, −176°) observed with Gly residues in collagen structures, azGly residues prefer dihedral angles of (−65°, 167°) when adjacent to a secondary-like amino acid and angles of (−62°, 159°) when adjacent to N-unsubstituted residues. Placing an azGly residue adjacent to an N-unsubstituted residue forces the azGly to deviate more strongly from the “ideal” collagen Gly residue dihedral angles than if the azGly residue was incorporated next to a residue like Pro, Hyp, or NmetG. The conformational bias imposed by the Xaa position residue, combined with the level of local backbone solvent exposure, points to the underlying explanation of azGly's preference for N-substituted Xaa residues.
Fig. 5 Dihedral angles of glycine residues grouped by the identity of the adjacent Xaa residue. The view area is restricted to the given region for clarity. The following PDB models were examined: 1BKV, 2DRT, 2DRX, 3A08, 3A19, 3ADM, 3P46, 4DMT, 4GYX, and 6A0C. (a) Highlighted data points for Gly residues in –(Gly–Pro)– regions (green). (b) Highlighted data points for Gly residues in –(Gly–Hyp)– regions (red). (c) Highlighted data points for Gly residues in –(Gly–Ala)–, –(Gly–Glu)–, –(Gly–Gln)–, –(Gly–Ile)–, –(Gly–Leu)–, –(Gly–Phe)–, or –(Gly–Val)– regions (blue). (d) Highlighted data points for the Gly residues in our structures, 6W46 and 6W47, in –(Gly–NmetG)– regions (purple) or –(Gly–Val)– regions (yellow). |
Fig. 6 (a) Summary of aza-glycine's effect on triple-helical thermal stability (ΔTm) within our model [H–(POG)8–NH2]-like system, incorporated adjacent to either a central Yaa33 or Xaa substitution. The ΔTm value is given as the difference between the azGly-containing CMP and the corresponding control (e.g. CMPs 10b and 10a). Residues are categorized by the observed ΔTm upon azGly incorporation. Left: Yaa position substitutions when the Xaa position is Pro. tLeu refers to the unnatural amino acid L-tert-leucine. Right: Xaa substitutions when the Yaa position is Hyp. (b) Summary of aza-glycine's effect when substituted in regions flanking a central substitution. |
The standard triplet used for small collagen mimetic peptides, (Pro–Hyp–Gly), is particularly well suited for azGly substitution. We've previously demonstrated the synergistic effect of multiple azGly residue incorporation; as the number of (Pro–Hyp–azGly) triplets increases the triple-helical thermal stability gain per azGly residue increases as well.31 This triplet, often used in the literature for small triple-helical collagen peptides, can be exploited to stabilize collagen peptides with central regions bearing varied sequences that would either exhibit small increases or even decreases in triple-helical thermal stability upon azGly incorporation. Substitution of Gly residues in (Hyp–Gly–Pro) triplets, when positioned in regions flanking a central substitution site, will provide a substantial increase in triple-helical thermal stability regardless of the central region's sequence. The viability of this approach can be observed both in this work with CMPs 6c, 14c, and 20c and our previous work.33 This strategy facilitates the preparation of incredibly stable and short collagen peptides with different binding motifs and sequences that are still capable of triple helix assembly.
Footnote |
† Electronic supplementary information (ESI) available. The detailed procedures for peptide synthesis, purification, characterization, and crystallography. See DOI: 10.1039/d0sc03003a |
This journal is © The Royal Society of Chemistry 2020 |