William M.
Dawson
*a,
Freddie J. O.
Martin
a,
Guto G.
Rhys
ab,
Kathryn L.
Shelley
ac,
R. Leo
Brady
c and
Derek N.
Woolfson
*acd
aSchool of Chemistry, University of Bristol, Cantock's Close, Bristol BS8 1TS, UK. E-mail: w.dawson@bristol.ac.uk; d.n.woolfson@bristol.ac.uk
bDepartment of Chemistry, University of Bayreuth, Universitätsstraße 30, 95447 Bayreuth, Germany
cSchool of Biochemistry, University of Bristol, Biomedical Sciences Building, University Walk, Bristol BS8 1TD, UK
dBristol BioDesign Institute, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK
First published on 13th April 2021
The rational design of linear peptides that assemble controllably and predictably in water is challenging. Short sequences must encode unique target structures and avoid alternative states. However, the non-covalent forces that stabilize and discriminate between states are weak. Nonetheless, for α-helical coiled-coil assemblies considerable progress has been made in rational de novo design. In these, sequence repeats of nominally hydrophobic (h) and polar (p) residues, hpphppp, direct the assembly of amphipathic helices into dimeric to tetrameric bundles. Expanding this pattern to hpphhph can produce larger α-helical barrels. Here, we show that pentameric to nonameric barrels are accessed by varying the residue at one of the h sites. In peptides with four L/I–K–E–I–A–x–Z repeats, decreasing the size of Z from threonine to serine to alanine to glycine gives progressively larger oligomers. X-ray crystal structures of the resulting α-helical barrels rationalize this: side chains at Z point directly into the helical interfaces, and smaller residues allow closer helix contacts and larger assemblies.
Most αHBs are Type-2 CCs based on hpphhph sequence repeats, labelled abcdefg (Fig. 1).17 Typically, αHBs have L/IxxIAxA repeats; i.e., a = Leu or Ile and d = Ile. β-Branched residues at d are particularly important for maintaining open αHBs.18 The hexameric and heptameric αHBs (CC-Hex2 and CC-Hept, systematically named CC-Type2-(SgLaId)4 and CC-Type2-(AgLaId)4) have a = Leu, d = Ile and e = alanine (Ala, A), but differ at g, which is Ala in the heptamer and the slightly larger serine (Ser, S) in the hexamer (Table 1). Another variant, CC-Pent (CC-Type2-(IgLaIdEe)4) has Ile at g, although it differs from the other examples having e = glutamic acid (Glu, E).16 A second series with all-Ile cores (a = d = Ile) has been characterized.16,18 In these, another hexamer, CC-Hex3 (CC-Type2-(SgIaId)4), follows the design rules above (Table 1);16 and a peptide with Ala at g (CC-Type2-(AgIaId)4) forms an octamer when crystallized in the presence of isopropanol.18
Fig. 1 (A and B) Type-N (A) and Type-2 (B) CC interfaces. The g position is highlighted in the Type-2 interface. (C) Side-chain structures of glycine, alanine, serine and threonine. |
Heptad repeat (abcdefg) | Systematic name | BUDE oligomer score | DPH KD (μM) | XRD αHB oligomeric state(s) | AUC | |
---|---|---|---|---|---|---|
SV | SE | |||||
a BUDE: Bristol University Docking Engine. DPH: diphenylhexatriene. XRD: X-ray diffraction. AUC: analytical ultracentrifugation. SV: sedimentation velocity. SE: sedimentation equilibrium. | ||||||
LKEIAxT | CC-Type2-(TgLaId)4 | 5 | 6.8 ± 1.3 | 5 | 4.8 | 4.8 |
IKEIAxT | CC-Type2-(TgIaId)4 | 5 | 0.84 ± 0.19 | 5 | 5.4 | 5.2 |
LKEIAxS16 | CC-Type2-(SgLaId)4 | 6 | 1.6 ± 0.2 | 6 | 5.7 | 6.5 |
IKEIAxS16 | CC-Type2-(SgIaId)4 | 6 | 3.8 ± 0.8 | 6 | 6.2 | 6.1 |
LKEIAxA16 | CC-Type2-(AgLaId)4 | 7 | 1.3 ± 0.3 | 7 | 6.9 | 7.0 |
IKEIAxA16,18 | CC-Type2-(AgIaId)4 | 6 | 2.2 ± 0.3 | 8 | 6.1 | 5.7 |
LKEIAxG | CC-Type2-(GgLaId)4 | 9 | 0.076 ± 0.0065 | 9 (& collapsed 6) | 6.5 | 6.3 |
IKEIAxG | CC-Type2-(GgIaId)4 | 8 | 0.66 ± 0.12 | 6 & 7 | 5.0 | 5.1 |
These previously described αHBs show a trend: increasing the size of side chains at g decreases the oligomer state formed. Therefore, we reasoned that a series of minimal changes, solely at the g position—i.e., at Z in L/IxxIAxZ repeats—might direct oligomeric state systematically and reliably.
Specifically, we considered the addition of a single heavy atom (C or O) to the side chain through the series glycine (Gly, G), Ala, Ser and threonine (Thr, T) (Fig. 1C). Our aim was to make a series of peptides with minimal changes to give a robust family of αHBs with tuneable oligomeric states for applications in synthetic biology and protein design.13 N.B. To maximise further uses of the peptide assemblies, we avoided larger hydrophobic residues at g, e.g. valine, as our experience is that these can lead to difficulties in purification. We also avoided cysteine because of potential issues with redox chemistry and to reserve it for introducing function in follow-on studies.11,19,20
We built and optimized parametric models for Gly@g and Thr@g in ISAMBARD.21 The sequences were modelled as parallel αHBs of oligomer state 5 to 10, and scored using BUDE22,23 (Table S3†). For the historical designs, the most-favoured states were indeed those observed experimentally,16 the exception being CC-Type2-(AgIaId)4, which predicted as a hexamer as observed in solution, but crystallized as an octamer.18 Encouragingly, the new Thr@g sequences consistently scored best as pentamers: for the a = d = Ile variant the pentameric assembly was favoured outright; while the a = Leu, d = Ile variant scored equally well as pentamer or hexamer. Conversely, both Gly variants scored more favourably as larger oligomeric states: the a = d = Ile variant as an octamer; and a = Leu, d = Ile as a nonamer. Although, there was less discrimination between models for the Gly@g sequences than for Thr@g, Fig. S1.† Thus, modelling supports the hypothesis that oligomeric states of Type-2 CCs can be tuned by side chains at g.
The Gly@g and Thr@g peptides were synthesized, purified by HPLC, and confirmed by mass spectrometry (Fig. S2–S8†). Circular dichroism (CD) spectroscopy indicated that all four peptides were α helical at low μM concentrations, Fig. 2A. CD spectra recorded at increasing temperatures showed that both Thr@g variants were hyperthermostable, Fig. 2B. Whereas, the Gly@g variant with a = Leu, d = Ile had the beginnings of a thermal unfolding curve consistent with the anticipated destabilizing effect of Gly on α-helical structures.24–26 The Gly@g variant with a = d = Ile unfolded at 38 °C and did not fully refold on cooling. Indeed, peptide sequences with a = d = Ile are more unstable than their a = Leu, d = Ile counterparts. Whilst β-branched residues like Ile clearly impart oligomer-state specificity in CC systems,3,4,18 we posit that this may be at the expense of some loss in overall thermal stability.
Next, dye-binding assays were used to assess the presence of accessible channels, Fig. 2C and S9† and Table 1. Binding of diphenylhexatriene (DPH) is a reliable indicator that CC peptides form αHBs in solution with a 1:1 correlation with X-ray crystal structures.16,18 All four new peptides bound DPH consistent with the formation of αHBs.
We determined X-ray protein crystal structures for all Gly@g and Thr@g variants. As predicted, the Thr@g peptides formed parallel pentamers (Fig. 3 and Tables S4 and S5†). Comparison with the foregoing computationally designed pentamer, CC-Type2-(IgLaIdEe)4,16 revealed similar CC parameters for the structures. Significantly, Thr@g with a = d = Ile has a wider channel of ≈9 Å than previous designs (≈7 Å), which increases the scope to functionalize this variant.
Fig. 3 X-ray crystal structures for pentameric through to nonameric αHBs. (Left to right) CC-Type2-(TgLaId)4 (red, PDB 7BAS), CC-Type2-(TgIaId)4 (dark red, PDB 7BAU), CC-Type2-(SgLaId)4 (orange, PDB 4PN9), CC-Type2-(AgLaId)4 (yellow, PDB 4PNA), CC-Type2-(AgIaId)4 (green, PDB 6G67) and CC-Type2-(GgLaId)4 (blue, PDB 7BIM).‡ |
The Gly@g peptide with a = Leu, d = Ile crystallized in two forms. Gratifyingly, one was an all-parallel nonamer, which is a new αHB with an exceptionally large channel of diameter ≈9.5–11.5 Å, Fig. 3 (Table S4†). Attempts to model solvent into density observed in the channel were inconsistent. Therefore, representative solvent molecules were included where they matched the density; though these did not make any stabilizing contacts with protein. Although there are natural nonameric protein assemblies,27,28 this is the first stand-alone, water-soluble α-helical CC of this size. The second crystal form revealed a collapsed C2-symmetric 6-helix bundle (Fig. 4A and B, Table S4†). This was surprising, as the β-branched Ile residues at d would be expected to prevent collapse.18 We posit that the small size of Gly relaxes this design rule allowing access to other nearby regions of the CC free-energy landscape.29
Fig. 4 Structures of Gly@g variants. (A and B) Orthogonal views of the collapsed hexameric-form of CC-Type2-(GgLaId)4 (PDB 7A1T). (C and D) The hexameric (C) and heptameric (D) form of CC-Type2-(GgIaId)4 (PDB 7BAT & 7BAW).‡ |
Gly@g with a = d = Ile also crystallized in two forms (Fig. 4C and D, Table S5†), However, both solved as αHBs; namely, a hexamer and a heptamer. Thus, β-branched Ile at both a and d maintains the open assembly18 even with Gly residues at g. Although a larger oligomer was predicted in silico, the calculated internal energies for Gly@g were similar for the different oligomers (Fig. S1†).
Because of the apparent structural duality with Gly@g, we examined the oligomeric states of all peptides in solution by analytical ultracentrifugation (AUC; Table 1, Fig. S10–S13†). Both sedimentation velocity (SV) and sedimentation equilibrium (SE) measurements for the Thr@g variants returned pentameric molecular weights consistent with the X-ray crystal structures. Indeed, for the pentamers through heptamers for these and previous designs, the correlation between the solution-phase and crystal-state oligomers was good, Table 1.
However, where larger oligomers (octamer18 and nonamer) were observed in crystals, smaller oligomers were consistently observed in solution, Table 1. This suggests that the solution states are the dominant species, and that the higher oligomers observed by X-ray crystallography are meta-stable. This is consistent with smaller oligomers being entropically favoured. In addition, the crystallization conditions for the octamer and nonamer contained isopropanol. This increases the hydrophobicity of the bulk solvent, which potentially supports larger, hydrophobic pores that would otherwise be energetically unfavourable.29 Nevertheless, these are legitimate states to consider as they are clearly accessible on the CC free-energy landscape.
Summarizing these data, Type-2 CC peptides with sequence repeats LppIApZ and Z = Thr, Ser, or Ala, form pentameric, hexameric, and heptameric αHBs, respectively. Adding Gly to the series accesses a nonameric open barrel, but only in the crystal state. Similarly, an IppIApA peptide forms an octamer in the crystal state.18 These X-ray crystal structures enabled us to examine the structural transitions in detail.
In all of the structures, the a and d residues contribute both to the lumens and to interactions between neighbouring helices. SOCKET30,31 analysis revealed that residues at a form knobs that fit into holes made by d′-1-g′-1-a′-d′ of a neighbouring helix.32 Thus, these knobs are complemented by interactions formed by the residues at g′-1 varied herein. The Cα-to-Cβ bond vectors of side chains at g point directly towards the adjacent helix, Fig. 5A–D. In classical CCs, this is called perpendicular packing, and it restricts how close the helices can approach.3,32,33 Thus, mutations at g might be expected to influence the quaternary structure. The changes made in the Gly→Ala→Ser→Thr series progressively add a single heavy atom to that side chain: Gly (0 heavy atoms)→Ala (1)→Ser (2)→Thr (3), Fig. 1C. It is gratifying, but still surprising, that this leads to unitary changes in oligomer state, at least for the Ala, Ser, and Thr variants. This is manifest in adjacent helix–helix distances through the series, Fig. 5E. The average distance increases from 8.0 ± 0.1 Å in the Gly@g nonamer to 10.4 ± 0.1 Å in the Thr@g pentamer. Thus, through the series, neighbouring helices are pushed apart effectively expelling helices from the assembly.
The judicious placement of Gly may prove useful in designing αHBs to unlock previously unseen architectures. However, using Gly presents challenges that must be met to allow its full exploitation. The first challenge is incorporating multistate design into αHBs, i.e. considering multiple conformations and/or assemblies that may become accessible in the CC free-energy landscape.29 This approach is being applied to other systems.34–36 It is tractable to model many possible Type-2 αHBs to direct computational design.16 However, this becomes difficult with increasing off-target states, e.g. collapsed and anti-parallel structures. The second challenge is to stabilize the larger, but clearly accessible, oligomer states in solution. One possibility would be to introduce networks of polar residues to reduce the penalty of all-hydrophobic channels.37 The hexameric and heptameric assemblies of CC-Type2-(GgIaId)4 imply that, whilst Gly@g is necessary to access larger states, it is not the only factor that dictates oligomer state of the helical assembly. De novo αHBs have proved useful in functional protein design.11–13 Reliably accessing scaffolds with significantly larger pores systematically and with minimal changes in primary sequence, would expand the scope for this and future applications.
Footnotes |
† Electronic supplementary information (ESI) available: Methods and ESI data. See DOI: 10.1039/d1sc00460c |
‡ X-ray crystal structures for CC-Type2-(TgLaId)4-W19BrPhe, CC-Type2-(TgIaId)4-W19BrPhe, CC-Type2-(GgLaId)4, CC-Type2-(GgLaId)4-W19BrPhe and CC-Type2-(GgIaId)4 are available from the Protein Data Bank. Accession codes: 7A1T, 7BAS, 7BAT, 7BAU, 7BAV, 7BAW & 7BIM. |
This journal is © The Royal Society of Chemistry 2021 |