Open Access Article
Colton D.
Payne
a,
Grishma
Vadlamani
bcd,
Fatemeh
Hajiaghaalipour
a,
Taj
Muhammad
e,
Mark F.
Fisher
bcd,
Håkan S.
Andersson
ef,
Ulf
Göransson
e,
Richard J.
Clark
a,
Charles S.
Bond
c,
Joshua S.
Mylne
bcd and
K. Johan
Rosengren
*a
aThe University of Queensland, School of Biomedical Sciences, Brisbane, QLD 4072, Australia. E-mail: j.rosengren@uq.edu.au
bCurtin University, Centre for Crop and Disease Management, School of Molecular and Life Sciences, Bentley, WA 6102, Australia
cThe University of Western Australia, School of Molecular Sciences, Crawley, WA 6009, Australia
dThe University of Western Australia, ARC Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, Crawley, WA 6009, Australia
eUppsala University, Division of Pharmacognosy, Department of Pharmaceutical Biosciences, 75124 Uppsala, Sweden
fKarolinska Institute, Department of Medical Biochemistry and Biophysics, 17177 Stockholm, Sweden
First published on 13th September 2021
Head-to-tail cyclic and disulfide-rich peptides are natural products with applications in drug design. Among these are the PawS-Derived Peptides (PDPs) produced in seeds of the daisy plant family. PDP-23 is a unique member of this class in that it is twice the typical size and adopts two β-hairpins separated by a hinge region. The β-hairpins, both stabilised by a single disulfide bond, fold together into a V-shaped tertiary structure creating a hydrophobic core. In water two PDP-23 molecules merge their hydrophobic cores to form a square prism quaternary structure. Here, we synthesised PDP-23 and its enantiomer comprising D-amino acids and achiral glycine, which allowed us to confirm these solution NMR structural data by racemic crystallography. Furthermore, we discovered the related PDP-24. NMR analysis showed that PDP-24 does not form a dimeric structure and it has poor water solubility, but in less polar solvents adopts near identical secondary and tertiary structure to PDP-23. The natural role of these peptides in plants remains enigmatic, as we did not observe any antimicrobial or insecticidal activity. However, the plasticity of these larger PDPs and their ability to change structure under different conditions make them appealing peptide drug scaffolds.
Currently, a large number of unique peptide sequence inserts have been identified in daisy albumins via transcriptomics, while 23 have been described in more detail and annotated as members of the PDP family.4,7 Most PDPs are 14–17 residues long with a cyclic backbone, which results from the formation of a peptide bond between a C-terminal Asp and an N-terminal Gly residue. Some PDPs do have an Asn residue rather than an Asp residue at the C-terminal position, and are simply cleaved from the precursor yielding an acyclic hairpin peptide.8 Until recently, it was thought that all PDPs contained a single disulfide bond and were structurally limited to a small β-sheet structure and turns. However, a new member of the PDP family was discovered that contains 28 amino acids and two disulfide bonds in a I–II/III–IV ladder-like configuration.9 NMR spectroscopy showed that this peptide, PDP-23, adopts a unique tertiary structure wherein two loops, each containing a β-sheet bridged by a single disulfide bond, fold together into a V-shaped tertiary structure enclosing a hydrophobic core, more akin to the structures of larger proteins.9 Additionally, in water PDP-23 self-associates and forms a square prism-shaped symmetrical homodimer quaternary structure by increasing the distance between the loop regions and merging the hydrophobic cores of the two monomers.9 The PDP-23 symmetrical homodimer disassociates into two well-defined monomers in less polar environments like 20% acetonitrile or membrane-mimicking micelles. In each case the β-sheet structure is retained but the hinges allow opening and closing of the ‘V’.
SFTI-1 and other cyclic plant peptides have been explored as scaffolds for grafting of molecular functionalities due to their inherent stability.8,10 PDP-23 may be a superior scaffold to other members of the family due to its unique size, structure and ability to adopt different conformations under different conditions. These features could allow for more accessible regions to graft bioactive epitopes or conjugate small molecule payloads without sacrificing stability. An analogue of rhodamine, a family of fluorescent dyes known to be rapidly effluxed from cells by the drug efflux pump P-glycoprotein,11 was successfully attached to an analogue of PDP-23. The conjugate demonstrated effective cell uptake and inhibition of this pump, thereby restoring drug sensitivity in a cancer cell line resistant to chemotherapeutics due to overexpression of P-glycoprotein.9
In this study we employ racemic crystallisation to independently determine the structure of PDP-23 using X-ray crystallography. NMR spectroscopy is notoriously difficult for homomeric proteins given the inability to distinguish between inter- and intra-molecular distance restraints. However, the crystal structure confirms the unique dimeric assembly. Furthermore, we describe the identification, chemical synthesis and NMR structure of a second example of a PDP containing two disulfide bonds, PDP-24. PDP-24 also forms a tertiary V-shaped structure in less polar environments, but despite high sequence identity to PDP-23 does not form an ordered, symmetrical homodimer in water. We also investigate some potential native functions of PDP-23 and PDP-24.
:
50 CH3CN/H2O followed by direct injection. Upon completion, phase separation using DCM was used to isolate the peptide from the DMF. The DCM phase containing the peptide was diluted with CH3CN before rotary evaporation to remove the DCM and subsequent lyophilisation. The remaining sidechain protecting groups were removed from the peptide by treatment with a solution of 96
:
2
:
1
:
1 TFA/triisopropylsilane/3,6-dioxa-1,8-octanedithiol/H2O for 2 h. The peptide was removed from the deprotection solution by precipitation with cold diethyl ether, followed by filtration and re-solvation in 50
:
50 CH3CN/H2O before lyophilisation.
:
50 CH3CN/H2O at a concentration of 0.25 mg mL−1 before the slow addition of a solution of 0.1 M iodine in 50
:
50 CH3CN/H2O until the solution was coloured orange. This solution was stirred under nitrogen in the dark for 2 h, prior to quenching with ascorbic acid. Purification was conducted via reverse-phase high performance liquid chromatography on a Prominence (Shimadzu, Rydalmere, AUS) using a solution of 90% CH3CN and 0.05% TFA at a gradient of 1% min−1. A preparative C18 column (300 Å, 10 μm, 21.20 mm i.d. × 250 mm, Phenomenex) was used for purification before and after cyclisation as well as after each disulfide bond formation step. A semi-preparative C18 column (300 Å, 5 μm, 10 mm i.d. × 250 mm, Vydac) was used to achieve final purity greater than 95% for all peptides. Purity was assessed using a C18 analytical column (300 Å, 5 μm, 2.1 mm i.d. × 150 mm, Vydac). Final product identity was confirmed by ESI-MS.
:
50 PDP-23/D-PDP-23) dissolved in water. PDP-23 crystals were cryoprotected in reservoir solution supplemented with 25% glycerol and flash-cooled in liquid nitrogen. X-ray data were collected using beamline MX2 (ANSTO)12 at the Australian Synchrotron. The X-ray data were indexed using XDS,13 followed by space group search using Pointless14,15 (CCP4 package) with parameters extended to include centrosymmetric space groups. The data were scaled and averaged using Aimless14,15 (CCP4 package). The PDP-23 structure was determined by molecular replacement using Molrep15,16 (CCP4 package) using the NMR structure of the PDP-23 dimer (PDB ID: 7L51) as a search model. The resulting solution was refined using Refmac15,17 (CCP4 package) and Coot,18 and the final model and structure factors were deposited in the Protein Data Bank under the ID 7MMY. A composite omit map for L-PDP-23 was generated using PHENIX.19
:
20) at pH ∼ 3.5. 1H 1D data as well as 1H–1H 2D Total Correlation Spectroscopy (TOCSY;20 mixing time of 80 ms), and Nuclear Overhauser Spectroscopy (NOESY;21 mixing time of 200 ms) were recorded at 298 K on a 900 MHz Bruker Avance III spectrometer equipped with a cryoprobe. TOCSY experiments were recorded with 8 scans and 512 increments and NOESY experiments were recorded with 40 scans and 512 increments, with a sweep width of 12 ppm. 1H–13C and 1H–15N Heteronuclear Single Quantum Coherence (HSQC) experiments were recorded at natural abundance. The 13C HSQC data were recorded with 128 scans and 256 increments with a sweep width of 10 ppm in the F2 dimension and 80 ppm in the F1 dimension in D2O/CD3CN (80
:
20) to minimise the overlap of residual water with the Hα–Cα resonances. The 15N experiments were recorded with 256 scans and 128 increments, with a sweep width of 10 ppm in the F2 dimension and 32 ppm in the F1 dimension. The data were processed using Topspin 4.0.4 (Bruker), with the CH3CN solvent signal at 2.031 ppm (298 K) used as reference. The data were assigned using sequential assignment strategies in the program CARA (Computer Assisted Resonance Assignment).22 Secondary structural features were determined by comparison of the secondary 1Hα shifts generated by PDP-24 to that of equivalent values generated in a random coil peptide.23 Additional TOCSY experiments were recorded at 288 K, 293 K, 298 K, 303 K and 308 K to monitor the temperature dependence of the amide protons.
000 cfu suspended in 100 μL of Tris buffer. After a 5 h incubation with the peptides, 5 μL of 20% (w/v) tryptic soy broth was added to each well and the plates were re-incubated for an additional 6–12 h (depending on the growth rate of each organism). At least two replicates per experiment and three independent experiments (biological replicates) were performed.
A racemic mixture of PDP-23 and D-PDP-23 was subjected to crystallisation screening and diffraction quality crystals were obtained. Synchrotron radiation yielded crystal diffraction to a 1.46 Å resolution. The space group for PDP-23 racemic crystals was P21/c (indexed as P121/n1) with unit cell dimensions a, b, c = 27.53, 49.01, 29.40 Å and β = 92.28°. The data are 98.8% complete to 1.46 Å with the overall merging Rsym being 7.1%. The crystallographic statistics for the structure are shown in Table 1. The crystal structure of PDP-23 was determined by molecular replacement using the PDP-23 homodimer in water (PDB code: 7L51) as the search model. Molecular replacement using a monomeric structure was unsuccessful.
| Data collection | |
| Resolution (Å) | 49.01–1.46 |
| Space group | P121/n1 |
| Total reflections | 85 984 (3128) |
| Unique reflections | 13 346 (515) |
| Multiplicity | 6.4 (6.1) |
| Completeness (%) | 98.8 (80.8) |
| I/σI | 10.6 (2.5) |
| R merge | 0.071 (0.712) |
| CC1/2 | 0.998 (0.833) |
| Unit cell dimensions | |
| a, b, c (Å) | 27.53, 49.01, 29.40 |
| α, β, γ (°) | 90.00, 92.28, 90.00 |
| Refinement | |
| R work/Rfree | 0.188/0.276 |
| No. of atoms: protein | 476 |
| water | 45 |
| B-Factors (Å2): protein | 22.3 |
| water | 35.8 |
| Ramachandran plot (%) | |
| Favoured | 100 |
| Allowed | 0 |
| Outliers | 0 |
| RMSD | |
| Bond lengths (Å) | 0.01 |
| Bond angles (°) | 1.82 |
The asymmetric unit of racemic PDP-23 crystals contains a single non-crystallographic dimer of L-/L- (or D-/D-) PDP-23 monomers. Crystal symmetry in the centrosymmetric space group P121/n1 results in a unit cell containing two L-dimers and two D-dimers, such that crystal contacts exist between L- and D-dimers (Fig. 1A). The structure confirms both the V-shaped tertiary structure and the dimeric quaternary structure originally determined using solution NMR spectroscopy. The structural similarity is high (RMSD of 0.64 Å for 53 of 56 Cα atoms) which confirms the NMR model was a suitable molecular replacement search model. The observed electron density is of high quality, as expected for a structure at <1.5 Å resolution (Fig. 1B), with all amino acids well resolved except for His5 and His6 in each subunit which have disordered sidechains. For a high-resolution structure, our reported Rwork/Rfree values (19%/28%) appear high. We note however that higher R-values are observed in centrosymmetric diffraction data than in non-centrosymmetric data for equivalent degrees of coordinate error, for statistical reasons.37 To validate the quality of our data, we generated a composite omit map and found that it is clearly resolved around individual residues in the L-PDP-23 dimer, except His5 and His6 as noted above (Fig. S1, ESI†).
![]() | ||
| Fig. 1 X-ray structure and racemic crystal packing of PDP-23. (A) Crystal packing for the racemic PDP-23 crystal structure with L-dimers (cyan) and D-dimers (green). (B) Stick diagram of an L-PDP-23 homodimer, with the individual monomers shown with green and white carbon atoms, respectively. The zoomed region depicts a maximum-likelihood weighted 2Fo − Fc electron density map (blue mesh) contoured at 1.7σ for the hydrophobic core formed by inter-facing monomers. (C) The L-enantiomer of PDP-23 shown in stick format, the backbone is coloured by atom with carbons in green or cyan to distinguish monomers. H-bonds are shown with yellow dashed lines. Two water molecules create bridged hydrogen bonds between the monomers. The dense hydrogen bond network proposed for PDP-23 by solution NMR9 is fully supported by the crystal data. | ||
Comparison of the crystal dimer and the solution NMR structure revealed noteworthy features and differences: the crystallographic dimer displayed asymmetry in the packing of its hydrophobic core. While the backbones of both chains were highly similar (RMSD 0.32 Å for 24 pairs of Cα atoms), Phe2, Trp4, Leu21 and Phe26 differed in conformation between subunits (Fig. 2A). This was in contrast to the solution NMR data and structure, which was fully symmetric, and these sidechains appeared in conformations intermediate between crystal structure extremes (Fig. 2A). In particular, the χ1 angles of both Phe2 and Leu21 differed between monomers within the crystal dimer (Fig. 2A). The sidechains of these two residues form critical contacts and establish the NOE network of the solution NMR structure of PDP-23. Leu21 has, because of its network of unambiguous intermolecular NOEs, been used as a signature to determine whether PDP-23 is in a dimeric or monomeric state when in solution. The methyl groups of the Leu21 generate NOEs to the sidechains of Trp4 and Val9, which cannot be explained by a monomeric structure. The absence of duplicate resonances in the homonuclear and heteronuclear NMR data is consistent with a fully symmetrical structure, however, structural rearrangements on a fast time scale could mean the NMR data are averaged over local fluctuations rather than reflecting a singular conformation. In the NMR structural ensemble Leu21 did adopt multiple different χ1 angles, consistent with this notion. Phe2 however only appears in a single conformation in the NMR structure, nonetheless there may be breathing motions in the homodimeric core in solution that is reduced in the crystalised form explaining the asymmetry in the X-ray structure. As we have previously reported that the homodimeric structure of PDP-23 could be separated by temperature or by the addition of a small amount of aprotic polar solvent, it is possible the homodimeric complex is somewhat dynamic. Nevertheless, the dimer interfaces of crystal and NMR structures appear similarly favourable, with buried areas of 779 Å2 and 740 Å2, respectively.38
It is possible that lattice contacts influence the conformations observed in crystal structures. A conspicuous interaction between dimers in the lattice involves Trp19. Each individual L- and D-homodimer extends the Trp19 sidechain from both its monomers. The Trp19 residue of one form of homodimer, L- or D-, then associates with a Trp19 residue from the adjacent homodimer of the same form in the lattice (Fig. 2B). These two Trp sidechains are sandwiched between two Cys14–Cys25 disulfide bonds from separate homodimers of the opposing form; L-Trp19 residues between D-Cys14–Cys25 disulfide bonds and vice versa.
The Trp19 interactions are seemingly critical to the arrangement of the crystal lattice, and also highlight a key difference between the structures determined by crystallography and solution NMR spectroscopy. In solution, instead of being extended outwards away from the homodimer, the Trp19 sidechain is packed against the adjacent Pro20 sidechain creating a cis-Pro turn. This cis-Pro conformation of the turn is clearly evident from electron density, chemical shifts and NOE patterns, and the packing of the aromatic ring against the Pro is evident from the ring current effect creating a large upfield shift of one of the 1Hβ protons of the Pro20 sidechain, a hallmark classically used to evidence this type of turn in the cyclotides.39
Overall, the consistency between the solution NMR structure and the crystal form is remarkable, given the complexity of resolving an NMR structure of a homodimer where every NOE contact can reflect either an intra or intermolecular interaction.
To confirm this sequence, we reassembled the raw reads produced by Jayasena et al.31 and identified a 744 nucleotide contig coding for PDP-24 and its adjacent seed storage albumin. We mapped the raw RNA-seq reads to this contig, concentrating on the 81-nucleotide sequence encoding PDP-24, and found coverage of 216 reads to the peptide-encoding sequence, including about 30 reads with single nucleotide mismatches at random positions, likely caused by sequencing errors (Fig. 3).
Given the intriguing features of PDP-23, we sought to determine whether the symmetrical homodimer quaternary structure is a common fold for bi-disulfide bonded PDPs. PDP-24 was thus synthesised and purified using the same protocol as PDP-23. Solution NMR spectroscopy was used to study PDP-24 and determine if it adopts the aforementioned features present in PDP-23. In contrast to PDP-23, PDP-24 dissolved poorly in water with signs of non-specific aggregation with an opaque appearance of the sample, and the 1D 1H NMR spectrum showed broad lines. However, similar to PDP-23, PDP-24 dissolved readily in 80
:
20 H2O/CD3CN and 1D 1H NMR gave a spectrum with excellent dispersion and sharp lines indicating a well-folded, structured peptide. Because of this, extensive NMR data were recorded for PDP-24 under these conditions, and assigned manually using sequential assignment strategies.40 The quality of the datasets allowed complete assignment of the backbone and sidechain resonances using TOCSY and NOESY spectra. HSQC data were recorded at natural abundance and the 13C and 15N backbone and sidechain resonances were assigned with assistance from the 1H assignments. PDP-24 contains four proline residues with Pro19 adopting a cis conformation and Pro10, Pro17, and Pro26 all adopting trans conformations based on NOE patterns and 13C chemical shifts. These conformational states are identical to what is observed in PDP-23.9 Additionally, the secondary Hα shifts, which are sensitive indicators of secondary structure, of PDP-24 are remarkably similar to those observed in PDP-23 when in either H2O/D2O or H2O/CD3CN,9 indicating PDP-24 adopts the same secondary structural features as PDP-23. In water PDP-23 contains key NOEs that indicate a symmetrical homodimer structure. These NOEs disappear in less polar solutions, or when PDP-23 is in the presence of micelles, indicating dissociation of the symmetrical homodimer.9 Notably, PDP-24 contains no NOEs suggestive of a symmetrical homodimer when in 80
:
20 H2O/CD3CN, consistent with what was observed with PDP-23. No additional spin systems suggesting conformational inhomogeneity were identified in the data, indicating the fold adopted is preferred in this environment.
To determine the 3D structure of PDP-24, structural restraints were derived from the NMR data. These included inter-proton distances based on NOE volumes, dihedral angles derived from chemical shifts using TALOS-N, as well as hydrogen bonds based on temperature coefficients and preliminary structure calculations. Initial structures were calculated using automated NOE assignment with CYANA and final structures were calculated and refined in explicit water using CNS. All structures were analysed using MolProbity to determine the quality of the structural geometry and atom packing. The best 20 structures from the 50 calculated based on MolProbity scores, low energy as well as containing no significant violations were chosen to represent the solution structures of PDP-24. The structures calculated for PDP-24 are of good stereochemical quality, with minimal clashes >0.4 Å and all residues being in favoured Ramachandran regions. The data also highlights the well-defined backbone of PDP-24, which has an RMSD of 0.85 Å. The structures generated for PDP-24 are above the 92nd percentile of all structures according to MolProbity (Table 2).
| Energies (kcal mol−1) | |
| Overall | −813.06 ± 13.18 |
| Bonds | 13.05 ± 1.48 |
| Angles | 29.03 ± 2.11 |
| Improper | 10.76 ± 1.45 |
| Dihedral | 114.00 ± 0.89 |
| van der Waals | −93.43 ± 3.43 |
| Electrostatic | −887.04 ± 10.61 |
| NOE | 0.065 ± 0.007 |
| cDih | 0.508 ± 0.206 |
| MolProbity statistics | |
| Clashes (>0.4 Å/1000 atoms) | 12.66 ± 1.65 |
| Poor rotamers | 0 |
| Ramachandran outliers (%) | 0 |
| Ramachandran favoured (%) | 100 |
| MolProbity score | 1.61 ± 0.05 |
| MolProbity score percentile | 92.05 ± 1.57 |
| Atomic RMSD (Å) | |
| Mean global backbone | 0.85 ± 0.33 |
| Mean global heavy | 1.51 ± 0.38 |
| Experimental restraints | |
| Distance restraints | |
| Short range (i–j < 2) | 282 |
| Medium range (i–j < 5) | 75 |
| Long range (i–j ≥ 5) | 130 |
| Hydrogen bond restraints | 24 (12 bonds) |
| Total | 511 |
| Dihedral angle restraints | |
| ϕ | 15 |
| ψ | 17 |
| χ1 | 5 |
| χ2 | 8 |
| Total | 45 |
| Violations from experimental restraints | |
| NOE violations exceeding 0.2 Å | 0 |
| Dihedral violations exceeding 2.0° | 0 |
PDP-24 adopts identical secondary structure to PDP-23 (Fig. 4), with an almost identical hydrogen bond network, evident from amide temperature coefficients. However, there is a key difference, this being the absence of the hydrogen bond between residues 13–10, which stabilises a type II β-turn in PDP-23. Residue 13 in PDP-23, a Thr, is not present in PDP-24, making the hinge between its two β-sheets shorter and preventing formation of a well-defined turn. The change in the hinge region appears to have a flow-on effect to the hydrophobic core, altering its composition and packing of sidechains. The ‘V’ shape is more open overall in PDP-24 and the hydrophobic core is packed to give a more twisted interface between the two β-hairpins compared to the more planar interaction in PDP-23 (Fig. 4). In PDP-23 the Val and Leu residues at positions 9 and 21, respectively, are a key part of the hydrophobic core. In PDP-24 these two residues are switched with a Leu at position 9 and Val at 20 (the equivalent positions in PDP-24); neither residue interacts strongly with the hydrophobic core in contrast to what is observed in PDP-23. This is evidenced by an absence of NOEs generated by the Leu and Val residues to hydrophobic core residues on the opposing β-hairpin. Conversely, in PDP-23 a multitude of NOEs were observed from these residues to almost all members of the hydrophobic core, both locally and across to the opposing β-hairpin. Overall the sequence changes present in PDP-24 cause the peptide to adopt a tertiary structure more akin to what is observed in the dimeric form of PDP-23 rather than the monomeric form (Fig. S2, ESI†). Based on the high sequence identity and structural similarity to the dimeric form of PDP-23 it is perhaps surprising that PDP-24 is unable to form the same quaternary square prism homodimeric structure of PDP-23. Despite the clear similarities, there are also obvious differences, particularly in the hairpin containing the substitutions H5Q, S7T and V9L. While His5 is surface exposed, both the S7T and V9L changes would cause clashes that would require structural rearrangement relative to the PDP-23 dimer. Alternatively, the shorter turn resulting from the deletion of T13 is forcing a different alignment of the hairpins, incompatible with a dimer. While the experimental data clearly shows that PDP-24 does not adopt a homodimeric structure, it is difficult to pinpoint which sequence change is primarily preventing the formation of a dimer. Whether the ability to form a dimer is functionally significant is not known. Nonetheless, identification of another scaffold adopting a tertiary V-shaped structure may be useful for protein engineering. Hydrophobicity and solvation are easily overcome by simple modifications,41–43 allowing this particular PDP to remain available for drug conjugation.
As another potential function in defense may be antimicrobial activity, PDP-23 and PDP-24 were tested against two bacterial species and one fungus in a two-step microdilution assay protocol. PDP-23 and PDP-24 showed no inhibition of growth at all tested concentrations up to 80 μM (Table S2, ESI†). The human cathelicidin LL-3734 was used as a positive control and inhibited growth at sub-micro molar concentrations.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/d1cb00155h |
| This journal is © The Royal Society of Chemistry 2021 |