Lingfei
Wang
a,
Joseph M.
Rogers‡
b,
Simon J.
Dawson
c,
Line M.
Langhorn
a,
Ryan T.
Howard
a,
Sunbum
Kwon§
a,
Céline
Douat
*a,
Hiroaki
Suga
*b and
Ivan
Huc
*ac
aDepartment Pharmazie, Ludwig-Maximilians-Universität München, Butenandtstr. 5-13, 81377 München, Germany. E-mail: celine.douat@cup.lmu.de; ivan.huc@cup.lmu.de
bDepartment of Chemistry, School of Science, The University of Tokyo, 7-3-1 Hongo, Bunkyo, Tokyo, 113-0033, Japan. E-mail: hsuga@chem.s.u-tokyo.ac.jp
cCBMN (UMR5248), Univ. Bordeaux-CNRS-INP, Institut Européen de Chimie et Biologie, 2 rue Escarpit, 33600 Pessac, France
First published on 3rd April 2025
Helical aromatic oligoamide foldamers with a cluster of five biogenic side chains at their surface were designed and synthesized. Display selection of thioether macrocyclic peptides against these targets generated low micromolar binders that are highly selective for the side-chain arrangement.
Here, we report on the successful display selection of thioether-macrocyclic peptides (teMPs) using the random non-standard peptides integrated discovery (RaPID)12 system for the selective recognition of aromatic foldamer helices having a deliberately reduced target area. Reducing the binding area on the target molecule created a challenge for peptide macrocycle display selection. Indeed, unlike for nucleic acid aptamers13 and in contrast to the success of the display selection of peptides against protein targets12,14 or other large substrates like polymers and solids,15 examples of successful display selections of peptides against small molecules are rare.16 Our results thus highlight both the potential of aromatic helical foldamers to recognize peptides and the potential of cyclic peptides to recognize a smaller target.
The folding of oligoamides of 8-amino-2-quinoline carboxylic acid QX (Fig. 1b) into stable aromatic helices has been thoroughly described.17 Their conformational stability is such that the equilibrium between the right-handed (P) and the left-handed (M) enantiomeric helical conformers of an octamer is kinetically inert in water at 25 °C.18 Methods to introduce a variety of biogenic side chains on QX monomers without impacting helix stability are also available.5b,19 We synthesized an aromatic oligoamide target potentially suitable for interaction with a peptide ligand: dodecameric sequence 1a which contains five monomers carrying biogenic side chains (hydrophobic leucine-like for QL, anionic aspartate-like for QD, cationic ornithine-like for QO, and aromatic tyrosine-like for QU), arranged in such a way that they form a cluster, i.e. a potential binding spot on one side of the helix (Fig. 1a and c). The seven other Q− residues bear tetraethylene glycol side chains to provide water solubility and also to prevent peptide interactions outside of the binding spot.20 We also synthesized sequence 2a, an analogue of 1a where the same biogenic residues are presented differently on the helix surface, to use as an alternative target and test the selectivity of peptide–foldamer interactions. Both sequences end with a 3-aminomethyl benzoic acid residue used as a spacer (S) separating the aromatic backbone from a biotin tail (Bt) intended for immobilization on streptavidin (SA). These foldamers are achiral and thus exist as a racemic mixture of P- and M-helical conformers. Both sequences were built on a solid support starting from a low-loading bromo-Wang resin and using in situ activation, as recently reported.21 Crude purity was good and final purification was achieved using RP-HPLC (see the ESI† for details).
![]() | ||
Fig. 1 (a) Top and side views of an energy-minimized model (MMFFs in Maestro)22 of the aromatic helix of 1a. The five clustered biogenic side chains are shown as solvent accessible surfaces. The main chain and tetraethylene glycol side chains are shown in green stick representation. (b) Formula of aromatic δ-amino acid residues (Q) colour coded according to their side chain: hydrophilic (black), hydrophobic (green), cationic (orange), blue (anionic), and red (aromatic). The chiral B* residue with R or S configuration, the spacer (S), and the chemical formula of the N-terminal functionalization of the foldamers (Bt or Dg) are also defined. (c) Synthesized foldamer sequences 1a–e and 2a–f. A one letter code has been used to define the side-chain or the chirality of the aromatic α-amino acids. |
Sequences 1a and 2a were individually loaded on magnetic SA-beads and two independent RaPID selections were carried out in parallel to identify teMP binders. In the first selection, 1a was the target with a counter-selection against the streptavidin on the solid support. In the second selection, 2a was the target and 1a on SA-beads was used for counter-selection. Thus, the second selection should in principle exclude teMPs that bind to areas present in both 1a and 2a like the helix cross-sections (the area decorated with Q− is also shared by the sequences but it is not considered to be a plausible binding area). Since both the P and M helices are present, selections were simultaneously performed against two targets. The peptide precursor DNA library contained a coding region of ATG-(NNK)n-TGC sequences, N is all 4 bases, K = T or G, and n = 8–15. This was transcribed to mRNA for in vitro translation, where the AUG start codon was reprogrammed with N-chloroacetylated-(L)-Trp (ClAc-W) for selection against 1a and with chloroacetylated-(D)-Trp (ClAc-w)|| for selection against 2a. The repeats of NNK mixed codons encoded random α-amino acids and the TGC codon encoded a mandatory Cys to form a thioether macrocycle with the N-terminal chloroacetyl group. Of note, TGT was also present in the random NNK codons so additional Cys residues may appear earlier in the sequence. After the mandatory Cys, that is, nearer to the C-terminus, all peptides had a GSGSGS segment to act as a spacer between the cyclic peptide and the site where the encoding mRNA is attached.
Five iterative rounds of selection were performed before sequence analysis of the cDNA library (Fig. 2, S1 and S2†). The first selection led to strong convergence with twelve out of the top twenty ranked sequences showing more than 85% similarity, and the second to less extensive convergence. In both cases, the most amplified sequence (P1 and P2 in Fig. 2) contained two Cys residues. Based on previous studies,7b,23 we considered that the smallest possible thioether macrocycle, that is, the macrocycle involving the Cys nearest to the N terminus, formed first during the spontaneous post-translational cyclization, giving rise to a lariat structure as found in other RaPID selections.24 P1 thus consisted of an -Ac-WSTC- macrocycle with a linear peptide extension, and P2 of an -Ac-wSYGWC- macrocycle with a linear peptide extension. In both cases, the linear extensions contained several other hydrophobic residues (W, Y, P or L), suggesting that binding is driven by hydrophobic effects. For subsequent KD determination, and to avoid possible issues with thiol oxidation, the C-terminal Cys was replaced with an Ala residue. Thus peptides 3a and 4a were prepared using standard solid phase peptide synthesis of a chloroacetylated precursor followed by resin cleavage and side-chain deprotection and by cyclization in a basic medium. The two teMPs were purified by semi-preparative RP-HPLC and characterized by LC-MS analysis.
At this stage, an unanticipated complication required our attention as we found that foldamers 1a and 2a slowly degraded. Upon standing in an aqueous medium, the para-hydroxybenzyl moiety of Qu underwent 1,6-elimination to give 4-mercapto-quinoline. This degradation had been overlooked in sequences made previously that contained QU.5d We investigated side-chain instability using model pentamer 5 synthesized for that purpose (Fig. 3, S3 and S4†). The rate of elimination was found to vary with pH. It was slow enough for pH values between 5 and 7 to consider that elimination had been limited during the teMP RaPID selection. Nonetheless, the original target sequences 1a and 2a were judged unsuitable for physical investigations as degradation would reduce accuracy and reproducibility. We therefore prepared new foldamer sequences 1b–1e and 2b–2f in which QU was replaced either with QF, which bears a phenylalanine-like side chain similar to that of QU but lacking its hydroxy group, or with QY which bears a tyrosine-like side chain with an ethylene connector instead of a thioether (Fig. 1b). QF and QY do not differ from QU in the same way and it was not known whether these differences would matter so both were included in this study. The synthesis of Fmoc-QY(tBu)-OH and Fmoc-QF-OH are reported in the ESI (Schemes S1 and S2†).
Peptide–foldamer interactions were first assessed using biolayer interferometry (BLI, Table 1, Fig. 4a, b and S5†). Biotinylated foldamers were independently loaded on SA-sensors and peptides were used as analytes in Tris buffer saline with Tween 20 (0.05%) and DMSO (0.1%) (TBST-D). The procedure is presented in detail in the ESI.† The observed kinetic curves showed a steady state regime, i.e. they reached a plateau value, and KD values were determined using the Langmuir equation. Peptide 3a from the first selection showed similar micromolar affinity for 1b and 1c, two analogues of the first selection target 1a, suggesting that QF and QY reproduce essential features of QU in this context. In contrast, in the same concentration range,** peptide 3a showed no binding to 2b, an analogue of the second selection target 2a. This suggests that 3a recognizes surface features unique to 1b and 1c, which excludes the aromatic helix cross-section also present in 2b, and this is despite the absence of a counter-selection step against 2b in the first selection. Similarly, peptide 4a from the second selection showed micromolar affinity for 2b and no binding to 1b and 1c. The observed selectivity thus points to the peptides targeting the small areas in which the biogenic side chains of their targets are clustered.
Immobilized foldamer | Lariat peptide analyte | |||
---|---|---|---|---|
3a | 3b | 4a | 4b | |
a NBD indicates no binding detected. b A dash indicates that this measurement was not performed. | ||||
1b | 12.8 ± 0.3 | —b | NBDa | —b |
1c | 19 ± 0.6 | —b | —b | —b |
1d (M) | 9.5 ± 1.8 | NBD | —b | —b |
1e (P) | NBDa | 10.3 ± 1.9 | —b | —b |
2b | NBDa | —b | 16.1 ± 3.0 | —b |
2c (M) | —b | —b | NBDa | 14.7 ± 2.2 |
2d (P) | —b | —b | 9.7 ± 2.9 | NBDa |
To consolidate these results, peptide–foldamer interactions were also assessed using a fluorescence assay as a second physical technique requiring no immobilization on a surface (Fig. 4d). Peptide 4c was synthesized as an analogue of 4a extended at its C-terminus using a Ser-β-Ala spacer and a fluorescein-labelled Lys. Concurrently, foldamers 2e and 2f, two analogues of 2a having no biotin and either QF or QY instead of QU, were prepared. Fluorescence was found to drop upon binding of the foldamers to the peptides. Peptide 4c bound to 2e and 2f with KD values of 4.0 μM and 3.4 μM, respectively. These values are slightly lower than those measured by BLI, and again show no significant differences between QF and QY.
Next, we assessed the diastereoselectivity of the peptide foldamer interactions. The five QO (×2), QL, QD and QY/F biogenic side chains are not only projected differently in space at the surface of 1b and 2b but, for both compounds, the side chains are also projected differently in their M and P helical conformations, both of which were present during selection (Fig. S6†). We thus synthesized four new biotinylated foldamers, 1d, 1e, 2c and 2d (Fig. 1c). In each of them, the third residue (Q−) was replaced with a chiral 2-(2-aminophenoxy)-propionic acid monomer (B* in Fig. 1b) with either (R) or (S) configuration. BR and BS have been shown to quantitatively bias the handedness of the aromatic helix to M (in 1d and 2c) and P (in 1e and 2d), respectively.25 The extent of handedness bias was confirmed by 1H NMR spectra where a single set of signals corresponding to only one (R–M or S–P) diastereomeric conformer is seen. The sign of helix handedness was controlled by circular dichroism spectroscopy. Each new foldamer was then individually immobilized on SA-sensor tips and their binding to peptides was assessed by BLI. These unveiled that the interactions are diastereoselective in the range of peptide concentrations used. teMP 3a binds to M-helical 1d and not to P-helical 1e, while teMP 4a binds to P-helical 2d and not to M-helical 2c (Table 1, Fig. 4c and S7†). It is noteworthy that the first selection has generated a binder selective for the M helix whereas the other selection yielded a binder selective for the P helix. The KD values calculated from steady-state curve fitting are comparable to those obtained during the first round of BLI measurements on P/M helix mixtures, i.e. in the low micromolar range. Finally, we further validated the diastereoselective interactions in experiments where both the foldamer handedness and the peptide chirality were inverted. For that purpose, we prepared peptides 3b and 4b, the enantiomers of 3a and 4a, respectively. We found that the binding of 4b to M-helical 2c was similar to that of 4a to P-helical 2d (Table 1 and Fig. S8†). In the case of 3b, measurements were first hampered by nonspecific binding of this D-peptide to the reference SA-sensor tips. This effect decreased sufficiently when adding 0.05% BSA to the TBST-D buffer and it could be again verified that the binding of 3b to P-helical 1e was similar to the binding of its enantiomer 3a to M-helical 1d (Table 1 and Fig. S8†).
Footnotes |
† Electronic supplementary information (ESI) available: Supplementary figures, schemes and data, detailed experimental protocols and characterisation of new compounds. See DOI: https://doi.org/10.1039/d5ob00228a |
‡ Current address: Department of Drug Design and Pharmacology, University of Copenhagen, Copenhagen 2100, Denmark. |
§ Current address: Department of Chemistry, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, Republic of Korea. |
¶ Aromatic groups bearing amine and carboxylic acid substituents. Amino acids with aryl groups in their side chains such as Phe, Tyr, Trp are also called “aromatic amino acids” but are not relevant here. |
|| Changing the configuration of the N-terminal α-amino acid often results in the selection of different sequences (see ref. 12d). To explore new sequence space when targeting 2a, we used “w” instead of “W”. |
** BLI experiments were typically performed up to teMP concentrations of 50 μM. At higher concentrations, teMP aggregation and nonspecific binding to SA-biosensors made it difficult to acquire reliable data. |
This journal is © The Royal Society of Chemistry 2025 |