Lin
Cheng‡
,
Ping
Tian‡
,
Honghong
Duan
,
Qingfang
Li
,
Xiaowen
Song
,
Anyang
Li
and
Liping
Cao
*
Key Laboratory of Synthetic and Natural Functional Molecule Chemistry of the Ministry of Education, Xi'an Key Laboratory of Functional Supramolecular Structure and Materials, College of Chemistry and Materials Science, Northwest University, Xi'an 710069, China. E-mail: chcaoliping@nwu.edu.cn
First published on 17th December 2022
Sequence-specific recognition of peptides and proteins by synthetic compounds or systems remains a huge challenge in biocompatible media. Here, we report the chiral adaptive recognition (CAR) with sequence specificity of aromatic dipeptides in a purely aqueous solution using an achiral tetraphenylethene-based octacationic cage (1) as both a molecular receptor and chiroptical sensor. 1 can selectively bind and dimerize aromatic dipeptides to form 1:2 host–guest complexes with high binding affinity (>1010 M−2), especially up to ∼1014 M−2 for TrpTrp. Given the dynamic rotational conformation of TPE units, achiral 1 can exhibit chiral adaptive responses with mirror-symmetrical circular dichroism (CD) and circularly polarized luminescence (CPL) spectra to enantiomeric dipeptides via supramolecular chirality transfer in the host–guest complexes. Furthermore, this CAR with sequence specificity of 1 can be applied for molecular recognition of TrpTrp- or PhePhe-containing tetrapeptides, polypeptides (e.g., amyloid β-peptide1–20 and somatostatin), and proteins (e.g., human insulin) with characteristic CD responses.
Tetraphenylethene (TPE) and its derivatives not only possess excellent aggregation-induced emission (AIE) properties that can show fluorescence signals, but also have chiral left-handed (M) and right-handed (P) rotational conformations that can exhibit CD and CPL signals.33–35 Based on the two direction-selective rotational conformations of the four phenyl rings in TPE, the M- and P-enantiomers of some TPE-based compounds can exhibit reliable structural chirality.36,37 Recently, our38–42 and other groups43,44 have developed some achiral TPE-based hosts including cyclophanes, molecular cages, and supramolecular organic frameworks, which can exhibit dynamic conformation chirality in host–guest recognition or self-assembly. Specifically, a water-soluble TPE-based octacationic cage (1) with two TPE faces can be an ideal candidate as both a molecular receptor and chiroptical sensor for the sequence-specific recognition of peptides in an aqueous solution. Theoretically, there are three possible conformational isomers of 1: achiral PM-1 and a pair of racemic MM-1 and PP-1, all of which have rapid and reversible conformation transformation with each other in the solution state (Fig. 1a). Among them, PM-1 is achiral and shows no CD or CPL signals. However, PP-1 exhibits negative CD and CPL signals, while MM-1 shows positive CD and CPL signals. Therefore, there are several advantages of 1 for the molecular recognition of peptides: (1) the fluorescent cage itself can exhibit optical responses without using other dyes as auxiliary chromophores. (2) The cooperation of the hydrophobic effect and electrostatic interactions arising from the large hydrophobic cavity and the eight positively charged pyridinium sites of 1 can be ideal for selectively binding to peptides with negatively charged and aromatic tryptophan (Trp), phenylalanine (Phe), or tyrosine (Tyr) residues at the C-terminal position. (3) The P- or M-rotational conformation of TPE units can offer various chiroptical responses to achieve sequence specificity based on the different binding affinities and intrinsic chiralities of various peptides via supramolecular chirality transfer in the host–guest complexes.
Fig. 1 (a) Three conformational isomers of 1 and their chiroptical responses. (b) Chiral adaptive recognition of 1 for enantiomers. X-ray structures of (c) PM-1 and (d) racemic MM-1 and PP-1. Cl− counter ions and hydrogen atoms are omitted for clarity. CCDC numbers: 2100784 for PM-1; 2100668 for racemic MM-1 and PP-1.† |
To develop a synthetic supramolecular system with multiple chiroptical responses for the sequence-specific recognition of peptides, we propose a model of host–guest recognition for enantiomers by an achiral host (e.g., 1) with the ability of chiral adaptive responses to enantiomers, which is called chiral adaptive recognition (CAR). In traditional chiral recognition systems, chiral hosts, which have different binding affinities for enantiomers, can achieve chiral discrimination for enantiomers through non-chiral detection technologies (e.g., NMR and fluorescence).45,46 In the CAR host–guest systems, however, achiral hosts with structural flexibility (e.g., switchable P/M-rotation of TPE units in this case) can be induced by enantiomeric guests into one of their chiral conformations in the chiral host–guest complexes, respectively (Fig. 1b). As a result, these achiral hosts can gain a kind of guest-induced conformation chirality in the host–guest complexation to exhibit turn-on chiroptical signals (e.g., CD and CPL), which are distinguished, real-time, and adaptive responses to the different chiral guests and their enantiomers.47,48
Here, we develop chiral adaptive recognition (CAR) with sequence specificity of aromatic dipeptides by an achiral TPE-based cage (1) in an aqueous solution. The molecular recognition of 1 for amino acids, dipeptides, tetrapeptides, polypeptides, and even proteins is systematically proposed and investigated. Firstly, 1 exhibits selective recognition for Trp with obvious adaptive CD signals over others including glycine (Gly) and 18 pairs of enantiomeric amino acids. Secondly, 1 shows sequence-specific recognition to selectively bind and dimerize aromatic dipeptides (e.g., TrpTrp) in a 1:2 stoichiometry with high binding affinity (Ka ≈ 1010–1014 M−2). Meanwhile, dipeptide enantiomers can induce the adaptive conformation chirality of 1 in these host–guest systems: D-dipeptides induce the MM-rotational chiral conformation of 1 with positive CD and CPL signals, while L-dipeptides induce the PP-rotational chiral conformation of 1 with negative CD and CPL signals. And the CD spectra of 1 with aromatic dipeptide enantiomers are mirrored, which can be used to determine the enantiopurity of the L/D-dipeptide mixtures. Given the sequence-specific recognition for TrpTrp and PhePhe, 1 can mainly bind TrpTrp and PhePhe residues in the corresponding tetrapeptide chains regardless of these dipeptide residues at the N-terminal, middle, or C-terminal positions. Furthermore, the CAR of 1 for PhePhe-containing polypeptides and proteins, such as amyloid β-peptide1–20 (Aβ1–20), somatostatin, and human insulin, is further explored to show distinguishable CD spectral fingerprints for each valuable guest.
The CD responses of 1 to other amino acids were very weak or silent (Fig. S4–S23†), while NMR titration experiments of 1 with other amino acids showed that only some amino acids could be slightly bound inside the cavity of 1via the hydrophobic effect or interacted with the cationic surface of 1via electrostatic interactions. For aromatic amino acids including Trp, Phe and Tyr, all proton resonances of the phenyl rings showed slight upshifts owing to the shielding effect from 1, indicating the hydrophobic indolyl or phenyl rings could be inside the cavity of 1 with very weak binding affinity (Fig. S24–S26†). Besides the hydrophobic effect, the electrostatic interactions between this cationic 1 and positively or negatively charged amino acids are another non-covalent force in this aqueous recognition. Specifically, after mixing them with 1 in D2O, positively charged amino acids including histidine (His), lysine (Lys), and arginine (Arg), and negatively charged glutamic acid (Glu) showed slight downfield shifts, while negatively charged aspartic acid (Asp) and a neutral cysteine (Cys) showed slight upfield shifts, indicating that these amino acids as endo- or exo-guests could interact with the positively charged pyridinium cationic surface of 1 (Fig. S27–S32†). For neutral aliphatic amino acids including glycine (Gly), alanine (Ala), leucine (Leu), isoleucine (Ile), threonine (Thr), valine (Val), proline (Pro), serine (Ser), glutamine (Gln), methionine (Met), and asparagine (Asn), they could not exhibit any effective binding behavior with 1, owing to the lack of sufficiently strong non-covalent interactions (Fig. S33–S43†). Isothermal titration calorimetry (ITC) and UV/vis experiments proved that 1 could encapsulate two Trp molecules with weak ternary association constants of Ka = (4.97 ± 0.10) × 104 M−2 (Fig. S44†). However, the association constants between 1 and other amino acids could not be calculated by NMR or ITC experiments. Therefore, 1 exhibited a CAR with high selectivity for D/L-Trp based on their relatively strong affinity, because they have more hydrophobic indole group. This result indicates that both the strong binding affinity of the host–guest complex and the intrinsic chirality of D/L-Trp are important factors that determine the intensities and negative/positive Cotton effect of the CD spectra in the host–guest complexation.
Guests | K a (M−2) |
---|---|
a The data were fitted by the one sites model. In these cases, the number of binding sites was about two, indicating that the two guests bind the host with the same binding constant. b The data were fitted by the sequential binding sites model. | |
L-TrpTrp | (1.80 ± 0.25) × 1014 |
D-TrpTrp | (1.99 ± 0.30) × 1014 |
L-PhePhe | (1.56 ± 0.11) × 1010 |
D-PhePhe | (1.72 ± 0.34) × 1010 |
L-TyrTyr | (2.36 ± 0.21) × 1011 |
D-TyrTyr | (1.39 ± 0.11) × 1011 |
L-PheTrpb | (4.46 ± 0.89) × 1012 |
L-TyrTrp | (5.34 ± 1.71) × 1012 |
L-TrpPhe | (3.55 ± 1.18) × 1013 |
L-TyrPhe | (1.98 ± 0.31) × 1011 |
L-TrpTyrb | (6.68 ± 0.74) × 1011 |
L-PheTyr | (1.10 ± 0.13) × 1012 |
D-Trp–L-Trp | (3.29 ± 0.10) × 1013 |
L-Trp–D-Trp | (3.61 ± 0.21) × 1013 |
TrpTrpGlyGly | (4.11 ± 0.46) × 1013 |
GlyTrpTrpGly | (3.91 ± 0.59) × 1013 |
GlyGlyTrpTrp | (3.36 ± 0.42) × 1013 |
PhePheGlyGly | (9.99 ± 1.07) × 108 |
GlyPhePheGly | (1.54 ± 0.15) × 108 |
GlyGlyPhePhe | (4.49 ± 0.17) × 108 |
Aβ1–20 | (2.37 ± 0.54) × 1012 |
Somatostatin | (1.18 ± 0.06) × 109 |
Human insulin | (1.15 ± 0.29) × 1013 |
Fig. 2 (a) Schematic representation of the host–guest complexation of PP-1⊃(L-TrpTrp)2. (b) 1H NMR spectra (400 MHz, 298 K, D2O) recorded for 1 (0.40 mM) with 1.0–3.0 equiv. of L-TrpTrp. (c) ITC of 1 with L-TrpTrp at 298 K in phosphate buffer (10 mM sodium phosphate, pH = 7.4). (d) Top and side views of the X-ray structure of PM-1⊃(L-PhePhe)2. The dashed lines are key CH⋯π interactions with distances of 2.70 Å (A), 2.35 Å (B) and 2.83 Å (C), electrostatic interactions with distances of 4.44 Å (D), and ion–dipole interactions with distances of 3.85 Å (E) and 4.97 Å (F). Br− counter ions and hydrogen atoms are omitted for clarity. CCDC number: 2155590† for PM-1⊃(L-PhePhe)2. |
To further understand the binding model of 1 with two dipeptide molecules, the X-ray quality crystals of 1 with L-PhePhe were obtained from their aqueous solution by slow solvent evaporation at room temperature (Fig. 2d and Table S1†). The crystallographic analysis shows that two L-PhePhe molecules are fully encapsulated inside the hydrophobic cavity of 1 through multiple noncovalent interactions along the diagonal direction of the cavity (Fig. 2d, top). And 1 can catch L-PhePhe molecules via electrostatic interaction (d(O−⋯N+) = 4.44 Å) between the carbonyl COO− and pyridinium N+ and two ion–dipole interactions (d(N⋯N+) = 3.85 and 4.97 Å) between the amido NH2 and pyridinium N+ (Fig. 2d, bottom). Therefore, the host–guest complexes of 1 and dipeptides are mainly stabilized by the cooperation of the hydrophobic effect and electrostatic/ion–dipole interactions in aqueous solution. The phenyl rings of two L-PhePhe molecules are not in the center of the cavity but at the intervals in the X-shaped molecular skeleton of 1, which indicates that the cavity of 1 is crowded for two aromatic dipeptide molecules. Therefore, the spatial proximity and steric hindrance effect between the TPE units of 1 and the phenyl rings of guests could be key factors that affect the rotational conformation of TPE units to achieve supramolecular chirality transfer in the host–guest complexes. In addition, two L-PhePhe molecules have no obvious interaction with each other. In this host–guest complex, the two TPE units of 1 adopt P- and M-rotational conformations to form PM-1⊃(L-PhePhe)2. Although the X-ray structure of PM-1⊃(L-PhePhe)2 does not show any direction selectivity on the rotational conformation of the two TPE units in the solid state, energy minimized structures of possible chiral host–guest complexes show that the selective conformation transformation from MP-1/MM-1 to PP-1 is an energy-favored process in the solution state when 1 molecule binds two L-PhePhe (Fig. S76†).
Subsequently, three pairs of aromatic dipeptide enantiomers including D/L-TrpTrp, D/L-PhePhe, and D/L-TyrTyr were selected to demonstrate the chiral adaptive responses of 1 by CD titration experiments (Fig. 3a). The CD spectra of 1 titrated with D-TrpTrp showed an obvious positive Cotton effect in the long-wavelength (310–450 nm) region, which is attributed to the M-rotational conformation of the TPE units (Fig. 3b), while the CD spectra of 1 titrated with L-TrpTrp showed negative CD signals at 310–450 nm, which corresponds to the P-rotational conformation of the TPE units (Fig. 3c). The mirror image of their CD spectra strongly confirmed that achiral 1 could exhibit CAR for D/L-TrpTrp to form a pair of enantiomeric host–guest complexes, MM-1⊃(D-TrpTrp)2 and PP-1⊃(L-TrpTrp)2, respectively. The dissymmetry factor (gabs) at 390 nm of MM-1⊃(D-TrpTrp)2 was 8.9 × 10−4, while gabs of PP-1⊃(L-TrpTrp)2 was −8.8 × 10−4. These results strongly suggest that the intrinsic chirality of D/L-TrpTrp determines the direction-selective rotational conformation transformation to generate MM-1 or PP-1, respectively. The gabs curves versus equiv. of L-TrpTrp further confirmed the 1:2 stoichiometry of PP-1⊃(L-TrpTrp)2, which is consistent with the ITC and X-ray results. Similarly, D/L-PhePhe and D/L-TyrTyr could also induce corresponding CD signals with the mirror image: one is positive for MM-1⊃(D-dipeptide)2 and the other one is negative for PP-1⊃(L-dipeptide)2, respectively (Fig. 3d and e and S77†). In addition, D/L-TrpTrp, which has a stronger binding affinity with 1, can induce a faster equilibrium based on the CD changes when compared with D/L-PhePhe and D/L-TyrTyr. This strongly suggests that the strong binding affinity of the host–guest complex is the primary factor for the chirality induction between the host and guest in the supramolecular chirality transfer process: the stronger binding between the host and guest can promote more effective chirality induction in the host–guest systems. Based on the CAR of 1 for the mixture of D/L-guests and their mirror-imaging CD spectra (Fig. 3f and S78–S80†), calibration lines with great linearity (R2 > 0.99) were obtained by plotting the CD signals of the three pairs of enantiomers at 390 nm against the enantiomeric excess (ee) values with errors of 1.22–3.68% (Fig. 3g), which can be used to determine the enantiopurity of dipeptide enantiomers. Interestingly, we also found the chiral adaptive responses of 1 are determined by the C-terminal residues of aromatic dipeptides: D-Trp–L-Trp induces the adaptive conformation chirality of PP-1, while L-Trp–D-Trp induces the adaptive conformation chirality of MM-1, which are consistent with L-TrpTrp and D-TrpTrp, respectively (Fig S81†). In these CAR host–guest systems, the association constants of achiral 1 with D/L-guests are almost same (Table 1). However, 1 can achieve chiral recognition to distinguish enantiomers and determine the enantiopurity, which mainly depend on the chiral adaptive responses with opposite CD signals to D/L-guests. The CD experiments also revealed that other L-aromatic dipeptides could also induce the conformation chirality of PP-1 to show negative CD with different intensities (Fig. S82–S87†). In addition, fluorescence titration between 1 and L-TrpTrp showed an obvious emission decrease of 1 centered at 560 nm, which could be mainly dominated by photoinduced electron transfer (PET) between electron-deficient pyridinium rings of 1 and electron-rich indole rings of L-TrpTrp (Fig. 3h).49 In contrast, the addition of L-PhePhe into the solution of 1 in phosphate buffer can induce an emission increase at around 560 nm, which could result from the restriction of intramolecular rotation (RIR) mechanism playing a dominant role when PhePhe molecules were inserted into the inner cavity of 1 (Fig. 3i).50
Given their adaptive conformation chirality and fluorescence, the CPL properties of the enantiomeric host–guest complexes further exhibited chiral adaptive responses to D/L-dipeptides. As shown in Fig. 3j, the CPL spectra recorded at 460–750 nm displayed positive or negative CPL induced by D-guests (e.g., D-PhePhe and D-TyrTyr) or L-guests (e.g., L-PhePhe and L-TyrTyr), respectively, which are consistent with CD experiments. Specifically, MM-1⊃(D-PhePhe)2 showed a positive CPL at around 560 nm (glum = 5.07 × 10−4), while PP-1⊃(L-PhePhe)2 displayed a negative CPL at around 560 nm (glum = −4.70 × 10−4). And the glum values of MM-1⊃(D-TyrTyr)2 and PP-1⊃(L-TyrTyr)2 were 6.40 × 10−4 and −4.10 × 10−4 at around 560 nm, respectively. The CPL needs a cooperative integration between supramolecular chirality and fluorescence in the supramolecular systems.35 Therefore, MM-1⊃(D-TrpTrp)2 and PP-1⊃(L-TrpTrp)2 hardly displayed CPL signals owing to their fluorescence quenching in the host–guest complexation (Fig. S88†).
1H NMR titration experiments of 1 with TrpTrp-containing tetrapeptides showed that the hydrophobic cavity of 1 preferred to encapsulate the TrpTrp residues of the peptide chains regardless of them being at the N-terminal, middle, or C-terminal positions, indicating that the hydrophobic effect is the main non-covalent force in these host–guest complexations (Fig. S90–S92†). TrpTrp-containing tetrapeptides have strong association constants (∼1013 M−2) with 1 in a 1:2 stoichiometry (Table 1). For PhePhe-containing tetrapeptides, the hydrophobic effect of PhePhe residues with 1 is much weaker than that of the TrpTrp residues. As a result, the deprotonated carboxylic groups with negative charge at the C-terminal position could offer an additional non-covalent force – electrostatic interaction – with positively charged pyridinium rings of 1, which can slightly affect the binding site between 1 and PhePhe-containing tetrapeptides. For example, the 1H NMR spectra of 1 with PhePheGlyGly showed that the proton resonances of CH2 units in the GlyGly residue and partial phenyl units in the PhePhe residue slightly shifted upfield (Δδ = −0.13 to −0.03 ppm), indicating that the PheGlyGly residue at the C-terminal position was preferentially bound inside the cavity of 1 (Fig. 4a and S93†). This is a result of the balance between the hydrophobic effect and electrostatic interaction. When the PhePhe residue is located near or at the C-terminal position in GlyPhePheGly and GlyGlyPhePhe, the positive cooperation of the hydrophobic effect and electrostatic interaction makes 1 preferentially bind with the PhePhe residue. The proton resonances of the phenyl, CH2, and CH groups of the PhePhe residue in GlyPhePheGly showed a large upfield shift (Δδ = −0.44 to −0.13 ppm) at a 1:2 ratio of host and guest, indicating that the PhePhe residue is inside the cavity of 1 (Fig. 4b and S94†). The proton resonances of phenyl, CH2, and CH groups of the PhePhe residue in GlyGlyPhePhe showed the largest upfield shifts (Δδ = −0.97 to −0.30 ppm) and the proton resonances of CH2 units in the GlyGly residue shifted downfield at a 1:2 ratio of host and guest, indicating that 1 binds the PhePhe residue (Fig. 4c and S95†). The UV/vis titration showed a slight redshift of the absorbance peak of 1 with clean isosbestic points, indicating the formation of the host–guest complexes (Fig. S96–S101†). ITC experiments further demonstrated that these tetrapeptides can be complexed with 1 in a 2:1 ratio (Fig. S102–S107†). The comparison of association constants showed that TrpTrp-containing tetrapeptides have much stronger binding affinity with 1 than PhePhe-containing tetrapeptides owing to their more hydrophobic indole groups, indicating that 1 possesses a good ability of sequence-specific recognition for TrpTrp (Table 1).
When these tetrapeptides were added to the aqueous solution of 1, the characteristic CD signals of chiral PP-1 at 310–450 nm were induced. As a result, all L-type tetrapeptides induced the P-rotational conformation chirality of 1 to generate PP-1⊃(tetrapeptide)2 with negative CD signals. The TrpTrp-containing tetrapeptides containing the N-terminal, middle, and C-terminal TrpTrp residues exhibited similar shapes and intensities in the CD spectra (Fig. 4d, S108 and S109†), because 1 could bind to the same position – TrpTrp residue – in TrpTrp-containing tetrapeptides. However, when the aromatic residues are PhePhe, the chirality induction between 1 and PhePhe-containing tetrapeptides was reduced in the order of GlyGlyPhePhe to GlyPhePheGly and PhePheGlyGly, which showed a certain position selectivity (Fig. 4e, S110 and S111†). In the case of PhePhe-containing tetrapeptides, the strongest negative CD signal of 1 was induced by GlyGlyPhePhe, while the CD signals induced by GlyPhePheGly and PhePheGlyGly were much weaker. These results further confirm that the aromatic amino acid residue at the C-terminal position is mainly a structural factor for chirality induction. The bulky chiral group of the aromatic amino acid residue at the C-terminal position can effectively affect the rotational conformation of the two TPE units of 1, owing to the spatial proximity and steric hindrance effect between the C-terminal aromatic rings and the phenyl rings of TPE units in the host–guest complexes. In addition, TrpTrp-containing tetrapeptides showed fluorescence quenching (Fig. 4f, S112 and S113†), while PhePhe-containing tetrapeptides showed fluorescence enhancement, which are consistent with their dipeptides (Fig. 4g, S114 and S115†). Therefore, achiral 1 exhibited the CAR for the TrpTrp-containing tetrapeptides with strong affinity (Ka ≈ 1013 M−2) and for the PhePhe-containing tetrapeptides with distinguished CD responses, when the TrpTrp or PhePhe residues were located at different positions in their tetrapeptide chains (Fig. 4h and i).
Footnotes |
† Electronic supplementary information (ESI) available: Experimental details including NMR, ITC, UV/vis, fluorescence, CD, CPL, and X-ray data. CCDC 2100784, 2100668 and 2155590. For ESI and crystallographic data in CIF or other electronic format see DOI: https://doi.org/10.1039/d2sc05854e |
‡ These authors contributed equally. |
This journal is © The Royal Society of Chemistry 2023 |