Open Access Article
Miao
Yu
a,
Lila
Ghamsari
b,
Jim A.
Rotolo
b,
Barry J.
Kappel
b and
Jody M.
Mason
*a
aDepartment of Biology & Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK. E-mail: j.mason@bath.ac.uk; Tel: +44 (0)1225386867
bSapience Therapeutics, Inc., 500 Mamaroneck Ave. Suite 320, Harrison, NY 10528, USA
First published on 29th January 2021
To date, most research into the inhibition of oncogenic transcriptional regulator, Activator Protein 1 (AP-1), has focused on heterodimers of cJun and cFos. However, the Fra1 homologue remains an important cancer target. Here we describe library design coupled with computational and intracellular screening as an effective methodology to derive an antagonist that is selective for Fra1 relative to Jun counterparts. To do so the isCAN computational tool was used to rapidly screen >75 million peptide library members, narrowing the library size by >99.8% to one accessible to intracellular PCA selection. The resulting 131
072-member library was predicted to contain high quality binders with both a high likelihood of target engagement, while simultaneously avoiding homodimerization and off-target interaction with Jun homologues. PCA screening was next performed to enrich those members that meet these criteria. In particular, optimization was achieved via inclusion of options designed to generate the potential for compromised intermolecular contacts in both desired and non-desired species. This is an often-overlooked prerequisite in the conflicting design requirement of libraries that must be selective for their target in the context of a range of alternative potential interactions. Here we demonstrate that specificity is achieved via a combination of both hydrophobic and electrostatic contacts as exhibited by the selected peptide (Fra1W). In vitro analysis of the desired Fra1–Fra1W interaction further validates high Fra1 affinity (917 nM) yet selective binding relative to Fra1W homodimers or affinity for cJun. The isCAN → PCA based multidisciplinary approach provides a robust screening pipeline in generating target-specific hits, as well as new insight into rational peptide design in the search for novel bZIP family inhibitors.
In particular, the coiled coil dimerization domain has been the focus of much research since it is key in driving the highly specific PPIs required for Fra1 activity. Specific details on coiled coil formation have been discussed extensively elsewhere.7–16 Here we present the derivation of a peptide antagonist that is selective for Fra1 in the presence of Jun-family members, and that is also able to resist homodimerization. The sequence is derived from a combination of in silico and intracellular library screening to target the leucine zipper region of Fra1. In particular a computational semi-rational library design approach was taken that explicitly considers all potential dimeric interactions within the system, including user defined competitors (all Jun family members). This provided a very large in silico library from which peptides were ranked to produce a reduced size, yet high quality library for in-cell screening using protein fragment complementation assay (PCA). This approach enabled the selection of library members with highest affinity and selectivity for Fra1.
582
720 member) library into a smaller, yet higher quality library (131
072 member) to identify peptides that can selectively target Fra1. In particular, by vastly reducing the library size to remove members unlikely to fulfil the desired criteria, it becomes readily accessible to our intracellular screening approach, containing many members with the predicted desired properties and therefore an increased likelihood of highly selective binders. The approach makes extensive use of the bZIP Coiled Coil Prediction Algorithm (bCIPA),17,18 which works by predicting the Tm of a dimeric coiled coil based only on primary sequence information. bCIPA has been used as the basis for in silico tools that can mimic both the Protein-fragment Complementary Assay (PCA) in identifying high affinity coiled coils, and Competitive And Negative Design Initiative (CANDI) to render them specific in the presence of competitor sequences.18 The in silico PCA (isPCA) and in silico CANDI (isCAN) equivalents (Fig. 1)20–22 can serve as useful tools in reducing much larger libraries, than are accessible experimentally, to those that are smaller and of high-quality. isCAN predicts the highest affinity peptides with the largest difference between specified target and off-target complexes (ΔTm).19 This includes the potential for both target and library homodimers, as well as multiple user-defined off-targets. In utilizing the underlying bCIPA algorithm, isCAN incorporates helical propensity, core, and electrostatic interactions to provide a quantitative estimate of the interaction affinities in the form of a Tm; the various functions within the algorithm assign scores to the peptide–peptide interactions. In addition to built-in frame alignment and prediction functions, isCAN has a number of unique built-in check points. These make use of the individual predictions relating to the library (L), target (T), and competitor (C) peptides. Owing to optimization of core and electrostatic residues found in designed libraries, many peptide members are predicted to be more stable as homodimeric complexes than as heterodimers with the target. isCAN is therefore split into two sections: the first set of calculations mirroring the PCA (isPCA section) and the second introducing the competitor peptides (isCAN). This stepwise calculation ensures that the processing time is not wasted on library members that are predicted to preferentially homodimerize or are unable to overcome the target homodimer (and are therefore not “PCA-successful”). A key concept in both is the predicted ΔTm. It is the key determinant behind the separation of successful and unsuccessful peptides in the library.20–22
![]() | ||
| Fig. 1 Overview of the isPCA and isCAN protocols. Shown are desired and numerous undesired states that can form upon combination of the library/target/competitor peptides. Complexes screened within isPCA include library–library, (negative), target–target (competitive) and library–target (desired). Under isCAN, multiple user defined competitor sequences bring about additional library–competitor (negative) and target–competitor (competitive) complexes. Within isCAN, specificity is driven by the desired predicted ΔTm value as specified by the user. Therefore, a library member is only successful if it forms the desired complex with a predicted Tm value greater than the ΔTm set by the user. Further details are discussed elsewhere.20–22 | ||
The 75
582
720 member peptide library designed to target Fra1 corresponded to five heptad repeats of a leucine zipper template (gabcdef). The initial isCAN step utilized three Jun family members (cJun, JunB and JunD) as competitors. Peptides to emerge from screening were required to possess a predicted ΔTm ≥ 25 °C. This ΔTm cut-off was chosen since it gave high confidence that the required selectivity for Fra1 over Jun family members could be achieved, while generating a library size that could be realistically represented in PCA. Peptides meeting these criteria were predicted to be capable of outcompeting all other possible complexes; i.e. to overcome potential target–target, library–library, library–competitor and target–competitor complexes (see experimental). To facilitate this, the interfacial g, e, and a positions were semi-randomized and screened in order to create a more refined and reduced-size library for entry into PCA screening, narrowing options presented at each semi-randomized position to the more specific residues predicted to be required for Fra1 binding (Fig. 2).
![]() | ||
Fig. 2 Fra1 library design and screening overview. During library design for isCAN screening peptide options were semi-randomized at all 10 e/g positions (Q/E/K) and a positions (L/I/V/A and L/I/V/A/N at a3). The library illustrates the hydrophobic options at the core positions (a/d) and the charged/polar options present at the flanking positions (e/g). In particular, positions g and e were semi-randomized with a view to generating potential attractive and repulsive options with the corresponding positions of the target. Similarly, core a positions were semi-randomized to generate aliphatic hydrophobic options. Positions c and d were fixed as A and L respectively (position b2 as Y for quantification purposes). The final PCA design arising from the isCAN step (using cJun, JunB, and JunD competitors) resulted in a library size of 131 072 members (i.e. >99.8% library size reduction), with 13 randomised positions of 2 or 4 options at each. The PCA library was derived by inspection of sequence variations within all 465 peptides predicted to display the specified ΔTm of ≥25 °C according to the isCAN software. Note that additional a position Thr residues in the PCA library were present owing to unavoidable degenerate codon options. The helical wheel diagrams were generated using DrawCoil 1.0, http://www.grigoryanlab.org/drawcoil.23 | ||
A design principle of the library was to begin with an N-terminal g position and end with a C-terminal e position to maximise the potential for attractive/repulsive electrostatics, with 15 semi-randomized positions of 3–5 options at each position placed into the library sequence at positions corresponding to core a hydrophobic and e/g electrostatic positions within the heptad repeats (Fig. 2). The isCAN peptide library was next narrowed to 485 sequences that met the requirements of the ΔTm cut-off, which when degenerated created a PCA accessible 131
072-member library (Fig. S1 and S2, ESI†). This reduced the original library size by >99.8%, providing one with 13 semi-randomised positions with 2 or 4 options at each (Fig. 2). isCAN selected exclusively K at positions g2 and e2, deeming it to be favourable for target binding while disfavouring binding to all other species. Position a3 was limited to I/N, g1 and e3 to Q/K, g3 and g4 to Q/E, e1, e4 and e5 to K/E, and a1, a2, a4 and a5 to V/I/A/T (in constructing the PCA library, Thr was unavoidably generated by degenerate codons). Accuracy and variety of the constructed library was verified by DNA sequencing as presented in ESI† (Fig. S1).
Single step PCA selection was undertaken on M9 selective agar media, followed by DNA sequencing and further rounds of competition selection in liquid M9 medium, resulting in one clean DNA sequence in the pool after ten rounds of competition selection passaging (Fig. 3, 4 and Fig. S3, ESI†). The isCAN → PCA selected sequence was named as Fra1W. Helical wheel inspections of Fra1–Fra1W heterodimers and Fra1W homodimers and cJun–Fra1W heterodimers (Fig. 4) illustrate the hydrophobic interface at the core positions (a/d) and the charged residues present at the flanking positions (e/g). Leu residues found at core d positions were preserved to maintain the leucine zipper. Fra1 and Fra1W heterodimer contains five favourable electrostatic interactions between E and K at e and g positions (blue hash). In contrast, the Fra1W homodimer contained six unfavourable electrostatic interactions via K–K alignment, with Fra1–cJun forming four unfavourable electrostatic interactions (K–K and K–R) and only one favourable electrostatic (E–K) interaction. These interactions provide a greater scope for stabilization of the target–antagonist complex over other possibilities. At the core, Fra1 residues are atypical and less favourable for hydrophobic interactions, with the a position consisting of two T and two K residues. Fra1W displays two A residues, two I residues and one V in establishing a heterodimer with Fra1.
![]() | ||
| Fig. 4 Helical wheel representations of potential interactions with Fra1W. Fra1–Fra1W heterodimeric and Fra1W homodimeric helical wheel diagrams illustrate the hydrophobic interface at the core positions (a/d) and the charged residues present at the flanking positions (e/g). Leu residues found at core d positions were preserved to maintain the leucine zipper. The Fra1–Fra1W interaction contains favourable electrostatic (blue hash) and core interactions to drive coiled coil formation. Fra1W–Fra1W and cJun–Fra1W display unfavourable electrostatic (red hash) interactions disfavouring their formation. Helical wheel diagrams were generated using DrawCoil 1.0, http://www.grigoryanlab.org/drawcoil.23 | ||
Circular dichroism (CD) spectra confirmed that all samples were broadly α-helical, with cJun and Fra1 samples only weakly populated. In particular, CD spectra showed Fra1 to exist as 21% helical (Fig. 5a) with the 208 nm signal significantly exceeding that of 222 nm, indicating low homodimeric affinity. The secondary structure content of Fra1W in isolation displayed a 60% helical signature (Fig. 5b), with that of the Fra1–Fra1W complex (Fig. 5c) exhibiting a more intense signal with greater α-helical content (75%), more than three times that of the target Fra1 in isolation. In addition, the 222/208 nm ratio was 1.06, providing further evidence for a significant increase in the helical stability of Fra1–Fra1W and evidence toward the formation of a quaternary structure.24 Moreover, it clearly demonstrated an increase from the averaged homomeric signals of Fra1 and Fra1W (Fig. 5d, red dash vs. black) at 20 °C before thermal denaturation.
In contrast, the cJun–Fra1W signal superimposed with the averaged homomeric signals (Fig. 6d, red dash vs. black line). Overall this demonstrates that incubation of Fra1 with Fra1W elicits a significant conformational change in the sample and provides compelling evidence for the formation of a coiled coil. Moreover, Fra1W selectively binds to Fra1 without interaction with cJun.
To provide further evidence for the preferential binding of Fra1W to Fra1 over cJun, and that Fra1W was able to interrupt a Fra1–cJun interaction, dimer exchange experiments were performed at equimolar concentrations of each peptide. First cJun was added to a heteromeric sample containing Fra1–Fra1W; in this case the component spectra of Fra1–Fra1W and cJun superimposed with that of the three combined, indicating that no exchange of binding partner had occurred (Fig. 8a; black observed vs. red hash average). However, when Fra1 was added to a heteromeric sample containing cJun–Fra1W, the CD signal increased from the average of the component samples (Fig. 8b; black observed vs. red hash average). Similarly, when Fra1W was added to a solution containing Fra1–cJun an increase in signal was observed, providing further evidence for the ability of Fra1W to compete off cJun from Fra1 (Fig. 8c). Reassuringly, the signals for the three combined peptides from all three experiments were found to superimpose (Fig. 8d), demonstrating that both samples had equilibrated to generate the same helical signature. These experiments provide firm evidence that Fra1W preferentially binds Fra1, interrupts a Fra1–cJun interaction, and that cJun is unable to interrupt the heterodimeric Fra1–Fra1W complex.
![]() | ||
| Fig. 9 SEC profiles indicate that Fra1W binds Fra1 but does not homodimerize or bind cJun. Shown are size exclusion chromatography profiles for (a) noninteracting and (b) interacting peptides. A peak at approximately 19 min for the Fra1–Fra1W mixture (b – black trace) represents a dimeric sample whilst cJun, Fra1, Fra1W and cJun–Fra1W generate a peak at approximately 20 min, indicating monomeric samples. These experiments, undertaken at a total peptide concentration of 20 μM, provide additional evidence for selectivity of the Fra1–Fra1W interaction. Arrows show previously characterised controls with elution times for a 32mer Fos monomeric peptide (20 min) and a 37mer cJun–FosW heterodimer (18.5 min).25 | ||
![]() | ||
| Fig. 10 Isothermal titration calorimetry analysis of the Fra1–Fra1W interaction. On the fitted data plot, the solid line represents the fit to the data based on the binding of a ligand to a macromolecule using the MicroCal (GE Healthcare) Origin software.26 See Materials and methods for further details. | ||
072 sequences to be accessible to the intracellular PCA, resulting in a highly selective nM affinity antagonist of Fra1. Of note within this process is that Fra1W was ranked 13th out of 485 peptides (top 3%) to emerge from the isCAN screen. Moreover, since residue variations within the 485 peptides degenerated to create a library of 131
072 for the PCA scan, the peptide identified is within the top 0.01% of all predicted sequences to enter the PCA step, giving considerable confidence in the system.
An important part of the screening approach is the inclusion of options that generate both favourable and compromised intermolecular contacts within the desired species. This overlooked prerequisite acts to define PPIs which are stable, while blocking the formation of otherwise energetically accessible alternatives. Formation of the Fra1–Fra1W heterodimer results from five favourable E–K electrostatic interactions between e and g residues. The formation of a Fra1W homodimer is destabilized via six unfavourable electrostatic K–K pairs. These provide greater scope for stabilization of antagonist–target heterodimeric complexes, destabilize antagonist homodimers, and enhance overall interaction specificity.25,30–32 Residues within the Fra1 core are atypical and less favourable for hydrophobic interaction; however, Fra1W displays a hydrophobic core, consisting of I, V, as well as smaller A sidechains, to establish favourable interactions with Fra1. Reassuringly, in combination with the electrostatic component, the permutation of these aliphatic hydrophobics within Fra1W does not disproportionately favour interaction as a homodimer or with cJun. Rather, it is shown that specificity is achieved using a combination of both hydrophobics and electrostatics. Heterodimerisation of Fra1W with cJun/JunB/JunD is prevented via four K/R repulsions. In vitro analysis via a combination of CD, SEC and ITC validates the isCAN → PCA approach and demonstrates that Fra1W is able to specifically interact with Fra1 while avoiding homodimerization or interactions with off-target cJun. Luciferase gene reporter experiments, in which Fra1W is fused to an NLS–Tat cell penetrating peptide, additionally demonstrates that the peptide can enter A549 lung cancer cells, impacting upon AP-1 Fra1 transcriptional activity in a dose dependent manner. Heterodimers of Fra1–Fra1W are established to be dimeric in nature and cannot be disrupted by the addition of cJun. Interaction with other Jun homologues is unlikely since (i) JunB and JunD were explicitly considered during the isCAN step and (ii) all Jun family members share the same a/d core (a major driver in coiled coil stability) with only two e/g residue differences between cJun and Jun B (Fig. S5 – g3 Q-to-E and e3 A-to-S, ESI;† JunD is identical at the interface). We are therefore confident that Fra1W does not interact with any Jun homologue.
Although the ability to select between Fos members, was not an aim of this study, we believe that it is unlikely to present an issue. In particular; (i) individual AP-1 members are known to be driven by unique temporal and tissue-specific patterns, impacting upon different target genes.36 Therefore, a key feature in targeting the correct Fos homologue is not only the ability to impart selectivity, but the ability to deliver the peptide to the particular cell in which the target protein homologue is overexpressed. This has been achieved for example by targeting cancer cells that overexpress specific populations of cell surface receptors that are highly expressed in specific cancers, thereby serving as physiological targets for therapeutic delivery. Indeed, receptor-mediated drug delivery presents an emerging opportunity to enhance therapeutic efficiency by accumulating the drug within the tissue of interest where the target resides, thereby reducing undesired, off-target effects;33 (ii) since Fra1 must heterodimerise with a Jun family member to become active, selectivity for Fra1 in the context of the Jun protein family remains key in blocking Fra1 driven transcriptional activity; (iii) as we have shown previously,17 native heterodimers between Jun/Fos family members are also non-specific according to affinities between the various combinations, again suggesting that unique temporal and tissue-specific expression patterns are the major drivers in determining the precise AP-1 composition; (iv) lastly, the Fos family exists with minimal differences between members, with cFos/FosB/Fra1/Fra2 all sharing an identical a/d core. There are only three minor e/g residue changes from Fra1 (Fig. S5 – e3 Q-to-E for FosB, e4 Q-to-L for cFos, and g5 Q-to-E for cFos/FosB/Fra2, ESI†). Overall, these changes are very modest, and therefore Fra1W is not expected to be capable of discerning between Fos family members.
Future structural biology approaches may be employed to provide further insight into the mechanism of Fra1–Fra1W interaction, while introduction of macrocyclic structures into the Fra1W framework may provide the ability to further strengthen binding activity. Addition of cell penetrating peptide motifs and/or other moieties to facilitate receptor-mediated drug delivery may be required to impart further selectivity in delivering peptide derivatives to their intracellular target. In conclusion, the isCAN → PCA library-based multidisciplinary approach harbours significant potential in the search for potent and selective PPI inhibitors, with the potential to deliver further insight into rational peptide-based drug design towards use in the clinic.
582
720 member peptide library was designed by introducing semi-randomised residue options at positions corresponding to key interfacial positions within each heptad repeat of a coiled-coil motif (gabcdef). The library was next screened using isCAN software based on the bCIPA algorithm, which has been described in detail elsewhere.20–22,25,34 Briefly, the software provides a qualitative rank of affinity by estimating the thermal melting point (Tm) of every potential dimeric interaction within the system. In this library, every g and e position within the coiled coil, which are critical in forming electrostatic contacts within a coiled-coil sequence19 were semi-randomized to generate Q/E/K options, with a view to generating both potential attractive and repulsive options with the corresponding positions of the target (Fig. 2). Similarly, all a positions corresponding to the core region within a coiled-coil sequence (a1, a2, a4, a5) were semi-randomized to generate L/I/V/A options. The a3 position was semi-randomized to additionally generate an Asn option (L/I/V/A/N). All c and d positions were fixed as A and L, respectively, to impart helicity and further core hydrophobicity that is characteristic of the parallel dimeric coiled-coil motif.20 Using the in silico CANDI (isCAN) software, all peptide library members were next computationally screened for predicted affinities in the form of a Tm. During isCAN the stability of every member was considered as (i) a homodimer, (ii) with the Fra1 target, and (iii) as a potential heterodimer with off-target competitors cJun, JunB and JunD, as well as (iv) the stability of any potential target–target complex, or (v) target–competitor complexes. To distinguish between desired (library–target) and non-desired interactions (target–target, target–competitor, library–library, library–competitor interactions), a lowest acceptable predicted ΔTm was defined as 25 °C (Fig. 1). isCAN is split into two sections: the first set of calculations mirroring the PCA (isPCA) and the second introducing the competitor peptides (isCAN). This stepwise calculation ensures that processing time is not wasted on library members that are predicted to preferentially homodimerize or are unable to overcome the target homodimer (and are therefore not “PCA-successful”). A key concept in both is the predicted differences in Tm (ΔTm). It is the key determinant behind the separation of successful and unsuccessful peptides in the library. PCA-successful library members then have their desired state Tm compared with library member-competitor Tm values and the “CANDI-successful” library members are finally exported.20,22 The library subset remaining was next screened in order to reduce residue options >500 fold to create a small yet high quality library that was accessible to intracellular PCA screening (Fig. 3).
072, meaning that 99.8% of all PCA library members were removed at this stage owing to their inability to effectively bind Fra1 or to rescue cell growth.
:
1 MeCN/H2O mixture and freeze-dried. Purification was achieved via reverse-phase high-performance liquid chromatography (RP-HPLC) using a Phenomenex Jupiter Proteo (C18) reverse-phase column (4 μm, 90 Å, 21.2 mm inner diameter × 250 mm length). The following eluents were used: 0.1% TFA in H2O (a) and 0.1% TFA in ACN (b). The peptide was eluted by applying a linear gradient (at 15 mL min−1) of 20 to 50% of 0.1% TFA in ACN (b) over 30 min. The fractions that were collected were examined by electrospray MS, and those found to contain exclusively the desired product were pooled and lyophilized. Analysis of the purified final product by RP-HPLC indicated a purity of >95% (Fig. S4, ESI†).
| q(i) = ((nΔHVP)/2)[1 + (L/nP) + (Kd/nP)] − [[1 + (L/nP) + (Kd + nP)]2 − (4L/nP)]1/2 |
:
1 stoichiometry) to extract the various thermodynamic parameters,26 namely the apparent equilibrium constant (KD) and the enthalpy change (ΔH) associated with heterodimerisation. The free energy change (ΔGbind) upon ligand binding can be calculated from the relationship:26ΔGbind = −RT ln KD |
000 cells per wells of a white opaque 96-well plate, and transfected with either the reporter vector or the negative control vector using TransIT-LT1 transfection reagent (Mirus Bio, WI, USA). Twenty-four hours post transfection, the culture medium was exchanged with low FBS assay medium and the cells were treated with given concentrations of Fra1W diluted in the assay medium. The AP-1 transcriptional activity was stimulated with 10 nM PMA 18 hours after peptide treatment. Six hours after PMA stimulation, the cells were lysed and the Firefly luciferase activity was measured using the Firefly luciferase reagent (BPS Bioscience, San Diego, CA, USA).
000 cells per wells of a black opaque 96-well plate. The cells were treated exactly as described in the Luciferase gene reporter experiment section. Twenty-four hours post peptide treatment, the cell viabilities were measured using CellTiter Blue (Promega Inc.) according to the manufacturer's instruction.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/d1cb00012h |
| This journal is © The Royal Society of Chemistry 2021 |