Jason
Stahlecker
a,
Theresa
Klett
a,
Martin
Schwer
a,
Simon
Jaag
b,
Marcel
Dammann
a,
Larissa N.
Ernst
a,
Michael B.
Braun
c,
Markus O.
Zimmermann
a,
Markus
Kramer
d,
Michael
Lämmerhofer
b,
Thilo
Stehle
c,
Murray
Coles
e and
Frank M.
Boeckler
*af
aLab for Molecular Design & Pharm. Biophysics, Institute of Pharmaceutical Sciences, Department of Pharmacy and Biochemistry, Eberhard Karls Universität Tübingen, Auf der Morgenstelle 8, 72076 Tübingen, Germany. E-mail: frank.boeckler@uni-tuebingen.de
bInstitute of Pharmaceutical Sciences, Pharmaceutical (Bio-)Analysis, University of Tübingen, Auf der Morgenstelle 8, 72076, Tübingen, Germany
cInterfaculty Institute of Biochemistry, Eberhard Karls Universität Tübingen, Auf der Morgenstelle 34, 72076 Tübingen, Germany
dInstitute of Organic Chemistry, Eberhard Karls Universität Tübingen, Auf der Morgenstelle 18, 72076 Tübingen, Germany
eDepartment of Protein Evolution, Max Planck Institute for Biology Tübingen, Max-Planck-Ring 5, 72076 Tübingen, Germany
fInterfaculty Institute for Biomedical Informatics (IBMI), Eberhard Karls Universität Tübingen, Sand 14, 72076 Tübingen, Germany
First published on 20th October 2022
The cellular tumor antigen p53 is a key component in cell cycle control. The mutation Y220C heavily destabilizes the protein thermally but yields a druggable crevice. We have screened the diversity-optimized halogen-enriched fragment library against T-p53C-Y220C with STD-NMR and DSF to identify hits, which we validated by 1H,15N-HSQC NMR. We could identify four hits binding in the Y220C cleft, one hit binding covalently and four hits binding to an uncharacterized binding site. Compound 1151 could be crystallized showing a flip of C220 and thus opening subsite 3. Additionally, 4482 was identified to alkylate cysteines. Data shows that the diversity-optimized HEFLib leads to multiple diverse hits. The identified scaffolds can be used to further optimize interactions with T-p53C-Y220C and increase thermal stability.
The low thermal stability makes handling and thus research of p53 and/or the DNA binding domain (p53C, residues 94–312) challenging. A quadruple mutant (M133L/V203A/N239Y/N268D) was designed, increasing thermal stability by about 5.6 °C.9 This thermally stabilized mutant, commonly called “T-p53”, has become a standard for biophysical experiments.
The mutation Y220C, located in the core domain, is one of the well-known hotspots.10 The substitution of tyrosine to cysteine thermally destabilizes the core domain by about 8 °C.11 However, this mutation yields a small, hydrophobic cleft. Early work in targeting this Y220C cavity has yielded a carbazole derivate, PK083 that stabilizes T-p53C-Y220C by 0.8 °C (125 μM cmpd. conc.).12 Trifluorination at the N-ethyl anchor of PK083 improved the stabilization to 1.2 °C (125 μM cmpd. conc.) by benefiting from multipolar interactions and fluorine–sulfur contacts.13 This compound and other selected are depicted in Fig. 1.
![]() | ||
| Fig. 1 An exemplary selection of published T-p53C-Y220C binders.12–17 Compounds PK083 and PK784 occupy the central cavity and subsite 1, while PK5196, PK7088 and MB710 additionally bind subsite 2. The pyrrole group of PK784 and MB710 also enables them to engage to the deep subsite 3. PK11000 showed covalent addition to multiple cysteines. | ||
Quantum chemical analysis of the cavity showed that Leu145 can be addressed by halobenzene moieties, which initially led to the design of a halogen-enriched fragment library.14 The 2,4-diiodophenol derivative PK784 was identified and crystallized, showing that a halogen bond with Leu145 as postulated is formed. Also, the compound increases the melting temperature by 0.55 °C (250 μM cmpd. conc.). Optimization of this scaffold led to PK5196, stabilizing T-p53C-Y220C by 3.61 °C (250 μM cmpd. conc.) using an acetylene linker to extend into subsite 2, illustrated in Fig. 2.14
![]() | ||
| Fig. 2 The Y220C mutation leads to a druggable cleft. (A) Overall view of T-p53C-Y220C bound to PK784 (4AGL14). The location of the mutation is highlighted, leading to a druggable cleft. (B) PK784 engages in a halogen bond with L145. The other iodine points towards subsite 2 and can be used as vector for ligand growing. (C) Surface representation of Y220 and surrounding amino acids in T-p53C (1UOL18). (D) The mutation opens the central cavity, connecting subsite 1 and subsite 2 (4AGL14). (E) The lower part of the cleft is not targeted as C220 points towards the compound. (F) Upon binding of an electron rich group, e.g. pyrrole in PK7242, the cysteine is displaced enlarging subsite 3 and enabling targeting (3ZME15). | ||
Later efforts identified PK7088 from a fragment library as a Y220C cleft binder, bearing a different scaffold. Interestingly, the pyrrole sidechain of this compound points into a deep internal cavity (subsite 3, Fig. 2) formed by a flip of C220, leading to a stabilization of 1 °C (350 μM cmpd. conc.).15
Based on this discovery, the aminobenzothiazole derivative MB710 was synthesized, linking the diiodophenol scaffold to the pyrrolo sidechain. This structure yields a thermal stabilization by 2 °C (250 μM cmpd. conc.).16
These promising results were the basis for the design of a diversity-optimized halogen-enriched fragment library (HEFLib).19 In this generalized HEFLibs approach, aiming for a collection of chemical probes for investigating halogen bonding by fragment-based drug discovery,20 library design and selection principles became independent from the initial target-focused approach.14 From kinase drug discovery, we learned that molecular design of halogen bonds at an advanced state of the lead optimization process can be quite challenging and does not necessarily reach its full potential to improve ligand affinities.21 Thus, we concluded that focusing on the earliest stages of the drug discovery process by generalizing our HEFLib's strategy can also provide the chance to establish unconventional binding modes based on specific halogen bonding motifs and allows to harness the chemotypes of such hits as novel lead structures with great impact on patentability.19,20 As the design was led by diversity and limited by availability, only 14 of the 191 compounds contain an iodine. This also means that compounds with strongly tuning groups were not specifically selected. The compounds based on PK784 on the other hand contain an iodine and a positive charge, leading to strong tuning effects.22
In principle, there is a multitude of suitable electron-rich interaction partners for accepting a halogen bond in a typical binding site:23 the backbone carbonyl,24,25 the peptide bond,26 the sulfur of methionine27 or cysteine, the nitrogen of histidine,28 the oxygen of the hydroxyl group of serine, threonine or tyrosine, the carboxamides of asparagine and glutamine, as well as the carboxylates of aspartate and glutamate,29 and the π-systems of phenylalanine, tyrosine, tryptophan, or histidine.
Based on this abundance of interaction partners and the different construction of the generalized HEFLib, we deemed it an interesting experiment to revisit the challenging binding site of T-p53C-Y220C with such an unbiased library. It is noteworthy that this diversity-optimized HEFLib has been thoroughly tested and characterized, resulting in many different hits on various targets.30–32
Another approach for stabilization was identified by covalent modification of cysteines other than C220.17 A 2-sulfonylpyrimidine was identified that covalently modifies C182 and C277, increasing the Tm up to about 2.5 °C without losing affinity towards DNA. This effect is independent of the Y220C mutation, which makes it a great candidate for a general T-p53C stabilization approach.
As the thermal destabilization of the Y220C mutation is of greatest concern, we performed differential scanning fluorimetry (DSF) as one of our primary screening techniques. Additionally, we used saturation transfer difference (STD) NMR for hit identification. Comparing target binding to its functional results by these two techniques can yield valuable insights for the drug discovery process. All hits found by these independent techniques were validated by 1H,15N-HSQC. Thus, binding modes independent of the Y220C induced cleft were detectable during primary screening as well and were classified through the validation process.
:
1 mM (1
:
50), for DSF 6 μM
:
1 mM (1
:
125) and HSQC 65 μM
:
2 mM (1
:
30) was used
| Compound | ΔTm [°C] | STD | HSQC | t 1/2 GSH [h] |
|---|---|---|---|---|
| 0116 | −0.05 | Yes | Yes | >100 |
| 0403 | 0.05 | Yes | No | — |
| 0404 | −0.10 | Yes | Yes | >100 |
| 0459 | −0.05 | Yes | Yes | >100 |
| 0522 | 0.30 | Yes | No | — |
| 0660 | 0.10 | Yes | No | — |
| 1151 | 0.80 | NA | Yes | >100 |
| 1218 | −0.20 | Yes | Yes | >100 |
| 1223 | 2.0–2.55 | Yes | Yes | 0.81 |
| 1243 | −0.25 | Yes | Yes | >100 |
| 1246 | 0.15 | Yes | No | — |
| 4482 | 0.5–1.75 | No | Yes | 2.8 |
| 7394 | −0.15 | Yes | Yes | 61 |
| 7405 | 0.20 | Yes | Yes | >100 |
The DSF results of the triplicate measurement of the STD and DSF hits during the screen are shown in Fig. 3. The compounds that were considered hits are all significant (p < 0.05) compared to the reference.
Based on the 1H,15N-HSQC data, hits could be categorized into three groups. By comparing the peak shifts with the 1H,15N-HSQC spectra of crystallographically confirmed Y220C cleft binders,12,14 one group was identified as the Y220C cleft binding group. Another group was confirmed to bind covalently to T-p53C-Y220C. The third set contained significant peak shifts but does not suggest Y220C cleft binding. The chemical structure and corresponding group are depicted in Fig. 4.
Of the proposed cleft-binding compounds, only compound 1151 stabilized the protein by more than 0.5 °C. The 1H,15N-HSQC spectrum (Fig. 5A) shows multiple peak shifts indicating cleft binding. The DSF curves with addition of 1151 (ratio 1
:
125), shown in Fig. 5B, are shifted by 0.8 °C.
Compounds 0459 and 1151 both contain a pyrazole scaffold with a halogen in position 4. A structure was solved of T-p53C-Y220C bound to 1151 (PDB: 8A92). Data collection and refinement statistics are displayed in Table 2. The structure, especially in chain A, indicates multiple binding poses, which could be identified by the anomalous signal of the bromine (Fig. 6). For chain B the occupancy was low and only one pose was built. The poses show that the CF3 group points towards the cleft and the bromine pointing towards L145 or the subsite 2. The electron-rich CF3 group displaces C220, opening subsite 3. This binding pose is similar to the ones containing a pyrrole ring, e.g. MB710 (Fig. 1 and 2). Even though the compound has a low KD value (>1 mM, based on non-fittable ITC data) it stabilizes T-p53C-Y220C by about 0.8 °C. It is likely that the binding is driven by hydrophobic interactions of the CF3 group to the cleft. No electron density could be observed for the amine.
| Compound 1151 (PDB: 8A92) | |
|---|---|
| Wavelength [Å] | 0.92 |
| Space group | P212121 |
| Cell dimensions | |
| a, b, c [Å] | 65.1, 74.1, 105.2 |
| α, β, γ [°] | 90, 90, 90 |
| Resolution range [Å] | 50–1.37 (1.45–1.37) |
| Redundancy | 13.2 (13.3) |
| Completeness [%] | 100 (100) |
| Mean I/σ(I) | 14.4 (1.0) |
| R-meas [%] | 11.7 (256.8) |
| CC1/2 [%] | 99.9 (49.7) |
| Wilson B [Å2] | 22.8 |
| Resolution included [Å] | 43.68–1.37 |
| R work/Rfree [%] | 15.32/18.32 |
| Bond RMSD [Å] | 0.008 |
| Angle RMSD [°] | 0.99 |
| Ramachandran [%] (favored/allowed/outliers) | 98.98/1.02/0 |
| Rotamer outliers [%] | 1.43 |
| All-atom Clashscore | 1.44 |
| Average B factor [Å2] | |
| Overall | 22.34 |
| Protein | 20.03 |
| Ligand | 41.4 |
| Water | 35.58 |
In one pose the bromine points towards the backbone oxygen of L145 with a distance of 3.0 Å and an angle of 153.4°. Based on this pose we calculated the adduct-formation energy of the compound and N-methylacetamide as a backbone model. In order to estimate the effect of the CF3-group we calculated the Vmax at an isodensity level of 0.02 au and the adduct-formation energy (ΔE) of 1151 and closely related scaffolds at the MP2/TZVPP level of theory. The results are displayed in Fig. 7. In addition, this bromine accepts an orthogonal hydrogen bond from the backbone nitrogen of T230.
The CF3 group is the main contributor for the tuning of the system, increasing the Vmax from 0.1597 au to 0.1737 au, comparable with iodobenzene (0.182 au).34 As the amine only has a small effect on tuning, the vector could be used for fragment growing. Another approach could be converting the amine to a tertiary amine, adding a positive charge and once more strongly tune the system.22
:
125 up to three molecules of 4482 were bound to T-p53C-Y220C. Among them, the species with two attached molecules showed highest intensity. Unmodified protein was not detectable. In contrast, the MS spectrum of the triple mutant C124/182/277S, shows only the species with one bound fragment. Unmodified protein could be identified. This indicates that covalent binding of 4482 to the now absent cysteines occurs. Using DSF measurements, a time-dependent stabilization could be identified (Fig. 8). The ΔTm of T-p53C-Y220C increased from 0.9 °C after 4 h incubation up to 1.75 °C after a 24 h incubation period.
In comparison, PK11000 resulted in a larger stabilization of T-p53C-Y220C up to a maximum ΔTm of about 2.5 °C. In contrast to 4482, a maximum number of two modified cysteines was evident in the ESI-MS spectra of PK11000. This highlights the selectivity of PK11000 for specific p53 cysteines.17
Interestingly, Bauer et al.17 could not detect any stabilization by the compound 2-chloro-5-(fluoromethyl)-pyrimidine. We measured significant stabilization with 4482, the trifluoro analog of the compound. This emphasizes the need of the CF3 moiety as a strong electron-withdrawing group (EWG) for covalent binding. In general, compound 4482 contains a similar scaffold to the known covalent modifier PK11000. Both are pyrimidines with leaving groups in position 2 and additional EWGs to enhance SNAr reactivity. These structural similarities should explain their comparable behavior towards T-p53C and T-p53C-Y220C. Moreover, it demonstrates the potential of their common scaffold as a covalent, Y220C-cleft independent, rescue for the oncogenic p53-Y220C mutant.
000 underlines the importance of continuous drug discovery for this target.35–38 The first confirmation of halogen bonding in T-p53C-Y220C led to the development of the diversity-optimized HEFLib. A total of ten hits from this diversity-optimized library could be confirmed by 1H,15N-HSQC. The development of a Vmax-optimized HEFLib could lead to an improvement of the hit rate.30 The diversity-optimized HEFLib did not yield another low Millimolar binding compound, but identified a multitude of diverse scaffolds binding to the protein. These scaffolds can now be further exploited by optimizing target binding and more importantly stabilization.
The HEFLib additionally produced more unconventional binding modes, by containing a covalent binder. This modification is independent of the Y220C mutation and could potentially lead to general purpose stabilizers with much broader applicability in the rescue of destabilized p53 cancer mutants. Another aim could be modifying these scaffolds to reestablish or modulate DNA binding in hot-spot mutations which loose their function based on altered DNA contacts.
Many of the identified and confirmed hits did not interact in the Y220C mutation-induced cavity as we had anticipated. For these hits further studies are needed to reveal their binding site and its possible implication for altering or rescuing p53 function. It should be noted that besides loss-of-function mutations, there are also oncogenic gain-of-function mutations known for p53.39–41 Thus, new binding modes could provide avenues for therapeutic intervention for both aspects of altered p53 function. Of course, tractability and druggability need to be shown for such a new binding site.42
The compound 1151 could be further characterized and used as a spy molecule for FAXS NMR.43,44 This could further facilitate screening experiments for the identification of novel fragments binding to the Y220C-induced cleft.
For labelled expression, M9 minimal medium45 was used supplemented with 1 g L−1 of 15NH4Cl prior to inoculation.
For mutagenesis, a pET24a-HLT vector with the thermostable p53 core domain (94–312) served as template. The Q5 site-directed mutagenesis kit (New England BioLabs) was used to produce the triple cysteine mutant C124/182/277S.
All constructs used, are displayed in Table 3.
| Construct | Sequence (N′–C′) |
|---|---|
| T-p53C-Y220C (biophysical screens) | GGSSS SVPSQ KTYQG SYGFR LGFLH SGTAK |
| SVTCT YSPAL NKLFC QLAKT CPVQL WVDST | |
| PPPGT RVRAM AIYKQ SQHMT EVVRR CPHHE | |
| RCSDS DGLAP PQHLI RVEGN LRAEY LDDRN | |
| TFRHS VVVPC EPPEV GSDCT TIHYN YMCYS | |
| SCMGG MNRRP ILTII TLEDS SGNLL GRDSF | |
| EVRVC ACPGR DRRTE EENLR KKGEP HHELP | |
| PGSTK RALPN NT | |
| T-p53C-Y220C (crystallization) | SSSVP SQKTY QGSYG FRLGF LHSGT AKSVT |
| CTYSP ALNKL FCQLA KTCPV QLWVD STPPP | |
| GTRVR AMAIY KQSQH MTEVV RRCPH HERCS | |
| DSDGL APPQH LIRVE GNLRA EYLDD RNTFR | |
| HSVVV PCEPP EVGSD CTTIH YNYMC YSSCM | |
| GGMNR RPILT IITLE DSSGN LLGRD SFEVR | |
| VCACP GRDRR TEEEN LRKKG EPHHE LPPGS | |
| TKRAL PNNT | |
| T-p53C-Y220C (C124/182/277S) | GGSSS SVPSQ KTYQG SYGFR LGFLH SGTAK |
| SVTST YSPAL NKLFC QLAKT CPVQL WVDST | |
| PPPGT RVRAM AIYKQ SQHMT EVVRR CPHHE | |
| RSSDS DGLAP PQHLI RVEGN LRAEY LDDRN | |
| TFRHS VVVPC EPPEV GSDCT TIHYN YMCYS | |
| SCMGG MNRRP ILTII TLEDS SGNLL GRDSF | |
| EVRVC ASPGR DRRTE EENLR KKGEP HHELP | |
| PGSTK RALPN NT |
In total, 1024 data points were collected in the direct dimension (1H) and 128 in the indirect dimension (15N). Data processing was performed using Bruker Topspin 4.1 and analysis using NMRFAM-SPARKY 3.19.46 Peak shifts were considered significant, if
![]() | (1) |
Reaction conditions were PBS buffer pH 7.4, 10% acetonitrile, 100 μM ketoprofen as an internal standard, 250 μM fragment, and 5 mM GSH excess at 37 °C. Measurement times were after 0, 1, 2, 4, 8, 12, and 24 h. For very reactive fragments (t1/2 < 5 h) analysis was performed every 20 min. The mixture was analyzed by HPLC with UV-detection. The reaction of the compounds was detected by measuring the decreasing area under the curve (AUC) of the fragment relative to the internal standard. The declining AUC was fitted to pseudo-first order kinetics and t1/2 was calculated using the following equation:
![]() | (2) |
Measurements were performed as duplicates with GSH. In addition, single measurements without GSH in PBS buffer were carried out for each fragment to check for hydrolytic degradation.
:
125. The mixture was incubated for 4 h at 20 °C while shaking.17
The UHPLC-system consisted of an Agilent (Waldbronn, Germany) 1290 Infinity binary pump (G4220A) and a thermostated column compartment (G1316C). Between the column and the ion source a Valco EHMA diverter valve (2-pos/6-port) from VICI (Schenkon, Switzerland) was installed for online de-salting of the samples. Mobile phase A was water + 0.1% (v/v) formic acid and mobile phase B was ACN + 0.1% (v/v) formic acid. The flow rate was set to 0.5 mL min−1 with the following gradient: 0–2 min: 5% B (de-salting of sample, flow to waste), 2–10 min: 5–80% B, 10–12 min: 80% B, 12–17 min: 5% B. The column temperature was set to 50 °C. A TripleTOF 5600+ mass spectrometer from Sciex (Darmstadt, Germany) was used with a Duospray ion source (ESI interface) in positive ionization mode. The following MS instrument parameters were used: curtain gas (CUR): 35 psi, nebulizing gas (GS1): 50 psi, heater gas (GS2): 40 psi, ion spray voltage floating: 5100 V, source temperature: 550 °C, collision energy (CE): 30 V, declustering potential (DP): 220 V. The mass range in TOF MS mode was set from 500 to 5000 m/z with an accumulation time of 500 ms. The IntactProteinMode script from Sciex was used to optimize advanced MS settings for protein analysis. Data acquisition was performed with Analyst TF 1.8.1 software (Sciex). Data analysis was performed using PeakView software 2.2.0 (Sciex) using the BioToolKit (2.2.0.) for deconvolution of the mass spectra.
:
1 with reservoir solution (100 mM HEPES (pH = 7.15), 19% PEG4000 and 10 mM DTT) using the sitting drop vapor diffusion technique and by performing streak seeding. Crystals grew within a few days. For soaking, crystals were transferred in a 50 mM or saturated compound solution in a cryo-protectant buffer (reservoir solution and an additional 20% glycerol) over night.
Data sets were obtained at the Swiss Light Source (SLS) (Villigen, Switzerland) at the X06DA (PXIII) beamline, using a Pilatus-2 M-F detector.
Data processing and reduction was performed using XDS.50 To obtain initial phases by molecular replacement, 4AGL was used as a search model for PHASER included in the CCP4 suite.14,51,52 Structure and phase improvement was performed using multiple cycles of manual model building in Coot and structure refinement using PHENIX.53,54 The anomalous difference map was generated using CAD and fft, both from the CCP4 suite.55–57 Ligand restraints were generated using AceDRG.58,59
| This journal is © The Royal Society of Chemistry 2022 |