Samuel D.
Griggs
abc,
Amalia-Sofia
Piticari
c,
Samuel
Liver
c,
Chris
Arter
c,
Sonja
Sievers
de,
Stephen P.
Marsden
*a and
Adam
Nelson
*abc
aSchool of Chemistry, University of Leeds, Leeds, LS2 9JT, UK. E-mail: s.p.marsden@leeds.ac.uk; a.s.nelson@leeds.ac.uk
bAstbury Centre for Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK
cRosalind Franklin Institute, Harwell Campus, Didcot, Oxfordshire OX11 0QX, UK
dMax-Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, Dortmund, 44227, Germany
eCompound Management and Screening Center, Otto-Hahn-Strasse 11, Dortmund, 44227, Germany
First published on 5th February 2025
An array of reactions of diazo substrates with many possible outcomes was executed, and the biological relevance of the resulting products assessed in the cell painting assay. Reactions that had yielded bioactive products were scaled-up, and the products structurally elucidated. By bypassing the need to characterise all reaction products, this phenotype-directed approach enabled efficient discovery of functionally-distinctive molecules based on novel, structurally-diverse scaffolds.
In this paper, we describe a phenotype-directed approach for the identification of novel biologically-relevant chemical space (Fig. 1). Like activity-directed synthesis,7 which has enabled discovery of diverse small molecules with specific biological functions, the approach is both structure-blind and function-driven. Initially, arrays of reactions with many possible outcomes are executed. The reaction products, whose structure are unknown at this stage, are then directly screened in the cell painting assay8 which enables an assessment of biological relevance. Finally, the structures of (only) identified bioactive products are structurally elucidated. It was envisaged that the approach would enable discovery of diverse molecular scaffolds that were both synthetically-accessible and biologically-relevant.
We designed an array of 288 reactions in which all combinations of six diazo substrates, eight co-substrates and six reaction conditions were explored (Fig. 2). We chose to harness reactions of α-diazo esters because many different outcomes are possible (e.g. C–H, N–H and O–H insertion, cyclopropanation, ylid formation/reaction), and may be varied by changing the catalyst/conditions.9 The selected α-diazo esters were the commercially-available α-unsubstituted substrate S1, the α-vinyl substrate S2 (which can undergo cycloaddition with alkenes10a) and the diverse α-(het)arylated substrates S3–S6. The selected co-substrates C1–C8 all contain at least one functional group that is precedented9 to react with Rh carbenoids (e.g. alkene; potentially reactive C–H, N–H and O–H bond). The reaction conditions were selected on the basis of the potential for alternative reaction outcomes. [Ru(bpz)3][PF6]2 and Mes-Acr-BF4 are precedented photocatalysts for reactions of α-diazo esters,10a,b and photocatalysis can result in complementary outcomes to dirhodium catalysis.10c Rh2piv4 and Rh2pfb4 were selected as electronically- and sterically-distinct catalysts for metal carbenoid chemistry.11
![]() | ||
Fig. 2 Reaction array overview. The α-diazo substrates (black), co-substrates (blue) and reaction conditions are shown. |
The reactions were performed in vials in 96-well format on a 200 μl scale (with 20 μmol limiting substrate). The reactions were assembled from stock solutions (in 1:
1 MeCN–TFE or CH2Cl2) using micropipettes. Each reaction involved a limiting co-substrate (C1–8, final concentration: 100 mM), an α-diazo ester (S1–6, final concentration: 150 mM) and a catalyst [(Ru(bpz)3][PF6]2, final concentration: 2 mM; Mes-Acr-BF4, final concentration: 5 mM; Rh2piv4, final concentration: 2 mM; or Rh2pfb4, final concentration: 2 mM). The reactions containing the photocatalysts catalysts were irradiated with two 40 W Kessil A160WE Tuna Blue lamps. After 16 h, the outcome of the reactions was determined by analytical UPLC/MS with, additionally, evaporative light-scattering detection12 to enable determination of the approximate yield of each product (Fig. 3, panel A). To focus on intermolecular reaction products, we identified reactions that had yielded >7% of a unique product with molecular weight higher than the diazo substrate. For each of these reactions, the bulk of the crude reaction mixture was purified by mass-directed HPLC (see ESI†).
![]() | ||
Fig. 3 Estimated yield of reaction array products determined by evaporative light-scattering detection. The conditions (A)–(F) are outlined in Fig. 2. |
The purified, but uncharacterised, products were screened in the cell painting assay which interrogates many biological pathways simultaneously, and enables an assessment of biological relevance.8 In the assay, six dyes are used to stain different cellular compartments, and high-content imaging and automated image analysis enables determination of 579 features that correspond to specific morphological properties/characteristics of the cell. The biological profile of compounds may be described in terms of morphological fingerprints which capture changes in the features relative to DMSO control. The bioactivity of compounds, screened at 10, 30 and 50 μM, was assessed in terms of an induction value: the percentage of significantly changed features (magnitude of median absolute deviation >3) relative to DMSO control. The reactions that had yielded bioactive products (induction value >5%) were scaled up to enable structural elucidation and validation of their biological activity (Table 1). The reactions were performed on 0.1–0.4 mmol scale, and the resulting products were purified by flash column chromatography or mass-directed HPLC. In total, nine bioactive products (1–3, 4a–b and 5–8) were successfully purified and structurally elucidated using NMR spectroscopy and (for 4a and 8) X-ray crystallography (ESI) (Fig. 4, panel A).
Substrates (methoda) | Product (yield) | Inductionb /% (30 μM) | (biosimilarity) |
---|---|---|---|
a See Fig. 2 for methods.
b Percentage of significantly changed features in the cell painting assay (magnitude of median absolute deviation >3) relative to DMSO control.
c Reference compound with highest biosimilarity.
d At 50 μM.
e No reference compound with >80% biosimilarity.
f Yield over 3 steps after Boc removal and conversion into the corresponding Ts derivative.
g 50![]() ![]() |
|||
S4, C2 (F) | 1 (86%) | 73 | WNK463 (88%) |
S5, C2 (E) | 2 (99%) | 6d | Nicergoline (83%d) |
S4, C8 (B) | 3 (25%) | 14 | |
S3, C2 (D) | 4a (5%f) | 14 | Englerin A (83%) |
4b (27%f) | 13d | Aripiprazole (87%d) | |
S4, C7 (B) | 5 (26%) | 35 | I-CBP112 (87%) |
S3, C1 (F) | 6 (44%) | 19 | |
S6, C1 (F) | 7 (10%) | 10 | Caroverine (84%) |
S4, C3 (E) | 8 (25%) | 6 | PF-CBP1 (81%) |
![]() | ||
Fig. 4 Structures and activity of bioactive products. Panel A: products isolated from scaled-up reactions. Panel B: hierarchical tree showing the diversity of, and relationship between, bioactive products scaffolds. Black circles indicate the product (outer ring) and iteratively simplified (inner rings) frameworks. Panel C: dimension reduction analysis (PCA) of cell painting data (30 μM); explained variance: PC1 (A, 77%), PC2 (B, 8%), PC3 (C, 6%). The bioactive products (red) are shown with some compounds (blue) displaying >5% induction. Panel D: hierarchically-clustered (see ref. 8d) heat map of the morphological fingerprints of the bioactive products (30 μM). |
The use of both metal- and photoredox-catalysed reactions of diazo compounds enabled formation of bioactive products via diverse reaction classes: enamide cyclopropanation (→1, 2 and 8); insertion into an amide N–H bond (→3); insertion into allylic(→7) C–H bonds; insertion into an alcohol O–H bond (→5); and dimerization to yield alkenes (→6).13 The fused tetracycles 4a and 4b were formed via a remarkable formal (3+2) cycloaddition between the α-benzofuran-3-yl α-diazo ester S3 and the enamide C2; this transformation is reminiscent of a photocatalysed cyclopentene annulation of α-vinyl α-diazo esters.14 Notably, 4a and 4b were separable regioisomers; however, their high (83%) biological similarity at 50 μM, a concentration at which they both have induction values >5%, suggests that they may have related mechanisms, or interconvert, in cells. The represented frameworks have significant structural diversity at all levels of hierarchy (Fig. 4, panel B).15 The scaffolds (without α atoms) of 1, 4a and 4b are novel, whilst that of 8 is found in just one patented compound.16
The morphological fingerprints of the bioactive products (Fig. 4, panels C and D) were compared to previously-identified functional clusters.8c Remarkably, only one compound, 1, was >70% similar to any defined functional cluster (79% similarity to tubulin functional cluster at 30 μM). Furthermore, two of the compounds (3 and 6) did not have >80% biosimilarity to any of ∼2300 reference compounds that have been previously evaluated in the cell painting assay.
In conclusion, our phenotype-directed approach enabled discovery of novel, structurally- and functionally-distinctive molecules. Critical to success was underpinning chemistry that enabled exploration of diverse chemical space, and an assay enabling broad assessment of biological relevance. Compounds with a range of distinct phenotypes were discovered using an array of 288 reactions that involved just 14 distinct starting materials.
We thank EPSRC (EP/W002914/1; EP/V011367/1; EP/N025652/1) for funding; and Dr Axel Pahl for discussions. Research at the Max Planck Institute of Molecular Physiology was supported by the Max Planck Society; and was co-funded via “Netzwerke 2021”, an initiative of the Ministry of Culture and Science of the State of Northrhine Westphalia. The Compound Management and Screening Center (COMAS) is acknowledged for cell painting measurements.
Footnote |
† Electronic supplementary information (ESI) available. CCDC 2410507–2410509. For ESI and crystallographic data in CIF or other electronic format see DOI: https://doi.org/10.1039/d4cc06605g |
This journal is © The Royal Society of Chemistry 2025 |