Gregor S.
Cremosnik†
a,
Jie
Liu†
ab and
Herbert
Waldmann
*ab
aDepartment of Chemical Biology, Max-Planck-Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany. E-mail: herbert.waldmann@mpi-dortmund.mpg.de
bFaculty of Chemistry and Chemical Biology, TU Dortmund University, 44227 Dortmund, Germany
First published on 6th October 2020
Covering: up to 2020
Natural products (NPs) provide inspiration for the design of biologically active compounds and libraries. In this review, we cover several experimental and in silico approaches, which have been used to simplify NPs and guide NP-based library design. Earlier approaches, like the structural classification of natural products (SCONP) and biology-oriented synthesis (BIOS), focus on the identification of activity determining scaffolds and the synthesis of corresponding compound collections. More recently, NP fragments identified by means of cheminformatic analysis of the Dictionary of Natural Products (DNP) have been combined in unprecedented fashions to yield pseudo natural products (pseudo NPs), which show biological activities unrelated to the guiding NPs. Each approach was also the source of chemical innovation, in which synthetic methods were established for the rapid assembly of NP-inspired compounds and libraries.
Associated with the synthetic challenges involved in NP total synthesis, methods and approaches have also focused on reducing structural complexity of NPs while keeping biological relevance. Such design approaches may arrive at NP-inspired compounds which still represent the kind of bioactivity encoded by the guiding NPs, or they may yield classes of NP-inspired compounds that display novel bioactivities.
Here we review two concepts for the design and synthesis of novel NP-inspired compound classes. In Biology Oriented Synthesis (BIOS), NP scaffolds are simplified in an iterative process arriving at series of structurally simplified, NP-inspired compound classes. Extension of the logic underlying BIOS and combining it with fragment-based compound development led to the concept of pseudo natural products (pseudo-NPs) in which NP fragments are combined in arrangements not available via biosynthesis.16,17
To this end the CRC Press Dictionary of Natural Products (DNP, 190939 compounds at the time) was employed as an initial dataset.18In silico fragmentation of the NPs and subsequent building of a scaffold tree based on substructure identification enabled a visual representation of scaffold relationships (Fig. 1). In the scaffold tree, more complex scaffolds are arranged on the outside and become simpler towards the centre.18
Fig. 1 Graphical star-like representation of the NP scaffold tree. For clarity of the graphic illustration, only scaffolds are shown that represent cumulatively at least 0.2% of the NP population in the DNP. Reprinted from ref. 18, Copyright (2005) National Academy of Sciences. |
The SCONP concept was further validated using protein structure similarity clustering (PSSC) as an independent guiding principle (Scheme 1A) for the discovery of a selective 11β-hydroxysteroid dehydrogenase type 1 (11βHSD1) inhibitor.18 PSSC builds structure similarity clusters of proteins with similar ligand-sensing domains in which cluster members are expected to have similar ligands. 11βHSDs were found in the same structure similarity cluster as the Cdc25A phosphatase and acetylcholinesterase. Through fragmentation according to SCONP, dysidiolide, a CdC25A inhibitor, could be related to glycyrrhetinic acid (GA), a naturally occurring unselective ligand for 11βHSD1 and 11βHSD2. Based on their common octahydronaphthalene scaffold 1, a library of 162 compounds was prepared of which 28 were confirmed as selective inhibitors of 11βHSD1 over 11βHSD2 (compounds 1a–1c, Scheme 1B).
Scheme 1 (A) Brachiation of glycyrrhetinic acid and dysidiolide to identify a common fragment. (B) Examples of 11βHSD1 selective inhibitors synthesised. |
To harness the power of the SCONP approach and facilitate the building of scaffold trees, the open source software Scaffold Hunter was developed.19,20 The software uses complex data from e.g. biological screening sets as input and extracts chemically meaningful scaffolds sharing biological activity. From those it builds a scaffold tree linking related scaffolds but it also generates virtual scaffolds, where links between tested molecules are missing. The virtual scaffolds represent unexplored chemical space expected to share the bioactivity of the input and can serve as starting points for novel synthesis efforts.19 Prospective exploration using Scaffold Hunter was demonstrated with data from a pyruvate kinase screen available on PubChem (Scheme 2A).19 The program built a scaffold tree containing 35868 scaffolds (for example fragments 2–4) from the 51415 unique molecules used in the assay. Subsequently, novel virtual scaffolds (fragments 5–7) were predicted to share the biological activity of the tested molecules. A library of 107 compounds embodying these virtual scaffolds was obtained and their half-maximal inhibitory concentrations (IC50) were determined. Nine novel pyruvate kinase inhibitors or activators with IC50 values between 1 and 10 μM were identified (selected examples 9 and 10, Scheme 2B). This corresponds to a hit rate of nearly 10% which is substantially higher than the average hit rate in PubChem high-throughput screening campaigns (0.34%).
The SCONP tree provides a systematisation to aid compound library design. In combination with a second independent criterion, SCONP can provide readily accessible NP scaffolds retaining affinity to proteins and biological activity. Even though the generated libraries were comparably small, high hit rates were obtained.19,21,22 It should be noted that with increasing deviation from the parent scaffold, e.g. to simpler scaffolds, biological similarity will decline.
Scheme 3 (A) Flowchart of the performed cheminformatic analysis; (B) the guiding rules for fragmentation; (C) principle for the discovery of novel p38α MAP kinase binders. |
This example highlights the potential of the generated clusters for development into bioactive molecules and to guide optimisation attempts. The fragment clusters retain many attributes of NPs and cover chemical space outside commercial libraries. Although the obtained fragments and cluster centres are structurally less complex than their guiding NPs, their synthesis will often require considerable effort and the development of novel methods.
In silico fragment generation from NPs with annotated biological activity or from high-throughput screening data is a powerful way to explore novel chemical space. The obtained fragments are often readily accessible, allowing for the creation of large libraries. It is important to note that fragments usually inherit the biological activity of the parent compound although they may vary in selectivity towards similar targets.20 Alternatively, fragments and structural clusters can be used as guiding structures for the rapid assessment of biological activity and subsequent development of more potent compounds.
The 1,3-dipolar cycloaddition is a powerful method to introduce pyrrolidine moieties into scaffolds.27–29 The development of regio-, diastereo- and enantioselective 1,3-dipolar cycloadditions makes it a versatile tool for synthesis.27 Ningalin B and lamellarin D are two classes of marine natural products exhibiting interesting biological activities including antitumor activity, HIV-1 integrase inhibition and multidrug resistance reversal activity.30 The synthesis of the common pyrrolocoumarine core-scaffold was achieved through intramolecular 1,3-dipolar cycloadditions from alkynes 13 (Scheme 5).31 Cell-based screening of a collection of 43 compounds revealed Pyrcoumin as an inhibitor of canonical Wnt signaling.32 Further target identification and validation showed that it targets the nucleotide pyrophosphatase dCTPP1.33
Scheme 5 One pot synthesis of pyrcoumin. IC50 values determined by Wnt-responsive reporter gene assays in HEK293-Fz cells. |
Pyrrolizidines are characteristic for a large group of alkaloids which exhibit insecticidal activity.34 A dual cyclization strategy was developed to access a pyrrolizidine-like compound collection (Scheme 6). Enantioselective Cu-catalysed 1,3-dipolar cycloadditions between azomethine ylides (14) and N-methylmaleimides (15) delivered bicyclic pyrrolidines 16 with excellent diastereoselectivity and enantioselectivity (Scheme 6).35 Subsequent intramolecular lactamisation provided the tricyclic pyrrolizidine-alkaloid analogues 17 in high yields. Of the 119 compounds, four compounds inhibited parasite growth.
1,3-Dipolar cycloadditions have also been used to access bridged structures with high stereocomplexity. In a recent example, cyclic azomethine ylides were generated from α-diazo ketones 18 by means of rhodium catalysis. The cycloaddition between the transient cyclic azomethine ylides and the oxindole-derived alkenes 19 delivered spirotropanyl oxindole scaffolds 20 with good to excellent diastereo- and enantioselectivity (Scheme 7).36 Utilizing bimetallic relay catalysis, 24 compounds were synthesised enabling a BIOS of spirotropanyl oxindole alkaloids similar to alstonisine and chitosenine.
Apart from the synthesis of NP scaffolds by novel methodologies, established multistep syntheses of NP fragments have been used to access simplified scaffolds of NPs, resembling a function-oriented synthesis (FOS)37,38 approach but with library synthesis in mind. Withanolides are a family of at least 300 steroids exhibiting diverse bioactivities, including potent anti-inflammatory effects or modulation of the mTOR and Wnt pathways.39 Comparison of the structural features of withanolides showed that many retained the trans-hydrindane unit (C/D rings) linked to an α,β-unsaturated lactone (Scheme 8). The A/B rings of the steroid scaffold on the other hand showed variability. This observation led to the assumption that the retained structural features of the C/D ring may be responsible for common biological activity along with the modification of the dehydro-δ-lactone moiety.40 A compound collection was assembled based on these observations using a multi-step sequence and late-stage derivatization.40 Prins reaction of olefin 21 with paraformaldehyde afforded alcohol 22 as a single stereoisomer. Diastereoselective hydrogenation of the double bond introduced another stereocenter. Swern oxidation followed by a diastereoselective Brown allylation gave the homoallylic alcohol 23. After removing the acetal protecting group, an appendage was incorporated into the scaffold through an esterification reaction. Ring closing metathesis furnished the dehydro-δ-lactone moiety. This core scaffold (24) was further derivatised to obtain a diverse library (examples 25 and 26). Biological evaluation of the compound class revealed novel modulators of the hedgehog signalling pathway targeting the Smoothened (Smo) protein.41 Notably, the specific bioactivity was modulated by the appendages on the C ring. Withanolides have not been identified as hedgehog pathway inhibitors before.
Scheme 8 Biology-oriented synthesis of withanolid-inspired compound collection. Ki value for smoothened binding. |
BIOS provides a strategy to access biologically active compounds based on natural products through truncations and appendage modifications. This has also been exploited to assemble libraries fit for drug discovery.42,43 Natural products were truncated to their core scaffold while keeping functionalities intact, which could be used for derivatisation and growing of the scaffold.42 The obtained libraries were used in a rescaffolding approach for the discovery of novel inhibitors. Interactions of known inhibitors were analysed based on crystal structures and provided the basis for scaffold selection. In the example of the caspase-1 inhibitor PGE-3935199, it was found important that keeping the biaryl- and lactone side chains intact is crucial (Scheme 9A).42 Truncation between C10 and C11 as well as C20 and N23 enabled screening for a replacement of the 5,8-fused bicyclic core of PGE-3935199. A screening through the assembled virtual library identified 16 NP scaffolds fitting the descriptors. Out of these the bridged 2-azabicyclo[2.2.2]octane (2-ABO), a core scaffold of several marine NPs, offered a compelling solution to the problem as docking studies predicted an inhibitor in the nM affinity range. Synthesis and biological testing of CD10847 confirmed these predictions and an IC50 17 nM was obtained in cell-based assays.
Scheme 9 Rescaffolding of known inhibitors with BIOS inspired scaffolds: (A) rescaffolding of a caspase-1 inhibitor; (B) BIOS inspired CypD inhibitors. |
The same library was also employed in the discovery of Cyclophilin D (CypD) inhibitors through high throughput screening combined with X-ray crystallography.43 Tetrahydroquinoline derivative 27 was identified to bind into the S2 pocket (Scheme 9B). Rescaffolding of a known inhibitor led to compound 28 with significant increase in potency and affinity to the protein. In the same campaign succinimide 29 was found to bind into the S1′ pocket of the target protein. The two scaffolds were subsequently combined (30) with a fitting linker but no further improvement in potency was observed.
Thus, the chemical combination of two biosynthetically unrelated NP fragments would yield novel compound classes still endowed with biological relevance and would cover novel chemical and potentially biological space (Scheme 10).16,49 Such hybrid compounds may be regarded as pseudo natural products (pseudo NPs).16 Similar to diversity oriented synthesis (DOS)50 libraries, the fragment based design principle can lead to compound collections with high chemical diversity. However, pseudo NPs are biologically prevalidated by the choice of fragments directly derived from natural products, while DOS libraries mostly are built with chemical and structural considerations as predominating arguments.16 Compounds synthesised according to DOS can be considered NP-like as they encompass many attributes of NPs (e.g. high Fsp3 content) but their biological relevance is initially unclear.16 Pseudo NP library synthesis also does not necessarily follow the build/couple/pair strategy,51 which was very successfully applied in DOS. Inspiration for suitable fragments to combine was found in the fragment clusters from the cheminformatic analysis of the DNP.23 The identified 2000 cluster centres (see above), most of which are mono- or bicyclic scaffolds, offer a large variety of basic fragments for recombination. To increase complexity, NPs or their derivatives can serve as suitable platforms for combination with simpler fragments. Different connection types and positions lead to topologically diverse structures that may also differ in biological activities. According to the number of shared atoms between the fragments, recombined structures can be divided into four main groups, including monopodal connections (0 atom), spirocycles (1 atom), fused rings (2 atoms) and bridged scaffolds (≥3 atoms). Determination of the NP-likeness score52 of different fragment combinations showed that pseudo NPs are distinctly different from NPs and BIOS compounds but overlap with the space covered by drugs.16,49,53,54
Differing from BIOS, where the guiding NPs can suggest possible biological targets, pseudo NPs are expected to exhibit different biological activity than their individual fragments. To biologically characterize these compound collections, target-agnostic screening methods can be applied.55–58 The “Cell Painting” assay has shown great potential in characterizing pseudo NPs. Through staining of different cellular compartments and structures, phenotypic changes induced upon compound treatment are observed by microscopy. The obtained morphological profiles (fingerprints) can be compared to reference compounds with known biological activity to arrive at a target hypothesis, which has to be further confirmed in more specific assays. The outlined strategy has led to the discovery of several novel biologically active compound classes with different molecular topologies, which will be discussed below. The focus is set on the employed chemistry as well as the biological characterisation, and is categorised by fragment recombination patterns.
Tropane and indole are two common fragments found in NPs. The recombination of these two fragments afford highly sp3-rich indotropane pseudo NPs (32, Scheme 12).60 Their enantioselective synthesis was achieved via 1,3-dipolar cycloaddition of imine 33 and nitroalkenes 34. This scaffold can also be seen as a bridged recombination of tryptoline and pyrrolidine. Phenotypic screening of the obtained collection resulted in the discovery of myokinasib, which induces striking changes in mouse L fibroblast morphology indicating an involvement in cytokinesis.53 Kinase screening and target validation revealed that myokinasib is a mixed-type myosin light chain kinase 1 (MLCK1) inhibitor. This chemotype was unprecedented for kinase inhibitors.
Scheme 12 Enantioselective synthesis of indotropanes (32). IC50 data represents the ability to inhibit MLCK1. |
Scheme 13 Asymmetric fusion of pyrrolidine moiety into diverse NP fragments. Synthesis of the pseudo NP classes pyrroloquinolinone (A), pyrrotropanes (B) and pyrroiridoids (C). |
While the aim of the pseudo NP strategy is to combine fragments in a manner not yet observed in nature, it is possible that certain scaffolds may exist in nature but may not have been isolated from natural sources yet. In 2015, pyrroloquinolinones were synthesized as one of the first examples of pseudo NPs by combining pyrrolidine and quinolinone fragments (36, Scheme 14).61 More recently, this scaffold has been isolated from Streptomyces albogriseolus MGR072.64 While pyrroloquinolinones 36 cannot be considered pseudo NPs anymore, they would still fit the BIOS principle. More examples like this will emerge with the discovery of novel NPs. Such discoveries highlight the strength of the pseudo NP principle in providing biologically active compounds.
Indole is frequently present in naturally occurring bioactive molecules. Owing to the practicability of indole synthesis methodologies,65 indole fragments can be readily combined with other fragments to afford large indole-based pseudo NP libraries. For example, sp3-rich morphan fragments 41 were subjected to Fischer-indole synthesis to efficiently gain access to an indomorphan collection (42, Scheme 15).66 Biological screening of this 63-membered library resulted in the discovery of glupin, another potent GLUT-1 and GLUT-3 inhibitor. In further studies, glupin was systematically truncated to validate the necessity of the recombination of these two fragments.
Scheme 15 Fusion of indole moiety with morphan fragment. IC50 values determined for the inhibition of 2DG uptake in MDA-MB-231 cells. |
The pseudo NP principle can be extended to more than two fragments. For instance, pyridone-, dihydrofuran- and pyran fragments were combined to form pyrano-furo-pyridones (43, PFP). Fusion of pyridines 44 and functionalized pyrans 45 through Tsuji–Trost reactions or Michael additions afforded the desired tricyclic cores as different regioisomers (43a–d, Scheme 16).54 Biological analysis of the PFPs by means of the cell painting assay showed high biosimilarities with aumitin, a known inhibitor of mitochondrial respiration.67 Further investigations validated this hypothesis and proved that PFPs were a novel chemotype for the inhibition of mitochondrial respiration by targeting mitochondrial complex I.
Since there is a vast number of possible fragment combinations, connection types and regioisomers, it is anticipated that the concept of pseudo NPs will inspire the design and synthesis of numerous compound classes with novel biological activity. A systematic analysis of these molecules is therefore highly desirable. The pseudo NP concept benefits from unbiased phenotypic screening techniques, like cell painting. They may provide target hypotheses and mode of action for novel scaffolds unprecedented in nature. The generated target hypotheses could be further confirmed in specific cell-based assays.
Footnote |
† These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2020 |