Daniel J.
Foley
a,
Richard G.
Doveston
a,
Ian
Churcher
b,
Adam
Nelson
*ac and
Stephen P.
Marsden
*a
aSchool of Chemistry, University of Leeds, Leeds, LS2 9JT, UK. E-mail: s.p.marsden@leeds.ac.uk; a.s.nelson@leeds.ac.uk
bGlaxoSmithKline Medicines Research Centre, Stevenage, SG1 2NY, UK
cAstbury Centre for Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK
First published on 11th May 2015
A powerful strategy for the efficient lead-oriented synthesis of novel molecular scaffolds is demonstrated. Twenty two scaffolds were prepared from just four α-amino acid-derived building blocks and a toolkit of six connective reactions. Importantly, each individual scaffold has the ability to specifically target lead-like chemical space.
Making diverse compound libraries is not necessarily difficult per se, but without careful planning comes at the expense of synthetic efficiency and, until recently,8a has lacked a lead-oriented focus. In order to realise efficient lead-oriented synthesis, our approach is to exploit small building blocks that contain multiple chemically-orthogonal functional groups; systematic exploitation of a minimal toolkit of reactions for the pairwise coupling of the functional groups present would rapidly deliver diverse scaffolds that, on decoration of the newly formed or residual functional groups, should target broad regions of lead-like chemical space.
To validate our strategy, we chose readily available9 α-allyl α-amino acid derivatives 1 as the substrates, the amine functionality of which would then be armed with a functional group (blue) to yield a trifunctional intermediate 2, tuning the precursor for cyclisation with either the adjacent ester (green; → 3) or alkene (red; → 4) (Scheme 1). Variation of the resulting scaffold should be possible by exploiting acyclic or cyclic amino acid derivatives (e.g.1a–d) as substrates; by varying the appended functionality (blue); and by varying the cyclisation reaction.
Firstly, allylation of commercial N-Boc-protected amino acid esters furnished the trifunctional building blocks 1.9 Additional intermediates bearing alternative N-substituents were prepared to tune the precursors 1 for cyclisation (Scheme 2, Panel A). Removal of the Boc-protecting group was followed by formation of acyl ureas 6 (treatment with ethyl isocyanatoformate) or by alkylation to give protected 1,2-diamines 9, o-bromobenzylamines 12 and allylamines 15. We then used these substrates to synthesise cyclic molecular scaffolds, by exploiting the use of a toolkit of three broad reaction types, all targeting medicinally relevant motifs.
![]() | ||
Scheme 2 Synthesis of diverse lead-like molecular scaffolds. Coloured boxes correspond to different cyclisation reactions, new bonds/groups are shown in red. Panel A: synthesis of scaffolds from 1a. Panel B: extension to alternative scaffolds using building blocks 1b–d. Standard methods: A: TFA, CH2Cl2; B: (i) I2, THF–H2O; (ii) NaN3, DMF; C: (EtO2C)NCO, CH2Cl2; D: (i) I2, Li[Al(OtBu)4], THF, −5 °C; (ii) NaN3, DMF; E: NaOMe, 85![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
1,2-Amino alcohols, 1,2-diamines and their functionalised derivatives are prevalent in many bioactive molecules.10 We therefore explored introduction of these motifs by pairwise coupling of the π-nucleophilic allyl group with the nucleophilic N-acyl substituents of 1 and 6 (carbamate11 or urea12 respectively) upon activation with iodine; subsequent treatment with NaN3 gave cyclic carbamates 5a–5d and the biologically relevant13 ureas 7a–c.
Pairwise coupling of the electrophilic carboxylate ester with nitrogen-centred nucleophiles led to hydantoins 8a,8c–d (a motif known for anticonvulsant biological activity, e.g. phenytoin) from acyl ureas 6 and lactams 10a–b/diketopiperazine 11d from protected amines 9. Finally, ring closure between the N-substituent and the C2-allyl group could be facilitated through transition metal catalysis: Pd-catalysed Heck cyclisations of substrates 12 gave the azepanes 13a–d and the azocane 14c. Similarly, Ru-catalysed ring-closing metathesis allowed the synthesis of the indolizidine alkaloid-like tetrahydropyridines 16a and the congeners 16b–d. In this way we prepared 22 distinct scaffolds from the four building blocks 1a–d in a total of just 49 synthetic operations‡ using a toolkit consisting of only six reaction methodologies and one N-capping event.
To assess the novelty of these scaffolds, substructure searches against the ZINC and CAS databases were performed. A total of 37 variants of the scaffolds (with protecting groups removed and allowing for modifications available through standard functional group interconversions; see ESI† for full details) were screened against the ZINC database of 9039
756 commercially available compounds. This returned only 28 hits, all of which were derived from just two of the scaffolds (10a/16a). A further analysis of the Murcko assemblies of all scaffolds (with alpha-attachments) against a random 5% of the ZINC database again only returned hits (4) against the same two scaffolds 10a/16a. Thus, it is clear that the scaffolds display a very high degree of novelty relative to commercial collections. Finally, the scaffolds were also assessed for skeletal diversity: using an hierarchical approach14,15 it was found that each scaffold comprised a unique molecular framework at the GNB level.§ Thus, the scaffolds are not simple derivatives of each other, but represent a skeletally diverse collection.
The potential of the scaffolds to provide access to lead-like screening compounds was demonstrated by enumeration of a virtual library of 1110 compounds from the deprotected scaffolds and 80 typical medicinal chemistry capping groups (see ESI†). Assessment of the lead-likeness of the virtual library (Fig. 1, Panel A) showed that 66% of compounds survived filtering by molecular size (14 ≤ heavy atom count ≤ 26; μ = 22.8, σ = 3.57), lipophilicity (−1 < Alog
P < 3; μ = 0.38, σ = 1.38) and structural filters (see ESI†). By comparison, just 23% of the ∼9 M compounds from the ZINC database15 survived this filtering process, with most compounds lying well outside lead-like chemical space (heavy atoms: μ = 25.9, σ = 5.4; A
log
P: μ = 1.7, σ = 2.9). Remarkably, using the same set of capping groups, each of the 22 scaffolds allowed significant lead-like chemical space to be targeted. In addition to this in silico study, we have shown practically that decoration of exemplar scaffolds is possible (ESI†).
The shape diversity of the virtual library was compared with that of 90911 randomly-selected compounds from the ZINC database (Fig. 1, Panel B). For each compound, the two normalised principal moments of inertia values were determined for a low energy conformation.16 The virtual library was significantly more three-dimensional (Fsp3: μ = 0.57) than the commercially-available compounds (Fsp3: μ = 0.33).
In this study we have shown that careful selection of small, polyfunctional substrates facilitates an efficient approach to the synthesis of molecular scaffolds that are novel, diverse, and can specifically target lead-like chemical space. This was achieved using a toolkit of just six cyclisation methodologies, leading to 22 scaffolds in only 49 synthetic operations. In particular, the efficient and diverse coverage of 3D space from such a small set of reagents is notable. This general approach should be applicable to many classes of polyfunctional substrate, enabling the more efficient exploration of lead-like chemical space.
We thank EPSRC (EP/J00894X) and GSK for funding.
Footnotes |
† Electronic supplementary information (ESI) available: Experimental procedures, compound characterisation data, copies of 1H and 13C NMR spectra, and details of the virtual library enumeration/analysis. See DOI: 10.1039/c5cc03002a |
‡ Defined as a process conducted in a single reaction vessel. |
§ See the scaffold tree in ESI† for further detail. |
This journal is © The Royal Society of Chemistry 2015 |