Richard G.
Doveston
a,
Paolo
Tosatti
a,
Mark
Dow
a,
Daniel J.
Foley
a,
Ho Yin
Li
a,
Amanda J.
Campbell
b,
David
House
b,
Ian
Churcher
b,
Stephen P.
Marsden
*a and
Adam
Nelson
*ac
aSchool of Chemistry, University of Leeds, Leeds, LS2 9JT, UK. E-mail: s.p.marsden@leeds.ac.uk; a.s.nelson@leeds.ac.uk
bGlaxoSmithKline Medicines Research Centre, Stevenage, SG1 2NY, UK
cAstbury Centre for Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK
First published on 18th November 2014
Controlling the properties of lead molecules is critical in drug discovery, but sourcing large numbers of lead-like compounds for screening collections is a major challenge. A unified synthetic approach is described that enabled the synthesis of 52 diverse lead-like molecular scaffolds from a minimal set of 13 precursors. The divergent approach exploited a suite of robust, functional group-tolerant transformations. Crucially, after derivatisation, these scaffolds would target significant lead-like chemical space, and complement commercially-available compounds.
The identification of high quality lead compounds is a key challenge in early stage drug discovery. Lead optimisation tends to increase both molecular weight and lipophilicity, as well as molecular complexity.13–16 It is advisable, therefore, to control the properties of lead compounds to facilitate the emergence of clinical candidates with desirable drug-like characteristics. As a result, approaches have been developed to define chemical space that is populated by molecules with good lead-like properties typically by considering factors such as lipophilicity (e.g.17 −1 < clogP < 3), molecular size (e.g.17 14 ≤ heavy atoms ≤ 26) and removal of undesirable sub-structures.17
High-throughput screening provides a significant source of starting points for drug discovery.14,18 Control of the molecular properties of screening compounds can both aid subsequent development into high-quality lead molecules, and increase the efficiency of the exploration of chemical space.19 However, sourcing large numbers of diverse compounds with appropriate molecular properties has been identified as a major challenge.17 In a recent analysis, just 2.6% of 4.9 M commercially-available compounds were found to survive filtering by molecular weight (200 ≤ MR ≤ 360), lipophilicity (−1 < clogP < 3) and various structural features known to be undesirable in drug candidates.17 In addition, the ability of emerging synthetic methods to deliver lead-like compounds has been very poorly demonstrated17 despite significant development of diversity-oriented synthetic approaches.20–22 The problem is in fact exacerbated when diversity is also considered because chemists have historically explored chemical space unsystematically and in an uneven manner;23 indeed, metrics have been developed to capture the scaffold diversity of screening collections.24 The realisation of lead-oriented synthesis has recently been framed as a major academic and practical challenge17 which has prompted the development of approaches25,26 to the synthesis of specific classes of lead-like molecules.27–30
We have established a programme focussed on developing general synthetic approaches to diverse and novel lead-like molecular scaffolds i.e. scaffolds with the potential, following diversification, to yield large numbers of compounds with lead-like molecular properties. In this case, our approach to exemplifying the strategy exploited a single connective reaction in combination with just six distinct cyclisations (Scheme 1). Ir-catalysed amination31–36 was selected as the connective reaction because we had previously retooled it for compatibility with polar, sp3-rich, functionalised substrates35 that would have particular value for targeting lead-like chemical space. Thus, reaction between allylic carbonates 1 and amines 2 would provide cyclisation precursors 3. Importantly, the building blocks would be armed with functional groups to enable subsequent cyclisation – either once (e.g. → 4 or 5) or twice (e.g. → 6) – to yield product scaffolds. In addition, more complex scaffolds might be accessible by exploitation of a third building block (e.g. → 7–9). Remaining functionality would then be available for late-stage scaffold decoration. Our aim was to prepare, in a synthetically concise and efficient manner, a wide range of diverse and novel molecular scaffolds that, following decoration, would target broad regions of lead-like chemical space and thus demonstrate the potential of the strategy to underpin early-stage drug discovery.
![]() | ||
Scheme 3 Selected syntheses of lead-like scaffolds prepared using a single cyclisation step (indicated by colour: A, yellow, Pd-catalysed aminoarylation; B, purple, iodocyclisation/displacement; C, pink, reaction with CDI; D, green, reaction with α-halo acetyl halide; E, light blue, ring-closing metathesis; F, peach, lactamisation). Typical methods (see ESI† for full details): A: Aryl bromide (1.2 eq.), 5 mol% Pd(OAc)2, 10 mol% DPE-Phos, Cs2CO3 (2.5 eq.), 1,4-dioxane, 105 °C; B: (i) NsCl (1.2 eq.), NEt3 (2.0 eq.), DMAP (0.1 eq.), rt, then TBAF (1.2 eq.), AcOH (1.2 eq.), THF, rt; (ii) NIS (1.5 eq.), MeCN, 65 °C; (iii) ArSH (1.5 eq.), DBU (2.5 eq.), MeCN, rt; (iv) mCPBA (4.0 eq.), CH2Cl2, rt; (v) PhSH (1.2 eq.), DBU (1.5 eq.), MeCN, rt; C1: CH2Cl2–TFA, 0 °C → rt, then CDI (1.5 eq.), DBU (4.0 eq.), THF, 50 °C; C2: CDI (4.5 eq.), DMF, 110 °C; C3: CDI (1.5 eq.), DBU (2.5 eq.), THF, 50 °C; D1: Chloroacetyl chloride (1.5 eq.), NEt3 (5.0 eq.), CH2Cl2, 0 °C → rt, then NaH (2.0 eq.), NaI (1.0 eq.), THF, rt; D2: (i) TMSCl (1.1 eq.), NEt3 (3.0 eq.),CH2Cl2, 0 °C → rt, then bromoacetyl bromide (1.5 eq.), then 20% AcOH (aq.), rt; (ii) 35% NaOH (aq.) (5.0 eq.), Bu4NSO4 (0.5 eq.), CH2Cl2, 0 °C → rt; E1: 5 mol% Grubbs II, CH2Cl2, reflux; F1: CH2Cl2–TFA, 0 °C → rt, then K2CO3 (6.0 eq.), CH2Cl2, H2O, rt; F2: CH2Cl2–TFA, 0 °C → rt, then NaOtBu (2.0 eq.), THF, reflux; TBDPS = tert-butyldiphenylsilyl; Ns = 2- or 4-nitrobenzenesulfonyl; DMAP = 4-dimethylaminopyridine; TBAF = tetra-n-butylammonium fluoride; DBU = 1,8-diazabicycloundec-7-ene; mCPBA = m-chloroperoxybenzoic acid; DPE-Phos = bis-[2-(diphenylphosphino)phenyl]ether; TFA = trifluoroacetic acid; CDI = carbonyl diimidazole. |
![]() | ||
Scheme 4 Selected syntheses of lead-like scaffolds prepared using a second cyclisation step (including telescoped procedures; shown in blue). Methods are consistent with those in Scheme 3 (indicated by colour: A, yellow, Pd-catalysed aminoarylation; B, purple, iodocyclisation/displacement; C, pink, reaction with CDI; D, green, reaction with α-halo acetyl halide; E, light blue, ring-closing metathesis; F, peach, lactamisation). Additional methods (see ESI† for full details): D3/E2: (i) Bromoacetyl bromide (1.1 eq.), DIPEA (1.2 eq.), CH2Cl2, 0 °C → rt; (ii) 5 mol% Grubbs II, CH2Cl2, 45 °C; (iii) NaH (2.0 eq.), THF, rt; F3: 10% Pd/C (20 mol% Pd), ethylenediamine (1.0 eq.), MeOH, rt, then Cs2CO3 (10.0 eq.), DMF, 110 °C; TBDPS = tert-butyldiphenylsilyl. |
For example, Pd-catalysed aminoarylation (Method A),37,38 if necessary after protection of the secondary amine, enabled conversion of the cyclisation precursors 11, 14 and 15 into the pyrrolidines 24–28. In each case, the reaction proceeded efficiently and with high diastereoselectivity.§ Iodocylisation39 (Method B) enabled the cyclisation precursor 11 to be converted, after protecting group manipulation, into the corresponding morpholine (dr 56:
44); subsequent treatment with 4-methyl-4H-1,2,4-triazole-3-thiol and DBU gave separable sulfides which were oxidised and then deprotected to give the morpholine 29. Methods A and B both exploit a potentially variable building block – a (het)aryl bromide or a thiol respectively – and may therefore enable variation of the specific scaffold prepared. The application of alternative electron-deficient (het)aryl bromides in an initial cyclisation by aminoarylation was exemplified in the synthesis of nine scaffolds, five of which are shown in Scheme 3.
Two cyclisation reactions (Methods C and D) exploited simple bifunctional reagents that enabled the formation of a range of five-, six- and seven-membered ring systems. Thus, reaction with CDI (Methods C), if necessary after a telescoped protecting group removal, gave either cyclic ureas (e.g.30 and 31) or oxazolidinones (e.g.32). Similarly, reaction with either chloroacetyl chloride or bromoacetyl bromide, followed by alkylation, (Methods D) enabled the synthesis of ketopiperazines (e.g.33) and ketomorpholines (e.g.34). In addition, 2-keto-1,4-diazepane formation was also possible from the homologous substrates, although generally telescoped with a second cyclisation step (see below). Ring-closing metathesis (Methods E) was often used as a single cyclisation to form tetrahydropyridines (e.g.35) or dihydropyrroles (e.g.36) but could also be telescoped with a second cyclisation step (see below). Finally, lactamisation (Methods F) enabled the formation of either six- (e.g.37) or seven- (e.g.38) membered lactams.
In many cases, a second cyclisation step was possible using one of the complementary cyclisation reactions (Scheme 4). The robustness of Pd-catalysed aminoarylation (Method A) enabled the formation of pyrrolidines from a wide range of substrates. For example, the substrates 31–34 were converted into the pyrrolidines 39–43 with consistently high diastereoselectivity. Again, the ability to vary the scaffold prepared through careful choice of (het)aryl bromide was widely demonstrated. Method C also had utility as a second cyclisation: Boc deprotection of 29, followed by reaction with CDI, enabled the synthesis of the bicyclic scaffold 44.
As already noted, telescoped procedures incorporating a second cyclisation step were often used to great effect (Scheme 4; structures shown in blue). For example, ring-closing metathesis (Methods E) followed by urea formation (Methods C) enabled the synthesis of bicyclic compounds 45 and 46 directly from acyclic cyclisation precursors (19 and 17). Alternatively, telescoped diketopiperazine or 2-keto-1,4-diazepane formation (Methods D) and ring-closing metathesis procedures enabled the facile synthesis of compounds 47–49.
Lactamisation (Method F) had particular value for substrates prepared by aminoarylation with 2-methoxycarbonylphenyl bromide in the first cyclisation. For example, deprotection of 24, 26 and 27 triggered cyclisation to yield either six- (e.g.50, 52 or 53) or seven- (e.g.51) membered lactams. In the case of pyrrolidine 26, the selective formation of two distinct scaffolds (50 and 51) was possible using orthogonal deprotection reactions.
First, the lead-likeness of the members of the virtual library was assessed (Fig. 1, Panel A). Compounds were successively filtered by molecular size (14 ≤ number of heavy atoms ≤ 26), lipophilicity (−1 ≤ AlogP ≤ 3) and undesirable structural features (see ESI† for specific structural filters). About 59% of the compounds in the virtual library had lead-like molecular properties, and the majority of the outlying compounds only narrowly failed the molecular property filters (heavy atoms: μ = 23.8, σ = 4.0; Alog
P: μ = 0.3, σ = 1.3). By comparison, with these specific filters, just 23% of ∼9 M commercially-available compounds from the entire ZINC database40 were lead-like, with the majority of compounds lying well outside lead-like chemical space (heavy atoms: μ = 25.9, σ = 5.4; Alog
P: μ = 1.7, σ = 2.9) (Fig. 1, Panel B).¶41 Remarkably, we also showed that, with this set of capping groups, each one of the 52 scaffolds allowed significant regions within lead-like chemical space to be targeted (see ESI†). Our unified synthetic approach thus specifically targeted lead-like chemical space.
![]() | ||
Fig. 1 Analysis of the molecular properties of a virtual library of 19![]() ![]() ![]() |
Second, we determined the fraction of sp3-hybridised carbon atoms (Fsp3) in the virtual compounds (Fig. 1, Panel C). It has previously been shown that Fsp3 correlates strongly with success because compounds in the discovery phase (mean Fsp3: 0.36) have lower Fsp3 than marketed drugs (mean Fsp3: 0.47).11 It has thus been stated that accessing more three-dimensional lead compounds is a desirable goal.11,17 The mean Fsp3 of the virtual compounds (0.58) compared very favourably with that of the random sample of compounds from the ZINC database (0.33). Thus, our synthetic approach can yield compounds with significantly greater sp3 character than most commercially-available compounds, thereby expanding the range of molecular architectures available within lead-like chemical space and offering more flexibility in lead optimisation.
Third, the novelty and diversity of the 52 scaffolds was assessed. A substructure search was performed in which the ZINC database (9046
036 compounds) was interrogated with each of the deprotected scaffolds. In 43 cases (82%), the deprotected scaffold was not found as a substructure in any compound in the database. Even in the remaining 9 cases, the deprotected compound was not known in the CAS registry. The diversity of, and relationship between, the scaffolds was assessed using an hierarchical analysis (see ESI†).42 It was found that 42 frameworks were represented at the graph-node-bond level, which were related hierarchically to 13 “parent” frameworks. There is significant scaffold diversity at each level of hierarchy, meaning that the scaffolds are not simply closely related derivatives.
In total, 52 diverse molecular scaffolds were prepared from just thirteen different cyclisation precursors. Initially, pairs of building blocks were combined using a single connective reaction – Ir-catalysed allylic amination – before a divergent synthetic approach was used to convert these cyclisation precursors into 52 molecular scaffolds. This approach exploited a toolkit of just six cyclisation reactions, and required on average just 3.0 operations** for the synthesis of the scaffolds from the constituent building blocks. Furthermore, the unified and modular nature of the strategy means that it has the potential to deliver many additional scaffolds through expansion of the range of building blocks used (e.g. by use of homologated, and stereo- or regioisomerically substituted variants).
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c4ob02287d |
‡ It has been suggested that new approaches may be required to translate such molecules into marketed drugs (see ref. 5). |
§ Substrates bearing a remote o-nitrosulfonyl-protected amine did not undergo clean aminoarylation (see ESI†). |
¶ Similar analyses of the molecular properties of commercially-available compounds have previously been reported. With similar molecular property filters and different structural filters, it was concluded that just 2.6% of 4.9 M compounds from various suppliers were lead-like (see ref. 7). In an analysis of compound collections from 29 suppliers using less constrained filters (200 < MR < 460, −4 < clog![]() |
|| The properties of these compounds are representative of the entire ZINC database of ∼9 M commercially-available compounds (see ESI†). |
** For the purposes of this analysis, a synthetic operation is defined as a process conducted in a single reaction vessel. |
†† An approach to identify such highly robust reactions has recently been reported (ref. 43). |
This journal is © The Royal Society of Chemistry 2015 |