Michael J.
Waring
*,
Craig
Johnstone
,
Darren
McKerrecher
,
Kurt G.
Pike
and
Graeme
Robb
Cardiovascular and Gastrointestinal Innovative Medicines Unit, AstraZeneca R&D, Mereside, Alderley Park, Macclesfield, Cheshire, SK10 4TG, UK. E-mail: mike.j.waring@astrazeneca.com; Tel: +44 (0) 1625 230942
First published on 28th June 2011
Small molecule activators of the glucokinase enzyme have the potential to deliver a level of glycaemic control that is superior to current oral agents and hence have great promise as new therapies for Type 2 Diabetes. As such, attempts to discover glucokinase activators suitable for clinical development have been the focus of many major pharmaceutical research programmes. Here we show how property-based matrix optimisation has been used to improve the multi-parameter technical profile of the AstraZeneca series of glucokinase activators culminating in the discovery of the development candidate AZD1092.
Glucokinase (GK, also known as hexokinase IV or D) is expressed predominantly in the liver, pancreas, brain and gut where its function is to catalyze the conversion of COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundglucose to COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundglucose-6-phosphate. In both the liver, where the action of GK promotes glycogen synthesis, and the pancreas, where the action of GK results in COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundglucose-sensitive insulin release,3GK is the rate limiting enzyme in COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundglucose utilisation.4Activation of GK is therefore expected to improve glycaemic control by modulating hepatic COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundglucose balance and decreasing the threshold for insulin secretion.5–7
Several groups have described small molecule glucokinase activators (GKAs) which act by binding to an allosteric site.8 We have described previously the identification and subsequent optimization of our own series of pyridine-5-carboxylic acid containing GKAs.9–11 In the preceding paper we described the modifications of this series that led to replacement of the pyridine-5-carboxylic acid motif, necessitated by the need to overcome testicular toxicology. This led us to neutral compounds such as GKA70 and GKA71 (Table 1).
GKA70 | GKA71 | |
---|---|---|
|
|
|
GK EC50/μM | 0.06 | 0.11 |
LogD | 3.2 | 2.9 |
Solubility/μM | 20 | 129 |
Caco-2 A to B Papp/nm s−1 | 540 | 290 |
hERG IC50/μM | 6.9 | 7.5 |
Rat Cl/mL min−1 kg−1 | 11 | 5.6 |
Rat Vss/L kg−1 | 0.3 | 0.2 |
Rat F (%) | 28 | 71 |
Dog Cl/mL min−1 kg−1 | 7.2 | 0.5 |
Dog Vss/L kg−1 | 0.8 | 0.3 |
Dog F (%) | 11 | 56 |
Rat OGTT AUC reduction | 29% at 10 mg kg−1 | 27% at 3 mg kg−1 |
Substructure transformationa,b | ΔpEC50 | ΔlogD | ΔpEC50-logD | ΔhERG pIC50 | ||
---|---|---|---|---|---|---|
a Raw data are shown in supporting information. b These key structural changes are outlined here to illustrate the principle, clearly as part of this campaign, many other changes, conservative and more diverse, were explored and analysed in this way with differing degrees of success. | ||||||
![]() |
R1 |
![]() |
−0.1 (n = 27) | −0.6 (n = 27) | 0.5 (n = 21) | −0.5 (n = 7) |
R2 |
![]() |
0.1 (n = 33) | −0.6 (n = 32) | 0.7 (n = 21) | −0.1 (n = 22) | |
R4 |
![]() |
0.3 (n = 20) | 0.2 (n = 23) | 0.1 (n = 16) | −0.3 (n = 19) |
In a broad sense, this series shows an overall dependence on lipophilicity for many secondary properties. Solubility appears to be negatively correlated with logD and permeability (as measured by Caco-2 A to B flux) correlates positively (Fig. 1). In addition, despite the subtleties in structural changes outlined above, hERG potency also correlates positively with logD within this series. Notably, it is apparent within this plot that despite the relatively diminished difference on hERG activity per lipophilicity unit observed between thiadiazoles and pyrazoles in a matched pair sense, for a given lipophilicity the sulfur containing R1 heterocyclic (COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundthiazole and COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundthiadiazole) sub-series tends to be more hERG active than the pyrazoles.
![]() | ||
Fig. 1 (a) x-y plots of solubility, Caco-2 A to B flux and hERG pIC50 against logD. Sulfur containing R1 heterocycles (thiazoles and thiadiazoles) are shown in red, pyrazoles are in green. Other R1 hetercycles are shown in grey. Points marked with circles are outside of the dynamic range of the y-data. The horizontal red dotted line marks the desired threshold value and the vertical green dotted line the likely logD limit required to achieve that threshold. (b) Logistic plots showing how probabilities of achieving solubility >100 μM and Caco-2 A to B flux >50 nm s−1 and hERG pIC50 < 4.5 change with logD. Sulfur containing R1 heterocycles (thiazoles and thiadiazoles) are shown in red, pyrazoles are in green. Other R1 hetercycles are shown in grey. Points marked with circles are outside of the dynamic range of the data. Points show the logD for each compound (x-coordinate) and are jittered in the y-direction to illustrate the populations of the two categories. The blue curve indicates the probability, the horizontal red line marks the probability of achieving the desired value of each parameter defined above at the selected logD limit. |
From these data, it is apparent that the chances of identifying a compound possessing the levels of solubility and permeability we deemed desirable for a drug candidate (defined here as 100 μM solubility and 50 nm s−1 for Caco-2 flux) can be significantly improved by targeting a logD range that is between 1.8 and 3. Achieving a hERG pIC50 below 4.5 would likely require a logD below 2.5 for an R1 pyrazole containing compound and is likely to be difficult to achieve at all for an R1 thiazole/COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundthiadiazole without impinging on regions of logD for which permeability would be compromised. Hence, it becomes necessary that new compound synthesis is targeted at a narrow range of logD between 1.8 and 2.5. From a design perspective, this is very challenging since many structural changes, for instance some of those outlined (Table 2) bring about a change in logD comparable in magnitude to the size of this window. This range is also comparable to the error of even the best computational algorithms for lipophilicity calculation.15
In order to address this, an entire matrix of virtual compounds consisting of four dimensions of substructural change that were either known, or postulated by analogy, to fulfil the requirements of glucokinase potency was enumerated computationally (Fig. 2). Crucially, the lipophilicity changes in these substructural inputs were minimised so as to allow the greatest number of possible combinations that target the optimum logD range. For each compound, lipophilicity was calculated using AstraZeneca's in house AZlogD algorithm.16 This method uses in house compounds, including those measured within the series in question, as the training set for the model and, as such, provided the most accurate means of predicting logD for these compounds, having a prediction error of 0.39 (see supporting information). Compounds with calculated logD values focused on the 1.8 to 2.5 range were then selected for synthesis.
![]() | ||
Fig. 2 Representative substructures chosen for the enumerated matrix. |
Having completed this selection, 70 compounds were made and profiled. As a measure of the success of this approach, each compound was assessed against three secondary screening parameters, notably solubility (>100 μM), hERG pIC50 (<4.5 μM) and rat oral AUC (nAUC > 0.5 μM h kg−1). Of the 70 compounds, 19 of them were found to surpass desirable values in these tests, representing a success rate of 27%. By comparison, of 112 representative previous compounds assessed in these assays from the same series, only 8 had been found to fulfil the same criteria (7%) (Fig. 3).
![]() | ||
Fig. 3 Numbers of compounds passing solubility (>100 μM), hERG pIC50 (<4.5) and rat oral AUC (nAUC >0.5 μM h kg−1) tests for compounds within the matrix design compared to a representative set of previous compounds from the series. |
We believe that these findings vindicate the approach taken but they also left us contemplating why, in a chemical series having reached a reasonable level of maturity in terms of understanding of structure–activity and structure-property relationships, attrition rates were still so high. A partial explanation lies in the calculation of logD. Despite best efforts to achieve accurate calculation, some of the synthesis targets had measured logD values outside of the desirable range and these tended to show greater attrition. Nevertheless, significant attrition was still observed for those in the optimum range, highlighting the complex effects of multiple, small structural changes against multiple parameters. This means that, without screening, it is very difficult to predict the best overall compound within a narrow region of chemical space and for identification of an optimised clinical candidate, such an exercise as outlined above is likely to be necessary at the end of an optimisation campaign to ensure the best compound has been identified.
Through extensive profiling of the compounds identified by the above exercise, most notably in rat and dog pharmacokinetic studies, high fat-fed female Zucker rat oral COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundglucose tolerance tests and solid state characterisation, GKA80 was identified as the most promising overall candidate for development within this set (Table 3). Other compounds were considered to be slightly inferior in one or more aspects for example compounds 1 and 2 showed increased clearance in the rat relative to GKA80 such that these compounds were considered less likely to achieve a suitable pharmacokinetic profile in humans.
GKA80/AZD1092 | 1 | 2 | |
---|---|---|---|
|
|
|
|
GK EC50/μM | 0.03 | 0.04 | 0.05 |
LogD | 1.8 | 1.7 | 1.8 |
Solubility/μM | 350 | >1000 | 730 |
Caco-2 A to B Papp/nm s−1 | 18 | 17 | 23 |
hERG IC50/μM | 70 | >100 | >95 |
Rat Cl/mL min−1 kg−1 | 14 | 31 | 34 |
Rat Vss/L kg−1 | 1.8 | 1.9 | 1.1 |
Rat F (%) | 30 | 78 | 27 |
Dog Cl/mL min−1 kg−1 | 14 | Not tested | Not tested |
Dog Vss/L kg−1 | 1.3 | Not tested | Not tested |
Dog F (%) | 51 | Not tested | Not tested |
Rat OGTT activity | 24% at 3 mg kg−1 | Not tested | Not tested |
GKA80 was selected for further development and was given the internal identifier AZD1092. AZD1092 exhibited dose dependent reduction of COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundglucose excursion in oral COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundglucose tolerance tests in high fat fed female Zucker rats17 and COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundglucose lowering efficacy in free feeding COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundglucose profiles in male Zucker rats (Fig. 4). The compound showed no major concerns in preclinical toxicology testing and hence was selected as a development candidate. Further results of our ongoing investigations in this area will be communicated in due course.
![]() | ||
Fig. 4
COMPOUND LINKS Read more about this on ChemSpider Download mol file of compoundGlucose lowering efficacy of GKA80/AZD1092. (a) Oral COMPOUND LINKS Read more about this on ChemSpider Download mol file of compoundglucose tolerance test in high fat fed female Zucker rats – vehicle control is shown in grey, 1 mg kg-1 in red, 3 mg kg-1 in green, 10 mg kg-1 in blue. (b) Free feeding COMPOUND LINKS Read more about this on ChemSpider Download mol file of compoundglucose profiles in male Zucker rats – vehicle control is shown in grey, 10 mg kg-1 in blue. |
Footnotes |
† Electronic supplementary information (ESI) available: References to experimental protocols, matched pair plots and raw data for those summarised in Table 2, a list of SMILES strings for the 70 compounds made within the matrix, distributions of their associated data and synthetic details for the synthesis of GKA80/AZD1092. See DOI: 10.1039/c1md00092f |
‡ Statistical analyses were carried out using SAS JMP (http://www.jmp.com). |
This journal is © The Royal Society of Chemistry 2011 |