Dávid
Bajusz
,
György G.
Ferenczy
and
György M.
Keserű
*
Medicinal Chemistry Research Group, Research Centre for Natural Sciences, Hungarian Academy of Sciences, Magyar tudósok körútja 2, Budapest 1117, Hungary. E-mail: keseru.gyorgy@ttk.mta.hu
First published on 2nd September 2015
A property-based desirability scoring scheme has been developed for kinase-focused library design and ligand-based pre-screening of large compound sets. The property distributions of known kinase inhibitors from the ChEMBL Kinase Sarfari database were investigated and used for a desirability function-based score. The scoring scheme is easily interpretable as it accounts for six molecular properties: topological polar surface area and the number of rotatable bonds, hydrogen bond donors, aromatic rings, nitrogen atoms and oxygen atoms. The performance of the Kinase Desirability Score (KiDS) is evaluated on both public and proprietary experimental screening data.
Since kinases have one well-defined function and share their endogenous ligand (ATP), their ATP-binding pockets are very well-conserved across the whole kinome. Thus, medicinal chemists face a great challenge in designing kinase inhibitors with sufficient selectivity towards the given target to avoid unwanted side effects. Even though the field has seen the advent of type II inhibitors in the 2000s,4 the majority of reported kinase inhibitors are still type I ligands. (Type II inhibitors bind to the inactive or “DFG-out” conformation of kinases as opposed to type I inhibitors which bind to the ATP-binding pocket in an active or “DFG-in” conformation.) Moreover, as our understanding of the mechanism of action of type II inhibitors improves, it is becoming clearer that this class of compounds is not inherently more selective than ATP-site inhibitors.5 Thus, the predominant approach towards kinase inhibitor design is still the small-molecule targeting of the ATP-site, even more so as the majority of available structural and biochemical data refer to type I inhibitors.
Virtual screening has been proven to be a useful approach in the hit discovery of kinase targets.6,7 However, due to the significant increase of the commercially and/or synthetically available drug-like (and lead-like) chemical space, structure-based screening methods are facing capacity challenges. As a solution, less accurate but quicker filters can be applied prior to the actual virtual screening (e.g. docking) to derive a more focused dataset of manageable size.
Various approaches have been applied previously to assemble kinase-focused compound libraries (virtual and physical as well), including substructure-based methods8–13 and similarity-based methods.14–16 Most recently, Singh and coworkers explored the possibility of characterizing kinase-like ligands based on physicochemical descriptors.17 With the increasing amount of publicly available inhibitor activity data,18 this approach becomes an attractive opportunity, since substructure- and similarity-based methods inherently retrieve molecules that are structurally similar to the reference compound(s), limiting the ability to identify inhibitors with novel scaffolds. In contrast, property-based methods do not have this limitation. The Kinase-Like Score (KLS) introduced by Singh and coworkers characterizes kinase-like ligand space on a statistical basis: it considers nine descriptors and scores them according to a formula that assumes a normal distribution.
A suitable MPO (multi-parameter optimization19) method for compound profile optimization is the desirability function.20,21 The essence of the underlying concept is that for each descriptor, a tailor-made scoring function is introduced, which reflects the “desirability” of the various possible values of that descriptor (e.g. how prevalent that descriptor value is among reference compounds). Desirability functions usually take values between 0 and 1, and generally either a sum or a product of the individual scores is calculated at the end of the process to produce the overall desirability score. Recent examples of studies that involve desirability function-based optimizations include Cruz-Monteagudo and coworkers' paper on global QSAR studies,22 Avram and coworkers' article on the characterization of pesticide-like compounds,23 and a GPCR-focused library design implementation by our group.24
In this paper, we present a desirability function-based scoring scheme (Kinase Desirability Score or KiDS) using topological descriptors to screen kinase-like ligands. Based on this study, KiDS can be applied as a pre-filter for kinase-like ligands in virtual screening campaigns, or alternatively, it might support the design of kinase-focused libraries.
![]() | ||
Fig. 1 Workflow representation of the calculation of KiDS. The last step corresponds to the application of KiDS as a filtering criterion. |
The desirability functions as introduced by Harrington20 and Derringer21 were defined for a number of molecular descriptors as custom-made functions that assign a value between 0 and 1 (desirability score) to each possible descriptor value. Generally, the assigned desirability scores were higher as the prevalence of the given descriptor value was higher among actives and lower among non-actives. (For a more detailed description, see the “Results” section and Fig. S1–S6.†) The additive approach was taken to summarize the separate desirabilities based on the descriptors, i.e. the overall Kinase Desirability Score was defined as the sum of the desirability scores obtained for the descriptors independently.
EFx% = (TPRx%)/x![]() | (1) |
EFx% = (Nact,x%/Nx%)/(Nact/N) | (2) |
The ROC curve is the plot of %(true positives) vs. %(false positives) for the ranked list of objects (here, molecules). The straight diagonal line is a reference that corresponds to a random classification. AUC is the area under the ROC curve which is calculated numerically. 95% confidence intervals are reported for both the AUC values and the enrichment factors as elaborated by Nicholls.36
Median (act.) | IQR (act.) | Other (act.) | |
---|---|---|---|
a No descriptors were selected where the medians of the kinase actives and random molecules coincide. b In cases where a value is outside the interquartile range (IQR) for both sets, a score of 0.2 is assigned when the given value is visibly more common among kinase actives than random molecules (see Fig. S1–S6). | |||
Median (rand.) | —a | 0.5 | 0 |
IQR (rand.) | 1 | 0.5 | 0 |
Other (rand.) | 1 | 1 | 0, 0.2b |
From the distributions of these descriptors among kinase-like and random molecules, the following general observations can be drawn: among kinase-like compounds, less oxygen atoms and rotatable bonds, higher polar surface area, and more aromatic rings, nitrogen atoms and hydrogen bond donors are preferred than what can be observed for random compounds. These differences are reflected in the definitions of the desirability functions of KiDS.
![]() | ||
Fig. 2 Evaluation of the Kinase Desirability Score. (A) ROC curves of the evaluation of the Training and Test sets with KiDS. In addition to the AUC values being close to 0.8, the initial slopes are quite high, which corresponds to good early enrichment factors (as reported in Table 2). A negligible deterioration of the results is observable for the Test sets (relative to the Training set), which suggests that the predictive power of the scoring method is sufficiently high, and thus it can be used for prospective applications. A ROC curve acquired for the Training set with the application of the Kinase-Like Score (KLS) of Singh et al.17 is provided as a reference (thick black line). (B) Additional validation has been carried out with a different set of non-actives. The actives from the Training and Test sets were mixed with 20![]() |
Dataset | Active | Random compounds | EF0.5%a | EF1%a | EF2%a | EF5%a | AUCa | |||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
KiDS | KLSb | KiDS | KLSb | KiDS | KLSb | KiDS | KLSb | KiDS | KLSb | |||
a 1.96σ values (corresponding to 95% confidence intervals) are given in parentheses.36 b Performance parameters obtained for the same datasets with the KLS score of Singh et al. are provided as a reference.17 | ||||||||||||
Training | 2500 | 22![]() |
23.2 (1.9 × 10−2) | 1.90 (5.1 × 10−3) | 14.2 (9.9 × 10−3) | 1.79 (3.5 × 10−3) | 10.6 (5.6 × 10−3) | 1.78 (2.4 × 10−3) | 7.1 (2.5 × 10−3) | 1.44 (1.3 × 10−3) | 0.786 (9.6 × 10−3) | 0.544 (0.012) |
Test 1 | 1923 | 18![]() |
22.6 (2.4 × 10−2) | 1.87 (6.5 × 10−3) | 14.0 (1.3 × 10−2) | 1.40 (3.9 × 10−3) | 10.9 (7.2 × 10−3) | 1.46 (2.8 × 10−3) | 6.9 (3.2 × 10−3) | 1.33 (1.7 × 10−3) | 0.778 (0.011) | 0.537 (0.014) |
Test 2 | 730 | 6300 | 18.9 (6.1 × 10−2) | 3.78 (2.6 × 10−2) | 14.8 (3.6 × 10−2) | 3.15 (1.7 × 10−2) | 9.9 (1.9 × 10−2) | 2.81 (1.1 × 10−2) | 6.2 (8.6 × 10−3) | 1.81 (5.3 × 10−3) | 0.757 (0.019) | 0.532 (0.023) |
External validation has been carried out on Test sets 1 and 2, and clearly the deterioration of the results (with respect to the Training set) is negligible, confirming the robustness of the scoring method. An additional external validation was carried out to verify the robustness of the Kinase Desirability Score: the random compounds from Mcule (in the Training and both Test sets) were exchanged to a set of 20000 random lead-like compounds from ZINC to assess whether the scoring method is dependent on the starting dataset (Fig. 2B). The deterioration of the performance parameters was negligible, suggesting that the performance of KiDS does not depend significantly on the source of the examined database. (Enrichment factors and AUC values are reported in Tables S3 and S4†). The active
:
non-active ratio on the other hand influences this performance as shown in the next section.
KiDS also outperforms the Kinase-Like Score (KLS) of Singh et al.17 (presented in Fig. 2 as a reference), justifying its use for the mentioned purposes. An explanation for the improved performance of KiDS relative to the Kinase-Like Score (KLS)17 is that while KLS accounts only for the property distributions of kinase actives, KiDS considers the differences between kinase actives and random, commercially available compounds. The same can be specified as the reason for KLS being sensitive to the source of random compounds, while KiDS is not (Fig. 2B). In this context, it is worth noting that the ability to distinguish and characterize different compound databases was a key requirement during the development of KLS. While the primary purpose of KLS was to examine compound databases, KiDS was developed with the intention of providing a general tool for property-based pre-screening for structure-based virtual screens and as such, it provides a better alternative for this task than KLS.
#a | AIDb | Target | Activity threshold (μM)c | Confirmed active | Inactive | KiDS AUCd | KLS AUCde |
---|---|---|---|---|---|---|---|
a Panel identifier in Fig. 3. b PubChem Bioassay IDs (where applicable). GR: Gedeon Richter Plc. proprietary HTS dataset. c IC50 value, below which a molecule is considered a confirmed active. d 1.96σ values (corresponding to 95% confidence intervals) are given in parentheses.36 e AUC values obtained for the same datasets with the KLS score of Singh et al. are provided as a reference.17 f 70% inhibition at an HTS screening concentration of 10 μM (as a confirmation, single-point inhibition measurements were carried out at 10 μM in duplicate). | |||||||
A | GR | Undisclosed kinase target | 70%f | 28 | 7480 | 0.574 | 0.397 |
(0.110)f | (0.116) | ||||||
B | 524 (screening) | Protein kinase A (PKA) | 60 | 40 | 22![]() |
0.700 | 0.557 |
548 (confirmatory) | (0.075) | (0.086) | |||||
C | 604 (screening) | Rho-associated protein kinase 2 (ROCK2) | 10 | 35 | 20![]() |
0.682 | 0.603 |
644 (confirmatory) | (0.080) | (0.083) | |||||
D | 619 (screening) | Polo-like kinase 1 (PLK1) | 50 | 14 | 30![]() |
0.791 | 0.523 |
785 (confirmatory) | (0.102) | (0.131) |
![]() | ||
Fig. 3 External validation of KiDS on proprietary (A) and publicly available (B–D) datasets of HTS campaigns (see Table 3 for details). The ROC curves suggest the applicability of KiDS as a pre-screening step in HTS campaigns to reduce the necessary instrumentation (and thus, the effective cost) for finding hit compounds. |
It is apparent from the results that the scoring of the screened datasets with KiDS is effective in selecting a subset enriched with kinase ligands. For example, the experimental testing of the top half of the HTS set published as AID 604 in the PubChem Bioassay (Fig. 3C) would result in identifying 80% of the actives that are found during the testing of the whole dataset. A similar result is obtained for AID 524 while KiDS gave somewhat inferior results for the Gedeon Richter's HTS (60% confirmed actives in the top scored 50%) and performed better for AID 619 where over 90% of actives are identified in the top scored 50% set. (Clearly, the performance is worse than for the Training and Test sets presented earlier, but that can be attributed to the much lower active:
inactive ratios of the PubChem Bioassay HTS sets.) Moreover, KiDS proved to be superior to KLS in each case. These results support the fact that the application of KiDS as a pre-filtering step can reduce the effective cost of finding active molecules in a kinase-directed high-throughput screening.
![]() | ||
Fig. 4 Plot of KiDS vs. average number of kinases hit for the EMD Millipore Kinase Screening dataset in ChEMBL.25 For each point (X, Y), Y is equal to the number of kinases hit averaged over the compounds possessing a KiDS score less than or equal to X. A significant linear correlation can be observed between the KiDS score and kinase promiscuity, with R2 = 0.838. |
In this study we introduced a molecular property-based scoring scheme, the Kinase Desirability Score (KiDS). The scoring scheme involves custom desirability functions based on six molecular descriptors: topological polar surface area (TPSA) and the number of rotatable bonds (rotB), nitrogen atoms (NN), oxygen atoms (NO), aromatic rings (Arom) and hydrogen bond donors (HBD). Scores between 0 and 1 are assigned to each of the descriptors and summed up to give the Kinase Desirability Score. KiDS is flexible in the sense that it does not impose very strict constraints regarding either of the involved molecular properties. Therefore, it allows for the identification of structurally novel kinase inhibitors.
KiDS was developed and tested using a dataset of known kinase inhibitors (ChEMBL) and random compounds from commercial compound databases (Mcule and ZINC), and its performance was assessed with early enrichment factors, ROC curves and AUC values on Training and independent Test sets. External validation also involved testing its performance on proprietary and public HTS datasets as well as full matrix screening data. In the latter case, a significant correlation between the KiDS score and kinase promiscuity could be observed.
The good and consistent performance parameters suggest that KiDS is useful as a pre-screening step in virtual screening workflows and for kinase-focused library design, as well. It also presents a more efficient alternative for these tasks than the previously suggested Kinase-Like Score (KLS). In HTS campaigns, a KiDS-based pre-screening can reduce the effective cost of finding hit compounds.
AUC | Area under the (ROC) curve |
EF | Enrichment factor |
GPCR | G-protein coupled receptor |
HBD | Number of hydrogen bond donors |
IC50 | Half maximal inhibitory concentration |
IQR | Interquartile range |
KiDS | Kinase Desirability Score |
log![]() | Logarithm of the n-octanol/water partition coefficient |
MPO | Multi-parameter optimization |
QSAR | Quantitative structure–activity relationship |
ROC | Receiver operating characteristic |
rotB | Rotatable bond count |
TPSA | Topological polar surface area |
Footnote |
† Electronic supplementary information (ESI) available: Desirability functions of the descriptors applied in KiDS, enrichment factors for the performance evaluation of KiDS, and results of the external validation of KiDS. See DOI: 10.1039/c5md00253b |
This journal is © The Royal Society of Chemistry 2015 |