Poongavanam
Vasanthanathan
a,
Jeroen
Lastdrager
b,
Chris
Oostenbrink
bc,
Jan N. M.
Commandeur
b,
Nico P. E.
Vermeulen
b,
Flemming S.
Jørgensen
a and
Lars
Olsen
*a
aFaculty of Pharmaceutical Sciences, Department of Medicinal Chemistry, University of Copenhagen, Universitetsparken 2, 2100, Copenhagen, Denmark. E-mail: lo@farma.ku.dk; Fax: +35-33-60-40
bLeiden–Amsterdam Center for Drug Research, Section of Molecular Toxicology, Department of Chemistry and Pharmacochemistry, Vrije Universiteit, De Boelelaan 1083, 1081 HV, Amsterdam, The Netherlands
cInstitute of Molecular Modeling and Simulation, University of Natural Resources and Life Sciences, Muthgasse 18, A-1190, Vienna, Austria
First published on 19th July 2011
The metabolic enzyme cytochrome P450 1A2 (CYP1A2) attracts much attention, not only because of its metabolism of drug compounds, but also due to its ability to convert procarcinogens to carcinogens. From a virtual screening, 41 compounds were selected and tested experimentally for inhibition of CYP1A2. Among these compounds, 16 inhibited the CYP1A2 activity by more than 50% at a concentration of 0.3 μM. The three most potent inhibitors have IC50 values of 20 nM. These inhibitors contain new scaffolds and may serve as starting points for further lead optimization. Thus, the applied virtual screening methods are useful for considering CYP1A2 inhibition, either to identify inhibitors of CYP1A2, e.g. for cancer therapy, or to identify undesirable inhibitory effects of the enzyme.
Although the number of drug candidate failures due to poor pharmacokinetic properties and toxicity have been reduced over the past years,13 there is still a need for improvement. One key to this could be a better understanding and prediction of CYP metabolism. This may be facilitated by the recent advances in the development of in silico methods to predict CYP metabolism,14–17e.g. predicting sites of metabolism15,18–22 or binding to the enzyme.23–28 The reasons for these advances are most likely the increasing amount of experimental CYP metabolism data, e.g. binding affinities from high-throughput screening assays in the PubChem project,29 and crystal structures,30–39 which are crucial for developing and validating the quality of the in silico models.
Ligand- and structure-based in silico approaches are extensively utilized to identify new ligands with affinity for a wide range of protein targets. CYPs, on the other hand, are often considered to be anti-targets rather than targets, and therefore, the in silico methods may be used to identify and remove compounds with affinity for the CYP enzyme. However, in the particular case of the CYP1A2 enzyme, it may also be considered a traditional target because of its potential involvement in cancer.11,12
We have previously used ligand-based and structure-based approaches to classify whether or not a compound is a CYP1A2 inhibitor. In a previous ligand-based study, a variety of machine learning methods were developed, the best one having an accuracy of 76% on an external test set containing about 7000 compounds.27 Using a structure-based docking approach, it was shown that about 61% of the inhibitors and 68% of the non-inhibitors could be classified correctly.40
Since the ligand-based and structure-based approaches are fundamentally different, we have in the present study used both virtual screening approaches to identify CYP1A2 ligands from a large compound library. The best ranked compounds from both searches have been selected for subsequent in vitro testing. Finally, the quality of the two types of in silico models is compared on the basis of the experimental results.
997 ZINC entries of 16
338 compounds were screened. The difference in the number of entries and unique compounds is due to the fact that some compounds have several possible protonation states and chiral atoms. For compounds with more than one entry, the one with the best score, either random forest probability from the ligand-based screening or ChemScore from the structure-based screening, was kept.
997 entries in the database were docked into the CYP1A2 active site using the ChemScore scoring function41,42 with the same settings as in our previous work.40 For a few compounds, the docking failed, resulting in a total of 16
330 compounds that were successfully docked. The obtained score values were in the range of −38.54 to 48.42. As shown earlier, compounds with a score being larger than 22.2 are more likely to be inhibitors.40 This corresponds to 60% of the database being predicted as being likely inhibitors of CYP1A2. However, it was also shown that most of the inhibitors being incorrectly classified as non-inhibitors are found in the score range of about 10–22.2.40 Thus, a few compounds in that range were also selected, in addition to those with scores larger than 22.2. In total, a set of 50 compounds was selected for experimental testing, of which 22 were available from the vendor (compounds T–AO, see Fig. S1†).
![]() | ||
| Fig. 1 Random forest probabilities, ChemScore values, and percent inhibition of CYP1A2 for the 41 tested compounds (A–AO) at a concentration of 0.3 μM. | ||
The IC50 values of the compounds that inhibited the CYP1A2 activity most at a concentration of 0.3 μM were determined (see Table 1 and Fig. 2). These had IC50 values in the range of 20–200 nM. The three best compounds, AD, AJ, and AM, had IC50 values around 20 nM. This is the same as the well-known CYP1A2 inhibitor, α-naphthoflavone (αNF). With a ChemScore value of 46.9, αNF was found among the top 10 scoring compounds in the docking experiment, and was included in the experiments as the reference value.
![]() | ||
| Fig. 2 The most potent inhibitors at a concentration of 0.3 μM. Compounds D, H, J, and Q were selected from the ligand-based screening, and compounds AB, AD, AH, AJ, and AM were selected from the structure-based screening. | ||
![]() | ||
| Fig. 3 The active site of CYP1A2 with α-naphthoflavone (αNF) in the active site as observed in the X-ray structure, characterized with molecular interaction fields. Shown are isosurfaces of the DRY probe (brown surface at −1 kcal mol−1), C3 probe (grey mesh at −2 kcal mol−1), and oxygen acceptor probe (red surface energy level at −5 kcal mol−1). Docking poses of compounds AD, AJ, and AM are also shown. Ligands are shown with cyan C atoms. The COMPOUND LINKS Read more about this on ChemSpider Download mol file of compoundwater molecule was not included in the dockings. | ||
The docked conformations of the best inhibitors, i.e. compounds AD, AJ, and AM, all have similar interactions with the phenylalanine cluster (Fig. 3). Moreover, compound AJ also has a carbonyl oxygen in the same position as αNF, although the COMPOUND LINKS
Read more about this on ChemSpider
Download mol file of compoundwater molecule was not included in the docking.40 Some of the other inhibitors have chemical groups that seem to interact directly with the heme group, e.g. the nitro group in compound AM and the 1,2,4-triazole ring in compound Q. The latter type of interaction between an aromatic nitrogen and the heme group is well known and has been observed previously for inhibitors of other human CYPs, e.g.CYP2A6 and CYP3A4.37,43
997 entries in ZINC was performed with already known CYP1A2 substrates projected onto this (see Fig. 4). The figure shows that the known CYP1A2 ligands primarily are located in the upper-right part of the score plot, reflecting that they are hydrophobic in nature (cf. the loadings plot, Fig. 4). The compounds selected from the ligand-based virtual screening seem to be more similar to the known CYP1A2 ligands than those obtained from the structure-based method. This is probably due to the fact that the ligand-based model was trained using known CYP1A2 ligands, and therefore identifies similar compounds. No such training is done with the structure-based method.
![]() | ||
Fig. 4 Score plot (top) and loadings plot (bottom) from principal component analysis (two first of seven principal components (PCs) shown (R2 = 0.67, Q2 = 0.40)) of the 19 997 entries extracted from the ZINC database (in grey). The experimentally tested compounds selected by the ligand-based random forest method (19 compounds) and the structure-based docking method (22 compounds) are colored black and green, respectively. Other known CYP1A2 inhibitors/non-inhibitors from PubChem are shown with open red/blue triangles, respectively. The tested compounds with best IC50 values are shown. | ||
Fig. 5 shows the frequency distribution of the ChemScore values for the entries that are predicted as inhibitors and non-inhibitors of CYP1A2 with the random forest method. The majority of entries predicted as inhibitors with ChemScore are also predicted as inhibitors with the random forest method (8294 out of 11
821 corresponding to 70%, see the caption of Fig. 5). Out of the 8085 entries predicted as non-inhibitors with ChemScore, 4096 were also predicted as non-inhibitors with the random forest method (51% agreement between the two virtual screening methods).
![]() | ||
| Fig. 5 The distribution of ChemScore values of the entries in the database that are predicted as inhibitors (“RF Inh”, in red) or non-inhibitors (“RF Non-Inh”, in green) with the random forest method. An entry was predicted as an inhibitor with the random forest method if the probability was ≥50%. The line at a value of 22.2 in ChemScore indicates the threshold used for considering a compound as an inhibitor with ChemScore. In total, 4096 and 8294 entries are classified as non-inhibitors and inhibitors by both methods, respectively; 3527 entries are predicted as inhibitors by ChemScore and non-inhibitors by the random forest method; 3989 entries are predicted as non-inhibitors by ChemScore and inhibitors by the random forest method. | ||
We have identified 16 compounds that inhibit the activity of the enzyme with more than 50% at a concentration of 0.3 μM (cf.Tables 1 and S1†). Of these, the majority (13 compounds) are predicted as inhibitors with both random forest and ChemScore. Considering only the ability of ChemScore to predict the experimental outcome (inhibition of the activity of CYP1A2 by more than 50% at a concentration of 0.3 μM), it correctly predicts that 15 compounds are inhibitors out of a total of 27 that were predicted as inhibitors (56%, Table 2). With the random forest method, a slightly larger number of compounds (32 compounds) are predicted as inhibitors but fewer (14 compounds, 44%) are verified as being inhibitors. Notably, if using 50% inhibition at a concentration of 3 μM as threshold for inhibition/non-inhibition, there is almost no difference between the random forest and ChemScore methods. This could be due to the fact that the random forest model has been trained on a set of compounds where an active compound is defined as inhibiting CYP1A2 with an affinity better than 40 μM, i.e. not a particularly high affinity.
| Number of compounds | Predicted with | Verified experimentally | Correctly predicted with | ||||
|---|---|---|---|---|---|---|---|
| Random forest | ChemScore | Both methods | IC50 < 0.3 μM | Random forest | ChemScore | Both methods | |
| Inhibitor | 32 | 27 | 23 | 16 | 14 | 15 | 13 |
| Non-inhibitor | 9 | 14 | 5 | 25 | 7 | 13 | 5 |
| Total | 41 | 41 | 28 | 41 | 21 | 28 | 18 |
The three best inhibitors having IC50 values of about 20 nM were predicted as inhibitors of CYP1A2 with both methods, having very high ChemScore values of more than 40, whereas the random forest probabilities are in the range of 60–80% (Fig. 2 and Table S1†). Thus, with ChemScore it seems to be possible to identify high affinity inhibitors. With the random forest method, on the other hand, it has previously been shown that a more certain classification of inhibitors with affinities lower than 40 μM was obtained using probabilities of 100%, but as shown here, this does not necessarily lead to high affinity inhibitors. That is, our experience from this study shows that in future studies aiming at identifying CYP1A2 ligands, the selection of potential CYP1A2 ligands could be done a bit differently. It could be sufficient to apply docking and select the compounds with best scores to identify potent inhibitors, or if applying the random forest model, to select predicted inhibitors with lower probabilities than 100%.
As mentioned previously, it is also of interest to consider CYP1A2 as a non-target, i.e. to identify the non-inhibitors correctly. ChemScore correctly predicts 13 out of a total of 14 predicted non-inhibitors and 25 experimentally verified non-inhibitors (52%) (see Table 2). Again, this is slightly worse with the random forest method that only predicts 7 of the non-inhibitors correctly. Thus, the restrictions induced by the enzyme preventing binding seem to be better included when explicitly considering the enzyme rather than only the descriptor space.
Although the random forest method seems to have a slightly worse ability to predict the inhibitors and non-inhibitors than ChemScore for this set of 41 compounds, it still has surprisingly good performance. Since only 39 two-dimensional physicochemical properties of each compound need to be calculated to use the random forest-based virtual screening method, it could be useful for quickly searching large compound databases.
338 compounds was screened for CYP1A2 inhibitors and 41 compounds were selected for subsequent experimental testing. Among these compounds, 16 inhibited the CYP1A2 activity by more than 50% at a concentration of 0.3 μM. The 9 most active compounds bind with an IC50 value of less than 250 nM to CYP1A2 and the three best inhibitors have affinities in the same range as α-naphthoflavone (IC50 ≈ 20 nM). A comparison of the outcomes from the two virtual screening methods shows that the structure-based method identifies more inhibitors, which are furthermore more potent as well. Moreover, the prediction rate of the non-inhibitors is better for the structure-based method. Thus, the structure-based method seems well-suited either when considering CYP1A2 as a classical drug target or a non-target. This is relevant for identification of inhibitors of CYP1A2, e.g. for cancer therapy or chemo-prevention, or to avoid other, undesirable, inhibitory effects of the enzyme, e.g. in drug–drug interactions.
Footnote |
| † Electronic supplementary information (ESI) available: Experimental section, Tables S1 and S2, and Fig. S1*. See DOI: 10.1039/c1md00087j |
| This journal is © The Royal Society of Chemistry 2011 |