J. Jesús Navejaab and
José L. Medina-Franco*a
aFacultad de Química, Departamento de Farmacia, Universidad Nacional Autónoma de México, Avenida Universidad 3000, México, D.F. 04510, México. E-mail: medinajl@unam.mx; jose.medina.franco@gmail.com; Tel: +52-55-5622-3899 ext. 44458
bFacultad de Medicina, PECEM, Universidad Nacional Autónoma de México, Avenida Universidad 3000, México, D.F. 04510, México
First published on 21st July 2015
The interest in developing inhibitors of DNA methyltransferases (IDNMT) as modifiers of epigenetic features for the treatment of several chronic diseases is rapidly increasing. Herein, we present insights of a chemoinformatic characterization of IDNMT focused on the analysis of the chemical space and structure–activity relationships (SAR) using activity landscape modeling (ALM). Analysis of the chemical space revealed two main groups of compounds whose chemical structures are associated with either cofactor analogs or non-nucleoside compounds. The ALM showed that non-nucleoside compounds have a continuous SAR while cofactor analogs have a rough SAR with several deep activity cliffs. Molecular modeling helped to explain the structural basis of the activity cliffs. The significance of the results is threefold: (1) the combined analysis of chemical space with activity landscape gave rise to a novel ‘activity landscape sweeping’ strategy that enabled a better structure-based interpretation of the SAR; (2) it is feasible – and advisable – to develop predictive models for non-nucleoside IDNMT studied in this work, and (3) structure-based interpretation of the SAR gave clear insights into the molecular mechanism of inhibition of novel IDNMT suggesting specific strategies to optimize the activity of leads compounds.
DNA methylation is a major epigenetic change that regulates gene expression in the genome of organisms that range from viruses to humans.3 DNA methylation is regulated by the family of enzymes DNA methyltransferases (DNMTs). DNMTs are responsible for the covalent addition of a methyl group from the cofactor S-adenosyl-L-methionine (SAM or AdoMet) (Fig. 1) to the carbon atom 5 of cytosine, preferably within CpG dinucleotides. Also, as a product of the methylation mechanism, S-adenosyl-L-homocysteine (SAH) is generated.4 In mammals, four DNMT enzymes have been identified: DNMT1 (the most abundant, it is a maintenance methyltransferase that acts on hemimethylated DNA); DNMT3A and DNMT3B (de novo methyltransferases that are capable of generating new methylation patterns in DNA), and DNMT3L that is associated with DNMT3A and DNMT3B, enhancing their activity.
![]() | ||
Fig. 1 Structures of representative IDNMT1. The relative position in chemical space of selected compounds: 4 SAM analogues (N) and 5 non-nucleoside (NN) compounds is shown in Fig. 2. |
The structure of DNMTs can be organized into a C-terminal catalytic domain and an N-terminal regulatory domain. The catalytic domain of all DNMTs shares a common structure called “AdoMet (SAM)-dependent Mtase fold”. The N-terminal domain is involved in distinguishing hemi- and unmethylated DNA. There are several three-dimensional (3D) structures of different domains of DNMTs, including the catalytic one.5
The role of DNMTs in carcinogenesis has been subject of intense research during the last ten years. Currently, there are two inhibitors of DNMT (IDNMT) in clinical use: 5-azacytidine and decitabine (Fig. 1) both approved by the United States Food and Drug Administration – FDA – for the treatment of myelodysplastic syndrome (MDS).6 However, these two drugs are cytosine analogues that are incorporated into DNA, which implies they are unspecific and have high toxicity due to their mutagenic effects that may occur in somatic cells. Many therapies involving IDNMT are under investigation, mainly as sensitizers to therapy, since epigenetic changes may be involved in rapid adaptation of cancerous cells to therapy. In addition to cancer, DNMTs are attractive targets for the treatment of other chronic and degenerative diseases such as Alzheimer's and psychiatric conditions. Also, DNA methylation has been involved in autoimmune diseases and inherited disorders.7
The low specificity and high toxicity of 5-azacytidine and decitabine has prompted the search for novel and specific IDNMTs. Currently there is a relatively large number of IDNMT and/or DNA demethylating compounds that have been obtained from different sources such as natural products, synthetic compounds, drugs approved for therapeutic indications other than cancer and high-throughput screening (HTS). As part of these efforts, computational analyses have been successfully implemented to model IDNMT and to identify novel inhibitors.8
Over the past few years, the structure and activity of compounds tested as IDNMT have been collected in public repositories such as ChEMBL.9 The increasing amount of structure–activity data of IDNMT opens up the possibility to conduct systematic structure–activity relationships (SAR) studies, such as quantitative SAR (QSAR). Nevertheless, it has been recognized that typical QSAR studies usually assume that compounds with similar structures have similar activity i.e., a ‘smooth’ SARs. It is well known that compounds with high structural similarity but low activity similarity i.e., ‘activity cliffs’,10 reduce the predictive ability of QSAR models.11,12 Therefore, the early detection of activity cliffs is a convenient step before attempting to develop models such as QSAR.13 Similarly, it is advisable to conduct detailed descriptive analysis to understand the SAR before developing predictive models.14 Thus far, limited studies have been reported to navigate and describe the SAR of a large set of IDNMT in a systematic manner.
In this work, we report a chemoinformatic-based characterization of the SAR of a dataset of 280 compounds tested as IDNMT1 and deposited in ChEMBL. The analysis had three specific aims: (a) characterization of the structural diversity and distribution in chemical space of the data set; (b) descriptive SAR analysis using the concept of activity landscape modeling and (c) structure-based interpretation of the activity cliffs. To the best of our knowledge this work represents one of the first activity landscape studies of IDNMT1. Indeed, it has been recently recognized that activity landscape modeling (ALM) is a convenient approach to explore systematically the SAR of screening data sets focused on epigenetic targets.15 The characterization of the chemical space distinguished two major types of chemical structures with different activity landscape. As part of the first aim it was developed a novel ‘activity landscape sweeping’ approach, that is, a dissection of the global activity landscape (global SAR) into smaller but more structural interpretable local landscapes (local SARs). The structure-based interpretation of the SAR of the activity cliffs gave key insights into the molecular mechanism of inhibition of active molecules. This analysis also prompted for structural modifications to lead compounds to continue developing IDNMT as potential epi-drugs or epi-probes.
![]() | ||
Fig. 2 Visual representation of the chemical space of the 280 compounds in the data set. The visualization was obtained by principal component analysis of the similarity matrices computed with ECFP. The percentage of variance explained by each PC is indicated in the corresponding axis. Data points are colored by the pIC50 values in a continuous scale. Two main clusters (A: circles, B: triangles) are readily distinguished. Nine selected compounds are identified as SAM analogues (N) and non-nucleosides (NN) compounds. The chemical structures are shown in Fig. 1. |
Fig. 2 shows two major clusters in chemical space herein labeled as cluster A (45 compounds) and cluster B (235 compounds), respectively. Both groups of compounds have active and inactive molecules e.g., red and gray points. Furthermore, the active compounds in each cluster are not further grouped suggesting that they are structurally diverse.
Visual inspection of all compounds in each cluster revealed that all the chemical structures in cluster A have a purine ring in their structure and are structurally related to the co-factor SAM. In contrast, molecules in cluster B are non-nucleoside. Representative structures from each cluster are depicted in Fig. 1 and are mapped into the visual representation of the chemical space of Fig. 2. The visual representation of the chemical space in Fig. 2 also suggested that molecules in cluster B (non-nucleoside) are structurally more diverse than the molecules in cluster A. Not surprisingly, the distribution of the similarity values (Fig. S1 in the ESI‡) confirmed that the non-nucleoside set has a higher structural diversity than the SAM-related compounds. This is because no further distinction is made on the type of chemical structures. In contrast, all compounds in cluster A are chemically related to SAM.
It is possible to further divide the non-nucleosides in smaller sub-sets chemically related. For instance, K-means clustering shows that 3–6 subgroups would provide an efficient clustering in terms of number of clusters and within group's sums of squares (see the ESI‡ for a detailed explanation on K-means methodology followed). However, clustering in two groups already diminished by more than 40% of the within groups sums of squares (see Fig. S4‡). Herein, we analyze the activity landscape of two clusters to discuss local SAR as general as possible. Undoubtedly, additional studies can be extended to analyze smaller clusters and provide information of more local SARs.
Equivalent clusters A and B were identified in the PCA of the combined ECFPs and MACCS keys similarity matrices using the fusion approaches detailed in the Methods section (Fig. S5 in the ESI‡). Interestingly, MACCS keys alone did not lead to the identification of the two clusters (Fig. S5a‡); this can be attributed to the low resolution of this fingerprint.30
![]() | ||
Fig. 3 Density SAS maps of the global and local activity landscapes. The 2D plots are colored by the frequency of data points in the coordinates given. Dashed lines divide the maps into the four quadrants labeled I–IV. The dotted line further divides the activity cliff quadrant (IV) in two regions (IVa and IVb) to distinguish shallow and deep cliffs (see text for details). (a) IDNMT1 SAS map for the entire set with 280 compounds (39![]() ![]() |
Quadrant | Region | Entire dataseta | SAM analogues (cluster A)b | Non-nucleosides (cluster B)c |
---|---|---|---|---|
a 280 compounds.b 45 compounds in cluster A of Fig. 2.c 235 compounds in cluster B of Fig. 2. | ||||
I | Uncertainty | 1571 (4.02%) | 2 (0.20%) | 1066 (3.88%) |
II | Similarity cliff (scaffold hop) | 36![]() |
46 (4.64%) | 26![]() |
III | Smooth SAR | 939 (2.40%) | 614 (62.02%) | 325 (1.18%) |
IVa | Deep activity cliffs | 64 (0.16%) | 49 (4.95%) | 15 (0.05%) |
IVb | Shallow activity cliffs | 309 (0.79%) | 279 (28.18%) | 30 (0.11%) |
Total | 39![]() |
990 (100%) | 27![]() |
Fig. 3a and Table 1 indicate that, overall, IDNMT1 have a heterogeneous SAR with data points in the continuous and discontinuous regions of the SAR (zones III and IV).15 Noteworthy, the scaffold hop, more recently called ‘similarity cliffs’38 region has the highest density of data points (92.6%). This indicates that there are quite different chemical structures with similar activity. Note however that both compounds in the pair may be either active or inactive. Fig. 3a and Table 1 also shows the presence of shallow and deep activity cliffs with a relatively small fraction of the entire data set (0.79 and 0.16%, respectively). The overall low frequency of activity cliffs is in agreement with the low frequency of activity cliffs observed for data sets for other molecular targets.26–28,30
The high density of data points in the similarity cliff region of the SAS maps and the two main clusters of compounds distinguished in the chemical space analysis, prompted us to conduct analysis of local activity landscapes of clusters A and B. As discussed in the next section, the chemical structures of compounds in each cluster, plus the knowledge of the mechanism of DNA methylation, led to an interpretable SAR.
The lower fraction of similarity cliffs for SAM-related analogues (4.6%) vs. the fraction of similarity cliffs for the non-nucleoside analogues (94.8%) is in agreement with the type of structures and molecular diversity in each cluster. Indeed, the visual representation of the chemical space (Fig. 2) and distribution of ECFP/Tanimoto similarity values for the compounds in each cluster (Fig. S1‡) yield consistent results. Similarly, the higher percentage of compounds in the smooth SAR region (III) for SAM analogues (62%) as compared to the percentage of compounds for non-nucleoside analogues (1.2%) (Table 1) is in line with the structural diversity of the chemical structures of each type of compounds.
As mentioned above, the distribution of data points in the similarity cliff and smooth regions of the SAS maps are expected from the type of chemical structures. But surprisingly, for SAM related analogues there is a larger fraction of deep and shallow activity cliffs as compared to the fraction of cliffs in the entire data set (4.9% and 28.2% vs. 0.16% and 0.79%, respectively; Table 1). In sharp contrast, the fraction of activity cliffs for the non-nucleosides is lower (0.05% and 0.11%, respectively, Table 1). These results indicate that SAM related analogues may be enriched with activity cliff generators.31 The next sections discuss the activity landscapes of each set of compounds, i.e., local activity landscapes. A brief analysis of the activity landscape of SAM-related compound is mentioned first followed by a more extensive discussion of the landscape of the non-nucleosides. We elaborated more on the non-nucleosides since they are currently more attractive as IDNMT1.39
As discussed in the literature, activity cliffs are rich in SAR information since they point to specific structural changes that have a large impact in the biological activity. In an activity landscape study based on structural fingerprints, the interpretability of the activity cliffs is a key component.37 In other words, the SAR of the activity cliffs should be easily translated in terms of specific structural changes. In the local activity landscape of non-nucleoside molecules we identified two major types of compounds with high ECFP/Tanimoto similarity whose chemical structures are structurally related, namely: compounds identified by HTS and structures related to SGI-1027.40 All pairs of compounds from HTS are shallow cliffs and are shown in Fig. 4 and S11 of the ESI.‡ From the 30 shallow activity cliffs found in the SAS map for non-nucleoside compounds, 16 (53%) compounds were found to be from HTS assays (Fig. 4 and Table 2). A considerable number of screenings and confirmatory assays were performed for these compounds, as found in PubChem.
![]() | ||
Fig. 4 Structures of activity cliffs of non-nucleoside compounds identified by high-throughput screening. Table 2 summarizes the potency difference and structure similarity for each compound pair associated with an arrow. |
Compound pair | Activity of most active compound in the pair (pIC50) | ΔpIC50 | ECFP/Tanimoto |
---|---|---|---|
CHEMBL115145, CHEMBL1503050 | 5.17 | 1.14 | 0.28 |
CHEMBL1302528, CHEMBL1377441 | 5.34 | 1.34 | 0.3 |
CHEMBL1302528, CHEMBL1443718 | 5.17 | 1.17 | 0.3 |
CHEMBL1302528, CHEMBL1558192 | 5.37 | 1.37 | 0.28 |
CHEMBL1302528, CHEMBL256098 | 5.04 | 1.03 | 0.3 |
CHEMBL1303509, CHEMBL1332402 | 5.99 | 1.04 | 0.27 |
CHEMBL1328733, CHEMBL1332506 | 6.09 | 1.1 | 0.27 |
CHEMBL1328733, CHEMBL1411673 | 6.09 | 1.18 | 0.37 |
CHEMBL1379120, CHEMBL592316 | 5.91 | 1.9 | 0.28 |
CHEMBL1403497, CHEMBL2063048 | 5.8 | 1.14 | 0.28 |
CHEMBL1564869, CHEMBL3109084 | 4.7 | 1.29 | 0.39 |
CHEMBL1607517, CHEMBL1704614 | 5.87 | 1.49 | 0.36 |
CHEMBL1607517, CHEMBL1988862 | 5.99 | 1.6 | 0.46 |
CHEMBL1916517, CHEMBL1916672 | 3.82 | 1.03 | 0.55 |
CHEMBL1978925, CHEMBL1990599 | 5.27 | 1.26 | 0.32 |
CHEMBL1983083, CHEMBL1990599 | 5.07 | 1.07 | 0.38 |
In the activity landscape of non-nucleoside molecules the deepest activity cliffs as well as the most relevant in medicinal chemistry were the structures related to the quinolone-based inhibitor SGI-1027. This compound is one of the most promising IDNMT1 that has been recently subject of a medicinal chemistry optimization program (vide infra). Therefore, in the next section we describe studies focused on the interpretation at a molecular level of activity cliffs related to SGI-1027.
![]() | ||
Fig. 5 Chemical structures of non-nucleoside activity cliffs related to regioisomers of SGI-1027. Table 3 summarizes the potency difference and structure similarity for each compound in this figure and the lead molecule CHEMBL3126646. |
In order to describe the analogues of the lead compound, Valente et al. considered that SGI-1027 is composed of four fragments (4-aminoquinoline + 4-aminobenzoic acid + 1,4-phenylenediamine + 2,4-diamino-6-methylpyrimidine) linked in sequence with para/para orientation.41 The most active compound in this series was CHEMBL3126646 which can be regarded as the meta/meta regioisomer of SGI-1027 (CHEMBL2336409).41 Table 3 summarizes the deep activity cliffs that form CHEMBL3126646. It must be noted that this compound is the most important activity cliff generator in the database i.e., it is the most prevalent compound within the activity cliff region of the SAS map.31 The deepest activity cliffs of the meta/meta regioisomer are formed with ortho regioisomers such as CHEMBL3126644, 3126647, 3126648, 3126649 with potency differences of two or more logarithmic units (Fig. 5 and Table 3).
Compound | ΔpIC50 | ECFP/Tanimoto |
---|---|---|
CHEMBL3126647 | 3.16 | 0.75 |
ortho/ortho SGI-1027 regioisomer | 3.16 | 0.53 |
CHEMBL3126654 | 3.16 | 0.38 |
CHEMBL3126649 | 2.51 | 0.69 |
CHEMBL3126644 | 2.46 | 0.60 |
CHEMBL3126653 | 2.37 | 0.42 |
CHEMBL3126648 | 2.01 | 0.57 |
Valente et al. reported docking models of CHEMBL3126646 with crystallographic structures of DNMT1. It was concluded from that studies that this molecule could interact with the CXXC auto-inhibitory domain of DNMT1 and be close to SAM but without making interactions with the cofactor or competing with any of the interactions that SAM makes.41 However, no structure-based explanation of the large potency difference of the significantly less active SGI-1027 analogues (e.g., ortho regioisomers) was explored. A structure-based interpretation of the activity cliffs that form the most active compound is elaborated in the next section.
Results of the PLIFs for the activity cliff generator CHEMBL557902 plus 11 related (paired) compounds are shown in Fig. 6. The chemical structures are shown in Fig. S8 of the ESI.‡ The data matrix in Fig. 6a summarizes the protein–ligand contacts between the best two poses of 12 docked molecules and DNMT1. In this matrix, the rows represent the docked poses of the 12 molecules. The columns are the fingerprint bits indicating the amino acid residues that make at least one contact with one of the compounds. A black cell in the matrix indicates that a contact is present between the intersecting compound and amino acid residue i.e., fingerprint bit turned ‘on’. In contrast, a white cell means that there is no contact i.e., fingerprint bit turned ‘off”. Fig. 6 revealed that interactions with Gly1223 (backbone hydrogen bond donor), Glu1266 (ionic attraction) and Arg1312 (both side chain hydrogen bond acceptor and ionic attraction) were found in the active SAM-analogue (CHEMBL557902) but not in the compounds with much lower pIC50 values. Similar analyses were performed with the three remaining activity cliff generators related to SAM (Fig. S12–S14‡). It was concluded that that the loss of a hydrogen bond donor that could interact with Asp1190 is generating cliffs for CHEMBL557902, CHEMBL560106, and CHEMBL559283.
![]() | ||
Fig. 6 Summary of protein–ligand interaction fingerprint (PLIFs) analysis of the activity cliff generator CHEMBL552309 (compound N3 in Fig. 1) and 11 SAM-analogues that form activity cliffs with this compound (the chemical structures of the 11 molecules are shown in Fig. S8 of the ESI‡). For each compound the best two docking poses are represented. (a) Data matrix summarizing the protein–ligand contacts between the best two poses of 12 docked molecules and DNMT1. In this matrix, the rows represent the docked poses. The columns are the fingerprint bits indicating the amino acid residues that make at least one contact with one of the compounds. A black cell in the matrix indicates that a contact is present between the intersecting compound and amino acid residue i.e., fingerprint bit turned ‘on’. In contrast, a white cell means that there is no contact i.e., fingerprint bit turned ‘off”. (b) The statistically more significant PLIFs. A darker color means that the interaction is more associated to the active compound. |
There is no co-crystallized structure available for the most active compound CHEMBL3126646 with DNMT1 (this is the case for every non-nucleoside IDNMT1). Therefore, its precise binding region is unknown. In order to explore the putative binding zone, before docking all activity cliff forming compounds, CHEMBL3126646 was docked with DNMT1 as detailed in the Methods section. Results were compared with the experimental biochemical results and docking studies recently published for this molecule. Fig. 7 shows the optimized docking model. In this model, CHEMBL3126646 is close to but does not occupy the binding region of the co-cofactor (as predicted for other type of IDNMT1 (ref. 43 and 44)). Remarkably, a potential hydrogen bond interaction was found between the carbonyl oxygen of CHEMBL3126646 and the O′2 oxygen atom of the co-crystal SAH. The molecule is able to make hydrogen bond contacts with the backbone of Ala647, and π–π interactions (T-shape) with the side chain of Phe648 of the CXXC domain. In addition, CHEMBL3126646 makes hydrophobic interactions with the side chains of Met696, Glu698, and Ala699 of the CXXC domain (see Fig. 7 and S15 in the ESI‡ for a 3D and 2D ligand interactions representation, respectively). The possibility of this inhibitor or making ‘sandwich’ interactions with both the CXXC domain and the co-factor in DNMT1 is in agreement with the docking study reported by Valente et al.41 Therefore, it is plausible that CHEMBL3126646 inhibits DNMT1 by a mechanism we previously proposed for SGI-1027 i.e., stabilization of the autoinhibitory linker.45 This hypothesis is further supported by the experimental evidence that CHEMBL3126646 seems to do not compete with the co-factor.
The binding mode for the most active compound proposed herein also explains the activity cliffs to a large extent. The most pronounced e.g., deepest activity cliffs with compound CHEMBL3126646 (Table 3) are regioisomers with at least one ortho substitution: CHEMBL3126644, 3126647, 3126648, 3126649 and ortho/ortho regioisomer (compound 9, as numbered in the Valente et al.41 paper). In agreement with Valente et al.,41 the shape of the ortho regioisomers may not adopt the extended conformation required to stabilize inhibitory linker domain. Flexible alignment of SGI-1027 analogues with the most active compound CHEMBL3126646 (Fig. 8) clearly shows the very different shape of the more active meta/meta and other non-ortho regioisomers (Fig. 8A) as compared to the inactive ortho regioisomers of SGI-1027 (Fig. 8B). Docking of the ortho containing compounds with DNMT1 (data not shown) showed the loss of the interaction with the co-factor also highlighting this key interaction of CHEMBL3126646.
![]() | ||
Fig. 8 Flexible alignment of regioisomers of SGI-1027 (chemical structures are shown in Fig. 5) with the best docked pose of CHEMBL3126646 (balls and sticks and carbon atoms in green). (a) Non-ortho regioisomers (carbon atoms in blue). (b) ortho regioisomers [CHEMBL3126644, CHEMBL3126647, CHEMBL3126649 and ortho/ortho SGI-1027 analog (not registered in ChEMBL) (carbon atoms in red) and CHEMBL3126648 (ortho/para) (carbon atoms in yellow). Note the alignment different in the red molecules and the different orientation of the carbonyl oxygen in the both the red and the yellow molecules, which is not the case in (a). |
Preliminary regular and pharmacophore-constrained docking studies of the eight compounds related to CHEMBL3126646 (Fig. 5) were conducted with a crystallographic structure of DNMT1. The docking poses were post-processed with PLIFs as detailed in the Methods section. Results are summarized in Fig. S16a.‡ In order to explore the protein–ligand contacts that may differentiate ‘active’ from ‘inactive’ compounds, the significance analysis implemented in MOE was performed. For this analysis we considered as “active” a compound with pIC50 > 5. Fig. S16b‡ shows that there are not statistically significant differences that might distinguish active from inactive molecules. This reflects the fact that ortho regioisomers are not unable of stretching to the required extent, but the energy necessary to do so is higher, mainly due to their intermolecular interactions. Further computational analyses are required to test this hypothesis (see below section of Future directions).
The structural interpretation of the activity cliffs indicated that SAM-related analogues contain several pharmacophoric interactions that are substantial to determining its potency. Therefore, even small changes in its structure may produce deep activity cliffs. Hence, SAM-analogues may not be suitable for classical predictive approaches that assume linear relationships.
Structure-based analysis of the most relevant non-nucleoside activity cliff generator, a regioisomer of SGI-1027 developed recently, supported the hypothesis that this type of molecules may act through a stabilization of the auto-inhibitory linker domain of DNMT1. Results of the docking model are in agreement with the SAR of the deepest activity cliffs involving CHEMBL3126646. Results are also in agreement with the biochemical analysis showing that CHEMBL3126646 is not a competitive inhibitor of the co-factor.
During the course of this work we concluded that density SAS maps are convenient graphical representations that enhance the interpretation of the SAS maps. It was also highlighted the convenience of performing ‘activity landscape sweeping’ before the analysis of the activity landscape of a data set. The activity landscape sweeping presented in this work led to the exploration of local activity landscapes that provided interpretable SAR results and provided insights for the structure-based optimization of lead compounds as IDNMT1.
3D | Three-dimensional |
ALM | Activity landscape modeling |
DNMT | DNA methyltransferases |
ECFP | Extended connectivity fingerprints |
HTS | High-throughput screening |
IDNMT | Inhibitor of DNA methyltransferase |
MDS | Myelodysplastic syndrome |
MOE | Molecular operating environment |
PCA | Principal component analysis |
PDB | Protein data bank |
PLIF | Protein ligand interaction fingerprint |
QSAR | Quantitative structure–activity relationships |
RMSD | Root-mean-square deviation |
SAH | S-Adenosyl-L-homocysteine |
SAM | S-Adenosyl-L-methionine |
SAR | Structure–activity relationships |
SAS maps | Structure–activity-similarity maps |
Footnotes |
† This work is dedicated to the memory of Dr Andoni Garritz Ruiz. |
‡ Electronic supplementary information (ESI) available. See DOI: 10.1039/c5ra12339a |
This journal is © The Royal Society of Chemistry 2015 |