Wiktoria
Jedwabny
a,
Szymon
Kłossowski
b,
Trupta
Purohit
b,
Tomasz
Cierpicki
b,
Jolanta
Grembecka
*b and
Edyta
Dyguda-Kazimierowicz
*a
aDepartment of Chemistry, Wrocław University of Science and Technology, Wyb. Wyspiańskiego 27, 50-370 Wrocław, Poland. E-mail: Edyta.Dyguda@pwr.edu.pl; Tel: +48 71 320 3200
bDepartment of Pathology, University of Michigan, 1150 W. Medical Center Dr, MSRBI, Rm 4510D, Ann Arbor, MI 48109, USA. E-mail: Jolantag@umich.edu; Fax: +734 763 8764; Tel: +734 615 9319
First published on 12th September 2017
Development and binding affinity predictions of inhibitors targeting protein–protein interactions (PPI) still represent a major challenge in drug discovery efforts. This work reports application of a predictive non-empirical model of inhibitory activity for PPI inhibitors, exemplified here for small molecules targeting the menin–mixed lineage leukemia (MLL) interaction. Systematic ab initio analysis of menin–inhibitor complexes was performed, revealing the physical nature of these interactions. Notably, the non-empirical protein–ligand interaction energy comprising electrostatic multipole and approximate dispersion terms (E(10)El,MTP + EDas) produced a remarkable correlation with experimentally measured inhibitory activities and enabled accurate activity prediction for new menin–MLL inhibitors. Importantly, this relatively simple and computationally affordable non-empirical interaction energy model outperformed binding affinity predictions derived from commonly used empirical scoring functions. This study demonstrates high relevance of the non-empirical model we developed for binding affinity prediction of inhibitors targeting protein–protein interactions that are difficult to predict using empirical scoring functions.
Our recently identified inhibitors disrupting the oncogenic activity of MLL fusion proteins by blocking the protein–protein interaction between menin and MLL belong to the thienopyrimidine class of compounds.6,7 These compounds directly bind to menin at MLL binding sites with low nanomolar binding affinities. We have also demonstrated that substantial improvement in inhibitory activity resulted from incorporation of fluorine atoms into an ethyl group on the thienopyrimidine scaffold.7 In particular, the MI-2-2 compound bearing a trifluoroethyl substituent exhibited an over 20-fold increase in the binding affinity compared to its non-fluorinated counterpart.6,7 Indeed, compounds with fluorine atoms are estimated to constitute about 20–25% of currently known drugs, as fluorine substituents affect both binding affinity and physicochemical properties of small molecules.12–14 While a rational approach for designing favorable fluorine interactions with proteins has recently been proposed,10 complementary methods providing insight into the physical nature of protein–ligand interactions would possibly augment these predictions. Furthermore, development of potent small molecule inhibitors targeting protein–protein interactions still constitutes a major challenge in the drug discovery field,15–17 and therefore identification of computational methods for effective prediction of binding affinities for PPI inhibitors represents a very important task and should attract a lot of attention in the drug discovery community.
Currently available methods used to estimate inhibitory activity are mostly based on empirical scoring functions or force fields, yielding widely different predictions.18,19 Due to significant computational cost, more accurate quantum chemical estimates are impractical in high throughput drug design and their applicability to protein–ligand interactions has been rather limited.20,21 However, alternative non-empirical models could be systematically derived by partitioning the ab initio energy of the corresponding inhibitor–receptor interactions into well defined components using state-of-the-art perturbation or variational theories.22 Analytical formulas for separate binding energy terms can be the subject of subsequent approximations leading to affordable non-empirical scoring models of general applicability. We have previously shown that the electrostatic term alone was sufficient to rank polar or charged inhibitors of phenylalanine ammonia-lyase (PAL),22 leucine aminopeptidase (LAP)22 and urokinase (uPA).23 Further results characterizing essentially nonpolar inhibitors of fatty acid amide hydrolase (FAAH)24 indicated that supplementing the electrostatic multipole term with the long-range dispersion contribution was required to afford reasonable correlation with experimental binding affinity data.
In contrast to empirical scoring functions derived from limited training sets, unified non-empirical models based on long-range multipole electrostatic and dispersion interactions might be more universal and less arbitrary. Assessment of the performance of non-empirical interaction energy analysis is especially important in the context of inhibitors targeting protein–protein interactions. Compared to widely studied enzyme–inhibitor interactions, experimental and computational characterization of small molecule PPI inhibition faces numerous challenges.25 Unlike buried binding cavities present in enzymes, binding sites at protein–protein interfaces are relatively flat and solvent exposed.26 Although several empirical and semi-empirical approaches have been successfully applied to rank selected PPI inhibitors,27–30 empirical scoring methods and knowledge-based potentials have not been trained to work with ligands that bind to PPI interfaces.31,32 Therefore, models of inhibitory activity derived from ab initio quantum mechanical methods might then be particularly useful to describe the activity of PPI inhibitors more accurately as no parameterization is required here and the applicability of these methods might be more general.
This work aims to develop systematic non-empirical models of inhibitory activity for small molecule inhibitors blocking the protein–protein interaction between menin and MLL. The choice of the protein system and compounds was made to address the accuracy of description of PPI inhibition by small molecules using computational approaches. As already mentioned, we successfully applied this non-empirical model to FAAH inhibitors with the interaction energy being expressed by the long-range electrostatic and dispersion terms.24 To determine whether such a model can be applied to PPI inhibitors represents a major focus of this work.
The theoretical model of inhibitory activity reported here involves a representative model of the MLL binding site on menin (Fig. 1) and a set of menin–MLL inhibitors that we have already characterized experimentally6,7,10 (Table 1). To limit the computational cost required for reference ab initio calculations of binding energy, only a subset of menin–MLL inhibitors reported in ref. 6, 7 and 10 was selected. Compound selection was made to accommodate the menin–MLL inhibitors with distinct substituents on the thienopyrimidine scaffold and a range of activity broad enough for the resulting model to be reliable. This system was used to develop a non-empirical model for activity prediction of menin–MLL inhibitors. Predictive capabilities of this approach were tested against new thienopyrimidine inhibitors of the menin–MLL interaction, developed independently to examine the influence of varying substituents to this scaffold on their inhibitory activity. Remarkably, nearly quantitative agreement was achieved between theoretically assessed and experimentally measured IC50 values. Competitive computational cost and more favorable performance of the non-empirical method applied here over those of commonly used empirical scoring functions indicate that this method can be successfully applied to rank newly designed inhibitors targeting protein–protein interactions.
Fig. 1 Representative model of a menin binding site with an MI-2-2 inhibitor bound. The model was derived from the structure of the menin–MI-2-2 complex (4GQ4 in PDB). |
Inhibitor | Structure | IC50 [μM] |
---|---|---|
a Inhibitory activity values are taken from ref. 7. b Inhibitory activity values are taken from ref. 10. c Inhibitory activity values are taken from ref. 6. | ||
MI-2-2 | 0.046 | |
MI-859 | 0.065 | |
MI-319 | 0.082 | |
MI-2-3 | 0.092 | |
MI-836 | 0.260 | |
MI-2 | 0.45 | |
MI-273 | 0.674 | |
MI-20 | 0.75 | |
MI-2-4 | 0.765 | |
MI-326 | 0.779 | |
MI-19 | 1.200 | |
MI-333 | 1.653 | |
MI-12 | 14 | |
MI-16 | 46 | |
MI-4 | 52 | |
MI-10 | 58 | |
MI-11 | 87 | |
MI-6 | 193 |
(1) |
The subsequent contributions to the MP2 binding energy are characterized by increasing computational cost, as indicated by O(X) scaling, where N and A stand for the number of atomic orbitals and atoms, respectively (see eqn (1)).
The E(10)EL,MTP term in eqn (1) represents the long-range electrostatic multipole binding energy calculated from atomic multipole expansion.40 This term includes interactions of Mulliken atomic charges in addition to interactions of atomic dipoles, quadrupoles, hexadecapoles, etc., representing the anisotropy of atomic charge distribution.
The short-range penetration term E(10)EL,PEN is defined as E(10)EL,PEN = E(10)EL − E(10)EL,MTP, where E(10)EL represents the first-order electrostatic energy. The first-order repulsive exchange term E(10)EX is calculated from the first-order Heitler–London energy, E(10):E(10)EX = E(10) − E(10)EL. The classical induction and charge transfer terms are represented by the higher order delocalization energy, E(R0)DEL, which is obtained as the difference between the counterpoise-corrected self-consistent field (SCF) variational energy, ESCF, and the first-order Heitler-London energy, E(10). The correlation term, accounting for dispersion and exchange–dispersion interactions as well as the intramolecular correlation contribution, is defined as: E(2)CORR = EMP2 − ESCF. As emphasized in eqn (1), all the subsequent corrections to EMP2 interaction energy could be categorized into long- and short-range interactions varying with the intermolecular distance, R, as R−n(n ∈ ) and exp−γR(γ > 0), respectively. In the above equations, the zero value of the second superscript represents uncorrelated interaction energy contributions, and the E(2)CORR term denotes the inter- and intra-molecular correlation contributions.
Within the HVPT decomposition scheme, the dispersion component is included in the computationally demanding E(2)CORR correlation energy term, scaling with the fifth power of molecular size expressed by the number of atomic orbitals, N. The recently derived atom–atom potential function, EDas, fitted to benchmark values of dispersion interactions41,42 has been a far more affordable alternative to these costly calculations, as they scale with the square number of atoms O(A2), in contrast to ab initio calculations scaling at least with the fifth power of the number of orbitals, O(N5). Following the remarkable performance of the EDas function to describe noncovalent interactions in hydrogen-bonded complexes,43 we applied this approximate dispersion term in our model for inhibitory activity prediction.
Binding energy calculations involved a menin binding site represented by 15 amino acid residues (Fig. 1) selected by their closest proximity to the ligand, namely within approximately 4 Å from MI-2-2 (see Table S1, ESI,† for the distances between MI-2-2 and menin residues). Negatively charged Asp180 and positively charged His199 residues constituted an ionic pair, and such a pair was included in further calculations (i.e., as a neutral Asp180–His199 dimer formed by counter-charged residues). There were no counter-charged amino acid residues in close proximity to the remaining negatively charged residues (Glu179, Asp285 and Glu363). The dangling bonds arising from cutting the residues out of the protein structure were saturated by hydrogen atoms.
The menin–inhibitor binding energy was calculated as the sum of the above defined intermolecular energy components obtained for each inhibitor–amino acid residue pair with a modified34 version of GAMESS51 program using the 6-31G(d)52–54 basis set. Counterpoise correction was applied to eliminate the basis set superposition error.55
The performance of particular scoring methods was evaluated by means of the Pearson's correlation coefficient calculated with respect to experimentally determined inhibitory activity6,7,10 (see Table 1). Scoring functions, for which the higher score indicates the greater binding potency, were assigned to the opposite of the calculated value of the correlation coefficient to enable direct comparison with the results of non-empirical binding energy calculations, assigning lower interaction energy values to more potent inhibitors. The results were additionally compared in terms of the success rate of prediction, Npred (the predictivity). Following nonparametric statistics, the predictivity refers to the percentage of concordant pairs among all possible pairs of complexes within a given set. A concordant pair denotes two complexes with computationally evaluated relative stability being of the same sign as the relative experimental binding affinity (see ref. 70 for details).
Inhibitor | E (10)EL,MTP | E (10)EL | E (10) | E SCF | E MP2 | E (10)EL,MTP + EDas |
---|---|---|---|---|---|---|
a In units of kcal mol−1. b Percentage of successful predictions [%]. c Correlation coefficient for the correlation of the energy obtained at a given level of theory and the experimental inhibitory activity expressed as pIC50. d Standard error of estimate, in units of kcal mol−1. | ||||||
MI-2-2 | −26.0 | −48.1 | 20.3 | 2.2 | −37.6 | −100.2 |
MI-859 | −25.0 | −48.8 | 22.9 | 5.4 | −35.3 | −98.0 |
MI-319 | −25.6 | −47.0 | 19.5 | 2.6 | −35.5 | −97.7 |
MI-2-3 | −26.1 | −47.4 | 19.6 | 2.7 | −35.7 | −99.2 |
MI-836 | −25.9 | −49.6 | 20.4 | 2.8 | −37.8 | −96.8 |
MI-2 | −23.3 | −45.9 | 21.9 | 3.4 | −36.8 | −94.7 |
MI-273 | −22.1 | −49.3 | 31.9 | 14.5 | −28.0 | −101.8 |
MI-20 | −21.6 | −44.1 | 23.9 | 5.7 | −35.7 | −95.5 |
MI-2-4 | −19.9 | −48.3 | 33.4 | 17.2 | −26.8 | −92.3 |
MI-326 | −26.2 | −47.2 | 17.9 | 1.1 | −36.7 | −97.4 |
MI-19 | −22.1 | −44.1 | 22.8 | 5.0 | −34.8 | −92.3 |
MI-333 | −26.8 | −48.2 | 17.9 | 1.0 | −36.9 | −97.4 |
MI-12 | −25.7 | −43.9 | 11.7 | −3.5 | −34.6 | −80.9 |
MI-16 | −19.3 | −41.7 | 22.4 | 6.4 | −30.3 | −80.9 |
MI-4 | −23.4 | −41.6 | 14.0 | −1.0 | −32.4 | −78.5 |
MI-10 | −20.9 | −41.7 | 23.3 | 7.0 | −30.9 | −88.4 |
MI-11 | −20.9 | −41.0 | 21.0 | 5.2 | −30.8 | −86.0 |
MI-6 | −19.3 | −41.0 | 22.1 | 6.2 | −30.2 | −80.5 |
N pred | 69.3 | 81.7 | 49.0 | 55.6 | 69.9 | 81.1 |
R | −0.63 | −0.87 | 0.17 | 0.03 | −0.55 | −0.87 |
SE | 2.1 | 1.6 | 5.2 | 5.1 | 2.9 | 3.9 |
For all menin–inhibitor complexes included in this study, Tyr276 serves as a hydrogen bond donor interacting with the nitrogen atom of the thienopyrimidine ring in the menin–MLL inhibitors. Compared to the remaining residues, Tyr276 significantly contributes to the total interaction energy of all inhibitors (Table S2, ESI†). The interaction energy values associated with essentially all Tyr276–inhibitor pairs fluctuate around −15 and −6 kcal mol−1 for E(10)EL and EMP2 energy, respectively, indicating the presence of strong hydrogen bonding in all menin–inhibitor complexes studied here. In addition, Asn282, Met278, and Glu179 also appear to have a pronounced impact on inhibitor binding affinities due to their substantial contribution to the total binding energy (Table S2, ESI†). The involvement of the remaining menin residues in inhibitor binding seems to be less significant.
Fig. 2 Menin–inhibitor binding energies at different levels of theory as a function of inhibitory activity. |
Very strong correlation was obtained for E(10)EL energy, resulting in a remarkably high correlation coefficient R, equal to −0.87 (Fig. 2). Furthermore, the electrostatic multipole term E(10)EL,MTP also demonstrates correlation with the inhibitory activity of menin–MLL inhibitors (R = −0.63; Fig. 2). These results indicate that the electrostatic interaction energy appears to be sufficient to represent the differences in the menin–inhibitor binding affinities within this class of compounds. This is likely related to the nature of the protein binding site, as the inhibitor binding pocket on menin features a number of polar amino acid residues. Indeed, only four out of 15 residues included in these calculations possess non-polar side chains, while the remaining 11 amino acid residues bear polar or charged side chains (e.g., Glu179, Asp285, Glu363, Asp180, His199, Ser178, and Asn282; see Fig. 1). Accordingly, the dominant contribution of the electrostatic effects in the interaction energy is reflected by its good correlation with the binding affinity characterizing these compounds.
In contrast, the remaining components of the interaction energy, i.e., E(10) and ESCF energy terms, do not exhibit satisfactory correlation with the inhibitory activity of the thienopyrimidine class of menin–MLL inhibitors (Table 2). The lack of correlation between the inhibitory activity and E(10) interaction energy presumably arises from including the short-range interaction energy component, which is very sensitive to any inaccuracies that might occur in the modelled protein–ligand complexes. The repulsive exchange term depends exponentially on the intermolecular distance and shortening of the intermonomer separation has a large impact on the corresponding binding energy.23,72 Recent systematic analysis of 66 biomolecular complexes72 indicated that considering only the long-range electrostatic term (without taking into account the repulsive exchange term) leads to much better estimates of relative ligand binding affinities. More importantly, this phenomenon is valid even for shortened intermolecular distances resulting from force field inadequacy73 or the basis set superposition error.74 Interestingly, it has been shown that augmenting the multipole electrostatic term with EDas dispersion approximation can further improve predictions of relative stabilities within a particular set of protein–ligand complexes.24
Indeed, despite the prevalence of electrostatic interactions in menin–inhibitor complexes, the E(10)EL,MTP + EDas model, in which both electrostatic and dispersion interactions are incorporated, is characterised by the strong correlation with the inhibitory activity, reflected by the high value of the correlation coefficient (R = −0.87; Fig. 2). Therefore, it seems that the interactions of predominantly electrostatic nature could be accurately and inexpensively described if the dispersion effects are incorporated in addition to the electrostatic term. Moreover, the significantly improved predictivity of E(10)EL,MTP + EDas compared to that of EMP2 energy is probably due to the absence of the repulsive exchange term, E(10)EX, which is included at the MP2 level of theory. As mentioned above, this particular term is very sensitive to the presence of intermolecular short contacts commonly occurring in modelled structures of protein–ligand complexes.23,72 In fact, the preparation of the menin–inhibitor complexes applied herein involved a simple optimization procedure employing rigid receptor representation. More sophisticated refinement protocols were avoided to verify the applicability of the proposed approach to quick assessment of the structural modifications in, e.g., the lead optimization stage. Possible unresolved steric clashes have probably resulted in the overly magnified repulsive exchange term, E(10)EX, and the loss of correlation at the higher levels of theory (up to the EMP2 energy), which include this particular interaction energy contribution.
In general, dispersion interactions are not absolutely required to properly describe the inhibitor binding forces within systems with dominant electrostatic interactions, such as menin–inhibitor complexes. However, a simple electrostatic model of interaction energy is clearly insufficient in the case of nonpolar hydrophobic systems.24 The performance of the E(10)EL,MTP + EDas model proposed herein appears to be satisfactory regardless of the physical nature of the inhibitory activity, and thus such a model could be used universally in the inhibitor design process without an unnecessary increase in the computational cost.
Differences in ligand binding affinities might arise from the interactions of inhibitors with a subset of amino acid residues in the binding site. The significant contribution to the total binding energy does not necessarily indicate that a given amino acid residue is responsible for the observed inhibitory activity differences, as such interaction could be similar for all tested inhibitors. Residues that contribute the most to the relative inhibitory activity might be identified by monitoring the changes in correlation coefficients upon removal of particular residues from the binding site model (Table S3, ESI†). Accordingly, the interaction of inhibitors with Glu179, Asp285, Tyr276, and Tyr323 residues seems to be the most important in terms of the differences in the binding potencies. Upon removal of these residues from the model of the menin binding site, the values of the corresponding correlation coefficients become substantially worse (Table S3, ESI†). Therefore, the interaction with Glu179, Asp285, Tyr276, and Tyr323 residues appears to influence the relative binding affinities and should be carefully studied while designing novel menin–MLL inhibitors.
Scoring function | R | N pred |
---|---|---|
a Correlation coefficient between the calculated binding affinity estimate and the experimental inhibitory activity expressed as pIC50. b Percentage of successful predictions [%]. | ||
E (10)EL,MTP + EDas | −0.87 | 81.1 |
LigScore1 | −0.81 | 75.2 |
Jain | −0.80 | 77.8 |
E binding (Discovery Studio 3.5) | −0.79 | 74.5 |
PLP2 | −0.79 | 80.4 |
PLP1 | −0.74 | 77.8 |
PMF04 | −0.65 | 73.2 |
Ludi2 | −0.62 | 72.6 |
LigScore2 | −0.43 | 69.9 |
Ludi1 | −0.40 | 58.8 |
Ludi3 | −0.23 | 54.3 |
PMF | +0.24 | 41.2 |
Goldscore | −0.64 | 69.9 |
ASP | −0.62 | 70.6 |
Chemscore | −0.28 | 60.1 |
Binding affinity (AutoDock Vina) | −0.67 | 73.2 |
While plausible predictions were obtained with selected empirical scoring approaches, the E(10)EL,MTP + EDas model of inhibitory activity appears to outperform the classical scoring functions presented herein, resulting in remarkable agreement with the experimental data characterized by R = −0.87 and Npred = 81.1%. The computational cost of the E(10)EL,MTP + EDas function is as attainable as that of empirical scoring models.24 Unlike the latter, no parameterization employing empirical data was involved to develop the E(10)EL,MTP + EDas model. Successful prediction of the relative binding affinities of menin–MLL inhibitors suggests that small molecule inhibitors that block protein–protein interactions are within the range of applicability of this particular non-empirical scoring method.
(2) |
To examine the applicability of this method to predict the activity of new menin–MLL inhibitors, seven novel compounds with varying substituents on the thienopyrimidine scaffold of MI-2-2 were selected. The development, synthesis and characterization of these compounds were performed as a part of the structure–activity studies directed at optimization of the lead compound. The structures and inhibitory activities of seven novel menin–MLL inhibitors are presented in Table 4. With the exception of the least potent compound 7, the range of inhibitory activity characterizing these novel menin–MLL inhibitors is similar to the reference set of compounds listed in Table 1. The highest potency was obtained for compounds 1 and 2 (IC50 = 193 nM and 650 nM, respectively), while other compounds showed inhibitory activity in the micromolar range (Table 4).
The predictive capabilities of eqn (2) were then tested against the set of novel menin–MLL inhibitors (Table 4), and the calculated interaction energy for the new compounds was used to predict their theoretical IC50 values. The systematic analysis of the interaction energy values calculated at the subsequent levels of theory for these seven menin–MLL complexes is provided in Table S4, ESI.† As observed in the case of the initial set of 18 menin–MLL inhibitors (Table 2), the EMP2 binding energy is characterized by negative values, ranging from −34.5 to −30.5 kcal mol−1 (Table S4, ESI†). The interactions formed by these new menin–MLL inhibitors are essentially electrostatic in nature, which is reflected by E(10)EL binding energy values exceeding −35 kcal mol−1. Accounting for the repulsive E(10)EX term, possibly overestimated due to shortened intermonomer distances,23 results in positive values of both E(10) and the subsequent ESCF binding energy terms. The delocalization and correlation contributions are both required to restore the stabilizing nature of the interaction energy obtained at the reference MP2 level of theory. The correlation between the calculated EMP2 binding energy and the experimental binding potency is reflected by the correlation coefficient R = −0.65 (Table S4, ESI†). Noticeably, the R values of −0.83 and −0.84 characterize the electrostatic term, E(10)EL, and the E(10)EL,MTP + EDas energy values, respectively, confirming the prevalence of electrostatic interactions at the inhibitor–protein interface and the adequacy of the E(10)EL,MTP + EDas function for the description of menin–MLL inhibition.
The performance of empirical scoring functions already considered for the training set of menin–MLL inhibitors (Table 3) was also tested on the set of seven inhibitors analyzed herein. The predictive capabilities of these empirical models are presented in Table S5, ESI.† Similarly to the results obtained in the training set of 18 inhibitors (Table 3), only two of the empirical scoring approaches considered here yielded predictions characterized with R correlation coefficients exceeding or close to 0.8 (Table S5, ESI†). The best estimates of the experimental inhibitory activity of seven menin–MLL inhibitors were obtained with the LigScore2 scoring function implemented in Discovery Studio (R = −0.84 and Npred = 66.7%; Table S5, ESI†). Most of the empirical scoring models tested herein exhibited either weak or no agreement with the experimental data. The fact that all but one of the inhibitors within the test set are fluorine-rich compounds might contribute to the poor performance of the empirical functions when applied to this particular set. Unlike most of the empirical approaches, the non-empirical E(10)EL,MTP + EDas model also appears to exhibit satisfactory predictive capabilities over the test set (R = −0.84 and Npred = 66.7%; Table S4, ESI†).
As already noted, the E(10)EL,MTP + EDas energy was employed for the prediction of binding potency of 7 new menin–MLL inhibitors according to eqn (2). The comparison of the calculated and experimental pIC50 values for new menin–MLL inhibitors is provided in Fig. 3, resulting in a very good correlation described by R = −0.84. As shown in Fig. 3, the inhibitory activity of all these compounds reported here is predicted with nearly quantitative accuracy. Compounds 1–5 are relatively similar in terms of their binding potency (Table 4). When applied to the inhibition constant, an order of magnitude change is associated with approximately 1.3 kcal mol−1 increment in the free energy of binding.75 Therefore, it would not be reasonable to expect that such small differences in the pIC50 values might be very accurately captured by any computational method, as they exceed the accuracy of most quantum chemical calculations. Accordingly, the performance of the E(10)EL,MTP + EDas scoring function in predicting the inhibitory activity of novel menin–MLL inhibitors is adequate, with the binding potency of these compounds reproduced with nearly quantitative agreement. Considering that six out of seven inhibitors discussed here are fluorine-rich compounds, these results also suggest that the E(10)EL,MTP + EDas approach is capable of capturing the changes in the binding properties resulting from fluorine substitutions. Overall, very good agreement between the experimental and theoretical data presented herein demonstrates the usefulness and applicability of the non-empirical interaction energy calculations for successful predictions of inhibitory activity of menin–MLL inhibitors and possibly other inhibitors targeting protein–protein interactions.
As revealed by the analysis of the interaction energy, binding of menin–MLL inhibitors is essentially electrostatic in nature. The non-empirical E(10)EL,MTP + EDas model has already been proven to be applicable to mostly nonpolar enzyme–inhibitor complexes.24 This contribution indicates the relevance of the E(10)EL,MTP + EDas model to polar or charged complexes. The range of applicability of this particular model for binding affinity prediction seems to encompass inhibitors of protein–protein interactions and ligands with multiple fluorine substitutions. Finally, the successful prediction of inhibitory activity of new menin–MLL inhibitors validates the usefulness of the E(10)EL,MTP + EDas model in the structure-based development of novel inhibitors, including inhibitors of protein–protein interactions, for which majority of empirical scoring functions provide rather poor predictions of ligand binding affinity. As demonstrated by nearly quantitative agreement between the predicted and experimental binding potency within the validation set of inhibitors, the E(10)EL,MTP + EDas model allows for the inexpensive and reliable assessment of the influence of structural changes made to the parent scaffold on the inhibitory activity of the resulting compounds. Further computational efforts will be directed at determination of the applicability of the E(10)EL,MTP + EDas model for testing of prospective PPI inhibitors characterized with more extensive structural differences.
Footnote |
† Electronic Supplementary Information (ESI) available: Detailed characterization of the compound collection; structural characteristics of the menin–MI-2-2 complex; and additional computational results. See DOI: 10.1039/c7md00170c |
This journal is © The Royal Society of Chemistry 2017 |