Duc Toan
Truong
ab,
Kiet
Ho
c,
Chinh Tam
Thai
d,
Dung
Do Thi Mai
d,
Nguyen Minh
Tam
e,
Viet Bac T.
Phung
f and
Minh Tho
Nguyen
*ab
aLaboratory for Chemical Computation and Modeling, Institute for Computational Science and Artificial Intelligence, Van Lang University, Ho Chi Minh City, Vietnam. E-mail: toan.truongduc@vlu.edu.vn; minhtho.nguyen@vlu.edu.vn
bFaculty of Applied Technology, School of Technology, Van Lang University, Ho Chi Minh City, Vietnam
cInstitute for Computational Science and Technology (ICST), Ho Chi Minh City, Vietnam
dHanoi University of Pharmacy, 13-15 Le Thanh Tong, Hoan Kiem, Ha Noi, Vietnam
eFaculty of Basic Sciences, University of Phan Thiet, 225 Nguyen Thong, Phan Thiet City, Binh Thuan Province, Vietnam
fCenter for Environmental Intelligence and College of Engineering and Computer Science, Vin University, Gia Lam District, Hanoi, Vietnam
First published on 9th June 2025
The major side effects of in-use drugs, such as gefitinib, erlotinib and osimetinib, have led to inherent limitations and considerable concerns regarding the use of tyrosine kinase inhibitors (TKIs) in cancer treatment. Natural compounds can effectively inhibit the expression of epidermal growth factor receptor (EGFR) tyrosine kinase protein; therefore, they are considered not only a functional product but also a substitute in cancer therapy. Based on the experimental findings of the bioactivities of natural compounds extracted from Ganoderma lucidum belonging to the family Ganodermataceae, which are commonly known as lingzhi and have been used since ancient times in Asian traditional medicine, we performed a theoretical study on the anti-tumor abilities of these colossolactone derivatives. Our work aims to understand the molecular interactions between a lactone compound and an EGFR intracellular protein, a common target in the development of TKI cancer drugs. A series of 16 colossolactone derivatives were placed in either the ATP-competition region or the allosteric active and inactive sites to explore their binding mechanism and rank their binding affinities. The latter was determined using steered molecular dynamics simulations and the umbrella sampling method. Of the 16 natural derivatives, colossolactone H was found to bind strongly to the allosteric pocket of EGFR-TKI and did not compete with the first-generation TKIs, which prefer to interact with the ATP region of the EGFR active state. The binding affinity of this lactone was 16 kcal mol−1. Our calculated results offer a rational explanation for previous experimental (in vivo) tests and promote the use of colossolactone H as a natural compound, providing an efficient synergistic drug combination for various cancer treatments.
In 2002, Stamos et al.9 reported the first structural information on the interactions between erlotinib and the EGFR-TK domain. Their finding suggested that erlotinib can take on an active conformation of the receptor without the activation of the phosphorylation loop.9 This provided an insight into the binding mode of erlotinib and the mechanism of activation by receptor dimerization. Two years later, Wood et al.10 reported a second interactive structure of the EGFR-TK domain with the lapatinib drug, suggesting that it seemingly represents an authentic inactive form of the enzyme.10 The inactive state is caused by an outward rotation of the C-helix, which interrupts the salt-bridge connecting the active site of Lys745 and Glu762 in the C-helix. This rotation also stabilizes the structure with a helical turn, which is part of the activation loop.11 The activation of the EGFR kinase can arise from either a ligand-induced dimerization or an oncogenic mutation. When the ligand binds to the extracellular region of the receptor, the intracellular kinase domain undergoes asymmetric dimerization in which the C-lobe of one molecule binds to the N-lobe of the other. Such an interaction essentially pushes the C-helix inward, thereby kinetically establishing an active state. Another way to activate EGFR kinase is through oncogenic mutations. The N-terminal segment of the activation loop forms a narrow helix around Leu858, which stabilizes the inactive state of the EGFR-TK domain. Thus, the L858R mutation releases the C-helix, leading to the formation of the active state of the EGFR-TK domain.12,13
Clinical studies conducted over the past two decades have demonstrated the efficacy of targeted EGFR therapy using tyrosine kinase inhibitors (TKIs). It effectively inhibits tumor growth and increases the survival rates among patients.14 For now, four well-known generations of EGFR-TKIs have been developed for the treatment of non-small cell lung cancer (NSCLC) in which the EGFR acts as a primary driver gene.15 The first generation of EGFR-TKIs includes the Gefitinib drug16–18 (which was approved by the US Food and Drug Administration in 2003), Erlotinib,19 Lapatinib,20 and Icotinib.21 The second generation of EGFR-TKIs possesses a side chain that can form an irreversible bond with Cys797,22 such as the Afatinib23 and Dacomitinib24 drugs. The third generation was created using a new aminopyrimidine framework that can selectively act on mutations and irreversibly bind to the TK domain.15 Prominent representatives of this generation include the Osimertinib,25 Rociletinib,26 Olmutinib27 and Avitinib28,29 drugs. Extensive investigations are currently conducted into the fourth-generation medications whose main purpose is to address various resistant mutations, such as the C797S, T790M and L858R.30 These drugs function as EGFR allosteric inhibitors (EAI) instead of inhibiting the ATP competition of the EGFR-TK domain, such as in the case of earlier generations.31,32 In 2019, the EAI-045 was reported as the first EAI.33
However, natural products have long been a plentiful resource in the pharmaceutical industry, especially for drug innovation and upgrading. Several previous reports have pointed out that some natural derivatives can inhibit strong cancer mutations following the activation of the EGFR signaling pathways.34–36 In the context that synthetic compounds can cause side effects and drug resistance in the EGFR-TK domain, the discovery and development of EGFR-TKI compounds derived from natural sources are of great importance, especially for developing countries possessing a rich inventory of natural products.
Medicinal mushroom species belonging to the genus Ganoderma in the family Ganodermataceae have been used since ancient times in Asian traditional medicine, such as Ganoderma lucidum, commonly known as lingzhi. The health benefits of chemical derivatives from Ganoderma species were investigated, of which the lanostane triterpenoids emerged as a significant metabolite category.37 Within the Ganoderma family, the Ganoderma colossum, which stands out as a distinctive member, is characterized by yellow basidiocarps on its surface38 and possesses unique biological properties, such as antibacterial,39 anti-cancer,40 and anti-HIV41 activities. Earlier investigations conducted by Isaka and colleagues42,43 revealed that Ganoderma colossum is a rich source of distinctive triterpenoids containing a diversity of structures and physiological activities. More interestingly, the fruiting bodies of this species have been known to produce a unique lanostane-type triterpene, which is characterized by the presence of six-membered, even seven-membered, lactone rings, named colossolactones.44 The chemical structures of colossolactones are characterized by a steroidal framework, which can include variations, such as oxa-A-homo-steroidal or oxa-A,B-dihomo-steroidal structures. Until now, the structures of a series of 16 colossolactons in Ganoderma colossum have been revealed, including colossolactones A–H,44,45 colossolactones I–VIII41,46 and the newest one, namely the colossolactone J.43 The chemical structures and some related information of these 16 lactones are presented in Table 1. These compounds are further diversified through modifications in functional groups, including dihydroxy alkenyl and lactone residues, which significantly enhance their biological activities. In what follows, for the sake of simplification, the term lactone stands for colossolactone.
The lactones V, VI and E (cf.Table 1) were established to exhibit inhibitory activities against the HIV-1 protease with IC50 values of 9.0, >100 and 8.0 μg mL−1, respectively.41 Although lactone E and 23-hydroxy-colossolactone E perform activities against the Bacillus subtilis and Pseudomonas syringae diseases, the B does not show any antibacterial activity.39 The E shows an antimalarial activity against Plasmodium falciparum with IC50 values of 10.0 μM,47 while the IV exerts anticancer activities against breast (MCF-7), cervix (HeLa), colorectal (HCT-116) and liver (HepG2) cancer cells with IC50 values ranging from 4.9 to 64.2 μM, and with an R-fraction of less than 47.5%.48
The lactone structures shown in Table 1 are renowned for their distinctive ability to enhance efficacy when combined with established chemotherapeutic agents. The presence of lactone G strengthens the anticancer effects of 5-fluorouracil (5-FU) and gemcitabine (GCB) in colorectal cancer cells. The combination of G with the latter drugs results in a synergistic effect, improving the cell cycle arrest and apoptosis in cancer cells.49 Beyond the treatment of colorectal cancer, lactones (G–H) have been found to reinforce the capacities of other anticancer agents, such as the gefitinib and sorafenib drugs, in the fight against non-small cell lung cancer and hepatocellular carcinoma, respectively.45,50 These findings demonstrate the possibilities of implementing these natural lactone compounds as complementary agents in various cancer therapies. They also emphasize the need for a better understanding of their chemical mechanisms and clinical activities.
Over the past decade, advances in computational methods have allowed us to quantitatively explore the interactions between small compounds and the EGFR intracellular domain51,52 and thereby obtain appropriate information, leading to relevant treatment. Numerous promising compounds were identified in various experimental studies, and their molecular level interaction dynamics were revealed through computational simulations. For instance, Nasser et al.53 discovered a novel class of pyrimidine-5-carbonitrile derivatives that inhibit the TK domain of EGFR wild-type and EGFR T790M mutations. Truong et al.54 combined umbrella sampling and steered molecular dynamics simulations to explain the EGFR inhibition mechanism of new imidazole[1,5-a]pyridine derivatives. Different teams55–57 used docking computations and molecular dynamics simulations to model the activities of new quinazoline derivatives targeted at EGFR.
In another approach, chemical derivatives isolated from natural sources were identified by computations as effective EGFR-TKIs, such as natural flavonoids, that can inhibit EGFR-TK.58 In addition, machine learning models were also applied along with docking and molecular dynamics simulations to the traditional Chinese medicine database to determine the potential candidates for EGFR-TKIs.59 Colossolactone compounds were also considered in previous computational studies by Rangsinth et al.60 and Jana et al.61 that targeted the SARS-CoV-2 main protease. Notably, assay experiments have shown that these derivatives can effectively reduce the size of cancerous tumors when they are used in combination with the conventional drug gefitinib.45 However, the atomic-level inhibition mechanism related to the properties of lactone compounds has not yet been elucidated.
In this context, motivated by the reasons mentioned above, we set out to carry out a theoretical investigation on the molecular interactions between all available colossolactone compounds, identified to our best knowledge (cf.Table 1), with the EGFR intracellular protein using computational approaches. In this study, we aim to leverage computational analyses, for the first time, on the binding capacities of a series of 16 colossolactone derivatives targeting three specific regions of the EGFR, namely, (a) the ATP-binding region of the EGFR active state, (b) the ATP-binding region of the EGFR inactive state, and (c) the allosteric region of the EGFR inactive state. In each of the 48 ligand–protein complexes considered, we follow a computational strategy in carrying out the following steps: (i) generation of initial conformation by molecular docking simulations, (ii) sampling of ligand–protein conformations by molecular dynamics (MD) simulations, (iii) evaluation of binding affinities by the steered molecular dynamic simulations, and finally (iv) computation of the absolute binding free energy using the umbrella sampling MD method.
The computed results help us open an avenue for utilizing lactone derivatives as potential scaffolds in the development of new drugs (targeting allosteric sites). More impressively, of the 16 natural derivatives displayed in Table 1, lactone H binds strongly to the allosteric pocket and does not compete with tyrosine kinase inhibitors (TKIs), which prefer only the ATP region of an EGFR active state. Overall, we offer a rational explanation for the results of previous experimental (in vivo) testing and promote the colossolactone H as a natural compound, leading to an effective use in synergistic drug combinations.45,49,50
The 3D conformations of the EGFR-TK domain are downloaded from the RCSB Protein Data Bank (RCSB PDB) under the codes 2GS6 and 2GS7, representing the active and inactive states, respectively. The missing residues are constructed using the MODELLER server66,67 after the removal of all water molecules. The geometric structures of the 16 lactone compounds considered are taken from the PubChem database,68 namely lactones I, II, III, IV, V, VI, VII, VIII, A, B, C, D, E, F, G and H, and all their corresponding 2D structures, as shown in detail in Table 1. The restricted molecular electrostatic potential69 (RESP) method is utilized to fit the atomic net charges in each compound, based on density functional theory (DFT) calculations performed using Gaussian 16 (G16) software,70 using the hybrid B3LYP functional in conjunction with the 6-31+G(d,p) basis set. AMBER Tools71 with an antechamber module is then used to convert the G16 results, including the geometries of the compounds considered, into Gromacs input files. All frontier orbitals (HOMO and LUMO) of the ligands are plotted, as illustrated in Fig. S1 of the ESI.†
Three-dimensional structures of EGFR in both active and inactive states, along with the binding positions of colossolactone H, are illustrated in Fig. 1 to highlight the conformational differences and potential interaction sites between the ligand and the protein.
To visualize the outcome data of the MD simulations method, the number of hydrogen bonds, numcount and interaction energy between the ligand and receptor were computed as performed in our previous studies.54 The free energy surface (FES) is crucial information for understanding the thermodynamics and kinetic mechanism of a bio-macromolecular complex. In this study, all FES maps are constructed by two components: the ligand's solvent accessible surface areas (SASA) and the number of contacts between the ligand and protein. These quantities are computed under the support of Gromacs packages: gmx hbond, gmx gmx sham, gmx sasa, and gmx mindist.
The docking energy of the lactones considered is provided in Tables 2–4 based on an analysis of the initial docked conformations. The best conformation with the lowest docking energy is subsequently extracted and further analyzed. When the EGFR is located in its active state, I, III, VII and F hold interaction energies below −9.0 kcal mol−1. Next, lactone VI has a docking energy smaller than −6.6 kcal mol−1. Despite having the largest number of hydrogen bonds, the docking energy of A (E ∼ −7.6 kcal mol−1) is larger than that of the compounds that do not form any hydrogen bond with the receptor, such as III (E = −9.4 kcal mol−1), C (E = −8.1 kcal mol−1), D (E = −8.7 kcal mol−1) and E (E = −8.1 kcal mol−1). This clearly demonstrates that no relationship between the number of hydrogen bonds and the docking energy can be established, and the same conclusion is reached in the case of non-bonding contacts.
ATP binding site (active state) | |||||||||
---|---|---|---|---|---|---|---|---|---|
N 0 | Index | Docking energy (kcal mol−1) | Mindist to T790 (nm) | Number of contact | Number of hydrogen bonds | Coulomb potential (kcal mol−1) | VDW potential (kcal mol−1) | SASA (nm2) | Interaction energy (kcal mol−1) |
Note: computed docking energies and averaged numerical data from the last 20 ns MD simulations: minimum distances between the ligands and local residue T790–Mindist (nm), number of contacts between the ligands and protein, number of hydrogen bonds formed between the ligands and protein, electrostatic potential energies, Coulomb potential (kcal mol−1), Lennard-Jones potential energies, VDW potential (kcal mol−1), solvent accessible surface areas (SASA, nm2), and interaction energy (kcal mol−1). The average value of the minimum distance between the ligand and T790 was calculated when the ligand fluctuated in the ATP region of the EGFR mutant. All 16 derivatives were stable in the EGFR binding site after 100 ns MD simulations. | |||||||||
1 | I | −9.1 | 0.23 | 2496 | 0.1 | −2.51 | −36.3 | 7.06 | −38.8 |
2 | II | −8.7 | 0.22 | 2508 | 0.52 | −1.71 | −38.1 | 7.17 | −39.8 |
3 | III | −9.4 | 0.22 | 2506 | 0.37 | −3.77 | −38.3 | 7.39 | −42.1 |
4 | IV | −8.5 | 0.31 | 2481 | 0.73 | −4.53 | −36.9 | 7.43 | −41.4 |
5 | V | −7.4 | 0.44 | 2477 | 0.74 | −3.06 | −37.5 | 9.28 | −40.6 |
6 | VI | −6.6 | 0.3 | 2449 | 1.21 | −5.11 | −32.2 | 9.37 | −37.3 |
7 | VII | −9.2 | 0.28 | 2478 | 0.57 | −8.47 | −37.1 | 8.52 | −45.6 |
8 | VIII | −7.3 | 0.24 | 2507 | 0.44 | −1.6 | −42.3 | 7.98 | −43.9 |
9 | A | −7.6 | 0.24 | 2533 | 0.35 | −8.06 | −42.3 | 7.9 | −50.4 |
10 | B | −8.1 | 0.24 | 2509 | 0.1 | −3.71 | −39.1 | 7.73 | −42.8 |
11 | C | −8.1 | 0.61 | 2427 | 0.47 | −1.2 | −28.5 | 7.78 | −29.7 |
12 | D | −8.7 | 0.26 | 2471 | 0.08 | −6.88 | −34 | 7.34 | −40.9 |
13 | E | −8.1 | 0.26 | 2516 | 0.05 | −1.83 | −43.1 | 7.9 | −44.9 |
14 | F | −9.2 | 0.48 | 2473 | 2.28 | −18.2 | −36.7 | 7.8 | −54.9 |
15 | G | −8.2 | 0.25 | 2483 | 0.07 | −4.31 | −37.8 | 7.87 | −42.1 |
16 | H | −8.5 | 0.24 | 2531 | 0.53 | −2.89 | −45.5 | 8 | −48.4 |
ATP binding site (inactive state) | |||||||||
---|---|---|---|---|---|---|---|---|---|
N 0 | Index | Docking energy (kcal mol−1) | Mindist to T790 (nm) | Number of contact | Number of hydrogen bonds | Coulomb potential (kcal mol−1) | VDW potential (kcal mol−1) | SASA (nm2) | Interaction energy (kcal mol−1) |
Note: computed docking energies and averaged numerical data from the last 20 ns MD simulations: minimum distances between the ligands and local residue T790–Mindist (nm), number of contacts between the ligands and protein, number of hydrogen bonds between the ligands and protein, electrostatic potential energies, Coulomb potential (kcal mol−1), Lennard-Jones potential energies, VDW potential (kcal mol−1), solvent accessible surface areas (SASA, nm2), and interaction energy (kcal mol−1). The average value of the minimum distance between the ligand and T790 was calculated when the ligand fluctuated in the ATP binding region of the inactive EGFR mutant. All 16 derivatives were stable in the EGFR binding site after 100 ns MD simulations. | |||||||||
1 | I | −10.2 | 0.25 | 2833 | 1.63 | −43.08 | −43.1 | 7.06 | −86.2 |
2 | II | −10.1 | 0.25 | 2422 | 0.04 | −36.76 | −36.8 | 7.04 | −73.5 |
3 | III | −10.2 | 0.23 | 3396 | 0.01 | −51.85 | −51.8 | 7.23 | −103.7 |
4 | IV | −9.2 | 1.00 | 1671 | 1.64 | −28.74 | −28.7 | 7.25 | −57.5 |
5 | V | −7.7 | 0.95 | 1876 | 1.04 | −34.62 | −34.6 | 8.12 | −69.3 |
6 | VI | −6.8 | 0.77 | 2055 | 0.95 | −34.95 | −35.0 | 7.90 | −69.9 |
7 | VII | −7.5 | 0.30 | 2571 | 1.23 | −40.56 | −40.6 | 8.26 | −81.1 |
8 | VIII | −9.0 | 0.24 | 2995 | 1.31 | −45.44 | −45.4 | 7.70 | −90.9 |
9 | A | −8.2 | 0.85 | 1894 | 1.39 | −29.42 | −29.4 | 7.34 | −58.8 |
10 | B | −9.7 | 0.67 | 2548 | 1.77 | −40.00 | −40.0 | 7.60 | −80.0 |
11 | C | −8.0 | 0.23 | 3118 | 0.00 | −49.03 | −49.0 | 7.66 | −98.1 |
12 | D | −10.2 | 0.24 | 2324 | 1.26 | −37.44 | −37.4 | 7.24 | −74.9 |
13 | E | −10.4 | 1.25 | 944 | 0.61 | −17.85 | −17.8 | 7.63 | −35.7 |
14 | F | −9.0 | 0.58 | 2255 | 0.62 | −40.42 | −40.4 | 7.91 | −80.8 |
15 | G | −10.2 | 0.80 | 1712 | 0.25 | −28.87 | −28.9 | 7.79 | −57.7 |
16 | H | −9.9 | 0.38 | 2286 | 1.15 | −36.79 | −36.8 | 7.88 | −73.6 |
Allosteric pocket (inactive state) | |||||||||
---|---|---|---|---|---|---|---|---|---|
N 0 | Index | Docking energy (kcal mol−1) | Mindist to T790 (nm) | Number of contact | Number of hydrogen bonds | Coulomb potential (kcal mol−1) | VDW potential (kcal mol−1) | SASA (nm2) | Interaction energy (kcal mol−1) |
Note: computed docking energies and averaged numerical data from the last 20 ns MD simulations: minimum distances between the ligands and local residue K745–Mindist (nm), number of contacts between ligands and protein, number of hydrogen bonds between ligands and protein, electrostatic potential energies, Coulomb potential (kcal mol−1), Lennard-Jones potential energies, VDW potential (kcal mol−1), solvent accessible surface areas (SASA, nm2), and interaction energy (kcal mol−1). The average value of the minimum distance between the ligand and T790 was calculated when the ligand fluctuated in the allosteric pocket of the EGFR mutant. All 16 derivatives considered were stable in the EGFR allosteric area after 100 ns MD simulations. | |||||||||
1 | I | −9.0 | 0.28 | 3282 | 0.83 | −7.47 | −50.9 | 7.13 | −58.4 |
2 | II | −8.3 | 0.24 | 3147 | 0.94 | −10 | −46.9 | 7.2 | −56.9 |
3 | III | −8.9 | 0.33 | 3196 | 0.72 | −6.88 | −51.2 | 7.38 | −58.1 |
4 | IV | −8.9 | 0.27 | 3138 | 0.35 | −4.55 | −49 | 7.32 | −53.6 |
5 | V | −6.9 | 0.71 | 2452 | 1.25 | −8.23 | −39.3 | 9.12 | −47.5 |
6 | VI | −7.5 | 0.59 | 2475 | 0.5 | −7.1 | −39.6 | 9.21 | −46.7 |
7 | VII | −7.3 | 0.81 | 2359 | 0.1 | −3.75 | −37.7 | 8.26 | −41.5 |
8 | VIII | −9.0 | 0.24 | 3061 | 0.11 | −7.11 | −50.2 | 7.77 | −57.3 |
9 | A | −7.9 | 0.56 | 3025 | 2.15 | −18.5 | −43.5 | 7.68 | −62.0 |
10 | B | −8.1 | 0.25 | 3234 | 0.71 | −7.35 | −50.2 | 7.82 | −57.6 |
11 | C | −8.4 | 0.29 | 3393 | 0.92 | −11.3 | −53.8 | 7.71 | −65.1 |
12 | D | −9.8 | 0.25 | 3060 | 0.51 | −5.26 | −48.7 | 7.24 | −54.0 |
13 | E | −9.1 | 0.22 | 3069 | 0.18 | −6.03 | −49.5 | 7.8 | −55.5 |
14 | F | −8.0 | 0.22 | 3109 | 0.87 | −8.3 | −50.8 | 7.92 | −59.1 |
15 | G | −8.6 | 0.23 | 3201 | 0.37 | −7.9 | −52 | 7.84 | −59.9 |
16 | H | −8.4 | 0.23 | 3270 | 0.004 | −5.31 | −53.5 | 7.88 | −58.8 |
All derivatives appear to be stabilized within the receptor binding site in view of the small average distances to the local residues, as illustrated in Table S3 (ESI†). At the ATP binding site in the active state, the number of contacts between lactones and the EGFR-TK domain amounts to over 2000, and the largest averaged number of hydrogen bonds is about 3. With EGFR in the inactive state, except for E, which involves only 944 contacts, each of the remaining lactones induces more than 1600 contacts. Regarding the number of hydrogen bonds formed, C did not form any. However, several lactones exhibit more than 1 hydrogen bond, such as colossolactones I, IV, V, VII, VIII, A, B, D and H.
Meanwhile, the number of contacts initiated by those ligands and residues in the allosteric region of the receptor goes mostly over 3000 contacts, but the number of hydrogen bonds remains not larger than 1, except for V and A, which have averages of 1.25 and 2.15 bonds, respectively. In part, the larger number of contacts suggests that the ligand is very hydrophobic. Tables 2–4 also list the SASA values of lactones within the complexes, ranging from 7000 to 9000 (nm2) at both binding sites.
The Coulombic interaction energy and Lennard-Jones energy are two well-known parameters that can be used to quantitatively evaluate the interactions between chemical objects, and they can be extracted from MD simulations. Basically, the more negative they are, the stronger the interactions between the objects, or in other words, the more they prefer to pair with each other. The last two columns of Tables 2–4 show the average values of Coulombic interaction energy and Lennard-Jones energy between the EGFR-TK domain and the lactones at the two binding sites, and all of them are actually negative values for both the ATP and allosteric sites. When the interactions of ligands occur at the ATP binding sites (both for active and inactive states) and the allosteric site, the correlation coefficients between the number of contacts and Lennard-Jones energy are up to 0.93, 0.98 and 0.94, respectively.
However, when paying attention to the minimum distances (cf. mindists) of lactones with the key residues in binding sites, we find that no ligand has a mindist value to any critical residue greater than 1.0 nm, except for the distance of 1.04 nm between lactone C and residue Leu859. Information on the distances for the series of 16 lactones at the ATP binding site is shown in Table S1 (ESI,† active state) and Table S2 (ESI,† inactive state). In particular, the mindist values of residues Gly721, Lys745, Thr790, Gln791, Leu792, Met793, Cys797 and Arg841 for the lactones are all smaller than 0.5 nm. Calculated results indicate the proximity of the residue Ala722 to lactones IV, V, VI, and VII, with mindist <0.44 nm. During the simulation, C consistently maintains a distance greater than 0.5 nm from the key residues. This implies that C may not be capable of inhibiting this protein by competing with ATP. In the inactive state, as shown in Table S2 (ESI†), most lactones maintain proximity to key residues, with mindist values being predominantly smaller than 0.5 nm. Exceptions arise for residue Leu858, where most of the lactones show distances exceeding 1.0 nm. Lactones I, III, VII and D consistently maintain tight interactions with residues Thr790, Leu792, Met793, Cys797 and Arg841, all with mindist values smaller than 0.3 nm. However, E exhibits particularly large distances to residues Thr790, Gln791, Leu792, Met793 and Leu858, ranging from 0.9 nm to over 1.5 nm, which suggests a weaker inhibitory potential in this state.
As illustrated in Table S3 (ESI†), the minimum distance between the lactones and some key residues in the allosteric region can be observed. Lys745, Ile759, Glu762 and Leu788 emerge as the preferred residues of all 16 lactones due to their mindist values of around 0.3 nm. On closer inspection, the mindist parameters of V, VI, VII and A reveal estimated values exceeding 0.5 nm. Some of the examples include the distances between lactone V and Cys775, lactone V and Thr790, and lactone V and Thr854 of 0.87, 0.72 and 0.69 nm, respectively. Both VI and A are 0.7 nm away from Cys775. Remarkably, VII always maintains quite large distances from some key residues, such as Ala763 (0.7 nm), Cys775 (1.11 nm), Leu777 (0.74 nm), Thr790 and Phe856 (0.81 nm) and Thr854 (0.82 nm).
Meanwhile, in the allosteric site, some lactones remain almost unchanged or have the contact number increased, including IV, B, C, D, E, F and G. However, the number of hydrogen bonds formed by lactone derivatives and EGFR in representative structures is also recorded. In comparison to the corresponding docking values, while the hydrogen bond in the case of ligands located at the ATP binding site is rather arbitrary, that in the allosteric region is not significantly changed. Several preferred residues induce hydrogen bond formation with ligands, such as Lys745, Met793, Arg803, and Thr854 in the ATP binding site, and Lys745, Asp855, Phe856 and Leu858 in the allosteric site.
Ligand | ATP binding region (active state) | ATP binding region (inactive state) | Allosteric binding pocket (inactive state) | |||||||
---|---|---|---|---|---|---|---|---|---|---|
N 0 | Index | F max (pN) | W pull (kcal mol−1) | ΔG‡unbind (kcal mol−1) | F max (pN) | W pull (kcal mol−1) | ΔG‡unbind (kcal mol−1) | F max (pN) | W pull (kcal mol−1) | ΔG‡unbind (kcal mol−1) |
1 | I | 883 | 73.9 | 56.5 | 673.1 | 58.6 | 27.4 | 494.6 | 40.9 | 19.5 |
2 | II | 850 | 70.1 | 51.3 | 405.8 | 25.9 | 14.7 | 523.9 | 46.3 | 19.8 |
3 | III | 910 | 82.6 | 58.4 | 873.2 | 79 | 55.6 | 476.0 | 42.7 | 16.1 |
4 | IV | 670 | 68.5 | 33.9 | 403.4 | 31.6 | 11.5 | 443.0 | 37.7 | 14.5 |
5 | V | 744 | 77.5 | 27.4 | 561.2 | 46.9 | 20.9 | 435.2 | 36.0 | 13.8 |
6 | VI | 992 | 113.0 | 69.1 | 437.2 | 31.5 | 15 | 487.0 | 46.8 | 15.5 |
7 | VII | 694 | 72.1 | 29.1 | 540.8 | 48.9 | 20.9 | 444.2 | 41.4 | 13.3 |
8 | VIII | 1130 | 119.0 | 89.9 | 531.5 | 45.9 | 20 | 754.0 | 77.4 | 41.3 |
9 | A | 1000 | 95.1 | 72.5 | 569 | 48.7 | 22.4 | 386.5 | 28.8 | 10.9 |
10 | B | 1070 | 101.0 | 82.1 | 587 | 42 | 25.9 | 523.1 | 42.4 | 22.0 |
11 | C | 626 | 41.6 | 28.5 | 710.3 | 63.6 | 35.8 | 478.7 | 39.3 | 18.4 |
12 | D | 932 | 88.6 | 60.2 | 455.7 | 36.6 | 16.6 | 472.1 | 41.3 | 16.0 |
13 | E | 1090 | 103.0 | 83.3 | 388.5 | 28.6 | 10.6 | 819.8 | 78.3 | 46.8 |
14 | F | 1190 | 120.0 | 99.0 | 500.9 | 38 | 19.1 | 729.2 | 72.9 | 36.4 |
15 | G | 596 | 47.1 | 25.8 | 421 | 31 | 13.6 | 798.1 | 73.6 | 47.9 |
16 | H | 509 | 44.7 | 16.9 | 985.5 | 92.6 | 68.3 | 917.6 | 86.8 | 61.2 |
The highest value of rupture force (cf.Table 5), in the case of interaction with the ATP binding site (active state) of the EGFR-TK domain, belongs to lactone F, with the value of Fmax = 1190 pN. In addition, some lactones with great rupture force values emerge, including VIII (Fmax = 1130 pN), E (Fmax = 1090 pN), B (Fmax = 1070 pN), and A (Fmax = 1000 pN). At the bottom of the ranking list are lactone G with a rupture force of 596 pN and lactone H with a rupture force of 509 pN.
We also calculate the pulling work, Wpull, based on the force–displacement profile. The top two pulling work values are associated with F (120 kcal mol−1) and VIII (119 kcal mol−1). Three other high values include VI (113 kcal mol−1), E (103 kcal mol−1) and B (101 kcal mol−1). Lactones G, H and C rank at the bottom of the list with the pulling work values of 47, 45 and 42 kcal mol−1, respectively. Simulated results point out that some pairs of Wpull values, such as F and VII (120 and 119 kcal mol−1), B and E (101 and 103 kcal mol−1), II and VII (70 and 72 kcal mol−1), and lactones I and VII (74 and 72 kcal mol−1), are almost indistinguishable from each other owing to the small differences when accounting for the expected error margins.
The unbinding barrier, ΔG‡unbind, calculated from Jarzynski's equation with the extension of Hummer and Szabo83 allow us to determine the phase transition point from binding to unbinding of lactones. A large range of this energy barrier value is found, ranging from 17 to 99 kcal mol−1. Similar to the case of pulling work, some unbinding barrier values overlap with each other; for example, I and III have values of 56.5 and 58.4 kcal mol−1, respectively; V, VII, and C have values of 27.4, 29.1, and 28.5 kcal mol−1, respectively; and B and E have values of 82.1 and 83.3 kcal mol−1, respectively. This overlap turns out to be a disadvantage when comparing dynamics and selecting objects. However, in the context of the present study, the overlap of the pulling work or the unbinding barrier can be interpreted because of the behavior of these compounds, which are quite similar to each other when interacting with the ATP binding region of the EGFR-TK domain.
When analyzing the results of SMD simulations for the ATP binding site (inactive state), a lower range of rupture force Fmax values are observed, spanning from 388.5 pN (lactone E) to 985.5 pN (lactone H). Notably, lactone H exhibits the highest Fmax value, thus highlighting its strong binding affinity in the inactive conformation of the ATP binding site. In terms of pulling work Wpull, its values range from 26 (II) to 93 kcal mol−1 (H). Similarly, H is characterized by the highest Wpull value, further reinforcing its stability in this binding state.
The unbinding barriers ΔG‡unbind follow a similar trend, with H leading the ranking at 68 kcal mol−1, while E has the lowest value of 11 kcal mol−1. These results suggest that lactone H enjoys the strongest interaction with the inactive ATP binding site, while other lactones, such as E and II, achieve weaker binding. The variations and overlaps among some unbinding barriers indicate similar binding behaviors for certain pairs of ligands in this conformation.
In parallel, 50 independent SMD trajectories are applied, and some dynamic parameters of 16 lactones are considered in the interaction with the allosteric binding site of the EGFR-TK domain are computed. The corresponding results are listed in Table 5. In general, the calculated values are smaller than those in the cases at the ATP binding site. The Fmax value varies from 386.5 to 917.6 pN. Surprisingly, H occupies a top position in the ranking of rupture force. Additionally, the first position in the pulling work and unbinding barrier involves H, with 87 and 61 kcal mol−1, respectively. This implies that lactone H can be regarded as an effective candidate for an allosteric inhibition of the EGFR-TK domain, instead of competition with ATP at the ATP binding site.
Following H, lactones E, G, VIII and F emerge with rather large rupture force values of 820, 798, 754 and 729 kcal mol−1, respectively. The remaining values are substantially smaller, ranging from 300 to 600 kcal mol−1. The rupture force of A is found to be the smallest value. The range of pulling work and unbinding barriers varies from 29 to 87 kcal mol−1 and from 11 to 61 kcal mol−1, respectively. Again, we encounter an overlap between some pairs of values when considering the expected error margins. This demonstrates that some lactone pairs are almost identical in their interactions with EGFR at the active sites.
The SMD simulation method is effective only when comparing the binding affinities in cases where multiple ligands consistently bind to the same region. Here, the system includes two ATP regions and one allosteric region. Hence, this makes the SMD approach useless when comparing the behaviour of a ligand, such as lactone H, when binding to different regions of EGFR. Fortunately, this inherent limitation can be overcome using the umbrella sampling (US) method when an equilibrium dissociation process can be generated. To generate reliable data, however, the US method consumes a great quantity of computational resources, as well as real simulation time, making it difficult to carry out US computations for all systems considered. Therefore, we focus only on exploring the equilibrium free energy of lactone H in both the ATP regions (active and inactive state) and the allosteric region. More than 120 windows with 10 ns each are simulated to ensure overlap between two adjacent windows in such a way that we can view the histograms plotted in Fig. 3. Free energy profiles are also constructed.
The US method is known to be an appropriate method for calculating the free energy of non-covalent ligand–protein binding. The free energy difference between the bound state when the ligand is located in the ATP binding region, and the unbound state when the ligand is rejected far away from the EGFR can be related to the strength of the ligand–protein binding affinity. In the case of lactone H binding in the ATP binding site at the active state of EGFR, the free energy profile shows a small value of −5 kcal mol−1. This value is approximately 3 times as small as the value of −16 kcal mol−1 in the case of this derivative binding in the allosteric pocket. The inspiring value of −16 kcal mol−1 obtained using the PMF method points out the ability of H to inhibit the EGFR allosteric pocket. In addition, the low binding free energy value of −5 kcal mol−1, corresponding to that of H in the ATP region, allows us to predict that lactone H is not involved in the binding mode to the EGFR active structure. Consequently, it does not compete with the ATP molecule or other TKIs (such as gefitinib or osimertinib) in the ATP region of the EGFR active state.
In the last case, when H binds to the ATP region of the inactive state, a quite good binding free energy of −13 kcal mol−1 is computed. This numerical value emphasizes that lactone H prefers to interact with the EGFR inactive conformation rather than with its active counterpart.
The largest number of contacts belongs to residue Leu788, with 165 and 120 contacts with the G2 and G3 fragments, respectively (cf.Fig. 4). Remarkably, in the top 5 largest numbers of contacts, the amino acid Leucine appears three times. Except for Leu788, Leu747 and Leu777 situated in the top 5, Leu 760, Leu778, Leu844, Leu858 and Leu861 are also involved, contributing up to 982 contacts to the total of 3270 contacts (accounting for up to 30%) between lactone H and EGFR.
Despite having a high contact frequency, the eight Leucine amino acids interact with H through a weak electrostatic interaction with an energy of only −0.9 kcal mol−1. These data show that lactone H is stabilized in the allosteric region of EGFR under the dominance of the van der Waals interaction, whose energy amounts to −14 kcal mol−1, in the case that all eight Leucine amino acids are added together. Besides, Ile759, Met766 and Asp855 are three residues possessing the strongest VDW interactions, with calculated potential values of −4.2, −4.3 and −4.5 kcal mol−1, respectively. For insight into the three separated structures of lactone H, the G1, G2 and G3 fragments (Fig. 4) contribute the VDW potential energies of −14, −19 and −19 kcal mol−1, respectively, which are at least 4 times higher than the relevant amounts of Coulomb potential energies with the values of −2.4, −0.1 and −4.1 kcal mol−1, respectively. Although not all Coulomb interaction pairs are considered, four pairs are found, G1-Ile759, G3-Met766, G3-Leu777 and G2-Asp855, which contribute some marked results of −2.8, −2.9, −3.2 and −2.4 kcal mol−1, respectively, to the interactions.
The umbrella sampling MD, which is a reliable simulation method, emphasizes the high binding free energies of lactone H. Based on these absolute binding free energies, we inspiringly predict that colossolactone H extracted from lingzhi, a well-known Asian natural product, can be used as a good treatment drug because it strongly binds to the inactive conformation of EGFR, either in the ATP or the allosteric region. This leads to the recommended implementation of a synergistic drug combination in cancer treatment involving colossolactone H and gefitinib, which is a well-established drug. This result opens an avenue for the use of colossolactone derivatives as effective scaffolds to develop new anti-cancer drugs.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5cp00817d |
This journal is © the Owner Societies 2025 |