Open Access Article
Ahmet Buğra Ortaakarsu
*a,
Michel Hosnyb,
Mansour Sobehc and
Mohamed A. O. Abdelfattahd
aDepartment of Chemistry, Faculty of Science, Gazi University, Turkey. E-mail: bugra@ortaakarsu.com
bAmerican University of the Middle East, Kuwait
cAgroBioSciences Program, College of Agriculture and Environmental Sciences, University Mohammed VI Polytechnic, Ben-Guerir 43150, Morocco
dCollege of Engineering and Technology, American University of the Middle East, Kuwait
First published on 3rd November 2025
Non-alcoholic fatty liver disease (NAFLD) is a prevalent metabolic disorder with limited therapeutic options. Thyroid receptor β (THR-β) agonists have been showing promise for controlling NAFLD via improving hepatic lipid metabolism. This study utilized different in silico tools to screen 47
199 natural compounds from the ZINC15 database to identify potential THR-β agonists. Molecular docking, molecular dynamics simulations, and advanced analyses such as PCA, TICA-FES, and MSM revealed that 4-O-caffeoylquinic acid (compounds 2) and dihydroxydehydrodiconiferyl alcohol (compound 18) are the most promising hits. Both demonstrated high binding affinity and stable agonist interactions with key THR-β residues such as Arg316 and Arg320 that stabilize the ligand binding pocket and support the agonist potential, comparable to the reference agonists resmetirom and {3,5-dichloro-4-[4-hydroxy-3-(propan-2-yl)phenoxy]phenyl}acetic acid. Long-term MD simulations confirmed their stability, and MM/GBSA calculations supported robust thermodynamic profiles. Moreover, the two hits displayed superior selectivity for THR-β over THR-α and favorable pharmacokinetic profiles with minimal toxicity alerts. These findings support compounds 2 and 18 as strong candidates for NAFLD therapy, warranting further experimental validation.
Primarily, NAFLD results from a calorie-rich diet, high intake of fructose and animal protein, lack of physical activity, and a sedentary life style in general. Obesity and type 2 diabetes mellitus are major risk factors for the development of steatosis, which is the first stage in the NAFLD spectrum. Steatosis is characterized by accumulation of fat in 5% or more of hepatocytes, when there is no obvious etiology such as drug or alcohol intake. It can then progress to a more serious condition known as nonalcoholic steatohepatitis (NASH) that is associated with lobular inflammation, hepatocyte damage, along with variable degrees of fibrosis and cirrhosis in liver. Although NASH is generally a nonprogressive disease, it might develop into liver failure and even hepatocellular carcinoma in some patients.3,4
Several research studies have demonstrated that disorders of significant signaling pathways play a crucial role in the pathogenesis of fatty liver disease such as sirtuin 1 (SIRT1), adenosine 5′-monophosphate-activated protein kinase (AMPK), toll-like receptor 4 (TLR4), peroxisome proliferator-activated receptor (PPAR), nuclear factor kappa-B (NF-κB), and nuclear factor erythroid 2-related factor 2 (Nrf2).5–8
The beneficial effects of thyroid hormones, which are key regulators of metabolism in humans, are mediated by thyroid receptors with two isoforms: THR-α and THR-β. The former is prevailing in the brain, heart, and bones, whereas the latter is predominant in the liver, brain, kidneys, and pituitary gland.8,9 While the THR-α isoform plays a crucial role in neuronal development, TRβ is proposed to have a role in the re-myelination processes. Moreover, THR-β is responsible for the thyroid hormones' beneficial effects on the lipid metabolism in the liver.10,11 The structural differences between THR-α and THR-β are shown in Fig. 1.
![]() | ||
| Fig. 1 Structural differences between THR-α and THR-β. The red illuminated area is the centre of action of the active zone centre. | ||
Thus, considerable efforts in the last two decades were devoted to developing selective agonists towards THR-β in order to be effective in managing metabolic and/or neuronal disorders without exerting serious adverse effects on the bone and heart. For many years, the two molecules sobetirome and eprotirome were extensively investigated. They showed great promise for treating lipid metabolism diseases but failed in the clinical trials because of their unwanted adverse effects.12 Resmetirom (MGL-3196) was the first FDA approved oral THR-β agonist in March 2024 to treat noncirrhotic NASH with moderate or advanced hepatic fibrosis. The drug targets the primary underlying causes of the metabolic disorder associated with NASH, and it should be administered concomitant with diet management and exercise.13
On the other hand, various studies revealed the beneficial effects of some plant-based phytochemicals and bioactive components in reducing the severe burden in patients with NAFLD. Among these, curcumin, cryptochlorogenic acid, epigallocatechin-3-gallate, caffeic acid, trigonelline, caffeine, 4-p-coumaroylquinic acid, salidroside, allicin, resveratrol, and silybin are extensively investigated and reported.14 The potential underlying mechanisms related to the mode of action of these compounds included inhibition of oxidative stress and inflammation in liver, downregulation of fatty acid synthase expression, increasing the abundance of Lactobacillus in gut, hindering apoptosis and autophagy, reducing TNF-α production, and upregulating some pathways such as Nrf2, PPAR-α, AMPK, and fatty acid β-oxidation pathways.15–17 To date, there are no comprehensive studies investigating the agonist potential of plant-based phytochemicals towards the THR-β receptor.
In the drug discovery process nowadays, where in situations where efficacy is prioritised on efficacy, diversity, and speed, various computer-aided approaches have enabled the virtual screening of large libraries of compounds, reduced experimental costs, and provided faster and more accurate calculations for parameters such as the estimated binding affinity of a drug to its target and various physico-chemical properties of newly designed drug candidates. These advances have all contributed to a rapid and more efficient drug design process.18
In this work, we aimed to evaluate the binding affinity and agonist potential of natural compounds in the ZINC15 database towards the THR-β receptor. For this purpose, four rounds of molecular docking were adopted, followed by short- and long-term molecular dynamics studies. The top performing compounds were subjected to thorough analysis of protein dynamics, including principle component analysis (PCA), Time-Lagged Independent Component Analysis-Free Energy Surface (TICA-FES), and Markov State Model (MSM) analyses, and their drug likeness, ADME, and toxicity profiles were evaluated. Furthermore, the selectivity of the top performing compounds towards the beta isoform of the receptor was investigated through molecular docking and molecular dynamics studies done on the alpha isoform.
199 compounds, within the natural compound class in the ZINC15 (ref. 26) database. The codes of the ligands utilized in the study are provided in the SI 1. Furthermore, resmetirom27 and the cognate ligand bound to THR-β (PDB ID: 1NAX), namely {3,5-dichloro-4-[4-hydroxy-3-(propan-2-yl)phenoxy]phenyl}acetic acid, were employed in the in silico studies as reference agonists. The ionisation states of all compounds whose 3D structures were downloaded were estimated for pH 7.4 using the Epik28 embedded software in the LigPrep29 module. The ten most stable conformations were generated by considering tautomerisation states and subjected to the OPLS4 (ref. 30) force field.The detailed analyses included PCA, in which the multidimensional and complex datasets comprising the MD data were employed. By reducing the system into two dimensions and constructing energy surfaces, this approach elucidates the conformational shifts within the protein structure and the fundamental configurations restricted by the energy barriers.41 In addition, TICA-FES analysis was conducted with lag values of 10 (5 ns) and 100 (50 ns) to detect the short-term and minor conformational alterations induced by ligands in protein structures. Conversely, a lag value of 500 (250 ns) was employed to identify events such as protein folding, which are likely to occur within the protein structure in the long-term dynamics.42,43 Finally, MSM analysis was employed to determine the time-dependent transition probabilities of the changes in protein conformation and to detect the changes in the energy landscape. Frame intervals containing related conformations, with the same energy profiles, were clustered using the K-means algorithm. The energies of other ensembles were calculated based on the energies of the frame series that had the lowest energy.44–46 The Python scripts used to perform these analyses are available on Zenodo (https://doi.org/10.5281/zenodo.17194142) and GitHub (https://github.com/cannabinoid13/MDScripts). Version updates were made to all scripts under a single version number, and the first version was used in this article. The libraries used in the scripts included NumPy47 v2.0, MDAnalysis48 v2.7.0, PyEMMA49 v2.5.7, Matplotlib50 v3.9.0, and deeptime51 v0.4.4.
P) and water solubility (log
S), total polar surface area (TPSA), number of H-bond acceptors and donors, number of rotatable bonds as well as the fraction of sp3 carbon atoms (Csp3). SwissADME also offers useful predictions about the absorption and bioavailability of drug candidates following oral administration, potential of metabolism by various cytochrome P450 enzymes, in addition to the medicinal chemistry parameters, and Brenk and pan-assay interference compounds (PAINS) alerts, which help predict the presence of unstable and/or toxic fragments within the chemical structure.
199 natural compounds according to their binding affinities towards the THR-β receptor. This was achieved by gradually increasing the sensitivity of the molecular docking protocol over four rounds, where the number of compounds was reduced to 78. Among these, 22 top performing compounds were identified based on their docking scores and the similarity of their binding modes to those of the reference agonists, considering the afforded amino acid interactions. Table 1 lists the docking scores (kcal mol−1) of the identified top ranking compounds; however, the data (docking scores and amino acid interactions) of all the 78 compounds are given in Tables S1 and S2 (the nomenclature and molecular shapes of the compound ZINC15 codes) in SI 2. Moreover, the 2D interaction maps of the docking poses retained by some of the top compounds are presented in Fig. 1. Docking results revealed that the primary interactions afforded by the reference agonists were due to the presence of some electron dense groups such as carboxylic acid and nitrile groups, which enabled the interactions with the amino acid residues of the target protein. The dominant H-bond and salt bridge interactions occurred via Arg316 and Arg320 in addition to various hydrophobic contacts within the agonist binding site of the receptor. These two arginine residues harbour structural features responsible for the selectivity of the receptor.54 These features provide conformational constraint in ensuring the stability of the agonist conformation in the receptor by substituting the amino acid side chain in the alcohol units present in resmetirom and the cognate ligand. THR-β is sensitive to the presence of a carbon chain, which enhances the polar interaction with arginine residues in the pocket and acts as a skeleton in which substituents lock in an active upright conformation relative to units such as the terminal phenyl ring.55
The docking score of the resmetirom compound is compatible with the cognate ligand's molecular structure in terms of the presence of common groups, and the presence of the halogen structure of resmetirom in the cognate ligand in the scores obtained clearly explains the close docking scores. The cause and effect explanation of this situation is clearly seen from the protein–ligand binding modes illustrated in Fig. 2.
![]() | ||
| Fig. 2 2D binding poses of the compound and reference agonists that localised with the highest affinity following induced fit docking to the THR-β receptor. | ||
The final docking round, which employed induced fit docking, revealed that the majority of the compounds that interacted with the arginine residues in the agonist binding site of the THR-β receptor were negatively charged compounds at physiological pH conditions. Compounds 1, 7, 11, 14, and 18 are of particular interest as they afforded similar binding modes to the reference agonists, which is presumably due to the presence of the negatively charged carboxylate moiety and other hydroxyl groups in the regions extending to residues Arg316 and Arg320. Nevertheless, compound 2, which was previously reported to have a strong potential for treatment of fatty liver disease, exhibited the highest degree of similarity to the reference agonists. This compound afforded interactions with Gly332, Thr329, Arg316, Arg320, Arg382 and Ala279 residues via water bridges and numerous direct hydrogen bonds, as illustrated in Fig. 1. Moreover, it was able to access the Arg316 and Arg320 residues by virtue of its carboxylate moiety and the nearby hydroxyl group while forming a hydrogen bond with Gly344 via the two hydroxyl groups at its other end.
Noteworthily, the chiral compounds 1 and 18 showed “R” configuration of the chiral centers having the hydroxyl groups that extended to the arginine residues, which represents a distinguishing feature in the emergence of strong interactions.
In general, the studied compounds showed low RMSD values, indicating appropriate binding affinity. While the overall RMSD average of the compounds fluctuated in the range of 2–3 Å, some distinguished compounds showed relatively low RMSD values, compared to others, over the whole simulation time. These include compounds 1 (mean: 2.28 Å), 2 (mean: 2.28 Å), 5 (mean: 2.23 Å), 7 (mean: 2.23 Å), and 10 (mean: 2.28 Å). On the other hand, compound 22 showed the highest deviations in its RMSD values, suggesting relatively lower binding affinity to the receptor.
Examining the fluctuations in RMSF plots obtained following the binding of ligands to a certain protein provides valuable insights into the strength of their binding affinity towards that protein. Ligands showing low fluctuations with the amino acid residues in the active site are characterized by tight binding and strong affinity to the protein.56 In this study, RMSF plots of the 10 ns MD simulations were thoroughly analyzed to identify the ligands that showed low fluctuations with the residues in the agonist site and high fluctuations in other regions of the protein structure. These ligands are expected to have strong agonist potential. As illustrated in Fig. 4, the red dashed lines correspond to the amino acid residues reported to interact with the reference agonists. The indexes of these residues are 67, 68, 71, 74, 105, 122, 123, 136, 138, 145, 227, 234, and 247. These indexes are located as residues 275, 276, 279, 282, 313, 330, 331, 344, 346, 353, 435, 442, and 455 in the protein structure, respectively.31 Accordingly, ligands with strong agonist potential should show minimal fluctuations with the amino acids in the region extending between indexes 65 and 150 and also in the region between indexes 200 and 250, while increased fluctuations should be observed elsewhere. Compounds 1, 2, 7, 11, 14, and 18 showed this fluctuation pattern, in addition to their relatively low RMSD values, which indicate their strong THR-β agonist potential. Among these compounds, compound 2 is a caffeic acid derivative. Considering the favourable effects of caffeic acid reported in the literature in the treatment of non-alcoholic fatty liver disease, this result is quite consistent.57 Another prominent compound, compound 18, is a propanetriol (glycerol) derivative. The fact that there are reports in the literature that propanetriol derivatives also have a positive effect on non-alcoholic fatty liver disease shows that the information obtained from short-term molecular dynamics simulations is consistent.58,59
Following the long-term MD simulations, compounds 2, 11, 14, and 18 exhibited mean RMSD values in the range of 2.91–3.12 Å, which was comparable to 2.79 Å and 3.02 Å for the cognate ligand and resmetirom, respectively. Compounds 2 and 18 exhibited the closest resemblance to the RMSD graphs of the reference agonists in terms of both average and trend. However, compounds 1 and 7 exhibited a deviation greater than 3.5 Å in the RMSD data and persisted in this condition for a period extended approximately to 200 ns. This could indicate that the protein–ligand complexes of these two compounds were unstable over the long term, as evident from their RMSD values, which reached up to approximately 4 Å towards the end of the simulations (Fig. 5a). This could also point out the relatively weak binding affinity of the two compounds and suggest that they were unlikely able to elicit an agonist signal upon binding to the receptor.
![]() | ||
| Fig. 5 RMSD data (a), interactions over 30% of the simulation time (b), and RMSF data (c) of the top ranking compounds and the reference agonists following the long-term (500 ns) MD simulations. | ||
Analysis of the RMSF data obtained from the long-term MD simulations (Fig. 5c) revealed that compounds 2 and 18 exhibited distinctive fluctuations in the region far from the agonist binding site, around residue index 40 and highly reduced fluctuations in the regions of the agonist site, suggesting their high agonist potential to the receptor. Although the fluctuations observed by compound 1 were higher than those of other compounds in this range, this fluctuation pattern is sufficiently high to render the natural functioning of the protein structure uncertain. In view of the distinct RMSD and RMSF data sets of compound 1, it might elicit specific effects, the nature of which is currently unclear. The fluctuations of compound 11 were significantly higher than the other compounds around the residue index 200. This information, when interpreted alone, indicates the appreciable agonist potential of the compound.
Compound 7 did not afford notable interactions with the crucial amino acid residues in the agonist site of the THR-β receptor in the region between indexes 200 and 250, as evidenced by the significant fluctuations it exhibited in this region. Although compound 14 had regular, satisfying, and plateauing RMSD data, it showed high fluctuation in the region of the agonist site residues, similar to the pattern observed with compound 7.
Regarding the binding modes of the ligands employed during the long-term (500 ns) MD simulations, Fig. 5b illustrates the interactions between the six drug candidates and the amino acid residues in the agonist site of the target receptor, compared to the interactions afforded by the reference agonists over the simulation time.
In the agonist site, the reference agonists are engaged in crucial hydrogen bonding with Arg316 and His435 residues in addition to hydrophobic contacts with Ile276 and Leu330. Noteworthily, the nitrile group in resmetirom and the isopropyl group in the cognate ligand displayed a significant contribution to the drug–protein interactions.
Compound 2 formed both direct and indirect hydrogen bonds (through water bridges) with Arg316 that persisted for 76% and 36% of the simulation time, respectively, and also formed hydrogen bonding interaction with Gly344, similar to resmetirom. Compound 18 afforded hydrogen bonding interaction with Arg320 for 70% of the simulation time. However, no interactions with Arg316 were detected for more than 30% of the simulation time. Nevertheless, the compound showed hydrogen bonding interaction with Asn331, which was also afforded by the cognate ligand.
Compound 14 did not demonstrate any of the common interactions afforded by the reference agonists during the simulations. Compound 7 formed water bridges with Arg316, yet it did not exhibit direct interactions. In the same sense, the interactions afforded by compounds 1 and 11 were not as effective, and their complexes with the protein did not show as high stability as that observed by other compounds such as 2 and 18 during the long-term MD simulations.
Conclusively, compounds 2 and 18 exhibited low RMSD values that were stable at the plateau levels, showed reduced fluctuations in the crucial regions of the receptor's agonist site with low RMSF values, and demonstrated similar binding modes observed by the reference agonists. Moreover, the ligand–protein complexes of the two compounds remained stable throughout the whole simulation time that indicated strong and robust adhesion to the residues in the agonist binding site. Moreover, compounds 2 and 18 were found to have a potential to enhance the signal carrier effect of the receptor by increasing the fluctuations in the regions away from the agonist binding site in a similar behavior to the reference agonists. Compound 2 has been reported as anti-diabetic, although its anti-inflammatory properties are predominantly mentioned in the literature.60,61 Compound 18 is an anticancer compound with cytotoxic effects and is known to be effective in bone health and bone metabolism in very similar forms.62,63
The free energy surface plots obtained following the PCA of the ligand–protein complexes showed dark blue regions, representing low energy, surrounded by yellow areas that reflect relatively higher energy barriers. The energy landscapes of the analyzed complexes were remarkably similar. Specifically, compound 2 and resmetirom showed very comparable energy landscapes, and similarly compound 18 and the cognate ligand had very close energy surfaces. This is an evidence that the movement of the protein structure throughout the simulations was very similar for all systems. Noteworthily, the deep energy valleys appeared in the plots represent a pathway for conformational transitions, which is highly favorable for the generation of agonist signals.
At a lag value of 10, resmetirom and the cognate ligand exhibited remarkably similar deep low energy valleys, whereas compounds 2 and 18 demonstrated a high degree of similarity to the energy landscape of the reference compounds. In all the systems, the protein was able to transition between different conformations by concentrating the deep energy valleys in specific regions. This points out that the studied compounds help stabilize certain conformations of the protein, making these states more likely to occur. However, when the lag value was set to 100, the energy profile similarity between resmetirom and the cognate ligand was undetectable, while the energy landscapes of the cognate ligand and the two hit compounds were still demonstrating a high degree of similarity. This observation underscores the potential for a pronounced agonist effect at this particular lag value. At a lag value of 500, the deep energy valleys exhibited highly analogous energy landscapes for all compounds, confirming similar agonist effects of the hit compounds to those of resmetirom and the cognate ligand over an extended timeframe (Fig. 7). This analysis at varying lag values provides substantial evidence that the parallel agonist properties of both the hit compounds and the reference agonists towards THR-β are effective, particularly during the time interval when the lag value is 100, i.e., around 50 ns.
![]() | ||
| Fig. 8 MM/GBSA plots showing the change in free energies of the ligand–protein complexes of the reference agonists and compounds 2 and 18 during the long-term (500 ns) MD simulations. | ||
The free energy data demonstrated that all the complexes exhibited low energy states, thereby confirming their stability at all stages of the long-term MD simulations. The complex of compound 2 exhibited a slightly higher energy state than the others, reaching a maximum value of −65 kcal mol−1. Subsequently, the free energy demonstrated a downward trend, declining below the threshold of −80 kcal mol−1. Noteworthily, the other three complexes manifested comparable and proximate free energy values.
Docking results showed that only compounds 2, 11, and 14 were able to dock to THR-α. However, they exhibited minimal interactions with the amino acid residues in the agonist site of the receptor (Fig. S5). Interestingly, compound 2 utilized water molecules as a bridge for the majority of its hydrogen bonding interactions with the protein. While water molecules contribute to the interaction energies, they must be meticulously examined in the context of static interactions due to their inherent mobility in dynamic processes. Compound 18 was not able to dock to THR-α, indicating high selectivity potential towards the β isoform of the receptor. The high degree of selectivity observed is attributable to the presence of the Ser277 residue in the α isoform at the ligand binding sites, despite the absence of the Asn331 residue in the β isoform. This amino acid residue difference has resulted in alterations to the hydrogen bond interaction pattern, thereby creating conditions conducive to selectivity. Furthermore, α-helices, free ends and helical structure differences, as demonstrated in Fig. 1, significantly distinguish the protein dynamics between the two structures. These helical structure differences are primarily observed in regions near the backbone residues within the binding site in the α isoform, where folding and length variations are evident.
RMSD and RMSF data of compound 11 showed minimal deviation values, suggesting robust stability of the ligand–protein complex and hence an appreciable affinity towards THR-α. Noteworthily, the RMSF plot of compound 11 displayed a notable indication of a partial agonist effect. This was manifested as an additional fluctuation in the highly fluctuating region around residue index 50. The RMSD and RMSF data of compounds 2 and 14 exhibited elevated deviation values, suggesting very low affinity and a minimal impact on the protein structure (Fig. S5).
The PCA data of compound 11 demonstrated the greatest similarity to the cognate ligand's effects on the protein dynamics. The free energy surface of this compound contained two distinct low-energy regions, analogous to those observed in the cognate ligand's surface. Compound 2, on the other hand, exhibited inconsistencies in its probability density representation of the agonist effect, with low-energy regions distributed across a wide range. Compound 14 appeared to restrict protein dynamics, exhibiting a single low-energy region (Fig. S5).
When all the data are considered collectively, it becomes evident that compounds 2, 14, and 18 evinced minimal interest in the alpha isoform. Compound 11, however, partly demonstrated compatible and consistent protein dynamics data with those of the cognate ligand, which is known to be an agonist. The RMSD and RMSF data of this compound could suggest a significant partial agonistic potential. Compound 14 exerted a constrained effect on the protein dynamics and thus could have the potential to demonstrate a direct antagonist effect towards the THR-α isoform. Compound 2 did not exhibit any evidence of agonistic effect and displayed markedly distinct protein dynamics. Compound 18, which did not dock to THR-α, could have a significant preference to the beta isoform with high selectivity and considerable agonistic potential.
| Compound number | Molecular weight (g mol−1) | log P |
H-bond acceptors | H-bond donors | TPSA (Å2) | Rotatable bonds | Csp3 | log S |
GI absorption | BBB permeant | Cytochrome substrate | Toxicity alert |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | 354.31 | −0.37 | 9 | 6 | 164.7 | 5 | 0.38 | −1.62 | Low | No | No | PAINS alert |
| Brenk alert | ||||||||||||
| 18 | 392.40 | 0.84 | 8 | 5 | 128.8 | 7 | 0.4 | −2.13 | High | No | No | No alerts |
| Cognate ligand | 355.21 | 2.82 | 4 | 2 | 66.8 | 5 | 0.24 | −5.21 | High | No | CYP1A2, CYP2C9, and CYP2C19 | No alerts |
| Resmetirom | 435.2 | 2.22 | 7 | 2 | 146.5 | 4 | 0.18 | −4.42 | Low | No | No | No alerts |
Lipinski's rule of five states that compounds with molecular weights less than 500, partition coefficients (log
P) below 5, fewer than 5 hydrogen bond donors, and fewer than 10 hydrogen bond acceptors are more likely to show good oral bioavailability.64,65 Veber's rule considers two additional criteria, which are a total polar surface area (TPSA) of less than 140 Å2 and less than 10 rotatable bonds.66 The fraction of the sp3-hybridized carbon atoms relative to the total count of carbon (Csp3) is another component that influences drug likeness. It reflects on the complexity of molecules and indicates their carbon saturation. Studies showed that compounds with zero Csp3 tend to have low absorption and bioavailability.67 In the same manner, water solubility (log
S) significantly influences drug absorption across biological membranes. Drug candidates with log
S values of −4 or higher are expected to have good oral bioavailability68
Compound 2 showed one violation to each of the Lipinski and Veber criteria, low absorption, and exhibited PAINS and Brenk alerts by virtue of its catechol and α,β-unsaturated carbonyl moieties, respectively. Indeed, drug candidates with catechol moieties demonstrate PAINS alert as they often nonspecifically bind enzymes and receptors, either via metal chelation or covalent bonding, other than the target one leading to false positive assay results. Moreover, catechols are possibly oxidized to quinones, generating reactive oxygen species that lead to oxidative stress and cause damage to DNA, lipids, and proteins. For instance, nordihydroguaiaretic acid that was used to manage asthma conditions was recalled due to liver and kidney toxicity linked to its catechol moiety. However, the catechol derivatives dopamine and epinephrine are safely used therapeutically, in a wide range, since they are very well managed by the anti-oxidant defense mechanisms of the body as long as the dosage limits and the route of administration (IV, injection) are properly considered.69
The α,β-unsaturated carbonyls are electrophilic groups that react with nucleophiles, such as the SH of cysteine residues in proteins, via the Michael addition reaction. This could impair protein functions and trigger geno- and cytotoxicity at high levels. The reported neurotoxicity and oxidative damage of acrolein and crotonaldehyde are mainly linked to their α,β-unsaturated carbonyl groups. That said, the presence of this group is still beneficial for the action of some drugs, such as dimethyl fumarate that is used in psoriasis and multiple sclerosis. The drug utilizes its α,β-unsaturated carbonyl to react with the cysteine residues of cellular proteins, forming adducts that influence signaling pathways involved in inflammation and oxidative stress such as the Nrf2 pathway.70 Moreover, curcumin is a natural α,β-unsaturated chalcone that is used safely as a potent antioxidant and anti-inflammatory agent71
In contrast, compound 18 and the cognate ligand did not show any violation to either Lipinski or Veber drug likeness criteria, showed good absorption with no predicted interactions with the cytochromes or toxicity alerts. However, resmetirom showed a similar profile with low absorption and one Veber violation (Table 2).
Our comprehensive in silico study has pointed out that compounds 2 and 18 are promising hits to be considered as potential candidates for NAFLD management. Interestingly, 4-O-caffeoylquinic acid (compound 2) has been reported to display a wide array of pharmacological activities and strong therapeutic potential against several ailments. It was able to improve memory and cognition functions and prevent neuronal loss in mouse models of Alzheimer's disease, most likely due to its support of mitochondrial health alongside its powerful antioxidant activity.64 This compound also demonstrated potent anti-diabetic activity in vitro by inhibiting the enzyme α-glucosidase, thereby altering carbohydrate absorption and by inhibiting protein tyrosine phosphatase 1B, a negative regulator in insulin signalling.72,73 In the HepG2 lipid-accumulation model, 4-O-caffeoylquinic acid lowered the triglyceride levels effectively, surpassing simvastatin in vitro, which suggested a promising potential in treating dyslipidemia and fatty liver74
The lignan derivative, dihydroxydehydrodiconiferyl alcohol (compound 18), on the other hand, demonstrated a therapeutic value in bone disorders such as osteoporosis as it showed agonistic effect on estrogen receptors, promoting osteoblast differentiation. It also exhibited strong anti-inflammatory activity through inactivating the NF-Kb signaling pathway and altered adipogenesis and lipid accumulation in some in vitro studies, indicating its promising potential against obesity and dyslipidemia related diseases.75,76
199 ZINC15 natural products, 500 ns all-atom molecular dynamics simulations, MM/GBSA free-energy estimates, and advanced protein-dynamics analyses (PCA, TICA-FES, and MSM)—we identified 4-O-caffeoylquinic acid (compound 2) and dihydroxydehydrodiconiferyl alcohol (compound 18) as compelling thyroid hormone receptor-β (THR-β) agonist candidates for the management of non-alcoholic fatty liver disease (NAFLD). Both ligands established persistent electrostatic and hydrogen-bonding interactions with the key residues Arg316 and Arg320, with their carboxylate and hydroxyl moieties complementing the positively charged guanidinium groups and thereby reinforcing an active-like receptor conformation. Consistent with these contacts, MD trajectories reveal microstate transition rates and collective motions that closely mirror those induced by the reference agonist resmetirom, indicating a comparable capacity to trigger THR-β signal transduction. In silico ADMET profiling further predicts favorable oral bioavailability, metabolic stability, and safety margins for both compounds. Collectively, these findings position compounds 2 and 18 as tractable chemical leads for THR-β-targeted NAFLD therapy and underscore the power of integrative computational pipelines to accelerate the discovery of mechanism-informed therapeutics. Subsequent biochemical and in vivo studies are now warranted to validate their efficacy and safety profiles.
Custom code & workflow: the scripts and workflow specifications used for database mining, structure preparation, docking/MD analyses, post-processing, and figure generation are archived on Zenodo and mirrored on GitHub. Please cite the exact archived version used in this article and refer readers to the most up-to-date version via the Zenodo record:
• Version used in this article (archived, immutable): https://doi.org/10.5281/zenodo.17194142 (MDScripts Molecular Dynamics Simulation Analysis Package (MDSAP), v1.0; license: MIT).
• Most recent version: the latest citable version is accessible via the “Versions” panel of the same Zenodo record; the live development repository is available at https://github.com/cannabinoid13/MDScripts?utm_source=chatgpt.com.
Availability during peer review and at publication: referees had access to the above code and data during peer review. The Zenodo record listed here is public and provides a persistent identifier for the archived version used; readers can retrieve any newer versions via the Zenodo “Versions” view.
Natural compounds from the ZINC15 database are publicly accessible at http://zinc15.docking.org. Any additional data related to this article are available from the corresponding author upon reasonable request.
All relevant data used in this study are provided in the supplementary information (SI). Supplementary information: details of molecular modelling, docking, pharmacophore, and ADME/toxicity analyses, including raw and processed data. See DOI: https://doi.org/10.1039/d5dd00146c.
Beyoğlu, Computational drug repurposing effort for identifying novel hits for the treatment of diseases such as endometriosis, uterine fibroids, and prostate cancer, Turk. J. Chem., 2024, 48, 402–421 CrossRef PubMed.| This journal is © The Royal Society of Chemistry 2025 |