Open Access Article
Anju Choorakottayil Pushkaran and
Alya A. Arabi
*
Department of Biochemistry and Molecular Biology, College of Medicine and Health Sciences, United Arab Emirates University, P.O. Box: 15551, Al Ain, United Arab Emirates. E-mail: alya.arabi@dal.ca; alya.arabi@uaeu.ac.ae
First published on 13th March 2026
Ensemble molecular dynamics (MD) simulations have been widely used to improve binding energy predictions in protein-ligand systems, but their application to DNA-ligand systems is limited. Following previous studies on DNA-doxorubicin and DNA-proflavine systems, the present study extends the ensemble MD simulations for a total of 16 intercalators spanning different molecular sizes, intercalating scaffolds, flexibilities, and charges. For each DNA-intercalator complex, 25 independent 10 ns MD replicas were simulated using randomized initial velocities. The uncorrected Molecular Mechanics/Poisson-Boltzmann Surface Area (MM/PBSA) energies significantly overestimated the binding energies and had a poor correlation with the corresponding experimental values (with a correlation coefficient R2 of 0.45). Including entropy and deformation energy correction terms to the MM/PBSA energies reduced the discrepancy between experimental and predicted binding energies of the cationic intercalators to within ca. 2 kcal mol−1 (with an R2 of 0.70). Two additional monocationic intercalators were included for validation. Their predicted binding energies were within 1.2 kcal mol−1 of the corresponding experimental values. Bootstrap analyses showed that around 10 independent replicas are generally sufficient to reduce the uncertainty in binding energy predictions for cationic intercalators to below 1 kcal mol−1, although up to 20 replicas could be needed for more flexible ligands that have relatively high degrees of freedom. Overall, MM/PBSA energies estimated from an ensemble of short MD simulations, and corrected for entropy and deformation, are reliable and computationally efficient for ranking and predicting the binding affinity of the cationic intercalators in this study. This method provides a valuable alternative when experimental binding measurements are impractical or costly.
High-resolution structures of DNA-intercalator complexes have shown that the intercalation process causes helix distortion.10 The sugar-phosphate backbone is locally unwound, extended, and the base pairs flanking the intercalator are separated and tilted.10–12 Therefore, it is important to account for the deformation energy in DNA-intercalator systems. Furthermore, the intercalation typically yields a large favorable enthalpy change due to π–π stacking and electrostatic interactions, which is counterbalanced by a substantial loss of entropy.13 Chaires and co-workers reported that the binding energies of intercalation (on the order of −5 kcal mol−1 to −10 kcal mol−1) are a result of a balance between favorable binding enthalpies and unfavorable entropy penalties.14–17 Several other studies have also highlighted the importance of accounting for entropy, in addition to enthalpy, when evaluating the binding energies of intercalators binding to DNA.18–20 Therefore, in DNA-intercalator complexes, both the entropy and deformation correction terms should be added to the predicted MM/PBSA binding energies to reproduce experimentally determined binding energies.
Another major issue with MM/PBSA energies is the sensitivity to limited sampling. Because of the stochastic nature of MD sampling, MM/PBSA binding energies estimated from single MD trajectories are often not reproducible.9,21 Extending the simulation time to 100 ns may result in a stable complex, however, if restarted with a different starting velocity, it could sample to a different trajectory resulting in a different MM/PBSA energy. It has been reported that ensemble MD simulations, where multiple independent MD simulations are performed using different starting velocities and/or geometries, overcome this challenge of insufficient sampling.9,21–26 Several studies have shown that an ensemble of shorter simulations often samples broader conformations and yields more reproducible energy estimates than a single long trajectory.9,21,27 Perez et al. quantified this by comparing the results from multiple simulations to a single run of equal total simulation time.28 Performing independent MD simulations helps to average over MD noise and present the error in the predictions in terms of standard deviations.22 In another study, the ensemble MD simulation method was reported to accurately reproduce the experimental ranking of ligands, with a Spearman rank coefficient of >0.8,29 although the absolute energies deviated significantly from the experimental values.
A recent study explored both limitations of the energy correction terms and the sensitivity to limited sampling in two DNA-intercalator complexes, DNA-doxorubicin and DNA-proflavine.20 It was shown that by averaging results over 25 independent MD simulations of 10 ns each, and by including the entropy and deformation terms, the predicted binding energies were within ca. 1.0 kcal mol−1 of the experimental binding energies for both complexes. The study also demonstrated that single simulations, whether long (100 ns) or short (10 ns), often produced binding energies that were off from experimental values (by up to 5 kcal mol−1). However, averaging results across an ensemble of 25 replicas (short 10 ns or long 100 ns simulations) brought the predicted binding energies much closer to the experimental values, within 1 kcal mol−1. The study further reported that the average binding energies predicted from ensembles of short vs. long simulations resulted in differences of no greater than 1 kcal mol−1.20 However, this discussed study is limited to only two complexes. Validation on a wider and diverse set of intercalators is important to confirm the reliability of the ensemble MD-based MM/PBSA method for DNA-intercalator complexes. Such validation is important for guiding the design and development of novel DNA intercalators for anticancer, antimicrobial, and diagnostic applications, especially for systems where experimental binding data are unavailable or difficult to obtain.
In the present study, the reliability of the ensemble MD-based MM/PBSA method is evaluated for a total of 18 DNA-intercalator complexes involving a diverse set of intercalators with varying molecular sizes (approximately 180 Da to 540 Da), intercalating scaffolds (with two to three fused rings in the scaffold), flexibility (low to high degrees of freedom), and charges (neutral, monocationic, and dicationic). The performance is assessed for both predicting absolute binding energies and ranking relative energies of DNA-intercalator complexes. Two intercalators, 9-aminoacridine (9-AA)1+ and amonafide1+, were used for further validation purposes. MM/PBSA energy decomposition was performed to partition the total binding energy into individual interaction components. Lastly, the minimum number of replicas required to obtain reliable binding energy estimates is analyzed. To our knowledge, this is the first study to evaluate ensemble-based DNA-intercalator binding energies across such a diverse set of intercalators.
Docking of the intercalators into the intercalation site of the DNA molecule was carried out using ‘FRED’ in OpenEye 4.2.1.0.32,33 The ‘Make Receptor’ utility32,33 was used to generate a docking grid box covering the full DNA structure including its intercalation site. The dimensions of the box are 41.8 Å × 27.9 Å × 19.5 Å, and its volume is 22
898 Å3. Chemgauss4 docking scores are reported, and the binding poses were analyzed using VIDA 4.4.0.434 and PyMOL.35
For the MD simulations, 18 different DNA-intercalator complexes were prepared using different programs of AMBERTools38 in the AMBER 22 software package.39 Using the ‘antechamber’ program,40 AM1-BCC atomic charges41 and atom types were assigned for all the intercalator structures. Further, the DNA and intercalator structures were parameterized using ‘tleap’.40 The amber leaprc.DNA.OL15 nucleic acid force field42 and the GAFF2 force field43 were used for the DNA and intercalators, respectively. The DNA-intercalator complexes were then neutralized by adding sodium counterions and lastly solvated by adding a box of TIP3P water molecules.
Previous studies on DNA-intercalator complexes have shown that average binding energies predicted from an ensemble of short simulations (25 replicas × 10 ns each) differ by no more than 1 kcal mol−1 from those obtained using an ensemble of longer simulations (25 replicas × 100 ns each), with deviations from experimental values being within 1 kcal mol−1.20 Accordingly, in this study, binding energies were estimated using ensemble simulations of 25 replicas, 10 ns each, i.e. the total cumulative time, per complex, is 250 ns. Each replica starts with a random starting velocity.
A series of six energy minimization cycles was performed for each of the systems. Initially, a restraint force constant of 500 kcal mol−1 was applied, and the restraint force constants were gradually reduced by half in each minimization cycle. In the final cycle, the full system undergoes energy minimization without any restraints, allowing the solute and solvent molecules to move freely. Further, the DNA-intercalator systems were gradually heated from 0 K to 300 K for 100 ps. The temperature was controlled by the NVT ensemble. After heating, the DNA-intercalator systems underwent a series of equilibration simulations for a total of 2 ns. During the initial cycles, the heavy atoms in the systems were restrained, and the restraint forces were gradually reduced by half in each cycle. The systems were then equilibrated without any restraint forces for 1.2 ns. Finally, a production run was performed for each of the equilibrated DNA-intercalator systems for 10 ns at a constant pressure and temperature (NPT ensemble). The trajectory data was collected every 20 ps during the production run, for a total of 500 frames. Additionally, to estimate the deformation energy, an ensemble MD simulation was performed on the DNA structure after removing the bound intercalator, using the same protocol.
System stability during the MD simulations was assessed by analyzing, using the ‘cpptraj’ module of AMBERTools,44 the RMSD of the heavy atoms in DNA with respect to the starting geometry for each of the replicas.
| ΔGMM/PBSA = GDNA‐ligand − (GDNA + Gligand), | (1) |
| G = Egas + Gsolvation, | (2) |
For an absolute binding energy prediction, entropy and deformation energy correction terms were added to the MM/PBSA energies of the DNA-intercalator complexes. Entropy (TΔS) was computed using the normal mode program of MMPBSA.py. Given that the normal mode analysis is computationally expensive and it produces unreliable results if a small number of frames are used,8 20 equally placed frames were selected from each replica. The deformation energy is the difference in energy between the DNA in its bound form (DNA-intercalator complex) and its unbound (ligand-free DNA) form. The corrected binding energy, ΔGcorrected, is given by:
| ΔGcorrected = ΔGMM/PBSA − TΔS + deformation energy | (3) |
000 bootstrap samples were generated by randomly selecting ‘n’ binding energy values, with replacement, from the original dataset. The mean binding energies and standard deviations were calculated for each of these bootstrap samples. The estimated standard deviations were plotted as a function of the number of replicas to identify the minimum number needed to reach a standard deviation of maximum 1 kcal mol−1.
The intercalators used in this study vary in the size of the intercalator scaffold, charges, and degrees of freedom. This provides distinct modes of π–π stacking, electrostatic interactions, and DNA structural change upon intercalation. Docking of these intercalators was performed at the intercalation site, within two neighboring GC base pairs. Fig. 2 shows the superimposed binding poses of all intercalators at the intercalation site, along with the docking scores of all the DNA-intercalator complexes.
![]() | ||
| Fig. 2 Superimposed top-ranked docked poses of 16 intercalators within the DNA binding site, along with their docking scores. | ||
As shown in Fig. 2, all 16 intercalators are bound into the same intercalation site. Positively charged intercalators with large and planar polycyclic aromatic rings had high docking scores. For example, coralyne1+ had the highest docking score of −13.2, followed by doxorubicin1+ (−12.7). Two out of three neutral ligands, thalidomide (−8.3) and NB506 (−7.5), had the least favorable scores. The docking scores were plotted against their corresponding experimental binding energies (listed in Table 1), as depicted in Fig. 3.
![]() | ||
| Fig. 3 Docking scores of 16 DNA-intercalator complexes versus their corresponding experimental binding energies, ΔGExp. | ||
| Intercalator | ΔGMM/PBSA (kcal mol−1) | Entropy (kcal mol−1) | Deformation energy (kcal mol−1) | ΔGcorrected (kcal mol−1) | ΔGExp (kcal mol−1) | Ref. of ΔGExp (kcal mol−1) |
|---|---|---|---|---|---|---|
| a The energies are taken from ref. 20. | ||||||
| Dicationic intercalators | ||||||
| Quinacrine2+ | −51.7 ± 3.8 | −21.6 ± 1.7 | 19.7 ± 3.6 | −10.4 ± 3.7 | −9.1 | 46 |
| Mitoxantrone2+ | −57.4 ± 6.7 | −25.2 ± 2.8 | 20.8 ± 5.1 | −11.6 ± 3.6 | −8.4 | 47 |
![]() |
||||||
| Monocationic intercalators | ||||||
| Coralyne1+ | −42.3 ± 0.9 | −16.3 ± 0.7 | 15.2 ± 4.3 | −10.9 ± 4.7 | −10.6 | 48 |
| Sanguinarine1+ | −39.9 ± 1.3 | −15.7 ± 1.2 | 15.9 ± 2.2 | −8.9 ± 2.2 | −9.5 | 49 |
| Chelerythrine1+ | −40.1 ± 0.9 | −16.0 ± 0.6 | 13.0 ± 2.7 | −10.4 ± 2.9 | −8.6 | 50 |
| Idarubicin1+ | −42.7 ± 4.9 | −21.2 ± 0.8 | 14.7 ± 2.1 | −6.8 ± 4.4 | −7.9 | 51 |
| Doxorubicin1+ a | −50.5 ± 2.6 | −23.5 ± 1.8 | 18.0 ± 1.4 | −8.9 ± 1.6 | −9.9 | 15 |
| Ellipticine1+ | −39.9 ± 1.3 | −15.4 ± 0.5 | 15.9 ± 2.2 | −8.6 ± 2.5 | −7.5 | 52 |
| Thionine1+ | −37.2 ± 2.4 | −15.7 ± 0.9 | 14.9 ± 2.2 | −6.6 ± 3.0 | −6.8 | 53 |
| Ethidium1+ | −37.9 ± 0.7 | −16.9 ± 0.6 | 14.5 ± 2.3 | −6.6 ± 2.2 | −6.7 | 14 |
| Berberine1+ | −37.2 ± 0.8 | −15.7 ± 0.6 | 15.5 ± 3.0 | −6.1 ± 3.0 | −6.6 | 54 |
| Acridine1+ | −31.3 ± 0.7 | −12.4 ± 0.5 | 12.9 ± 2.1 | −6.1 ± 2.1 | −6.4 | 55 |
| Proflavine1+a | −33.5 ± 2.4 | −15.9 ± 1.1 | 12.0 ± 2.3 | −5.6 ± 1.4 | −5.9 | 56 |
![]() |
||||||
| Neutral intercalators | ||||||
| Adriamycinone | −22.9 ± 1.4 | −16.6 ± 0.9 | 15.9 ± 2.8 | 9.6 ± 3.6 | −6.3 | 15 |
| NB506 | −28.8 ± 5.5 | −20.8 ± 1.6 | 22.2 ± 4.3 | 14.2 ± 6.6 | −6.7 | 57 |
| Thalidomide | −21.6 ± 2.6 | −14.4 ± 1.0 | 12.6 ± 4.0 | 5.4 ± 3.6 | −3.3 | 58 |
Docking scores showed a weak correlation (R2 = 0.38) with experimental binding energies (see Fig. 3). This result is not unexpected given the approximate nature of docking scores. The largest deviations from the trend line are observed for the neutral intercalators, adriamycinone, thalidomide, and NB506. Adriamycinone had a good docking score of −12.0 but had a relatively weaker experimental binding energy of −6.3 kcal mol−1. While docking yielded reasonable binding poses and relative fits, they did not account for DNA conformational flexibility. Therefore, in this study, the docking poses are used as starting structures for subsequent ensemble MD simulations, which allow the complexes to relax in explicit solvent and provide the basis for the MM/PBSA binding energy calculations.
In general, the ΔGMM/PBSA increases in magnitude from neutral to monocationic to dicationic intercalators. Consistent with previous findings,20 despite using ensemble MD simulations, none of the DNA-intercalator systems reproduced the experimental binding energies when the correction terms, for entropy and deformation, were excluded. For the DNA-intercalator complexes studied, the predicted ensemble average MM/PBSA binding energies are in the range of −57.4 ± 6.7 kcal mol−1 (mitoxantrone2+) to −21.6 ± 2.6 kcal mol−1 (thalidomide), while the experimental values are in the range of −10.6 kcal mol−1 (coralyne1+) to −3.3 kcal mol−1 (thalidomide). The uncorrected MM/PBSA binding energies are overestimated as they solely account for the highly favorable enthalpic terms driven by the π-π stacking of aromatic rings, van der Waals interactions, and favorable electrostatics between the ligand (most of which are positively charged) and the anionic DNA backbone. Experimental thermodynamic studies have also shown that the DNA intercalation process is characterized by a large favorable binding enthalpy, counterbalanced by a large unfavorable entropy.14–17 It is further observed that the uncorrected MM/PBSA energies have high standard deviations (see Table 1), which reflect high variations in energies among replicas. For example, the percent standard deviations in the binding energies of DNA complexed with NB506, thalidomide, mitoxantrone2+, idarubicin1+ are 19%, 12%, 12%, and 11%, respectively. For the remaining intercalators, this percentage is between 2% and 7%. Overall, the predicted ensemble-averaged MM/PBSA binding energies failed to reproduce the experimental ranking of the compounds (R2 = 0.45, see Fig. 4).
![]() | ||
| Fig. 4 The uncorrected MM/PBSA (ΔGMM/PBSA) energies, with the standard deviation bars, of 16 intercalators bound to DNA versus their corresponding experimental binding energies (ΔGExp). | ||
In terms of entropy, the ligand may need to adopt a less favorable conformation to fit in the intercalation site. The entropy term accounts for the loss of degrees of freedom by the ligand and the receptor upon binding. Table 1 lists the estimated entropies for the 16 DNA-intercalator systems. They are largely independent of the intercalator charge. Compared to the uncorrected MM/PBSA energies and the deformation energies, the entropy across the replicas showed minimal fluctuations, as indicated by the smaller standard deviations (see Table 1). The largest ensemble average entropy (TΔS) was −25.2 ± 2.8 kcal mol−1 (for mitoxantrone2+), and the lowest was −12.4 ± 0.5 kcal mol−1 (for acridine1+). Large, flexible intercalators with many rotatable bonds, such as mitoxantrone2+, tend to incur substantial entropic penalties upon DNA binding due to the restriction of their conformational freedom. Mitoxantrone2+, which contains two long flexible side chains, had the highest entropic penalty. This was followed by penalties in the range of 21–24 kcal mol−1 for quinacrine2+, which contains a long flexible side chain, and for idarubicin1+, doxorubicin1+, and NB506, each of which contains a substituted heterocyclic ring capable of rotation. Ethidium1+ also contains a rotatable ring, but since it is a less flexible benzene ring, the entropic penalty is more moderate (16.9 ± 0.6 kcal mol−1). Whereas smaller and relatively rigid molecules, such as acridine, bind mainly via π–π stacking and have a smaller entropic penalty. The addition of entropy to the MM/PBSA binding energy further improves the predicted MM/PBSA energy.
Among the monocationic intercalators, the entropy of most of them was within a narrow range of −12 to −16 kcal mol−1, except for idarubicin1+ and doxorubicin1+, which had relatively large entropy values of −21.2 ± 0.8 kcal mol−1 and −23.5 ± 1.8 kcal mol−1, respectively. Similarly, the range of deformation energies was also −12 to −16 kcal mol−1, except for doxorubicin1+, which had a higher deformation energy of 18.0 ± 1.4 kcal mol−1. However, it is also noted that the uncorrected MM/PBSA (i.e. enthalpy) values of these two intercalators happen to be large enough to compensate for these exceptionally high energy correction terms, in such a way the corrected MM/PBSA binding energies of these two intercalators remain comparable to experimental values.
It is reported in the literature that MM/PBSA energies estimated from ensemble MD simulations are reasonable for reproducing the ranking of the compounds based on the binding energy, but not for predicting absolute binding energies.29 However, in the present study, after adding the energy correction terms, the method reproduced not only the ranking of intercalators but also the experimental binding energies. For example, after adding the entropy, TΔS (−15.7 ± 0.9 kcal mol−1) and the deformation (14.9 ± 2.2 kcal mol−1) energy terms, the corrected MM/PBSA energy (averaged over 25 replicas) of the DNA-thionine1+ complex was −6.6 ± 3.0 kcal mol−1. This value aligns well with the experimental binding energy of −6.8 kcal mol−1 (see Table 1). For mitoxantrone2+, the average corrected MM/PBSA energy (−11.6 kcal mol−1) shows an over-binding by ca. 3.0 kcal mol−1 compared to the experimental value (−8.4 kcal mol−1). However, this average is reported with a standard deviation of ±3.6 kcal mol−1, i.e. the alignment between the computed and the experimental values is still very good. Overall, the corrected MM/PBSA energies for all cationic intercalators, except for mitoxantrone2+, differed from the experimental values by no more than 2 kcal mol−1 (see Table 1). However, the neutral intercalators (adriamycinone, NB506, and thalidomide) were clear outliers, a trend that was already evident in the molecular docking results. These intercalators had repulsive values, with positive corrected MM/PBSA energies of 9.6 ± 3.6 kcal mol−1, 14.2 ± 6.6 kcal mol−1, and 5.4 ± 3.6 kcal mol−1, respectively (see Table 1). Among these, NB506 was the worst, with an average corrected MM/PBSA energy of 14.2 ± 6.6 kcal mol−1, which is much lower than the experimental value of −6.7 kcal mol−1. Even when considering the individual binding energies from independent simulation trajectories, none of the replicas reproduced the experimentally determined binding energy for any of these three intercalators. Unlike cationic intercalators, neutral intercalators lack charged functional groups and form fewer electrostatic interactions with DNA. Charged groups on the intercalators form specific contacts, such as hydrogen bonds and salt bridges with the neighboring bases,11,18,61 which also increases the enthalpic contribution. In fact, experimental studies have shown that removing a charged amino group from a classic intercalator, ethidium bromide, significantly reduced the binding affinity by at least 100-fold, and also flipped the binding energy from enthalpy-driven to entropy-driven.62 Furthermore, among the outliers, adriamycinone is also an aglycone form of doxorubicin. Therefore, it is possible that the removal of the sugar moiety not only reduces electrostatic interactions but can also alter the mode of binding. Experimental studies by Tartakoff et al. reported that adriamycinone, an aglycone of doxorubicin, does not bind to the DNA through intercalation despite retaining a planar aromatic ring structure.63 In the case of thalidomide, although experimentally measured binding energies have been reported for its binding as a possible DNA intercalator, certain other studies report that it binds potentially to DNA grooves.64 For NB506, this molecule has been reported to preferentially bind to triplex DNA among 12 different nucleic acid structures and sequences tested.65 In addition, it has been reported that the binding energy of NB506 to DNA is significantly less favorable than that of the classic intercalator, doxorubicin.57 Therefore, the disagreement between MM/PBSA-predicted and experimental binding energies for the three neutral ligands may be because of the incorrect assumptions about their binding mode in solution. Furthermore, the experimental binding energies of the intercalators compiled from the literature were obtained using different techniques, such as spectrophotometric titration,46–48,50,51,54,56,66,67 isothermal titration calorimetry,14,49,53 and equilibrium binding assays.52,55,57,58 Deviations by up to ca. 2.0 kcal mol−1 were observed in the experimental values of certain DNA-intercalator complexes.68,69 This is due to variations in the methods or experimental conditions such as DNA sequence, buffer composition, pH, and ionic strength. Such differences may also influence the observed correlations with computational predictions. Lastly, beyond these listed reasons, having more accurate predicted energies for the cationic intercalators than for neutral ones, may be attributed to the more consistent treatment of coulombic interactions relative to other energy terms in the MM/PBSA method (the energy decomposition analysis will be discussed at the end of this section).
Although adding entropy and deformation energy corrections improved the agreement between corrected MM/PBSA energies and experimental values, the overall correlation was reduced, with an R2 value of 0.32 (see Fig. 5) (compared to R2 of 0.45 in Fig. 4). This is even lower than the correlation obtained with docking scores. However, when the three outliers (NB506, thalidomide, and adriamycinone, all of which are neutral) were excluded from the analysis, the R2 value improved to 0.70. Corrected MM/PBSA binding energies (R2 = 0.70) outperformed both uncorrected MM/PBSA energies and docking scores in ranking cationic ligands (R2 = 0.34 and R2 = 0.30, respectively). Furthermore, since the experimental reference values were obtained from different methods, the correlation analysis was repeated with experimental binding energies measured using exclusively isothermal titration calorimetry. This further improved the correlation, (R2 = 0.85). To validate the predictive capability implied by this correlation (i.e. R2 = 0.70 when experimental methods are not filtered), binding energies were estimated for two additional charged DNA-intercalator complexes, DNA-9AA1+ and DNA-amonafide1+. The corrected MM/PBSA binding energies of these two complexes, obtained from ensemble MD simulations of 25 replicas, were −5.2 ± 2.6 kcal mol−1 and −6.5 ± 4.1 kcal mol−1, respectively. Using these values in the regression relationship obtained from the set of charged intercalators (y = 1.27x + 1.68, R2 = 0.70 from Fig. 5), the experimental binding energies are predicted to be −5.4 ± 2.0 kcal mol−1 and −6.4 ± 3.2 kcal mol−1 for DNA-9AA1+ and DNA-amonafide1+, respectively. These values are well aligned with the actual experimental binding energies, −6.6 kcal mol−1 for 9AA1+ and −6.9 kcal mol−1 for amonafide1+. This agreement indicates that the regression relationship can be used to predict the binding energies of additional intercalators. However, in this case, the corrected MM/PBSA energies alone perform equally well, as they are not only reliable for ranking but also reasonably accurate in absolute terms. Overall, these results demonstrate that corrected MM/PBSA energies from ensemble MD simulations reliably predict the binding energies of cationic intercalators. Such predictive capability is particularly valuable in cases where new intercalators are difficult or costly to synthesize, have limited stability, or require challenging or resource-intensive experimental assays to determine their binding affinities. In these situations, this established regression relationship enables the simulated MM/PBSA binding energies to be translated into estimated experimental values, thereby offering a practical and efficient alternative for preliminary screening and prioritization of candidate molecules.
Furthermore, MM/PBSA energy decomposition was performed to partition the total binding energy into individual interaction components. This analysis identifies the relative contributions of van der Waals (VDWAALS) and electrostatic (EEL) interactions, as well as polar (EPB) and non-polar (ENPOLAR) solvation terms (see Table 2). This analysis reveals differences in interaction patterns among ligands with different charges and chemical structures.
| Intercalator | VDWAALS (kcal mol−1) | EEL (kcal mol−1) | EPB (kcal mol−1) | ENPOLAR (kcal mol−1) |
|---|---|---|---|---|
| a The energies are taken from ref. 20. | ||||
| Quinacrine2+ | −56.8 ± 3.7 | −1034.1 ± 26.9 | 1042.1 ± 26.3 | −4.3 ± 0.2 |
| Mitoxantrone2+ | −60.2 ± 5.7 | −1054.1 ± 35.3 | 1061.5 ± 34.4 | −4.2 ± 0.2 |
| Coralyne1+ | −51.3 ± 1.7 | −509.9 ± 5.6 | 522.6 ± 7.5 | −3.6 ± 0.1 |
| Sanguinarine1+ | −47.2 ± 1.1 | −514.9 ± 4.5 | 525.3 ± 5.0 | −3.3 ± 0.1 |
| Chelerythrine1+ | −47.8 ± 0.9 | −510.5 ± 2.6 | 521.6 ± 3.0 | −3.4 ± 0.1 |
| Idarubicin1+ | −60.3 ± 4.1 | −525.6 ± 19.7 | 549.6 ± 16.9 | −4.1 ± 0.1 |
| Doxorubicin1+ a | −68.2 ± 2.9 | −554.3 ± 12.0 | 577.6 ± 11.6 | −4.8 ± 0.2 |
| Ellipticine1+ | −44.4 ± 0.4 | −511.2 ± 3.9 | 518.8 ± 2.6 | −2.9 ± 0.0 |
| Thionine1+ | −38.9 ± 0.3 | −510.1 ± 8.5 | 514.4 ± 6.3 | −2.7 ± 0.0 |
| Ethidium1+ | −42.3 ± 0.2 | −499.1 ± 1.5 | 506.5 ± 1.2 | −3.1 ± 0.1 |
| Berberine1+ | −45.3 ± 1.0 | −500.6 ± 2.5 | 511.9 ± 3.2 | −3.3 ± 0.1 |
| Acridine1+ | −34.4 ± 0.3 | −515.9 ± 4.0 | 521.6 ± 3.4 | −2.5 ± 0.0 |
| Proflavine1+ | −36.2 ± 0.8 | −512.9 ± 5.9 | 518.4 ± 4.0 | −2.7 ± 0.1 |
| Adriamycinone | −47.4 ± 1.4 | −5.2 ± 2.3 | 32.9 ± 2.2 | −3.3 ± 0.1 |
| NB506 | −55.7 ± 2.7 | −32.0 ± 10.8 | 63.2 ± 10.9 | −4.2 ± 0.2 |
| Thalidomide | −33.8 ± 2.5 | −14.3 ± 1.9 | 29.1 ± 1.8 | −2.6 ± 0.1 |
In all 16 DNA-intercalator complexes, VDWAALS is a consistently large and favorable (highly negative) contributor to the binding (see Table 2). The van der Waals term captures the π–π stacking interactions of the planar aromatic intercalators with DNA base pairs, which is a key feature of the intercalation mechanism.11,18,70 For example, the van der Waals term was reported to be the major stabilizing factor for the intercalation of doxorubicin1+ within DNA.18–20 This term correlates with the size and planarity of the intercalators.71 In other words, intercalators with large, planar, and rigid polyaromatic scaffolds exhibit more favorable van der Waals energies due to strong stacking interactions with neighboring DNA bases. For example, molecules with a large aromatic surface area, e.g. doxorubicin1+ (−68.2 ± 2.9 kcal mol−1), mitoxantrone2+ (−60.2 ± 5.7 kcal mol−1), and idarubicin1+ (−60.3 ± 4.1 kcal mol−1), had the largest VDWAALS contribution. Whereas, smaller or less planar intercalators, such as acridine1+ (−34.4 ± 0.3 kcal mol−1) and proflavine1+ (−36.2 ± 0.8 kcal mol−1), had smaller VDWAALS contribution. The flexibility or rigidity of the intercalator structure also influenced fluctuations in the VDWAALS term across replicas of each complex. Rigid and smaller intercalators, such as ethidium1+ and thionine1+, had relatively small standard deviations in the VDWAALS term (±0.2 and ± 0.3 kcal mol−1, respectively). On the other hand, flexible intercalators, such as mitoxantrone2+ and idarubicin1+, which have multiple rotatable bonds, had larger fluctuations in VDWAALS term (ca. ±4 and ±6 kcal mol−1, see Table 2).
The EEL and EPB are strongly dependent on the charge of the ligand. Compared to the neutral intercalators, many of the cationic intercalators had large attractive coulombic interactions, particularly with the negatively charged phosphate backbone of DNA (see Table 2). Dicationic intercalators, such as mitoxantrone2+ and quinacrine2+, had the largest EEL (−1054.1 ± 35.3 kcal mol−1 and −1034.1 ± 26.9 kcal mol−1, respectively) and EPB (1061.5 ± 34.4 kcal mol−1 and 1042.1 ± 26.3, respectively) contributions. Monocationic intercalators had approximately half of these contributions. As expected, the EEL and EPB values for the neutral intercalators were relatively very low (below 32.0 ± 10.8 for EEL and 63.2 ± 10.9 kcal mol−1 for EPB). The electrostatic interaction energy in the gas phase (listed as EEL in Table 2) is largely counterbalanced by the polar solvation energy. Therefore, the net electrostatic contribution (difference between EEL and EPB) provides a relatively small contribution to the binding of intercalators to the DNA. This observation is consistent with findings from previous studies.18–20 It is worth noting that both EEL and EPB show large standard deviations, especially for dicationic intercalators. For example, mitoxantrone2+ has a standard deviation of ± 35.3 and ± 34.4 for EEL and EPB, respectively. Small fluctuations in either term can lead to large errors in their combined contribution. For example, if EEL is −500 kcal mol−1 and EPB is 500 kcal mol−1, their net contribution is close to zero. However, a small fluctuation of only 1 to 2% in either term (i.e. ±5 to ±10 kcal mol−1) can change the net value by several kcal mol−1. This relates to the poor reproducibility of MM/PBSA energy estimated from single trajectories. The ENPOLAR term accounts for the hydrophobic effects. All 16 intercalators showed a favorable ENPOLAR term, ranging from roughly −2.7 ± 0.0 to −4.8 ± 0.2 kcal mol−1 in the ensemble averages. Overall, the cationic intercalators had relatively low uncertainty in the electrostatic (<4%) and polar solvation (<3%) terms, while neutral intercalators had significantly higher uncertainties in the electrostatic terms (EEL and EPB), with fluctuations exceeding 30% of the predicted value for NB506.
![]() | ||
| Fig. 6 Standard deviations of the corrected MM/PBSA energies (in kcal mol−1) with respect to the number of replicas of 10 ns MD simulations for the cationic DNA-intercalator systems. | ||
Having a cutoff error of ca. 1 kcal mol−1 has been reported as the gold standard in advanced free energy calculations.4 From Fig. 6, targeting a 1 kcal mol−1 uncertainty appears to be a practical compromise between precision and computational cost. The bootstrap analyses in Fig. 6 show that sampling with an increased number of replicas decreased the standard deviation in the corrected MM/PBSA energies, until a plateau is reached. In other words, after a certain optimal number of replicas, the energy is effectively converged and running more replicas will have only negligible improvements in precision. The minimum number of replicas required to predict binding energies varies from system to another. Proflavine required only 2 replicas to fall below an accepted standard deviation threshold of 1 kcal mol−1, while, on the other extreme, 20 replicas were needed for the idarubicin1+ system to meet the same threshold. The DNA-intercalator systems that required more than ten replicas were coralyne1+, idarubicin1+, quinacrine2+, and mitoxantrone2+. The standard deviation of the corrected MM/PBSA energies for these systems was greater than 3.5 kcal mol–1. For sanguinarine1+, acridine1+, and ethidium1+, five replicas were required. Overall, most complexes with cationic intercalators (11 systems) had reached ca. 1 kcal mol−1 uncertainty using up to 10 independent replicas.
Supplementary information (SI): the RMSD plots for 25 independent MD simulation replicas of the DNA-thionine1+ complex. See DOI: https://doi.org/10.1039/d6ra00032k.
| This journal is © The Royal Society of Chemistry 2026 |