Tanay
Debnath
a and
G. Andrés
Cisneros
*ab
aDepartment of Physics, University of Texas at Dallas, Richardson, Texas 75080, Dallas, USA. E-mail: andres@utdallas.edu
bDepartment of Chemistry and Biochemistry, University of Texas at Dallas, Richardson, Texas 75080, Dallas, USA
First published on 8th February 2024
DNA polymerases are fundamental enzymes that play a crucial role in processing DNA with high fidelity and accuracy ensuring the faithful transmission of genetic information. The recognition of unnatural base pairs (UBPs) by polymerases, enabling their replication, represents a significant and groundbreaking discovery with profound implications for genetic expansion. Romesberg et al. examined the impact of DNA containing 2,6-dimethyl-2H-isoquiniline-1-thione: D5SIC (DS) and 2-methoxy-3-methylnaphthalene: DNAM (DN) UBPs bound to T. aquaticus DNA polymerase (Taq) through crystal structure analysis. Here, we have used polarizable and nonpolarizable classical molecular dynamics (MD) simulations to investigate the structural aspects and stability of Taq in complex with a DNA duplex including a DS–DN pair in the terminal 3′ and 5′ positions. Our results suggest that the flexibility of UBP-incorporated DNA in the terminal position is arrested by the polymerase, thus preventing fraying and mispairing. Our investigation also reveals that the UBP remains in an intercalated conformation inside the active site, exhibiting two distinct orientations in agreement with experimental findings. Our analysis pinpoints particular residues responsible for favorable interactions with the UBP, with some relying on van der Waals interactions while other on Coulombic forces.
Several potential UBPs have been synthesized in recent years30–42 among them d5SICS (DS)-dNaM (DN) have emerged as one of the proficiently replicated unnatural base pairs.30,31 Previous reported work revealed that DS–DN form unconventional structures inside a DNA duplex. Bertz et al. reported two crystal structures of a DNA polymerase in complex with DNA containing a DS–DN incorporation.30,31 The first report involves a crystal structure of DNA polymerase I from T. aquaticus (Taq) in a closed ternary complex with DN in the templating position, and DS- triphosphate (DSTP) as substrate (PDB ID: 3SV3).30 In this structure, DN and DSTP exhibit Watson–Crick–Franklin (WCF) like parallelly stacked geometry.
Later they also reported the binary complex of Taq with the artificial base pair DS–DN in the post-insertion site. Crystal structures showed that in the binary complex of the polymerase, DS–DN forms two distinct intercalated non-Watson–Crick–Franklin (nWCF) structures in the terminal position of the DNA duplex. In one structure (PDB ID: 4C8L),31 the primer-DS is located externally with respect to the substrate-DS; whereas in another (PDB ID: 4C8O)31 the positioning becomes opposite.
Different computational techniques have been employed to predict the impact of DS–DN inside DNA in aqueous solvent, which predict both WCF and nWCF structures of the UBP inside DNA without polymerase systems.51–58 Our another theoretical work51 suggests that the inclusion of a DS–DN pair at the terminal position of DNA increases the flexibility of these bases, with occasional mispairing and fraying.
Romesberg et al. further reported individual residue contacts between Taq and the UBP-incorporated DNA based on the X-ray crystal structures.31 Although this investigation provided valuable insights into possible molecular interactions, a comprehensive interplay between Taq and UBP-incorporated-DNA including their dynamic characteristics, remains a subject yet to be thoroughly investigated. The full scope of how a polymerase orchestrates the interactions, and the dynamic nature of these connections remains an unsolved avenue for further exploration and scientific inquiry.
Here, we have employed classical molecular dynamics simulation with both AMOEBA and AMBER force fields to investigate the dynamical characteristics of UBP-incorporated DNA in binary complex with Taq and the molecular level interactions of UBP incorporated DNA with Taq. The reminder of the paper is organized as follows: The next section describes the setup of the simulation systems, simulation procedures, and analyses. Subsequently, analysis of the MD simulations is presented and discussed, followed by concluding remarks. This work is intended to provide novel insights on the DNA polymerase-bound behaviour of DNA that incorporates an experimentally synthesized and tested UBP that can be replicated, translated and transcribed in vitro. Our results uncover the specific drivers for the stabilization and selectivity of specific pairing of the UBP in the active site, which can help drive the field of synthetic biology forward.
500 frames have been extracted from the entire trajectory of each replicate for the analysis. Energy decomposition analysis (EDA) has been employed to investigate the intermolecular interactions between the UBP and residues of the rest of the systems for the entire trajectory using 12
500 frames for two replicates. As both the replicates show similar outcomes, the rest of the analysis has been done for one replicate. Amber-EDA was employed to calculate the nonbonded intermolecular interaction energies.79
000 AMOEBA water molecules and neutralized using Na+ in the center of the box with volume 90 × 90 × 90 Å3 using packmol. After that, the system was heated to 300 K in 4 simulation steps (2 ns each) with an NVT ensemble removing all positional restrains (100.0–0.0 kcal Å−1). After the equilibration step, MD simulations were carried out for 50 ns in an NPT ensemble (1 atm and 300 K). The Monte Carlo barostat and the Bussi thermostat were used to keep the pressure and temperature fixed respectively. The duration of the time step was 2 fs using RESPA integrator. The smooth particle mesh Ewald (sPME) method81 was used in the calculation of charge, atomic multipole, and polarization interactions. A value of 10 Å was used for the cutoff distance value for van der Waals potential energy interactions and the real-space distance cutoff in the Ewald summation.82 Geometry sampling was done every 5 ps, which led to generate total 10
000 frames.
| RMSD/STDV | |||
|---|---|---|---|
| Rep1 | Rep2 | Rep3 | |
| AMOEBA | |||
| EXTSYN | 2.71/0.37 | 1.56/0.23 | |
| INTSYN | 2.17/0.29 | 2.23/0.24 | |
| AMBER | |||
| EXTSYN | 2.11/0.26 | 2.40/0.28 | 2.21/0.30 |
| INTSYN | 2.58/0.30 | 2.40/0.31 | 2.58/0.41 |
| INTANTI | 2.65/0.40 | 2.29/0.23 | 2.72/0.46 |
| EXTANTI | 2.87/0.40 | 2,93/0.39 | 2.79/0.45 |
The distance between DS and DN (dDS–DN) (Table 2 and Fig. S3, ESI†) based on both AMOEBA and AMBER simulation, indicating a reduced distance compared to that of a nBP of ∼10.5 Å. We have also measured the distance (Fig. 2) and angles (Fig. S4, ESI†) of UBs with respect to their adjacent nBP. It has been found that throughout the simulation the gap between dDS–DC and dDN–DG is always positive indicating the asymmetric nature of the DNA strand with external intercalation of DS and DN (Fig. 2). From angle analysis the asymmetric nature of DNA is also observed since <DS–P–DC is found to be greater than <DN–P–DG throughout entire trajectory of the simulation. In the crystal structure a similar trend for the UB–NB and <UB–P–NB have been observed.
| Distance/STDV | |||
|---|---|---|---|
| Rep1 | Rep2 | Rep3 | |
| AMOEBA | |||
| EXTSYN | 8.08/0.45 | 8.10/0.38 | |
| INTSYN | 9.10/0.32 | 7.52/0.62 | |
| AMBER | |||
| EXTSYN | 8.60/0.77 | 8.86/0.87 | 7.89/1.13 |
| INTSYN | 9.63/0.44 | 9.67/0.47 | 9.71/0.38 |
| INTANTI | 9.06/0.73 | 9.07/0.76 | 9.19/0.86 |
Similar to AMBER, simulations with the AMOEBA force field for 50 ns for EXTSYN also result in an intercalated DS–DN structure with no conformational change and no significant distortion inside the polymerase. Here also, the DS–DN distance is found to be shorter than that of nBPs. AMOEBA also predicts greater dDS–DC and <DS–P–DC than dDN–DG and <DN–P–DG (Fig. 2 and Fig. S4, ESI†) and throughout the simulations supporting an external intercalation of the UBP leading to the maintenance of an asymmetric DNA strand structure. From the AMBER and AMOEBA simulations, it becomes apparent that both yield mostly similar results in terms of RMSD values and related geometric parameters. Furthermore, the simulations show minimal deviations from the actual crystal structures, suggesting that the molecular dynamics (MD) simulations align closely with the experimental observation of binary Taq with UBP-incorporated DNA. Interestingly, with no-overhang DNA, the simulation with AMBER also predicts similar stability and structural features indicating an overhanging base doesn’t significantly impact UBP stability when the dsDNA is in complex with Taq. The associated RMSD, RMSF and geometrical parameters are presented in Fig. S6 (ESI†).
The INT structure has been obtained through inter-strand flipping of the UBP in the EXT conformer (Fig. 4). For the INTSYN conformer, the intercalation of DS–DN was also maintained during the entire simulation with both force fields, with low RMSD value (average RMSD for each replicate <2.6 Å for AMBER) (Table 1 and Fig. S2, ESI†). RMSF values obtained from the AMBER simulated system indicates that the fluctuation of all the residues including UBP are low. Similar to EXTSYN, the INTSYN system exhibits no conformational change and no significant distortion, suggesting that in this case the flexibility of the UBP has also been prevented by Taq. It is also evident (Fig. 3) that along with most of the protein–DNA residues, DS and DN also display low RMSF values confirming the relatively low fluctuating nature of the UBP.
![]() | ||
| Fig. 4 Conformational change through Intra- and Inter-strand flipping observed during the MD simulations. (Figure taken from preprint with permission51). | ||
The pattern of fluctuations of the polymerase residues as observed from the RMSF analysis is similar to EXTSYN, suggesting that a conformational change of the UBP doesn’t influence the characteristics of the system. In this case also the calculated dDS–DN distance, derived from both AMOEBA and AMBER force fields, is found to be lower than that of nBPs. Asymmetricity of the DNA duplex is reflected in the associated uneven dUB–NB (Fig. 2) and <UB–P–NB (Fig. S4, ESI†) obtained from both AMOEBA and AMBER simulations. It is observed that here dDS–DC and <DS–P–DC are found to be lower than dDN–DG and <DN–P–DG respectively, which is in agreement with the crystal structure. Comparing these geometrical parameters between EXTSYN and INTSYN, contrasting trajectories have been observed from which two conformers can be distinguished. Here also there is no significant deviations between the AMBER and AMOEBA results as both predict similar structural behaviour. We have further simulated no-overhang INTSYN system with the AMBER force field from which it is evident that in absence of hanging base pair also UBP can stabilize an intercalated structure with low RMSD values and similar RMSF and geometrical parameters (Fig. S7, ESI†). It also suggests that hanging base pair has no significant influence on the formation of nWCF UBP-incorporated DNA strand in Taq.
Despite the conformational dissimilarities between EXTSYN and INTSYN, they exhibit mostly similar RMSF pattern (ΔRMSF∼0). (Fig. S5, ESI†) This is further supported by the cross-correlation analyses for both systems (Fig. S8, ESI†), which demonstrates similar correlation patterns among the residues for both EXTSYN and INTSYN. Normal mode analysis also indicates that exclusive dominance of the 1st normal mode has been observed with similar vibration pattern (ESI†).
To investigate the effect of conformational change on Taq, we have generated the EXTANTI system via an intra-strand flipping of the DS in the EXTSYN structure where the sulfur of DS and methoxy group of DN are on opposite sides. As discussed in our previous article,51 the UBP can stay in both SYN and ANTI forms within DNA in aqueous solution with spontaneous intra-strand flipping. As depicted in Fig. 5, O–S distance for INTSYN is lower than INTANTI. But simulations of UBP-intercalated DNA bound to Taq suggests that the EXTANTI conformer of UBP inside the DNA duplex is disfavored, leading to a non-pairing between DS and DN inside the DNA duplex (Fig. 1). From all three replicates it has been found that DS–DN fails to generate an intercalated structure, and rather DS is flanked out from the DNA duplex. Comparatively higher RMSD value (Table 1 and Fig. S2, ESI†) suggests that EXTANTI induces larger structural changes compared with the SYN conformation. Notably, some specific polymerase residues mainly from the finger region of the polymerase exhibit higher RMSF (ΔRMSF(SYN-ANTI) > 0) values (Fig. S5, ESI†). From cross-correlation analysis also, significant deviation of movement correlation among the polymerase residues has been observed between SYN and ANTI conformation (Fig. S8, ESI†).
![]() | ||
| Fig. 5 O–S distance for INTSYN and INTANTI conformers obtained from AMBER simulation. O–S distance of crystal structure of INTSYN is 7.4 Å. Distance value are in Å. | ||
INTSYN is further reoriented through intra-strand flipping the DS which leads to the production of INTANTI structure (Fig. 1). Despite the conformational change, this UBP conformation results in an intercalated structure in Taq with low RMSD value throughout the simulation (Fig. S2, ESI†). RMSF analysis suggests that apart from terminal residues, the fluctuation of most polymerase residues is below 4.5 Å, indicating low degree of flexibility. Here also it is observed that for ANTI conformer, polymerase residues from finger region fluctuate more compared to SYN indicating perturbation arises on conformational change on the finger region. Average DS–DN distance is found to be ∼9.1 Å, indicating minor reduction of the distance as obtained from SYN conformer (Table 2). The persistence of intercalated geometry of the UBP suggests that, unlike EXT, the INT conformer doesn’t result in mispairing through intra-strand flipping, indicating that both SYN and ANTI orientations are tolerated by Taq for INT UB conformers.
We have further performed EDA analysis to identify and quantify individual interactions between individual protein residues and the UBP (Fig. S9 and S10, ESI†). It has been observed that attractive Coulomb interactions (ECoul) arise mostly from positively charged residues. Arg and Lys, from all three polymerase subdomains (Fig. 6 and 7). For all the conformers, R587 and R746 exhibit the highest ECoul with the UBP through interactions with the phosphate backbone of 3′-DS and 5′-DN respectively. Interestingly, these two residues are identified in the interacting zone with the phosphate backbone of DS and DN from the crystal structure analysis of 4C8L (EXTSYN) but not for 4C8O (INTSYN) by Romesberg et al. R573 is also found to be strongly interacting with the UBs having higher negative ECoul for all the conformers but only reported for EXTSYN from crystal structure analysis. These findings lead to create a scope to extend the experimental investigation through mutagenesis techniques to identify the Taq residues that interact with UBP in different conformers. EDA also predicts that residues residing in the finger region of Taq interacting with the UBP through vdW interactions. Tyr671 is known as the steric gate, which is crucial for rNTP discrimination. Interestingly, in the case of UBPs, Tyr671 exhibits stabilizing vdW interactions with the exposed UBs (DS for EXTSYN and DN for INTSYN) which is also in agreement with the findings from crystal structure analysis. Nevertheless, a disparity between experimental observations and our EDA results becomes evident concerning Glu615. While crystal structure analysis propose that Glu615 forms attractive interactions with DS, our EDA findings indicate that while the van der Waals interaction is indeed attractive, the Coulombic interaction is significantly repulsive in nature and consequently the total non-covalent interaction energy between DS and Glu615 becomes positive. This is consistent with the fact that Glu615, being a negatively charged residue in physiological pH exhibits repulsive Coulombic interactions with the phosphate backbone of the UBP.
Comparing the EDA results of Taq-UBP-incorporated DNA with the 3.2 Å resolution mapped structures of Taq-nB-DNA83 it is observed that interaction profile (EvdW) looks mostly similar for both the cases. A notable distinction observed is that, in the case of nBP, Tyr671 and Phe667 form interactions with the 5'- and 3'-BP positions, respectively. In contrast, when the UBP is incorporated into DNA, significant interactions primarily occur between the exposed UBs (3'-DS for EXTSYN and 5'-DN INTSYN) and Tyr671. Additionally, in the INTSYN scenario, instead of the Phe667-5'-BP interaction, we observed EvdW interaction between Phe667 and the 3'-DS.
We have further calculated the total inter-molecular interaction (interaction between UBP and sum of all the Taq residues) differences (ΔETot) between different conformers. The difference in total interaction energy ΔETot between EXTSYN and INTSYN is found to be insignificant (−3.5 kcal mol−1) suggesting the similar stability of the UBP inside Taq for both of these conformers. However, EXTANTI displays a significant increase of ΔETot as compared to EXTSYN (ΔETot = 70.4 kcal mol−1) implying intra-strand flipping is not favored for EXT conformer, which is consistent with this conformation not being structurally stabilized. For INT, ΔETot between SYN and ANTI is found to be −12.4 kcal mol−1 indicating despite energetic difference INTANTI is also significantly stabilized inside Taq.
Mutation of R660S of the homologous E. coli DNA polymerase I (Klenow fragment, KF) conducted by Yosida et al. revealed a reduction in transitions from T to C.84 Thompson et al. reported changes in fidelity by mutating 26 amino acids in the KF, and found that DNA mismatches are recognized by two important factors; free energy difference for the partitioning of the DNA primer terminus (i) between the polymerase and exonuclease sites for several mispairs, (ii) between the residues near the active site and the mismatched pairs.46 They concluded that residues N845 and R668 are required for recognition of correct mispairs. Singh and Modak also reported that residues N845, Q849, R668, H881, and Q677 are part of a hydrogen bond track. Computational studies have also been done to investigate the effect of mutation of DNA polymerase I towards mispairing of the base pairs in the terminal positions.49,85
Collectively, previous studies confirmed that mutation of KF residues R668, R682, E710 and N845 (R573, R587, E615 and Asn750 for Taq) are the key residues controlling the fidelity within the KF84–87 whereas E742 and A743 are found to be crucial for the elongation activity.87 Our studies also identified similar residues with significant interactions (E) with UBP particularly for EXTSYN, INTSYN and INTANTI. Interestingly these residues also exhibit significant difference in interactions (ΔE) on conformational change. It has been observed that Coulomb interaction profile (ECoul) almost looks similar apart from few residues (R573, R587, E615 and R746) displaying significant altered interactions (ΔECoul ≠ 0) from EXTSYN to INTSYN. (Fig. 8) Similar vdW interaction profile (EvdW) has also been observed for these two conformers with few residues exhibiting altered vdW interactions (ΔEvdW ≠ 0) on finger region. Intra-strand flipping between SYN and ANTI also leads to minimal perturbation of Coulomb and vdW interaction between UBP and Taq (Fig. 9). The difference in inter-molecular interactions between the different UBP orientations results in almost all residues exhibiting similar (de)stabilizing roles, albeit there are some exceptions, among them R573, R587, E615, E742 and R746 are significant. However, the difference of interaction profile is prominent between SYN and ANTI for EXT with several residues showing altered interactions indicating the EXTANTI is not stabilized by Taq, leading to non-pairing of the UBP. The extent of non-covalent interactions between DS and DN is also obtained from EDA. Interaction between DS and DN is found to be highest for EXTSYN (EvdW = −10.4 kcal mol−1) followed by INTSYN (EvdW = −9.9 kcal mol−1) and INTANTI (EvdW = −9.2 kcal mol−1) and least for EXTANTI (EvdW = −3.5 kcal mol−1) indicating apart from EXTANTI intercalation persist for all three conformers. During the examination of adjacent nBP interactions, EDA analysis indicates that the neighboring base pair exhibits relatively lower stability when contrasted with other base pairs, which suggest that the structure and stability of adjacent BP have been affected by the presence of the unconventional orientation of UBP.
![]() | ||
| Fig. 8 Difference in interaction profiles (ΔECoul and ΔEvdW) between EXTSYN and INTSYN. Energy values are in kcal mol−1. | ||
![]() | ||
| Fig. 9 Difference in interaction profiles (ΔECoul and ΔEvdW) between (A) EXTSYN and EXTANTI, (B) (A) INTSYN and INTANTI. Energy values are in kcal mol−1. | ||
Footnote |
| † Electronic supplementary information (ESI) available: Additional details of MD, EDA (PDF). Additional ESI for the initial coordinates and parameters for all of the studied systems (ESI-1.zip) (ZIP). NMA movies of all the conformers (NMA.zip) (ZIP). See DOI: https://doi.org/10.1039/d3cp05571j |
| This journal is © the Owner Societies 2024 |