Waro Nakanishi*,
Satoko Hayashi* and
Taro Nishide
Faculty of Systems Engineering, Wakayama University, 930 Sakaedani, Wakayama 640-8510, Japan. E-mail: nakanisi@sys.wakayama-u.ac.jp; hayashi3@sys.wakayama-u.ac.jp; Tel: +81 73 457 8252
First published on 1st July 2020
The intrinsic dynamic and static nature of each HB in the multi-HBs between nucleobase pairs (Nu–Nu′) is elucidated with QTAIM dual functional analysis (QTAIM-DFA). Perturbed structures generated using coordinates derived from the compliance constants (Cii) are employed for QTAIM-DFA. The method is called CIV. Two, three, or four HBs are detected for Nu–Nu′. Each HB in Nu–Nu′ is predicted to have the nature of CT-TBP (trigonal bipyramidal adduct formation through charge transfer (CT)), CT-MC (molecular complex formation through CT), or t-HBwc (typical HB with covalency), while the vdW nature is predicted for the C–H⋯X interactions, for example. Energies for the formation of the pairs (ΔE) are linearly correlated with the total values of Cii−1 in Nu–Nu′. The total Cii−1 values are obtained by summing each Cii−1 value, similarly to the case of Ohm's law for the parallel connection in the electric resistance. The total ΔE value for a nucleobase pair could be fractionalized to each HB, based on each Cii−1 value. The perturbed structures generated with CIV are very close to those generated with the partial optimization method, when the changes in the interaction distances are very small. The results provide useful insights for better understanding DNA processes, although they are highly enzymatic.
Fig. 1 Molecular graphs for the nucleobases adenine (A), guanine (G), cytosine (C), thymine (T) and uracil (U), optimized with MP2/BSS-B′a (see Table 1 for BSS-B′a). |
BSS | H, C, N, O | BSS | H, C, N, O |
---|---|---|---|
a The 6-311+G(3d) basis set being employed for C.b The 6-311+G(d) basis set being employed for C. | |||
BSS-A | 6-311++G(3df,3pd) | BSS-A′ | 6-311+G(3df,3pd) |
BSS-B′a | 6-311+G(3df,3pd)a | BSS-B′b | 6-311+G(3df,3pd)b |
BSS-C | 6-311++G(3df,3p) | BSS-C′ | 6-311+G(3df,3p) |
BSS-D | 6-311++G(3d,3p) | BSS-D′ | 6-311+G(3d,3p) |
The QTAIM approach, introduced by Bader,16,17 enables us to analyze the nature of chemical bonds and interactions.18–29 The bond critical point (BCP, *16,17,30) is an important concept in QTAIM corresponding to the point where ρ(r) (charge density) reaches a minimum along the interatomic (bond) path, while it is a maximum on the interatomic surface separating the atomic basins. ρ(r) at the BCP is denoted by ρb(rc) in this paper, as are other QTAIM functions, such as the total electron energy density Hb(rc), potential energy density Vb(rc) and kinetic energy density Gb(rc) at the BCP. A chemical bond or an interaction between A and B is denoted by A–B, which corresponds to the bond path (BP) in QTAIM. We use A–*–B for the BP, where the asterisk emphasizes the existence of a BCP in A–B.16,17,30 Eqn (1), (2) and (2′) represent the relations among Gb(rc), Vb(rc), Hb(rc), and ∇2ρb(rc).
Hb(rc) = Gb(rc) + Vb(rc) | (1) |
(ℏ2/8m)∇2ρb(rc) = Hb(rc) − Vb(rc)/2 | (2) |
(ℏ2/8m)∇2ρb(rc) = Gb(rc) + Vb(rc)/2 | (2′) |
Interactions are classified by the signs of ∇2ρb(rc) and Hb(rc). Hb(rc) must be negative when ∇2ρb(rc) < 0, as confirmed by eqn (2), since Vb(rc) < 0 at all BCPs. Interactions are called shared shell (SS) interactions when ∇2ρb(rc) < 0 and closed-shell (CS) interactions when ∇2ρb(rc) > 0.16 In particular, CS interactions are called pure CS (p-CS) interactions when Hb(rc) > 0 and ∇2ρb(rc) > 0. We call interactions where Hb(rc) < 0 and ∇2ρb(rc) > 0 regular CS (r-CS) interactions, which clearly distinguishes these interactions from the p-CS interactions. The signs of ∇2ρb(rc) can be replaced by those of Hb(rc) − Vb(rc)/2 because (ℏ2/8m)∇2ρb(rc) = Hb(rc) − Vb(rc)/2 (see eqn (2)). Again, the details are explained later.
Experimental chemists have recently used QTAIM to explain their results by considering chemical bonds and interactions. Indeed, Hb(rc) − Vb(rc)/2 = 0 corresponds to the borderline between the classic covalent bonds of SS and the noncovalent interactions of CS, but Hb(rc) = 0 appears to be buried in the noncovalent interactions of CS. As a result, it is difficult to characterize the CS interactions of van der Waals (vdW) interactions, typical hydrogen bonds (t-HBs), interactions in molecular complexes formed through charge transfer (CT-MCs), trihalide ions (X3−), and interactions in trigonal bipyramidal adducts formed through CT (CT-TBPs), based on the signs of Hb(rc) − Vb(rc)/2 and/or Hb(rc). How can such CS interactions be classified and characterized effectively? It is essential for experimental chemists.
We proposed QTAIM dual functional analysis (QTAIM-DFA), based on the QTAIM approach, to classify and characterize the various CS interactions more effectively.31–36 QTAIM-DFA is very useful for experimental chemists to analyse their own chemical bond and interaction results based on their own expectations. In QTAIM-DFA, Hb(rc) are plotted versus Hb(rc) − Vb(rc)/2 at BCPs, which incorporates the classification of interactions by the signs of ∇2ρb(rc) [=(8m/ℏ2)(Hb(rc) − Vb(rc)/2)] and Hb(rc) (see, eqn (2)). In this treatment, both axes of the plot are given by the common unit of energy. As a result, four-function calculations can be applied to analyze the plot, which leads to the analysis of the interactions in a unified form.
The signs of the first derivatives of Hb(rc) − Vb(rc)/2 and Hb(rc) (d(Hb(rc) − Vb(rc)/2)/dr and dHb(rc)/dr, respectively, where r is the HB distance) are used to characterize CS interactions in QTAIM-DFA, in addition to those of Hb(rc) − Vb(rc)/2 and Hb(rc). In our treatment, data from the perturbed structures around the fully optimized structures are employed, in addition to those from the fully optimized structures. Data from the fully optimized structure are analyzed using polar coordinate (R, θ) representation, which corresponds to the static nature of interactions.32–37 Each interaction plot, containing data from both the perturbed and fully optimized structures, includes a specific curve that provides important information about the interaction. This plot is expressed by (θp, κp), where θp corresponds to the tangent line of the plot and κp is the curvature. θ and θp are measured from the y-axis and the y-direction, respectively. The concept of the dynamic nature of interactions has been proposed based on (θp, κp).32–37 We call (R, θ) and (θp, κp) QTAIM-DFA parameters (see Fig. 4 for the definition, as illustratively exemplified by NH–*–N of A–A).
How can the perturbed structures for effective analysis with QTAIM-DFA be generated? Accordingly, we very recently proposed a highly reliable method to the generate perturbed structures for QTAIM-DFA.38 The method, which is called CIV, employs the coordinates derived from the compliance constants Cii for internal vibrations.39,40 Eqn (3) defines Cij as the partial second derivative of the potential energy due to an external force, where i and j refer to internal coordinates and the external force components acting on the system fi and fj correspond to i and j, respectively.39 While the off-diagonal elements Cij (i ≠ j) in eqn (3) correspond to the compliance coupling constants, the diagonal elements Cii represent the compliance constants for an internal coordinate i.
Cij = ∂2E/∂fi∂fj | (3) |
The Cij value given in eqn (3) corresponds to a lower numerical value (i) of a compliance constant representing a stronger bond (j); that is, Cij measures the flexibility (or compliance) of a particular bond. The applications of CIV to CS interactions are substantially more effective than those to SS interactions in QTAIM-DFA.38 The Cii values and the coordinates corresponding to Cii (Ci) were calculated using the Compliance 3.0.2 program, released by Grunenberg and Brandhorst.41–43 The dynamic nature of interactions based on the perturbed structures with CIV is described as the “intrinsic dynamic nature of interactions” since the coordinates are invariant to the choice of coordinate system. The mechanism for the formation the Nu–Nu′ pairs will also be clarified in more detail based on the Cii parameters.
QTAIM-DFA is applied to standard interactions, and rough criteria that distinguish the interaction in question from others are obtained. QTAIM-DFA has excellent potential for evaluating, classifying, characterizing, and understanding weak to strong interactions according to a unified form.32–37 QTAIM-DFA and the criteria are explained in the Appendix of the ESI using Schemes SA1–SA3, Fig. SA1, SA2, Table SA1 and eqn (SA1)–(SA7).† The basic concept of the QTAIM approach is also explained.
Indeed, the understanding of HBs has been considerably growing recently,1–9,44,45 but evaluating, characterizing, and understanding the nature of each HB in multi-HBs, especially in nucleobase pairs, is inevitably needed to obtain a better understanding of DNA processes. How can the dynamic and static nature of each HB in the multi-HBs between Nu–Nu′, where the multi-HBs are formed in close proximity in space and interact mutually and strongly with each other, be clarified? Grunenberg and Brandhorst calculated the strength of each HB of the multi-HBs in the A–T and C–G pairs by applying the compliance constants.42,46 The elucidation of the intrinsic dynamic and static nature of each HB in multi-HBs, exemplified by the acetic acid dimer and derivatives, was attempted by employing the perturbed structures generated with CIV to examine the effective applicability of QTAIM-DFA to the system.47
We consider QTAIM-DFA to be well suited to elucidate the nature of each HB in the multi-HBs between Nu–Nu′ by employing the perturbed structures generated with CIV, with above discussion in mind. The method enables us to classify and characterize the nature of the interaction and the results will be very useful when experimental chemists analyze their own chemical bond and interaction results based on their own expectations. This is another purpose of this work. Weak interactions in Nu–Nu′ may sometimes be called HBs in this paper, even if they should be assigned to other categories of interactions. Herein, we present the results of investigations on the intrinsic dynamic and static nature of each HB in the multi-HBs between nucleobase pairs. Each HB interaction in Nu–Nu′ can be classified and characterized effectively with QTAIM-DFA, employing the perturbed structures generated with CIV. The criteria are employed in this process as reference. The behavior of the pairs is also discussed based on the nature.
Eqn (4) explains the method used to generate the perturbed structures with CIV. The i-th perturbed structure in question (Siw) is generated by the addition of the coordinates Ci, derived from Cii, to the standard orientation of a fully optimized structure (So) in the matrix representation. The coefficient giw in eqn (4) controls the structural difference between Siw and So: giw is determined to satisfy eqn (5) for r, where r and ro stand for the HB distances in the perturbed and fully optimized structures, respectively, and ao is the Bohr radius (0.52918 Å). The Ci values of five digits are used to predict Siw.
The perturbed structures were also generated by the partial optimization method (POM)31,33 of the Z-matrix and/or ModRedundant types,34 where the HB distances in question (r) in the perturbed structures were fixed to satisfy eqn (5).
Siw = So + giw·Ci | (4) |
r = ro + wao (w = (0), ±0.025 and ±0.05; ao = 0.52918 Å) | (5) |
y = co + c1x + c2x2 + c3x3 | (6) |
QTAIM functions were calculated using the same method as in the optimizations, unless otherwise noted, and were analyzed with the AIM200052 and AIMAll53 programs. Hb(rc) is plotted versus Hb(rc) − Vb(rc)/2 for five data points of w = 0, ±0.025 and ±0.05 in eqn (5). Each plot is analyzed using a regression curve of the cubic function, as shown in eqn (6), where (x, y) = (Hb(rc) − Vb(rc)/2, Hb(rc)) (Rc2 (square of the correlation coefficient) > 0.99999 is typical).37
AH–*–B in Nu–Nu′b (symmetry: no.c) | ρb(rc) (eao−3) | c∇2ρb(rc)d (au) | Hb(rc) (au) | Re (au) | θf (°) | Ciig (Å mdyn−1) | θph (°) | κpi (au−1) | Predicted nature |
---|---|---|---|---|---|---|---|---|---|
a See Table 1 for BSS-B′a.b Data are given at the BCPs.c Numbers given for the interactions are the same as those in Fig. 2 and 4.d c∇2ρb(rc) = Hb(rc) − Vb(rc)/2, where c = ℏ2/8m.e R = (x2 + y2)1/2, where (x, y) = (Hb(rc) − Vb(rc)/2, Hb(rc)).f θ = 90° − tan−1(y/x).g Defined in eqn (3) in the text.h θp = 90° − tan−1(dy/dx).i κp = |d2y/dx2|/[1 + (dy/dx)2]3/2. | |||||||||
N–*–HN in A–T (C1: 1) | 0.0498 | 0.0094 | −0.0143 | 0.0171 | 146.7 | 3.12 | 182.4 | 8.5 | r-CS/CT-TBP |
NH–*–O in A–T (C1: 2) | 0.0291 | 0.0114 | −0.0012 | 0.0115 | 95.9 | 5.78 | 145.4 | 115.3 | r-CS/t-HBwc |
CH–*–O in A–T (C1: 3) | 0.0059 | 0.0025 | 0.0007 | 0.0026 | 74.5 | 16.31 | 80.6 | 64.9 | p-CS/vdW |
N–*–HN in A–T (Cs: 4) | 0.0498 | 0.0094 | −0.0143 | 0.0171 | 146.7 | 3.12 | 182.4 | 8.5 | r-CS/CT-TBP |
NH–*–O in A–T (Cs: 5) | 0.0291 | 0.0114 | −0.0012 | 0.0115 | 95.9 | 5.78 | 145.4 | 115.2 | r-CS/t-HBwc |
CH–*–O in A–T (Cs: 6) | 0.0059 | 0.0025 | 0.0007 | 0.0026 | 74.5 | 16.30 | 80.6 | 64.9 | p-CS/vdW |
NH–*–O in C–G (C1: 7) | 0.0449 | 0.0134 | −0.0096 | 0.0165 | 125.5 | 3.20 | 169.9 | 11.4 | r-CS/CT-MC |
N–*–HN in C–G (C1: 8) | 0.0377 | 0.0099 | −0.0062 | 0.0117 | 122.2 | 2.15 | 175.0 | 30.8 | r-CS/CT-MC |
O–*–HN in C–G (C1: 9) | 0.0305 | 0.0118 | −0.0017 | 0.0119 | 98.2 | 4.08 | 148.3 | 101.3 | r-CS/t-HBwc |
NH–*–N in A–A (C1: 10) | 0.0289 | 0.0093 | −0.0018 | 0.0095 | 100.9 | 5.74 | 158.6 | 99.6 | r-CS/CT-MC |
N–*–HC in A–A (C1: 11) | 0.0119 | 0.0045 | 0.0013 | 0.0047 | 74.1 | 17.10 | 75.6 | 55.8 | p-CS/vdW |
N–*–HN in A–C (C1: 12) | 0.0391 | 0.0101 | −0.0071 | 0.0123 | 125.1 | 3.70 | 174.1 | 23.3 | r-CS/CT-MC |
NH–*–O in A–C (C1: 13) | 0.0364 | 0.0135 | −0.0042 | 0.0141 | 107.5 | 3.72 | 158.2 | 40.2 | r-CS/CT-MC |
N–*–HN in A–G (C1: 14) | 0.0424 | 0.0098 | −0.0091 | 0.0133 | 132.8 | 3.52 | 178.6 | 22.5 | r-CS/CT-MC |
NH–*–O in A–G (C1: 15) | 0.0361 | 0.0125 | −0.0044 | 0.0133 | 109.5 | 4.45 | 161.0 | 45.0 | r-CS/CT-MC |
CH–*–HN in A–G (C1: 16) | 0.0056 | 0.0026 | 0.0009 | 0.0027 | 71.1 | 29.31 | 78.8 | 111.5 | p-CS/vdW |
N–*–HN in A–U (C1: 17) | 0.0500 | 0.0093 | −0.0145 | 0.0172 | 147.2 | 3.10 | 182.6 | 8.2 | r-CS/CT-TBP |
NH–*–O in A–U (C1: 18) | 0.0289 | 0.0114 | −0.0011 | 0.0115 | 95.5 | 5.79 | 141.5 | 117.4 | r-CS/t-HBwc |
CH–*–O in A–U (C1: 19) | 0.0060 | 0.0025 | 0.0007 | 0.0026 | 74.5 | 16.06 | 80.3 | 77.2 | p-CS/vdW |
N–*–HN in C–C (C1: 20) | 0.0488 | 0.0099 | −0.0134 | 0.0167 | 143.5 | 2.63 | 180.6 | 2.3 | r-CS/CT-TBP |
NH–*–O in C–C (C1: 21) | 0.0421 | 0.0131 | −0.0079 | 0.0153 | 121.0 | 3.86 | 168.1 | 17.8 | r-CS/CT-MC |
O–*–HC in C–C (C1: 22) | 0.0050 | 0.0021 | 0.0006 | 0.0022 | 73.0 | 14.60 | 82.4 | 61.9 | p-CS/vdW |
N–*–HN in C–T (C1: 23) | 0.0406 | 0.0096 | −0.0083 | 0.0127 | 130.7 | 4.77 | 178.1 | 24.7 | r-CS/CT-MC |
NH–*–O in C–T (C1: 24) | 0.0348 | 0.0125 | −0.0037 | 0.0130 | 106.5 | 4.81 | 158.6 | 55.6 | r-CS/CT-MC |
O–*–O in C–T (C1: 25) | 0.0026 | 0.0013 | 0.0006 | 0.0014 | 67.2 | 32.13 | 86.3 | 344.8 | p-CS/vdW |
N–*–HN in C–U (C1: 26) | 0.0410 | 0.0096 | −0.0085 | 0.0129 | 131.6 | 4.73 | 178.5 | 23.5 | r-CS/CT-MC |
NH–*–O in C–U (C1: 27) | 0.0347 | 0.0125 | −0.0036 | 0.0130 | 106.2 | 4.79 | 158.4 | 55.9 | r-CS/CT-MC |
O–*–O in C–U (C1: 28) | 0.0028 | 0.0014 | 0.0006 | 0.0015 | 67.9 | 30.90 | 87.4 | 325.6 | p-CS/vdW |
NH–*–O G–G (Ci: 29) | 0.0500 | 0.0136 | −0.0124 | 0.0184 | 132.4 | 2.86 | 172.4 | 7.6 | r-CS/CT-MC |
O–*–HN G–G (Ci: 30) | 0.0083 | 0.0044 | 0.0015 | 0.0046 | 71.6 | 12.98 | 73.0 | 10.5 | p-CS/vdW |
N–*–HN in G–T (C1: 31) | 0.0416 | 0.0100 | −0.0087 | 0.0133 | 130.8 | 3.90 | 177.1 | 19.7 | r-CS/CT-MC |
NH–*–O in G–T (C1: 32) | 0.0335 | 0.0123 | −0.0030 | 0.0127 | 103.7 | 4.92 | 155.6 | 68.9 | r-CS/CT-MC |
NH–*–O in G–U (C1: 33) | 0.0419 | 0.0138 | −0.0072 | 0.0155 | 117.5 | 3.09 | 165.7 | 21.7 | r-CS/CT-MC |
O–*–HN in G–U (C1: 34) | 0.0404 | 0.0127 | −0.0070 | 0.0145 | 118.8 | 4.32 | 167.5 | 22.2 | r-CS/CT-MC |
NH–*–O in T–T (C1: 35) | 0.0375 | 0.0129 | −0.0051 | 0.0139 | 111.4 | 4.29 | 163.5 | 34.1 | r-CS/CT-MC |
O–*–HN in T–T (C1: 36) | 0.0375 | 0.0129 | −0.0051 | 0.0139 | 111.4 | 4.29 | 163.5 | 34.1 | r-CS/CT-MC |
NH–*–O in T–T (Ci: 37) | 0.0375 | 0.0129 | −0.0051 | 0.0139 | 111.4 | 4.29 | 163.5 | 34.1 | r-CS/CT-MC |
NH–*–O in T–U (C1: 38) | 0.0381 | 0.0130 | −0.0054 | 0.0141 | 112.6 | 4.17 | 164.2 | 30.9 | r-CS/CT-MC |
O–*–HN in T–U (C1: 39) | 0.0366 | 0.0128 | −0.0046 | 0.0136 | 109.8 | 4.42 | 162.5 | 38.2 | r-CS/CT-MC |
NH–*–O in U–U (C1: 40) | 0.0373 | 0.0129 | −0.0050 | 0.0138 | 111.1 | 4.29 | 163.3 | 34.5 | r-CS/CT-MC |
O–*–HN in U–U (C1: 41) | 0.0373 | 0.0129 | −0.0050 | 0.0138 | 111.1 | 4.29 | 163.3 | 34.5 | r-CS/CT-MC |
NH–*–O in U–U (Cs: 42) | 0.0373 | 0.0129 | −0.0050 | 0.0138 | 111.1 | 4.29 | 163.3 | 34.5 | r-CS/CT-MC |
O–*–HN in U–U (Cs: 43) | 0.0373 | 0.0129 | −0.0050 | 0.0138 | 111.1 | 4.29 | 163.3 | 33.6 | r-CS/CT-MC |
Fig. 2 Molecular graphs for nucleobase pairs (Nu–Nu′), with the contour plots of ρ(r), evaluated with MP2/BSS-B′a. The numbers for the bonds are the same as those in Fig. 4 and Table 2. Bond critical points (BCPs) are denoted by red dots, ring critical points (RCPs) are denoted by yellow dots and bond paths (BPs) are denoted by pink lines. Oxygen, nitrogen, carbon and hydrogen atoms are in red, blue, black and gray, respectively. Contour plots are drawn on the planes containing at least one side of the HB interaction. The contours (eao−3) are at 2l (l = ±8, ±7, … and 0). |
The r(H, B) values in Nu–Nu′ evaluated with the various methods are plotted versus those evaluated with MP2/BSS-A′. The plot is shown in Fig. S1 of the ESI.† This plot gave very good correlations, as shown in the figure. The high similarities in r(H, B) correspond to the high similarities of the structures of Nu–Nu′ optimized with the methods employed in the calculations.43,46,55 The similarities are excellent, especially for the structures optimized with MP2/BSS-B′a, MP2/BSS-B′b and MP2/BSS-A′, although frequency analysis could not be performed on those with MP2/BSS-A′.
Fig. 3 Negative Laplacians of ρ(r) for the A–T and C–G pairs, calculated with MP2/BSS-B′a. Positive and negative areas are in blue and red lines, respectively. |
For example, the ΔEES values evaluated with MP2/BSS-B′a are −70.3, −70.6 and −123.5 kJ mol−1 for A–T, A–U and C–G, respectively. The values for A–T and A–U are very close to each other due to their structural similarity.13 The ΔEZP values are plotted versus the ΔEES values calculated with MP2/BSS-B′a. The plot, which is shown in Fig. S2 of the ESI,† gives a very good correlation (ΔEZP = 0.968ΔEES + 1.80: Rc2 = 0.9993). Therefore, either ΔEES or ΔEZP can be employed for the discussion of the energy terms.
The nature of each HB in multi-HBs of Nu–Nu′ will be clarified by employing QTAIM-DFA.
QTAIM functions are calculated at each BCP on the BP corresponding to each HB in Nu–Nu′. Table 2 collects the ρb(rc), Hb(rc) − Vb(rc)/2 (=(ℏ2/8m)∇2ρb(rc)) and Hb(rc) values evaluated with MP2/BSS-B′a,46 where each HB in a nucleobase pair is numbered in the order of decreasing ρb(rc) values. Hb(rc) is plotted versus Hb(rc) − Vb(rc)/2 for the data shown in Table 2, together with those data from the perturbed structures generated with CIV. Fig. 4 illustrates the plots. Fig. 4a shows the whole picture of the plots, and Fig. 4b presents the magnified plots that appeared in the p-CS region of Hb(rc) − Vb(rc)/2 > 0 and Hb(rc) > 0. The data (points) in Fig. 4 are divided into three groups: (a) NH–*–N appeared in the r-CS region of Hb(rc) − Vb(rc)/2 > 0 and Hb(rc) < 0, (b) NH–*–O appeared in the r-CS region and (c) very weak O–*–O and CH–*–X (X = O, N and HN) interactions appeared in the p-CS region, where the weaker NH–*–O interaction in G–G (30) is also contained. The three groups are called G(A), G(B) and G(C), respectively, here. Relative to those from G(B), data from G(A) appear more on the left and lower sides overall. The results would show that interactions in G(A) are stronger than those corresponding to G(B) as a whole. As shown later, interactions in G(C) are predicted to have the vdW nature. The QTAIM-DFA parameters of (R, θ) and (θp, κp) are calculated for each HB in Nu–Nu′ by analyzing each plot shown in Fig. 4, according to eqn (SA3)–(SA6) of the ESI.† The (θp, κp) values calculated with CIV should be denoted by (θp:CIV, κp:CIV); however, we will use (θp, κp) in place of (θp:CIV, κp:CIV) for simplification of the notation. Table 2 collects the (R, θ) and (θp, κp) values evaluated with MP2/BSS-B′a, together with the Cii values related to the perturbed structures. Similar results calculated with the various methods other than MP2/BSS-B′a are collected in Tables S4 and S5 of the ESI.†
Fig. 4 Plots of Hb(rc) versus Hb(rc) − Vb(rc)/2 for each HB in Nu–Nu′, evaluated with MP2/BSS-B′a. For the whole picture (a) and the magnified image for the pure CS region (b). The numbers for the interactions are the same as those in Fig. 2 and Table 2, respectively. Two streams appear in the plots of (a) by NH–*–N and NH–*–O, which are shown by the solid and hollow marks, respectively. The definitions of (R, θ) and (θp, κp) are also illustrated. |
Each HB in Nu–Nu′ is classified and characterized based on the (R, θ, θp) values evaluated with MP2/BSS-B′a. The results are discussed in the following.
The parameters given in bold are superior to those given in plain font in the classification and characterization of interactions, where those in plain font are given as the tentative ones. The classic chemical bonds of SS interactions (180° < θ) are not detected in the HBs collected in Table 2. As a result, each HB in Nu–Nu′ can be classified and characterized using the (θ, θp) values in place of (R, θ, θp). If the data of an HB appear in the CT-TBP region, for an example, the HB interaction is recognized to have the CT-TBP nature.
The (θ, θp) values are (67.2–74.5°, 73.0–87.4°) for O–*–O in C–U (28) and C–T (25); CH–*–O in C–C (22), A–T (3, 6) and A–U (19); CH–*–HN in A–G (16); CH–*–N in A–A (11); and the weaker O–*–HN in G–G (30). Therefore, the interactions are classified as p-CS interactions (45° < θ < 90°) and characterized as having the vdW nature (45° < θp < 90°), which is denoted by p-CS/vdW. The ∠NHO angle for the weaker NH–*–O interaction in G–G (30) is 135.0° (≪180°); therefore, it is much weaker than expected. The NH–*–O interactions in A–T (2, 5) and A–U (18) along with the weaker O–*–HN in C–G (9) are predicted to be r-CS/t-HBwc since the (θ, θp) values are (95.5–98.2°, 141.5–148.3°) (90° < θ; θp < 150°), although the weaker NH–*–O in C–G (9) seems fairly close to the borderline area with r-CS/CT-MC, of which (θ, θp) = (98.2°, 148.3°). The NH–*–O interactions in A–C (13), A–G (15), C–C (21), C–T (24), C–U (27), G–G (29), G–T (32), G–U (33, 34), T–T (35, 36, 37), T–U (38, 39) and U–U (40, 41, 42, 43), together with the stronger NH–*–O in C–G (7), are predicted to have the r-CS/CT-MC nature since the (θ, θp) values are (103.7–132.4°, 155.6–172.4°) (150° < θp < 180°). On the other hand, the (θ, θp) values for N–*–HN in A–T (1, 4), A–U (17) and C–C (20) are (143.5–147.2°, 180.6–182.6°); therefore, the interactions are predicted to have the r-CS/CT-TBP nature (θp > 180°), while the NH–*–N interactions in A–A (10), A–C (12), A–G (14), C–T (23), C–U (26) and G–T (31) along with the weaker NH–*–N in C–G (8) are predicted to be of the r-CS/CT-MC nature since (θ, θp) = (100.9–132.8°, 158.6–178.6°) (150° < θp < 180°). The NH–*–N interactions in A–G (14), C–T (23) and C–U (26) seem fairly close to the borderline area with r-CS/CT-TBP (θp = 180°) since the θp values are 178.6°, 178.1° and 178.5°, respectively, which are fairly close to 180°. The results are summarized in Table 2. The nature of each HB in the multi-HBs between Nu–Nu′, calculated with MP2/BSS-B′a, together with the number, is illustrated in Fig. S3 of the ESI.†
The total orders for NH–*–N and NH–*–O, based on θ and θp, are shown in eqn (7) and (8), respectively. The NH⋯N interactions are again demonstrated to be stronger than the NH⋯O interactions, overall. The orders shown in eqn (7) and (8) are similar with each other, although the similarity is not necessarily. These results would arise from the specific nature of each HB in multi-HBs of Nu–Nu′. The applicability of QTAIM-DFA, which employs the perturbed structures generated with CIV, is also demonstrated to elucidate the nature of each HB of the multi-HB system in Nu–Nu′. There are some differences, however. The differences in the orders are shown by italic. The differences seem large for NH–*–O (G–G: 29), NH–*–N (C–G: 8), NH–*–O (A–C: 13) and NH–*–N (A–A: 10), among them, as shown by italic.
For both NH–*–N and NH–*–O, based on θ:
NH–*–N (A–U: 17) ≥ NH–*–N (A–T: 1) > NH–*–N (C–C: 20) > NH–*–N (A–G: 14) ≥ NH–*–O (G–G: 29) ≥ NH–*–N (C–U: 26) ≥ NH–*–N (G–T: 31) ≥ NH–*–N (C–T: 23) > NH–*–O (C–G: 7) ≥ NH–*–N (A–C: 12) > NH–*–N (C–G: 8) > NH–*–O (C–C: 21) > NH–*–O (G–U: 34) > NH–*–O (G–U: 33) > NH–*–O (T–U: 38) > NH–*–O (T–T: 35, 36) ≥ NH–*–O (U–U: 40, 41) > NH–*–O (T–U: 39) ≥ NH–*–O (A–G: 15) > NH–*–O (A–C: 13) > NH–*–O (C–T: 24) ≥ NH–*–O (C–U: 27) > NH–*–O (G–T: 32) > NH–*–N (A–A: 10) > NH–*–O (C–G: 9) > NH–*–O (A–T: 2) ≥ NH–*–O (A–U: 18) ≫ NH–*–O (G–G: 30) | (7) |
For both NH–*–N and NH–*–O, based on θp:
NH–*–N (A–U: 17) ≥ NH–*–N (A–T: 1) > NH–*–N (C–C: 20) > NH–*–N (A–G: 14) ≥ NH–*–N (C–U: 26) ≥ NH–*–N (C–T: 23) > NH–*–N (G–T: 31) > NH–*–N (C–G: 8) > NH–*–N (A–C: 12) > NH–*–O (G–G: 29) > NH–*–O (C–G: 7) > NH–*–O (C–C: 21) > NH–*–O (G–U: 34) > NH–*–O (G–U: 33) > NH–*–O (T–U: 38) > NH–*–O (T–T: 35, 36) > NH–*–O (U–U: 40, 41) > NH–*–O (T–U: 39) > NH–*–O (A–G: 15) > NH–*–N (A–A: 10) > NH–*–O (C–T: 24) ≥ NH–*–O (C–U: 27) ≥ NH–*–O (A–C: 13) > NH–*–O (G–T: 32) > NH–*–O (C–G: 9) > NH–*–O (A–T: 2) > NH–*–O (A–U: 18) ≫ NH–*–O (G–G: 30) | (8) |
After elucidation of the nature of each HB in Nu–Nu′, the next extension is to consider the behavior of the HBs.
Good correlations are detected for the relations. The R values are plotted versus ρb(rc) for each HB in Nu–Nu′, as shown in Fig. S4 in the ESI.† The plot can be analyzed as three correlations of G(A), G(B), and G(C), which are closely related to the plot shown in Fig. 4. The data point for the weaker NH–*–O in G–G (30) is just on the correlation line for G(B); therefore, it is added to G(B) in the analysis. The correlations are shown in Table 3 (entries 1–3).
Entry | Correlation | a | b | c | Correlation with n |
---|---|---|---|---|---|
a Evaluated with MP2/BSS-B′a.b Data from weaker NH–*–O of G–G (30) being added to G(B).c Omitting the data from weaker NH–*–O of G–G (30).d Omitting the data from C–G. | |||||
1 | R vs. ρb(rc) | 0.388 | −0.003 | 0.963 | Fig. S4 (G(A): 10) |
2 | R vs. ρb(rc) | 0.320 | 0.002 | 0.993 | Fig. S4(G(B): 20b) |
3 | R vs. ρb(rc) | 0.355 | 0.001 | 0.992 | Fig. S4 (G(C): 7c) |
4 | θ vs. ρb(rc) | 2110.5 | 42.5 | 0.985 | Fig. 5 (G(A): 10) |
5 | θ vs. ρb(rc) | 1811.6 | 43.4 | 0.989 | Fig. 5 (G(B): 19) |
6 | ΔE vs. (1/Cii)Nu–Nu′ | −121.1 | −7.52 | 0.954 | Fig. 6 (15) |
7 | ΔE vs. (1/Cii)Nu–Nu′ | −136.6 | −0.02 | 0.956 | Fig. 6 (14d) |
8 | (w′/w)POM vs. (w′/w)CIV | 1.021 | −0.001 | 0.9997 | Fig. 7 (15) |
9 | (w′/w)POM vs. (w′/w)CIV | 1.046 | −0.004 | 0.997 | Fig. S11 (15) |
The results seem to promise similar relations between the parameters. Fig. 5 shows the plot of θ versus ρb(rc). The plot is analyzed as three correlations for G(A) of NH–*–N, G(B) of NH–*–O and G(C) of vdW interactions. The correlations are shown in Table 3 (entries 4 and 5), except for the very poor correlation for G(C), which is given in the figure. The plot of θ versus R is illustrated in Fig. S5 of the ESI.† The plot is also analyzed as two correlations, similar to the case of the plot in Fig. 5. The correlations are given in the figure.
Fig. 5 Plots of θ versus ρb(rc) for each HB in Nu–Nu′, calculated with MP2/BSS-B′a. While data for G(A) of NH–*–N are shown by black solid circles, those for G(B) of NH–*–O are by red solid circles, together with those for G(C) of CH–*–X (X = O, N and HN) and O–*–O by blue hole circles. The numbers for the interactions are the same as those in Table 2 and Fig. 4. |
Good linear correlations are not found in the plots of θp versus ρb(rc) and θp versus R. The plot of θp versus θ also does not give a good linear correlation. Instead, the relation between θp and θ is analyzed using a cubic function as a regression curve. The correlation was much improved when analyzed as two correlations, which are shown in Fig. S6 of the ESI.† The correlations are given in the figure.
ΔE = a(1/Cii) + b | (9) |
(1/Cii)Nu–Nu′ = Σk(1/Cii)Nu–Nu′:k | (10) |
The ΔE values are plotted versus (1/Cii)Nu–Nu′ for Nu–Nu′ in Fig. 6. A (very) good correlation was obtained for the plot, which is shown in Table 3 (entry 6). In this case, a y-intercept value (b in eqn (9)) very close to zero is obtained (b = 0.02 kJ mol−1) if data from C–G are omitted from the correlation, although the correlation seems not very improved. The correlation is shown in Table 3 (entry 7). The inverse proportion also holds for the multi-HB system of Nu–Nu′ in this case. The constant value (in ΔE·Cii = constant), as the averaged value of ΔE·Cii for Nu–Nu′, is evaluated to be −137.04 without C–G. The constant value for Nu–Nu′ (−137.04) is close to but somewhat smaller than that reported for the neutral mono-HB species (−165.64) in magnitude.45 The constant value for all Nu–Nu′ is evaluated to be −135.96, which is very close to that without the data from C–G. The results show that the compliance constants (Cii) are closely related to ΔE for the formation of not only the neutral mono-HB species but also the multi-HB system of Nu–Nu′. A similar mechanism would be operative in both processes of ΔE and Cii in the multi-HB systems of Nu–Nu′. Eqn (10) reminds us that the total value of resistance of a parallel connection should be calculated for each one according to Ohm's law for the electric resistance of resistors connected in parallel.56
The total contributions of ΔE and Cii should be calculated as the summations of the contributions from each HB. As a result, it is expected that the ΔE value for a nucleobase pair can be fractionalized to each HB in the multi-HB system of the Nu–Nu′. Based on the good relation with eqn (9) and (10) shown in Fig. 6 (see entry 6 or 7 in Table 3), the ΔE value for a nucleobase pair is expected to be fractionalized to each HB (ΔEe) by the ratio of 1/Cii of each HB, according to eqn (11), where ΔEe:1 and (1/Cii)Nu–Nu′:1 stand for the fractionalized energy to the first HB and for the 1/Cii value of the first HB in the Nu–Nu′, respectively. The results are collected in Table 4.
ΔEe:1:ΔEe:2:… = (1/Cii)Nu–Nu′:1:(1/Cii)Nu–Nu′:2:… | (11) |
Nu–Nu′ (sym) | ΔE | ΔEe (no.b) | ΔEe (no.b) | ΔEe (no.b) |
---|---|---|---|---|
a The values are given in kJ mol−1.b The number for each HB, containing the vdW interaction, is the same as that given in Table 2. | ||||
A–T (C1) | −70.3 | −40.6 (1) | −21.9 (2) | −7.8 (3) |
C–G (C1) | −123.5 | −37.7 (7) | −56.2 (8) | −29.6 (9) |
A–A (C1) | −35.4 | −26.5 (10) | −8.9 (11) | |
A–C (C1) | −73.1 | −36.6 (12) | −36.5 (13) | |
A–G (C1) | −80.5 | −42.1 (14) | −33.3 (15) | −5.1 (16) |
A–U (C1) | −70.6 | −40.8 (17) | −21.9 (18) | −7.9 (19) |
C–C (C1) | −102.6 | −55.1 (20) | −37.6 (21) | −9.9 (22) |
C–T (C1) | −64.0 | −29.9 (23) | −29.7 (24) | −4.4 (25) |
C–U (C1) | −64.9 | −30.3 (26) | −29.9 (27) | −4.6 (28) |
G–G (Ci) | −117.1 | −48.0 (29) | −10.6 (30) | |
G–T (C1) | −65.4 | −36.5 (31) | −28.9 (32) | |
G–U (C1) | −74.2 | −43.3 (33) | −30.9 (34) | |
T–T (C1) | −60.0 | −30.0 (35) | −30.0 (36) | |
T–U (C1) | −59.9 | −30.8 (38) | −29.1 (39) | |
U–U (C1) | −59.8 | −29.9 (40) | −29.9 (41) |
Similar relation is also observed in the di-HB system of acetic acid dimer and the related species. The results are explained in Fig. S7 and Table S6 of the ESI.†
The mutual interactions between HBs must also be of very importance in the multi-HBs of Nu–Nu′, which would be clarified through the detailed analysis of Cij (i ≠ j) for the multi-HBs.46a
PNu–Nu′ = ΣkPNu–Nu′:k | (12) |
The ΔE values are plotted versus RNu–Nu′, θNu–Nu′ and θp:Nu–Nu′, and the plots are shown in Fig. S8–S10 of the ESI.† The correlation is greatly improved by analyzing the plot as two or three correlations instead of one correlation. The correlations are shown in the figure. The correlation for ΔE versus θp:Nu–Nu′ seems poorer than that for ΔE versus θNu–Nu′.
It is also instructive to clarify the structural feature in the perturbed structures of Nu–Nu′ to discuss the behavior of each HB of Nu–Nu′ in more detail, which is examined in the following.
rk1 = rk1o + wk1ao | (13) |
rk2 = rk2o + wk2ao | (14) |
rk3 = rk3o + wk3ao | (15) |
The structural feature in the perturbed structures of Nu–Nu′ is examined by dividing them into four groups, G(AT), G(CG), G(AA) and G(TT).57 Nu–Nu′ of A–T, C–G, A–A and T–T are the typical members of the groups, respectively. The feature is discussed by employing A–T, C–G, A–A and T–T, together with U–U. The feature in UU will supply a small structural difference from that in TT, although UU belongs to G(TT).
The values of and are calculated for each HB in A–T, C–G, A–A, T–T and U–U with MP2/BSS-B′a at wii = 0.05 by applying in eqn (13)–(15).32–38 The values are collected in Table S7 of the ESI.† Small differences in between T–T and U–U are detected. A positive value of implies that the minor (HB) interaction in Nu–Nu′ moves in the same direction as the major interaction. On the other hand, relative to the major interaction, the minor HB interaction moves in the inverse direction for negative values. Compared to that of the major interaction, the magnitudes in the movement of the minor HB interactions would be negligible when the values are close to zero. Fig. 7 shows the plot of versus for the HB interactions. The plot gave an excellent correlation, which is shown in Table 3 (entry 8). The results show that the perturbed structures generated with CIV and POM are very close to each other, approximately at wii = 0.05, in the multi-HB system of Nu–Nu′, as well as in the case of the mono-HB system.45
Fig. 7 Plot of versus for each HB of multi-HB system in A–T, C–G, A–A, U–U and T–T, calculated at wii = 0.05 with MP2/BSS-B′a. |
What happens if the H⋯B distance (r(H, B)) in each HB of Nu–Nu′ is elongated further, where Δr(H, B) (=r(H, B) − ro(H, B)) is defined as the difference in H⋯B distance between the perturbed structure and the fully optimized structure. Relative to that of M06-2X/BSS-A, the reliability of M06-2X/BSS-C′ is confirmed for the optimizations. That is, the ro(H, B) values calculated with M06-2X/BSS-C′ differ from the corresponding values calculated with M06-2X/BSS-A by less than 0.01 Å in magnitude (see Table S1 of the ESI†). Therefore, these perturbed structures are calculated with POM by fixing the r(H, B) distances in question in the wider range of −0.05 Å ≤ Δr(H, B) ≤ 0.50 Å for all HBs in A–T, C–G, A–A, T–T and U–U with M06-2X/BSS-C′ for improved calculation cost. The results are summarized in Table S8 of the ESI† in the form. The ΔEESps (=EESps − EESo) values are also calculated for each HB in A–T, C–G, A–A, T–T and U–U based on the partially optimized structures. The EESps values are the energies of the perturbed structures at r(H, B) on the energy surface, and the EESo values are those for the fully optimized structures. The magnitudes of the differences between ΔEESps calculated with M06-2X/BSS-C′ and those calculated with M06-2X/BSS-A are less than 0.3 kJ mol−1 for A–T, C–G, A–A, T–T and U–U if the corresponding values are compared at Δr(H, B) = 0.025 Å (see Table S3 of the ESI†). The results again support the reliability of M06-2X/BSS-C′ relative to M06-2X/BSS-A in the optimizations.
The perturbed structures of A–T, C–G, A–A, T–T and U–U are also generated by employing CIV with M06-2X/BSS-C′ in a wider range of −0.1 ≤ wii ≤ 1.0 (cf.: −0.05 Å ≤ Δr ≤ 0.50 Å for POM). The wij values of the minor HBs are calculated, corresponding to wij at wii = 0.05 for the Nu–Nu′. The results are also summarized in Table S8 of the ESI† in the form. The values are plotted versus calculated at wii = 0.05 with M06-2X/BSS-C′, as shown in Fig. S11 of the ESI.† The plot also gives a very good correlation, which is shown in Table 3 (entry 9). The quality of the correlation based on M06-2X/BSS-C′ is noticeably the same as that of the correlation based on MP2/BSS-B′a.
The ΔEESps values are plotted versus a wide range of −0.05 Å ≤ Δr(H, B) ≤ 0.50 Å and −0.1 ≤ wii ≤ 1.0 for each HB in A–T, C–G, A–A, T–T and U–U evaluated with POM and CIV, respectively. The plot is illustrated in Fig. S12 of the ESI,† where r(H, B) in the x axis with POM is replaced by wii. As shown in the figure, the differences in ΔEESps between the structures evaluated with CIV and those evaluated with POM are negligible at approximately wii < 0.2. Indeed, the ΔEESps curves evaluated with CIV show a similar trend as those evaluated with POM for wii < 0.3, but overall, the curves begin to grow rather exponentially for wii > 0.4 as wii increases. The results show that the perturbed structures generated with POM and CIV are very similar for wii < 0.2 and similar for 0.2 < wii < 0.3 but become different for 0.4 < wii.
The gradient for ΔEESps is largest for N–H⋯N in C–G, which must be the reflection of the largest magnitude of ΔEES for C–G (−117.2 kJ mol−1) among A–T, C–G, A–A, T–T and U–U. The gradient for ΔEESps decreases in the order shown in eqn (16). The order seems to not necessarily reflect the strength of each HB in the A–T and C–G pairs.
N–H⋯N (C–G) ≫ N–H⋯O (C–G) ≥ O⋯H–N (C–G) > N–H⋯N (A–T) > N–H⋯O (U–U) > N–H⋯O (T–T) > N–H⋯N (A–A) ≈ N–H⋯O (A–T: j = 1) > C–H⋯N (A–A) > N–H⋯O (A–T: j = 3) | (16) |
The gradient increased when POM or CIV is applied to the central N–H⋯N interaction for both the A–T and C–G pairs. The behavior of ΔEESps evaluated with POM may correspond to that in the initial stage for the scission of Nu–Nu′ to Nu and Nu′ under the simple mechanism for each HB. Such large ΔEESps values must be effectively decreased by the enzyme-catalyzed reactions in vivo at approximately room temperature. However, it is helpful to understand the behavior of HBs in Nu–Nu′ through a simple mechanism.
Indeed, the behavior of HBs, containing those of multi-HBs in Nu–Nu′, will be revealed in more detail, if the magnitudes in the movement of HBs is directly investigated. The NVT ensemble method seems typical one of such methods.58 The predicted nature will change depending on the quality of the calculation levels, especially for weak HBs. However, the results in the framework of QTAIM-DFA with CIV should be reasonable, if calculated with MP2/BSS-B′a.
Many multi-HB systems play a crucial role in the chemical and biological sciences, not only in vitro but also in vivo. Each HB in such multi-HB systems will interact mutually and strongly with each other due to their close proximity in space. It is of very interest if the proposed method can open the door to elucidate each HB in such multi-HB systems, although some devices seem necessary for the effective analysis.
Footnote |
† Electronic supplementary information (ESI) available: QTAIM-DFA approach, computational data, and the fully optimized structures given by Cartesian coordinates, together with total energies of the nucleobase pairs (Nu–Nu′). See DOI: 10.1039/d0ra01357a |
This journal is © The Royal Society of Chemistry 2020 |