Jan
Novotný‡
a,
Sophia
Bazzi‡§
abc,
Radek
Marek
ac and
Jiří
Kozelka
*bd
aCEITEC – Central European Institute of Technology, Masaryk University, Kamenice 5/A4, CZ-625 00 Brno, Czech Republic
bDepartment of Condensed Matter Physics, Faculty of Science, Masaryk University, Kotlářská 2, CZ-611 37 Brno, Czech Republic
cNational Center for Biomolecular Research, Faculty of Science, Masaryk University, Kamenice 5/A4, CZ-625 00 Brno, Czech Republic
dUniversité Paris Descartes, UMR 8601 CNRS, 45, rue des Saints-Pères, 75270 Paris, France. E-mail: kozelka.jiri@gmail.com
First published on 17th June 2016
Lone-pair–π (lp–π) interactions have been suggested to stabilize DNA and protein structures, and to participate in the formation of DNA–protein complexes. To elucidate their physical origin, we have carried out a theoretical multi-approach analysis of two biologically relevant model systems, water–indole and water–uracil complexes, which we compared with the structurally similar chloride–tetracyanobenzene (TCB) complex previously shown to contain a strong charge-transfer (CT) binding component. We demonstrate that the CT component in lp–π interactions between water and indole/uracil is significantly smaller than that stabilizing the Cl−–TCB reference system. The strong lp(Cl−)–π(TCB) orbital interaction is characterized by a small energy gap and an efficient lp–π* overlap. In contrast, in lp–π interactions between water and indole or uracil, the corresponding energy gap is larger and the overlap less efficient. As a result, water–uracil and water–indole interactions are weak forces composed by smaller contributions from all energy components: electrostatics, polarization, dispersion, and charge transfer. In addition, indole exhibits a negative electrostatic potential at its π-face, making lp–π interactions less favorable than O–H⋯π hydrogen bonding. Consequently, some of the water–tryptophan contacts observed in X-ray structures of proteins and previously interpreted as lp–π interactions [Luisi, et al., Proteins, 2004, 57, 1–8], might in fact arise from O–H⋯π hydrogen bonding.
![]() | ||
Fig. 1 (a) lp–π interaction stabilizing Z-DNA. (b) lp–π interaction suggested to operate between a conserved water molecule and tryptophan W48 in the engrailed homeodomain and its mutants. (c) lp–π interaction observed in the X-ray structure of the charge-transfer complex between Br− and TCP. Reproduced with permission from ref. 5, 26 and 13, respectively. |
Not less intriguing was the finding made in 2004 by Luisi et al. who searched the protein data bank (PDB) for water–tryptophan and water–histidine contacts and found hundreds of cases where a water molecule contacts an indole or an imidazole residue along the normal to the ring plane through an endocyclic N atom.6 For instance, a specific water–tryptophan contact, interpreted as lp–π interaction, was found to stabilize the engrailed homeodomain and to participate in the recognition of its cognate DNA.6,7 Since in some of the protein structures, the water molecule was likely to use both H-atoms to donate hydrogen bonds to other neighboring residues, the authors concluded that these water–indole contacts must represent lp–π interactions. An example is shown in Fig. 1b, where the K52E mutant of the engrailed homeodomain (shown in red) binds to the water molecule with two H-bond accepting residues, leaving only the lone pairs of oxygen free for contacting the indole side-chain of tryptophan W48.
Also in 2004, Kochi et al. presented crystal structures of adducts between tetraalkylammonium halides and tetracyanopiperazine (TCP), featuring quite spectacular halide–TCP contacts. Fig. 1c displays the detail of the TCP–(NEt4Br) 3:
2 cocrystal.8 The bromide⋯ring interactions, oriented roughly along the normals to the TCP planes, show a striking similarity to the O⋯ring interactions shown in Fig. 1a and b.
The TCP–Br− interaction shown in Fig. 1c belongs formally to a subclass of lp–π interactions named anion–π interactions.4,9–17 Anion–π interactions were theoretically predicted to attract anions such as Cl− or Br− toward the center of substituted/heterocyclic aromatic systems.18–20 Subsequent experimental and theoretical work8,10 revealed that in many cases, anions would bind to substituted aromatic systems in a different, “off-center” mode, reminiscent of that seen in Jackson–Meisenheimer σ-complexes.21 Accordingly, Hay, Berryman et al. distinguished between three anion–arene binding motifs: (i) noncovalent anion–π interactions, (ii) weakly covalent σ-complexes, and (iii) strongly covalent σ-interactions.10,12 Some confusion has arisen from the fact that many authors understand that under anion–π interactions the first group can be exclusively found and argue that anion–π interactions are noncovalent interactions, driven by electrostatic, polarization, and dispersion forces (ref. 4, 17, 22, 23 and references therein). A recent IQA analysis by Foroutan-Nejad and Marek challenged this view, indicating that not only the off-center halide–arene complexes but also the symmetric anion–π interactions can profit from significant “multicenter covalency”.24
The TCP–bromide interaction shown in Fig. 1c belongs to the off-center weakly covalent σ-complexes characterized by a substantial lp → π* CT stabilization.10 Considering the structural similarity between the three interactions shown in Fig. 1, one may wonder whether these interactions could be similar in nature. That is, could the CT component, evidently important in the latter case, contribute significantly also to the stabilization of the two former ones? The present article examines this question. We used a multi-approach theoretical analysis to elucidate the physical origin of water–indole and water–uracil lp–π interactions, in view of their possible role in the stabilization of proteins, nucleic acids, and protein–nucleic acid complexes. To evaluate the influence of the lp donor and that of the lp acceptor on the nature of the interaction, we included in our analysis the chloride–TCB interaction as a reference system with an established significant charge-transfer component.10 In addition, water–TCB and chloride–indole interactions were investigated as well, which enabled us to make cross-comparisons. Finally, for the water–indole system, we considered also N–H⋯O and O–H⋯π hydrogen bonding orientations, as these binding modes were previously predicted to be more stable than lp–π interactions.25
Both water–indole hydrogen-bonding systems, indole–N (O–H⋯π) and indole–NH (N–H⋯O), were constrained to be perfectly linear, the former aligned with the normal to the indole plane through N, and the latter having the N–H bond coinciding with the axis of one lone pair of the water oxygen atom.
Table 1 lists the components of the interaction energy evaluated using three different decomposition schemes (see Methods). For a comprehensive comparison of the decomposition methods and the definition of the individual components, the reader is referred to the excellent recent review by Phipps et al.27 The most striking observation is that the chloride–TCB system is by far the most stable, its interaction energy being 5 to 10 times larger than that of any of the other systems. The decompositions show, in mutual agreement, that this stabilization arises from larger electrostatic (ES), polarization (POL), charge transfer (CT) and dispersion (DISP) components, which is somewhat balanced by larger Pauli repulsion (PAULI in EDA, included in EX-REP of SAPT and in DEF of NEDA). These individual energy contributions are discussed in the following subsections.
Method | Lp-donor | H2O | Cl− | ||||||
---|---|---|---|---|---|---|---|---|---|
Arene | Indole–N lp⋯π | Indole–N O–H⋯π | Indole–NH N–H⋯O | Uracil–N lp⋯π | Uracil–M lp⋯π | TCB–C(H) lp⋯π | Indole–N lp⋯π | TCB–C(H) lp⋯π | |
a Data for the LCP found between water O-atom and uracil N1-atom. Another LCP could be identified between O and C4, with ρ(LCP) = 0.005 e a0−3, corresponding interatomic DI = 0.018. b Closest atom–atom or atom–centroid distance between the fragments. c van der Waals radii taken from Bondi,29 correction for H from Rowland and Taylor.30 d Total interaction energy between fragments relaxed in the complex. | |||||||||
AIM | ρ(rc) [e a0−3] | 0.007 | 0.014 | 0.024 | 0.007 | 0.005a | 0.008 | 0.010 | 0.023/0.034 |
DI | 0.042 | 0.041 | 0.086 | 0.047 | 0.030 | 0.043 | 0.087 | 0.158/0.305 | |
SAPT | ES | −2.3 | −3.5 | −8.3 | −3.4 | −3.1 | −4.1 | −4.2 | −39.0/−47.3 |
EX-REP | 3.1 | 4.6 | 8.4 | 3.4 | 2.8 | 3.8 | 10.5 | 37.8/54.0 | |
POL | −0.6 | −1.4 | −2.6 | −0.6 | −0.6 | −0.8 | −7.1 | −15.6/−24.3 | |
CT | −0.1 | −0.3 | −0.7 | −0.1 | −0.1 | −0.1 | −0.9 | −3.7/−6.5 | |
DISP | −2.5 | −3.3 | −2.8 | −2.7 | −2.7 | −2.8 | −5.4 | −10.9/−12.3 | |
TOTd | −2.3 | −3.6 | −5.4 | −3.3 | −3.5 | −4.0 | −6.3 | −27.7/−29.8 | |
EDA | ES | −2.1 | −3.3 | −7.9 | −3.6 | −3.4 | −4.2 | −1.5 | −33.8/−42.0 |
PAULI | 0.8 | 1.8 | 6.1 | 1.0 | 0.4 | 1.7 | 1.8 | 23.0/38.3 | |
ORB | −0.6 | −2.1 | −3.2 | −0.6 | −0.6 | −1.0 | −5.9 | −18.0/−29.2 | |
TOTd | −1.8 | −3.6 | −4.9 | −3.2 | −3.6 | −3.5 | −5.6 | −28.8/−32.9 | |
NEDA | ES | −2.0 | −4.5 | −9.5 | −3.5 | −3.6 | −4.1 | −3.1 | −44.8/−53.0 |
POL | −2.6 | −2.5 | −4.9 | −2.3 | −3.2 | −3.2 | −14.8 | −9.8/−14.8 | |
CT | −2.5 | −3.8 | −13.7 | −2.9 | −2.2 | −1.5 | −2.0 | −14.3/−36.3 | |
EX-CORR | −3.7 | −3.5 | −3.9 | −3.2 | −4.1 | −3.6 | −5.9 | −7.6/−10.9 | |
DEF | 9.2 | 10.7 | 25.5 | 9.0 | 9.7 | 8.9 | 21.0 | 52.0/85.4 | |
TOTd | −1.6 | −3.7 | −6.5 | −3.0 | −3.4 | −3.5 | −4.8 | −24.5/−29.6 | |
NBO |
![]() |
−0.13 | −1.68 | −7.45 | −0.34 | −0.29 | −0.41 | −0.26 | −5.3/−21.0 |
ε π*/σ* | −3.1 | 239.8 | 224.0 | −19.5 | −21.3 | −47.7 | 80.4 | 45.0/36.8 | |
ε lp | −155.0 | −152.2 | −293.7 | −193.3 | −264.8 | −220.3 | 5.6 | −39.3/−59.0 | |
ε π*/σ* − εlp | 151.9 | 392.0 | 517.7 | 173.8 | 243.5 | 172.6 | 74.8 | 81.4/95.8 | |
F lpπ*/σ* | 3.1 | 18.2 | 44.0 | 5.6 | 6.3 | 5.6 | 3.1 | 15.1/32.0 | |
S lpπ*/σ* | 0.03 | 0.15 | 0.28 | 0.05 | 0.06 | 0.05 | 0.03 | 0.13/0.21 | |
![]() |
−0.47 | −0.29 | −8.25 | −0.46 | −0.53 | −1.01 | −1.22 | −9.0/−30.1 | |
![]() |
−0.38 | −1.86 | −0.11 | −0.05 | −0.05 | — | — | −0.47/−0.90 | |
ΔE(2)total | −0.85 | −2.15 | −8.36 | −0.51 | −0.58 | −1.01 | −1.22 | −9.45/−31.00 | |
q(arene) [e] | −0.003 | 0.002 | −0.021 | −0.003 | −0.002 | −0.006 | −0.029 | −0.276/−0.358 | |
d X–Y [Å] | 3.12 | 2.29 | 1.99 | 3.07 | 3.10 | 3.08 | 3.25 | 2.78/2.55 | |
d X–Y (vdW)c [Å] | 3.07 | 2.64 | 2.61 | 3.07 | — | 3.22 | 3.30 | 3.45 |
According to Table 1, the Cl−–TCB interaction has by far the strongest ES component (−35 to −50 kcal mol−1). For comparison, the Cl−–indole interaction is one order of magnitude weaker. In order to estimate the contribution of long-range electrostatics to this difference, we have evaluated the ESP isosurfaces around indole, uracil, and TCB, as shown in Fig. 3. Clearly, TCB has a region of strongly positive ESP at the π-surface of the benzene ring, and especially along the normal to the ring through the C(H) carbon, whereas at the π-surface of indole, the ESP is close to zero or negative. Uracil appears as an intermediate case, with a positive ESP at the π-face, but with a smaller magnitude than in the case of TCB. The difference between the three heterocycles is even more clearly apparent from Fig. 4, where the ESP along the normals through C(H) of TCB, N of indole, and the centroid of the six-membered ring of uracil is plotted as a function of the distance from the ring plane. Fig. 4 shows that the ESP is strongly positive for TCB, slightly positive for uracil, and negative for indole. Thus, the long-range electrostatic forces for the Cl−–TCB interaction are attractive, whereas those for the Cl−–indole interaction are weakly repulsive. The negative sign of the ES component for the Cl−–indole interaction might therefore seem, at first glance, counter-intuitive, and must arise from a small but nonzero contribution from the penetration energy.
![]() | ||
Fig. 3 ESP for indole, uracil, and TCB on isosurfaces corresponding to the electron density of 0.04 a.u. The color scale runs from −0.06 to +0.06 a.u. |
Fig. 5 thus demonstrates that both long-range electrostatics and penetration energy contribute to the highly negative ES component characterizing the Cl−–TCB complex. The large extent of the short-range electrostatic energy, that is, of the penetration energy, is, in turn, indication that the charge transfer, arising from overlap of occupied orbitals of one partner with unoccupied orbitals of the other partner, operates. Without the charge transfer operating, the increase of the penetration energy upon approaching the partners is counterbalanced by the enhancement of exchange-repulsion (also exponentially increasing), which prevents a further approach. Thus, the large penetration energy involved in the Cl−–TCB complex can be related to the efficient charge transfer (see Table 1 and section “Charge transfer” below).
The different ESPs around TCB, indole, and uracil (Fig. 3) are also reflected in the ES energies calculated for the complexes with water (Table 1), although to a smaller extent. The water–TCB lp–π interaction has a larger ES component than the water–uracil and the water–indole lp–π interactions. The negative ESP along the normal to the indole plane through N can be also related to the fact that the ES component is larger for the water–indole O–H⋯π interaction compared to that for the lp⋯π interaction (see the Discussion of hydrogen bonding versus lp⋯π interaction below).
Polarizability tensor diagonal components [Å3] | |||
---|---|---|---|
α xx | α yy | α zz | |
TCB | 210 | 143 | 63 |
Indole | 134 | 104 | 54 |
Uracil | 72 | 87 | 36 |
Water | 9 | 8 | 6 |
Cl− | 10 |
It has to be noted that the efficient charge transfer operating in the Cl−–TCB complex and enabling an approach well below the sum of van der Waals radii (Table 1) contributes to a further increase of both POL and DISP components, both increasing with the negative sixth power of the distance.
Table 1 shows that the second-order perturbation energy , arising from the single lp → π* orbital interaction, makes a dominant contribution to the CT energy of the Cl−–TCB lp–π complex. The same statement applies to the indole–water classical N–H⋯O hydrogen bond, where the dominant charge transfer goes from the lp of the water oxygen to the antibonding σ* orbital of the N–H bond of indole.38 All the other lp–π systems have
energies of the order of a few tenths of kcal mol−1, which are values similar to the
stabilization energies of very weak hydrogen bonds, such as CH4⋯OH2.38 The
energy for chloride–TCB of 21 kcal mol−1, on the other hand, corresponds to
values of strong hydrogen bonds, such as FH⋯OH2.38 The
energy for the chloride–TCB interaction is also more than twice as large as the
energy calculated for the classical indole–water N–H⋯OH2 hydrogen bond, 7.5 kcal mol−1 (Table 1).
The energy for the fully optimized chloride–TCB complex of 21.0 kcal mol−1 is similar to the value of 21.2 kcal mol−1 obtained by Berryman et al.10 It is noteworthy that for the symmetrical complex where Cl− approaches the ring centroid of TCB, the ORB component calculated with the EDA method is −9.7 kcal mol−1, about one third of the value of −29.2 kcal mol−1 obtained for the fully optimized off-center complex (Table 1 and Table S1, Fig. S1, ESI†). This indicates that while in the centrosymmetric approach the orbital interaction is somewhat weaker than that in the off-center geometry, it is not neglectable, and classifying this interaction as “non-covalent” or “electrostatic” appears misleading. Our data concurs with the previous finding of a significant global delocalization index for this symmetrical interaction arrangement.24
The second-order CT-stabilization energy calculated for all donor–acceptor orbital pairs, ΔE(2)total, can be decomposed into the components corresponding to CT from H2O/Cl− orbitals to the ring system, , and those corresponding to the CT in the opposite direction,
(bottom part of Table 1). For all the lp–π complexes except the water–indole complex, the CT is practically unidirectional, i.e.
. For the water–indole lp–π complex, the
and
components are of similar magnitude, and ΔE(2)total is composed of several small CT interactions in both directions.
Fig. 6 displays the electron deformation density associated with natural orbitals for chemical valence (NOCV)39 channels with the highest stabilization energy identified for the lp–π interactions Cl−–TCB, water–TCB, and water–indole, and for the two water–indole hydrogen-bonded complexes. The identification of the Cl−–TCB lp–π CT as the orbital interaction with the highest stabilization orbital energy, followed by the indole–water N–H⋯O hydrogen bond, corresponds to the CT-component ranking in Table 1, confirming again the dominant role of these orbital interactions. We observe that greater stabilization energy goes generally along with a greater amount of transferred charge; however, there is no obvious quantitative correlation. Note that in the two hydrogen-bonding modes of the water–indole interaction the roles of donor and acceptor are inversed, as manifest in the inverse sign of Δq.
In contrast to the above hypothesis, our results, summarized in Table 1, indicate that water–uracil and water–indole lp–π interactions are governed by relatively weak forces, composed of small (≤3.5 kcal mol−1) contributions of ES, POL, CT, and DISP components. On perusal of Table 1, we can see that for all these water–π-ring lp–π interactions, the ES component happens to be of a similar extent as the total interaction energy, that is, all the other components sum up to approximately zero. Such incidental cancelling of energy components can, however, give rise to misinterpretations, and can lead, for instance, to interpretations of hydrogen bonding as either purely electrostatic, or purely covalent.33 We have already reported such mutual cancelling of non-electrostatic components for other water–π-ring lp–π interactions,44 and explained thus previous observations that lp–π interactions involving water, alcohols, or ethers as lp donors can be seemingly described as predominantly electrostatic interactions.45 This cancelling of components also explains why classical force-fields are so successful in portraying non-bonding forces in biomolecules, without taking polarization or charge transfer specifically into account.
Particularly interesting is our finding that water associations with the π-face of indole are more likely to occur via O–H⋯π hydrogen bonding than via an lp–π interaction. Although our calculations do not account for solvation or effects of neighboring residues, which may reverse the in vacuo stabilities, this result raises the question whether the out-of-plane (π-surface) contacts between water and tryptophan frequently observed in protein structures may not arise, at least in some cases, from O–H⋯π hydrogen bonding rather than from lp–π interactions, as previously suggested.6 In fact, since both the O–H⋯π and the lp–π interactions are relatively weak, it is conceivable that in protein structures, the O–H⋯π and lp–π states constitute a dynamic equilibrium which would provide an entropic advantage for water molecules in such sites.
(i) At the “strongly-binding” (ETOT ≈ −30 kcal mol−1) end of the spectrum, we find the Cl−–TCB lp–π complex characterized by a considerable shortening of the distance between the closest atoms beyond the sum of van der Waals radii (Δr ≈ 0.9 Å in the fully relaxed complex) and by a high CT contribution, arising from an efficient overlap between the donor lp and acceptor π* orbitals and a small gap between their orbital energies. A significant charge transfer stabilization () was already found previously for this complex by Berryman et al.10 Those authors referred to this complex class as “weakly covalent donor to π–acceptor complexes”, as opposed to strongly covalent analogs with F− where
is one order of magnitude larger and the interaction energy approaches values typical of covalent bonds.10 The shortening of the interfragment distance beyond the van der Waals distance is accompanied by an enhancement of penetration energy (contributing about 50% to the ES component) and of Pauli repulsion, both of which increase exponentially with decreasing distance. The close approach of the fragments also causes an increase of the POL and DISP components, corresponding to their dependence on R−6. It is therefore important to note that the large ES, POL, and DISP components of the Cl−–TCB lp–π interaction are, in part, a consequence of the large CT which enables a significant shortening of the interatomic distance beyond the van der Waals contact. However, not the entire extent of the ES, POL, and DISP components is due to the close approach of the fragments induced by the orbital interaction: their long-range parts are significant as well. For the ES component, we have demonstrated in Fig. 5 that the long-range part, LRES, is also greater in magnitude than, for instance, that of the Cl−–indole interaction. As for POL and DISP, we can roughly infer the long-range contributions from the zz components of the polarizability tensors (Table 2): at the van der Waals distance, these components for Cl−–TCB are expected to be slightly larger than those for Cl−–indole, and considerably larger than those for all the other lp–π interactions.
(ii) The intermediate class (−7 kcal mol−1 < ETOT < −3 kcal mol−1) is represented by the water–uracil lp–π complexes, the water–TCB lp–π complex, and the Cl−–indole complex. In this class, the CT component is at least one order of magnitude smaller than that of the first class, and the interfragment separation is within 0.2 Å from the van der Waals distance. Within this group, the Cl−–indole complex differs from the others by repulsive long-range electrostatics (Fig. 5), and by larger polarization energy, arising from the polarizing power of the chloride ion.
For the water–uracil and water–TCB lp–π interactions, the stabilization energy is composed of small amounts of ES, POL, CT, and DISP, neither of which exceeds −4.2 kcal mol−1 in any of the three decomposition analyses. The water–uracil(M) lp–π total interaction energy of −3.4 ± 0.2 kcal mol−1 is identical with the value obtained previously from MP2/6-31G* calculations1 and similar to that determined for the deoxyribose–guanine lp–π interaction energy at the same level of theory.46 This suggests that the oxygen lone pairs of electrons of water and deoxyribose interact with π-systems with similar energetics, and our results for the water–uracil lp–π interaction can be thus extrapolated to deoxyribose–nucleobase lp–π interactions.
(iii) At the “weakly-binding” end of the spectrum (ETOT ≈ −2 kcal mol−1), we find the water–indole lp–π interaction, which combines repulsive long-range electrostatics with relatively low penetration energy, charge-transfer, polarization and dispersion components.
Our calculations show that the lp–π contacts that water forms with the aromatic residues of nucleic acids and proteins correspond to bonding interactions with a very weak CT component. For contacts between water molecules and the π-face of tryptophan, which have been shown to stabilize proteins and protein–DNA complexes,6 our calculations suggest that the O–H⋯π hydrogen bonding is the more likely binding mode than the lp–π interaction.
In summary, our work has shown that lone-pair–π interactions can range from weak bonding to relatively strong bonding with a significant CT contribution. A sizeable CT component requires a donor molecule with a high-energy lone pair and a π–acceptor with a relatively low-energy, empty π*-orbital, allowing for efficient overlap with the donor lp-orbital. In agreement with previous results,10 we found that for TCB an off-center interaction with an anion (chloride) results in better lp–π* overlap and stronger CT. However, our results indicate that even the symmetrical Cl−–TCB approach results in a non-neglectable orbital interaction (Fig. S1, ESI†), an effect previously termed “multi-center-covalency”.24 We conclude that considering lone-pair–π or anion–π interactions generally as “non-covalent” (ref. 4, 17, 22, 23 and references therein) does not correspond to reality.
The orbital interactions were examined within the NBO module Version 6.061 implemented in Gaussian 0959 at the same level of theory as in the AIM analysis. The total CT energies between partners, and
, were calculated considering all possible CTs from occupied to vacant orbitals, with the default threshold for intramolecular orbital transitions set to 0.05 kcal mol−1. The NEDA decomposition analysis62 was performed at the M06-2X/6-31G(d,p) level of theory using the NBO module within the GAMESS package.63,64
Footnotes |
† Electronic supplementary information (ESI) available: Fig. S1: Electrostatic and Orbital energy components of the Cl−–TCB complex in the symmetrical and off-center geometries; Table S1: EDA analysis of the Cl−–TCB complex in the symmetrical and off-center geometries, Table S2: PDB coordinate files of the analyzed complexes. See DOI: 10.1039/c6cp01524g |
‡ These authors contributed equally. |
§ Present address: Center for Free-Electron Laser Science, DESY, Notkestraße 85, Hamburg, Germany. |
This journal is © the Owner Societies 2016 |