Bridgit O.
Crews
a,
Ali
Abo-Riziq
a,
Kristýna
Pluháčková
b,
Patrina
Thompson
c,
Glake
Hill
c,
Pavel
Hobza
bd and
Mattanjah S.
de Vries
*a
aDepartment of Chemistry and Biochemistry, University of California Santa Barbara, CA 93106-9510, USA. E-mail: devries@chem.ucsb.edu
bInstitute of Organic Chemistry and Biochemistry, Academy of Sciences of the Czech Republic and Center for Biomolecules and Complex Molecular Systems, Flemingovo nam. 2, 160 10 Prague 6, Czech Republic
cDepartment of Chemistry, Jackson State University, Jackson MS 39217-0510, USA
dDepartment of Physical Chemistry, Palacky University, 771 46 Olomouc, Czech Republic
First published on 8th March 2010
Double resonance spectroscopy of clusters of guanine with aspartic acid reveals geometries similar to patterns exhibited in DNA base pairs. In the spectral region of 32800 cm−1 to 35
500 cm−1 we observe five isomers of guanine–aspartic acid clusters and assign their structures based on IR–UV hole-burning spectra and wave function theory calculations at the MP2/cc-pVDZ and MP2/cc-pVTZ levels. The calculations employed both harmonic and one-dimensional scan anharmonic approximations. Three of the isomers are similar, assigned to structures containing three hydrogen bonds and 9-enolguanine. We assign the fourth isomer to a structure containing a 9-keto tautomer of guanine and forming a triply bonded structure similar to a base pairing interaction. The fifth isomer dissociates with proton transfer upon excitation or ionization. This is the first set of experiments and high-level ab initio calculations of the isolated, microscopic interactions of an amino acid and a nucleobase, the building blocks of nucleic acids and proteins.
Although the complex of guanine with aspartic acid (Asp) might seem to be small or artificial in fact it is a real model system of interactions between a base pair and an amino acid. Both guanine and aspartic acid can be part of much larger system but still remain in the same structural complex with each other. We have found isomers in which aspartic acid binds to the Watson–Crick edge of guanine while the terminal COOH and NH2 groups of aspartic acid remain available to form peptide bonds with other amino acids. This is possible because only the CO part of the C-terminus COOH group participates in hydrogen bonding to guanine while the most important binding group appears to be the COOH group in the side-chain. In the protein data bank (www.pdb.org) there are 2318 structures containing a protein and a nucleic acid. Among these there are 208 contacts between guanine and aspartic acid within 3 Å. 47.1% of these contacts are between aspartic acid and phosphate groups; in 34.6% cases Asp is bound to the Watson–Crick edge of guanine, 11.5% of the Asp is in contact with a sugar, 5.3% of Asp are bound to the sugar edge of guanine, 1% to the Hoogsteen edge of guanine and 0.5% are in a stacked orientation with guanine.
When calculating frequencies, the complex of guanine with aspartic acid needs to be treated in an anharmonic way because as we have found for guanine,4 distinguishing among the tautomers is impossible with the harmonic approximation. The same is true for the guanine–cytosine base pair where the three hydrogen bonds exhibit strongly anharmonic character.5 Generally, rotational spectroscopic techniques provide spectra that are very difficult to unravel for larger systems and vibrational spectroscopic techniques do not usually provide enough structural information. Problems arise from nonlinear dependence of the complexity of the molecular dynamical problems on the number of actual vibrational degrees of freedom and from conformational complexity. A popular way of overcoming these problems is empirical scaling of the calculated harmonic frequencies. However, in the case of vibrational motions opposed by strongly anharmonic potentials (motions involved in hydrogen bonding) the approach is no longer reliable.6 A possible solution is an adiabatic separation of the probing molecular modes. Being only few dimensional the resulting dynamical problems are tractable in a numerically exact way for practically any shape of the corresponding effective potentials.7
Our current observations suggest that molecules other than nucleobases, such as amino acids, may also affect the excited state potential energy surface of the guanine nucleobase by clustering in geometries very similar to the geometry exhibited in base pairing. The fact that amino acids form complexes with guanine in base pair like structural motifs also highlights the idea that these interactions may serve to distinguish among bases in protein–nucleic acid interactions.2,8
For the naming of the clusters we have tried to remain consistent with that previously published on the structure of the isolated nucleobase guanine.14 Keto clusters refer to any cluster in which guanine is in its ‘keto’ form, identified by a keto oxygen on carbon “6” of the purine ring structure. Enol structures refer to clusters in which guanine is in its ‘enol’ form, identified by an hydroxyl group on carbon “6” of the purine ring structure, and ‘oxo-imino’ forms are identified by a keto oxygen on carbon “6” of the purine ring structure combined with an imino nitrogen group on carbon “2” of the purine ring structure. The number which precedes the keto, enol, or oxo-imino refers to the placement of the mobile protons which attach to different atoms of the purine ring structure to form the multiple tautomers of guanine. The clusters we describe in most detail in the following discussion involve the 7,9-keto and 7,9-enol tautomers of guanine. These are clusters in which guanine is in the keto or enol form and a mobile proton is attached to either nitrogen “7” or nitrogen “9” of the purine ring structure.
We ran clusters of guanine–aspartic acid with guanine in each of its five lowest energy forms: 7, 9-keto, 7, 9-enol and 7-oxo-imino. We performed simulated annealing by running at a high temperature (800 K) for 30 ps followed by 10 ps of stepwise cooling and energy minimization. We imposed a distance restraint between guanine and aspartic acid during simulated annealing and removed it during energy minimization. The location of the restraint was varied in three rounds of annealing with 500 iterations of heating and stepwise cooling. We sorted the resulting structures according to energy, then optimized and calculated theoretical harmonic frequencies for unique structures. The resulting structures were re-optimized using B3LYP/6-31G to weed out high energy structures and minimize the number of structures to be calculated at a higher basis set; however, we found that guanine tautomerizes from enol to keto forms in this basis. When we ran identical structures in the 6-31+G* basis set, the enol forms did not tautomerize into the keto forms. We calculated B3LYP/6-31+G* energies and harmonic frequencies for approximately 30 clusters of each tautomer of guanine.
Because the calculation of the spectra must be done with the best possible geometries and most accurate method for comparison with the experimental data we re-optimized the lowest energy B3LYP structures by means of the MP2/cc-pVTZ method.15,16 According to our previous experience this is the most suitable approach for hydrogen bonded as well as for stacked complexes of this size. We used the resolution of identity approximation for all MP2 calculations.17 Each MP2/cc-pVTZ structure was then re-optimized using the MP2/cc-pVDZ method with tough convergence criteria (TURBOMOLE convergence criteria were set as scfconv <1e−8, denconv <1e−8 and gcart = 4) so that the structures were optimally minimized and frequency analysis could be done at the same level of theory. In addition to calculating interaction energies and interaction enthalpies at MP2/cc-pVDZ and MP2/cc-pVTZ levels we also included unscaled zero-point vibration energies (ZPVE) at the cc-pVDZ level of theory from harmonic vibrational frequency calculations and evaluated interaction and relative interaction enthalpies at both levels of theory.
For calculation of harmonic vibrational frequencies and IR spectra we used method NumForce as implemented in the TURBOMOLE 5.8 package program.18 This method is based on knowledge of the molecular Hessian, matrix of second derivatives of total energies. Because very fast tautomerization processes exist in the complex we were interested in the anharmonicity of the individual vibrational modes. Therefore we also calculated one-dimensional anharmonic Hamiltonians and potential energy functions evaluated by means of the MP2/cc-PVDZ method. A typical stretch grid was constructed in the following way: hydrogen bonds were shortened by up to 0.25 Å with 0.025 Å steps and elongated by 1 Å with 0.05 Å steps up to 2 Å. This approach allowed good sampling of the potential in all regions and smoothing by fitting to polynomials expressed in terms of the Morse variable y.
y = 1 − e−a(R–Re) |
The methodology for assigning spectra was as follows. First we calculated unscaled MP2/cc-pVDZ infrared spectra using a harmonic approximation for all complexes, for keto and enol forms of guanine and for normal and protonated aspartic acid. Next we calculated one-dimensional potential energy scans of relevant hydrogens for protonated aspartic acid and for the most stable complexes in each of four structural groups: two keto and two enol complexes, each time one being triply and one doubly hydrogen bonded. After fitting these scans we obtained both harmonic and anharmonic frequencies of these vibrations. By comparing harmonic frequencies from one-dimensional scans and standard harmonic spectra we concluded how much each individual mode is coupled with other modes in the molecule. After comparing the harmonic and anhamormonic frequencies we obtained the amount of anharmonicity in each mode and calculated suitable scaling factors for each vibration. These we then applied to standard harmonic spectra. The resulting spectra therefore account for both anharmonicity and mode coupling. The intensities were taken directly from standard harmonic spectra because they are not as strongly influenced by anharmonicity as the vibrational frequencies themselves. Finally the spectra were compared to the experimental ones and if needed shifted by a constant to minimize the differences.
![]() | ||
Fig. 1 Lowest energy structures of 9-keto and 9-enol guanine–aspartic acid clusters calculated at MP2/cc-pVTZ level. Both relative interaction energies in black and relative interaction enthalpies in red in parenthesis are BSSE corrected and in kcal mol−1. All energies are relative to structure 9KN. |
The structures in Fig. 1 are labeled according to tautomeric form and relative energy calculated at the B3LYP/6-31+G* level. 9E1 is the lowest energy cluster found with guanine in 9-enol form, 9K2 is the second lowest energy cluster found for guanine in 9-keto form, etc. Structures 9EN and 9KN were not predicted by B3LYP/6-31+G* to be in the set of the most stable structures but they appeared after re-optimization at the MP2/cc-pVTZ level of theory, whereby the 9KN structure emerged as the global minimum. Relative interaction electronic energies and enthalpies (in parentheses) are given in kcal mol−1 and are relative to the lowest energy cluster we found, 9KN. The lowest energy structure contains 9-keto guanine in a ‘base pairing’ type configuration in which aspartic acid binds to the base pairing site of guanine through three hydrogen bonds. 9K1 and 9K5 have a similar type of interaction, slightly higher in energy by 1.88 and 2.40 kcal mol−1, respectively. The 9-enol guanine structures 9EN and 9E7 are analogous but higher in energy by 4.93 and 4.60 kcal mol−1, respectively. Structure 9E1 exhibits a bifurcated hydrogen bond between both oxygens of the COOH group of aspartic acid and hydrogen in the NH2 group of guanine. This structure is only 2.56 kcal mol−1 higher in energy than the most stable structure, 9KN. However, this binding motif is quite unique for the isolated molecules because when aspartic acid is part of a protein the COOH group is part of a peptide bond and it is likely that this type of interaction cannot occur. Complexes with two H-bonds are less stable than complexes with three H-bonds and the 9-keto complexes generally are lower in energy than 9-enol complexes. Structure 9K6 is shown for comparison as it presents the only structure where aspartic acid is bonded to the sugar edge of guanine. However this complex has only two H-bonds and is 11.04 kcal mol−1 higher in energy. All of the clusters involve hydrogen bonds with the side-chain of aspartic acid.
![]() | ||
Fig. 2 (A) Mass spectrum of guanine–aspartic acid. (B) R2PI spectrum of guanine–aspartic acid with unique isomer origins labeled I–IV as revealed by UV–UV hole burning. |
We also observe a peak for protonated aspartic acid in the mass spectrum. Since aspartic acid does not absorb at any of the UV wavelengths in these experiments, the only possibility is that this peak arises from an excited or ionized guanine–aspartic acid cluster that undergoes proton transfer and dissociation to form protonated aspartic acid. This cluster corresponds to isomer V as determined by double resonance spectroscopy, described below.
![]() | ||
Fig. 3 IR hole-burning spectrum of isomer I. Stick spectra show anharmonic frequencies calculated for the lowest energy keto structures. See Table 1 for relative intensities. Color coding indicates modes as shown in the 9KN structure at the top and specified in the text. The spectrum in the inset is from protonated aspartic acid fragments, due to isomer V. |
![]() | ||
Fig. 4 IR hole-burning spectrum of isomers II–IV. Stick spectra show anharmonic frequencies calculated for the lowest energy enol structures. See Table 1 for relative intensities. Color coding indicates modes as shown in the 9EN structure at the top and specified in the text. The spectrum in the inset is from protonated aspartic acid fragments, due to isomer V. |
Hessian | Scan | ||
---|---|---|---|
9E3 | Harmonic freq. | Intensities | Anharmonic freq. |
COOH HB Asp | 2787.2 | 2787.26 | — |
COOH Asp C-term Asp | 3761.3 | 73.53 | 3611.606 |
OH HB G | 3273.3 | 3273.32 | — |
NH2s HB G | 3589.3 | 58.33 | 3454.726 |
N9H G | 3674 | 106.03 | 3537.703 |
NH2a HB G | 3718 | 52.76 | 3580.936 |
NH2s Asp | 3511.8 | 3.81 | 3438.052 |
NH2a Asp | 3594.8 | 8.49 | 3519.309 |
9E7 | Harmonic freq. | Intensities | Anharmonic freq. |
NH2s HB G | 3545 | 326.92 | 3408.014 |
NH2a HB G | 3693 | 118.79 | 3556.857 |
N9H G | 3675 | 104.9 | 3538.778 |
COOH Asp C-term | 3484 | 359.67 | 3237.038 |
OH HB G | 3323 | 1930.7 | 3006.189 |
COOH HB | 2668 | 2898.07 | 1812.5 |
NH2a Asp | 3616 | 9.4 | 3460.512 |
NH2s Asp | 3523 | 11.97 | 3371.511 |
9EN | Harmonic freq. | Intensities | Anharmonic freq. |
NH2s HB G | 3546 | 287.64 | 3408.975 |
NH2a HB G | 3689 | 130.51 | 3553.005 |
N9H G | 3674 | 95.41 | 3537.815 |
COOH Asp C-term | 3754 | 77.68 | 3487.899 |
OH HB G | 3331 | 1792.91 | 3013.426 |
COOH HB | 2681 | 2851.97 | 1821.332 |
NH2a Asp | 3614 | 23.05 | 3458.598 |
NH2s Asp | 3517 | 12.5 | 3365.769 |
9E1 | Harmonic freq. | Intensities | Anharmonic freq. |
NH2s HB G | 3574 | 160.27 | 3435.893 |
NH2a HB G | 3702 | 93.49 | 3565.525 |
N9H G | 3675 | 107.08 | 3538.778 |
COOH Asp C-term | 3421 | 380.19 | 3178.504 |
OH HB G | 3316 | 1973.38 | 2999.856 |
COOH HB | 2435 | 3697.09 | 1654.212 |
NH2a Asp | 3635 | 24.6 | 3478.695 |
NH2s Asp | 3532 | 12.92 | 3380.124 |
Hessian | |||
9KN | Harmonic freq. | Intensities | Anharmonic freq. |
COOH HB | 2833 | 2413.18 | 2310.878 |
COOH Asp C-term | 3382 | 348.31 | 3134.267 |
N3H HB G | 3422 | 678.44 | 3262.743 |
NH2s HB G | 3511 | 403.24 | 3353.303 |
N9H G | 3671 | 96.54 | 3534.815 |
NH2a HB G | 3674 | 122 | 3537.256 |
NH2s Asp | 3515 | 30.29 | 3441.185 |
NH2a Asp | 3624 | 34.82 | 3547.896 |
9K1 | |||
COOH HB | 2881 | 2281.48 | 2350.032 |
COOH Asp C-term | 3437 | 580.14 | 3185.239 |
N3H HB G | 3427 | 491.81 | 3267.511 |
NH2s HB G | 3491 | 588.66 | 3334.201 |
N9H G | 3675 | 116.12 | 3538.666 |
NH2a HB G | 3671 | 93.74 | 3534.367 |
NH2s Asp | 3521 | 11.49 | 3447.059 |
NH2a Asp | 3627 | 18.21 | 3550.833 |
9K5 | |||
COOH HB | 2948 | 2220.38 | 2404.684 |
COOH Asp C-term | 3444 | 418.54 | 3191.726 |
N3H HB G | 3402 | 855.5 | 3243.674 |
NH2s HB G | 3543 | 294.76 | 3383.865 |
N9H G | 3672 | 103.73 | 3535.777 |
NH2a HB G | 3682 | 128.47 | 3544.958 |
NH2s Asp | 3520 | 0.79 | 3446.08 |
NH2a Asp | 3621 | 12.26 | 3544.959 |
Hessian | |||
9K2 | Harmonic freq. | Intensities | Anharmonic freq. |
COOH HB | 2869 | 2690.82 | 2340.243 |
COOH Asp C-term | 3497 | 250 | 3240.844 |
N3H HB G | 3256 | 1878.95 | 3104.469 |
NH2s HB G | 3584 | 71.58 | 3508.736 |
N9H G | 3671 | 103.62 | 3534.815 |
NH2a HB G | 3712 | 60.93 | 3634.048 |
NH2s Asp | 3530 | 38.21 | 3455.87 |
NH2a Asp | 3628 | 29.62 | 3551.812 |
9K3 | |||
COOH HB | 2989 | 1949.07 | 2438.127 |
COOH Asp C-term | 3762 | 72 | 3486.432 |
N3H HB G | 3224 | 2349.25 | 3073.958 |
NH2s HB G | 3585 | 67.32 | 3509.715 |
N9H G | 3671 | 103.3 | 3534.815 |
NH2a HB G | 3714 | 57.53 | 3636.006 |
NH2s Asp | 3517 | 4.16 | 3443.143 |
NH2a Asp | 3610 | 7.98 | 3534.19 |
9K4 | |||
COOH HB | 2915 | 2491.1 | 2377.766 |
COOH Asp C-term | 3757 | 71.71 | 3481.799 |
N3H HB G | 3237 | 2111.26 | 3086.353 |
NH2s HB G | 3587 | 69.52 | 3511.673 |
N9H G | 3671 | 104.93 | 3534.815 |
NH2a HB G | 3717 | 60.8 | 3638.943 |
NH2s Asp | 3513 | 3.98 | 3439.227 |
NH2a Asp | 3601 | 10.21 | 3525.379 |
The IR spectra of isomers II–IV are very similar, while that of isomer I is distinctly different. This observation strongly suggests that isomers II–IV are variations of similar cluster structures, possibly differing only by small changes in amino acid conformation or by 7H versus 9H tautomer form. The latter explanation is less likely taking into consideration the large shift between electronic origins of the 7H enol and 9H enol forms of free guanine which are shifted by more than 1600 cm−1 relative to each other.19–21 Isomer I on the other hand exhibits a significantly different IR spectrum, suggesting that it belongs to a very different cluster structure. The four lowest energy calculated structures belong to two families: triply hydrogen bonded structures with keto guanine (9Kx) versus triply hydrogen bonded structures with enol guanine (9Ex). Clusters which are only doubly bonded structures are at least 3 kcal mol−1 higher in energy. Therefore we propose that the two types of observed IR spectra (for isomer I versus isomers II–IV) represent those two families of structures. Comparison with the corresponding calculated frequencies provides a better fit for the enol structures for isomers II–IV and the keto structures for isomer I, as further detailed below.
Fig. S3 of the ESI† shows one-dimensional potential hydrogen scans of five hydrogens (COOHHB,Asp, COOHAsp–C terminus, N9HG, N3HHB,G and NH2–s,HB,G). The colors of individual curves correspond to those in Fig. 3. The hydrogens involved in hydrogen bonds exhibit strongly anharmonic behavior and cannot be correctly described by harmonic potentials. The calculated spectra in Fig. 3 and 4 include the anharmonic correction. An even more complete correction would require two-dimensional scans for the NH2 group as well as inclusion of the movement of heavy atoms along the hydrogen bond coordinate.
Energetics alone are not enough to make an assignment, as previously demonstrated by the lack of observation of the lowest energy guanine monomers in R2PI experiments.22 We do note an intriguing similarity between the origins of the R2PI spectra of isomer I and the guanine–cytosine base pair, suggesting that the electronic configuration of guanine is similar in these two clusters, which we expect if the structure and bonding pattern of guanine is very similar in the two clusters. This observation, illustrated in Fig. 5, suggests that guanine in isomer I has a similar electronic configuration and structure to that of guanine in the Watson–Crick base pair which originates from a keto tautomer, and is simulated by 9-substituted guanine and 1-substituted cytosine in place of the DNA backbone.23,24 The spectra are similar in the frequencies of their electronic origins. The R2PI spectra of both guanine–aspartic acid and guanine–cytosine base pairs begin near 32800 cm−1 and continue as broad spectra for several hundred wavenumbers. Guanine–aspartic acid isomers II–IV, which are assigned as 9-enol containing tautomers and 9-enol guanine monomer have electronic origins far blue-shifted, near 33
700 cm−1 and 34
800 cm−1, respectively. The guanine dimer is also suspected to contain 9-keto guanine in a similar base pairing configuration. Its R2PI spectrum is slightly less red-shifted and not as broad; however, a lower energy symmetric structure has, so far, not been observed in the gas phase, presumably due to a short excited state lifetime.
Footnote |
† Electronic supplementary information (ESI) available: Fig. S1–S3. See DOI: 10.1039/b925340h |
This journal is © the Owner Societies 2010 |