Yan
Zhao
,
Benjamin J.
Lynch
and
Donald G.
Truhlar
Department of Chemistry and Supercomputing Institute, University of Minnesota, 207 Pleasant Street S.E., Minneapolis, MN 55455-0431, USA
First published on 10th December 2004
We have developed a new kind of multi-coefficient correlation method (MCCM) by empirically mixing correlated wave function methods and density functional methods. The new methods constitute a generalization of hybrid density functional theory and may be called multi-coefficient extrapolated density functional theory. Results by the new methods are compared to those obtained by G3SX, G3SX(MP3), CBS-Q and MCCM/3 for calculations of atomization energies, barrier heights, ionization potentials and electron affinities. These results show that the multi-coefficient extrapolated density functional theory is more accurate for thermochemistry and thermochemical kinetics than the pure wave function methods of comparable cost. As a byproduct of this work we optimized a new hybrid meta density functional theory called TPSS1KCIS, which has excellent performance for thermochemistry.
Among the wavefunction-based approaches, the Gaussian-n theories and their variants developed by Pople and coworkers,1–3 the complete basis set (CBS) family of methods by Petersson and coworkers,4,5 the Weizmann-n theories and their variants of Martin and coworkers,6,7 and the multi-coefficient correlation methods (MCCMs)8–15 of our group are the most popular schemes which achieve “chemical accuracy” (±1 kcal mol−1 for bond energies and barrier heights). These methods are classified as multilevel methods because they involve combining more than one level of electronic structure theory and/or different one-electron basis sets, usually with empirical parameters, to try to extrapolate to a more accurate result than the most accurate component calculation, and this multi-level approach is a very powerful alternative to single-level ab initio methods for many types of calculations. Unfortunately, the computational cost of most of the above mentioned methods such as G2,1 G3,2 CBS-Q,4 CBS/APNO,4 W1,6 W2,6 and multi-coefficient G3 (MCG3)11,15 formally scales as N7, where N is the number of atoms. If one wants to calculate consistent gradients or Hessians (for example, in dynamics calculations or geometry optimizations), these methods are very expensive, and sometimes prohibitively so.
A previous paper15 developed a suite of MCCMs of varying accuracy and cost, namely the MCCM/3 suite, that is suitable for a variety of problems with different sizes of molecules and different accuracy requirements. The recommended methods in the MCCM/3 suite are MC-CO/3, MC-UT/3, MC-QCISD/3 and MCG3/3. A particularly powerful version of the MC method is the MC-QCISD method,13,15 which scales as N6. In the limit of only a single coefficient, the MCCM methods reduce to the older scaling-all-correlation (SAC) method,16 which scales as N5. Although the MCCM and SAC methods have empirical parameters, the guiding principle of scaling the correlation energy and extrapolating to an infinite one-electron basis set provide a rational basis for the functional forms in which the parameters are embedded.
In addition to these WFT approaches, DFT-type methods17–63 have been widely used for thermochemistry and thermochemical kinetics due to their excellent cost-to-performance ratio. Hybrid DFT is a particularly powerful subset of DFT methods in which nonlocal Hartree–Fock exchange is combined with density functionals in the one-electron Hamiltonian (i.e., the Fock–Kohn–Sham operator) that determines the orbitals. Like the Gaussian-n and MCCM theories, hybrid DFT theories are all semiempirical. Recently we optimized a 1-parameter DFT model for kinetics, namely BB1K based on Becke’s 1988 gradient corrected exchange functional (Becke88 or B)20 and Becke’s 1995 kinetic-energy-dependent dynamical correlation functional (Becke95 or B95).26 BB1K is a hybrid meta DFT method which has better performance for kinetics than the pervious successful non-meta MPW1K model.42 (The descriptor “meta” means that the “density functional” depends not just on the density and its gradient but also on the orbitals in the form of a kinetic energy density.) More recently we optimized two hybrid DFT models based on the modified Perdew and Wang exchange functional31 (MPW) and the B95 meta correlation functional.26 One is called MPW1B95,63 and it is a hybrid DFT model for general applications in thermochemistry. Our assessments show that MPW1B95 has better performance for noncovalent interactions than the also very successful B1B9526 and B97-247 models. The other model we developed is called MPWB1K,63 and it is a one-parameter hybrid meta DFT method for kinetics. The MPWB1K method gives good results for thermochemistry, thermochemical kinetics, hydrogen bonding and weak interactions. Although one sometimes makes a distinction between pure DFT, where the density functionals depend explicitly on the density and its derivatives, and meta, hybrid, and hybrid meta DFT, where the functionals also depend explicitly on Kohn–Sham orbitals and (in meta methods) on their derivatives, we note that technically these are all DFT since the generalized Kohn–Sham orbitals are functionals of the Kohn–Sham potential, which is a functional of the density.19,38 When, for either conceptual or operational reasons, we want to remain mindful of the explicit orbital dependence density functionals that depend on orbitals may be called orbital dependent DFT (or ODDFT) functionals. Nevertheless, when we call a method DFT, we do exclude ODDFT; we need to keep the terminology manageable.
The inclusion of the Hartree–Fock component in hybrid DFT is not the only way to combine the WFT methods with DFT methods. Another combination is to use hybrid DFT to optimize geometries and calculate zero point vibrational energy (ZPVE). For example, the wave function-based G3SX,3 W1,6 and CBS-QB35 methods use the B3LYP25 hybrid DFT method to optimize geometries and calculate the ZPVE. Recently we generalized the multi-coefficient approach to directly mix WFT energies and gradients with DFT ones.64 In particular, we used the multi-coefficient approach to mix the SAC method8,12,15,16,65 with a hybrid meta DFT or hybrid DFT method. The resulting methods are called the multi-coefficient three-parameter Becke88-Becke95 (MC3BB) method64 and multi-coefficient three-parameter modified Perdew-Wang (MC3MPW) method.64 These two methods can also be viewed as a semiempirical hybrid of the SAC method and the hybrid meta DFT or hybrid DFT method, respectively, so we also labeled the MC3BB method as a doubly hybrid meta DFT method, and we call the MC3MPW method a doubly hybrid DFT method. The MC3BB and MC3MPW methods give better performance for the atomization energies and barrier heights calculations than the hybrid meta DFT and hybrid DFT methods, but at only slightly increased cost. A key distinction between these MC methods and previous empirical DFT methods that involve more than one linear parameters (such as B3PW9123 and B3LYP25) is the explicit attempt to extrapolate to a complete basis set and full configuration interaction; for this reason we call the new methods multi-coefficient extrapolated DFT.
An important conclusion of our recent paper63 mentioned above is that the modified Perdew and Wang exchange functional (MPW)31 is the overall best available exchange functional, and the Becke’s 1995 correlation functional (B95)26 is the best correlation functional for complementing it. Therefore, in the present paper, we will develop a new doubly hybrid meta DFT method based on combining MPW, B95 and SAC. Following the spirit of standard procedures for naming DFT methods, the new method is called the multi-coefficient three-parameter modified Perdew-Wang-Becke95 (MC3MPWB) method, similarly we optimize another MC3 method called MC3TS in which the exchange is treated by the functional of Tao et al.59 (called TPSS), and correlation is treated by the method of Savin and coworkers36,37,51 (sometimes called KCIS, but here abbreviated as S).
The present article is, however, primarily addressed to the question of whether we can profitably combine higher-level WFT methods with DFT. To answer this, we will develop a series of doubly hybrid methods that directly mix the WFT-based MCCMs with DFT using semiempirical parameters. We will compare the accuracy of these new multi-coefficient extrapolated DFT methods to wave-function based methods such as the CBS-Q, G3SX and the MCCM/3 suite and to nonextrapolated DFT methods such as the popular B3LYP hybrid DFT method and a new DFT method, presented in the present article, called TPSS1KCIS (which is explained in section 3).
One line of research seeks DFT methods where the parameters are obtained from general principles rather than semiempirically, but the present work de-emphasized that distinction in light of the thin line35 between theoretical considerations and empiricism and in light of lack of a unique theoretical justification for the functional forms themselves. As clarified by Becke,29 the fundamental value of the semiempirical hybrid approach is not diminished by parametrization. In fact, a flexible parametrization has implications for the limits achievable by a particular combination of elements and functional dependences, and it provides a guide into the areas where further development and new insights are required.29 This is the spirit of the present work. At the same time, the resulting parametrized methods can be very useful for practical work until such (unknown) time when additional breakthroughs may allow a more fundamental approach, and the development of useful methods for practical applications is the immediate goal of our work.
Section 2 presents the databases used in the present work. Section 3 discusses the theory and parametrization of the new methods. Section 4 presents results and discussion.
F = 0.5[MUEPB(MGAE109/04) + MUE(HTBH38/04)] | (1) |
E(L/B2|B1) = E(L/B2) − E(L/B1) | (2) |
E(L2|L1/B) = E(L2/B) − E(L1/B) | (3) |
The new methods are constructed by taking linear combinations of DFT methods with WFT single-level methods such as Hartree–Fock (HF) theory, Møller-Plesset perturbation theory (MP2,69 MP4SDQ70), quadratic configuration interaction with single and double excitations (QCISD),71 and QCISD with quasiperturbative connected triples71 (QCISD(T)). The geometries of all species in all calculations on the three databases of section 2.1–2.3 were optimized at the QCISD level with the MG3 basis set. The MG3 basis set,11 also called G3LargeMP2,72 is the same as 6-311++G(3d2f, 2df, 2p)73 for H–Si, but improved2 for P–Ar. The QCISD/MG3 geometries for all calculations in this paper can be obtained from the Truhlar group database website.74
We also used four basis sets in the parametrization of the new multi-coefficient hybrid methods, namely, the 6-31G(d)70,75 6-31+G(d,p),70,75 6-31G(2df,p)70,75 and MG3S basis set. The MG3S basis set53 is the same as MG3 (explained above) except that it omits diffuse functions on hydrogens. The following abbreviations for standard basis sets are used throughout this paper:
Dd 6-31G(d) |
DIDZ 6-31+G(d,p) |
D2dfp 6-31G(2df,p) |
![]() | (4) |
The three DFTX methods we used are: MPWBX, MPWX and TSX. MPWBX is a hybrid meta DFT method which uses modified Perdew-Wang exchange and the Becke95 correlation functional with X% HF exchange. MPWX is a hybrid DFT method which uses MPW exchange and Perdew-Wang 1991 (PW91) correlation22 again with X% HF exchange. TSX is a hybrid meta DFT method which uses the TPSS exchange59 and the correlation functional of Savin and coworkers36,37,51 (as stated above, this is abbreviated S in the present work although it is sometimes called51 KCIS) Note that X is a parameter that is optimized along with the ci parameters. We optimized the c1–c8 coefficients and X simultaneously against the databases described in section 2, and they are listed in Table 1.
Scaling | Method | c 1 | c 2 | c 3 | c 4 | c 5 | c 6 | c 7 | c 8 | X |
---|---|---|---|---|---|---|---|---|---|---|
a See text for the definition of these parameters and also see text for the exchange and correlation functionals used in each methods. | ||||||||||
N 7 | MCG3-MPWB | 1.085 | 0.276 | 0.465 | 1.329 | 1.255 | 0.889 | 1.057 | 0.727 | 34 |
N 7 | MCG3-MPW | 1.096 | 0.422 | 0.622 | 1.299 | 1.270 | 0.742 | 1.078 | 0.815 | 32 |
N 7 | MCG3-TS | 1.082 | 0.551 | 0.734 | 1.295 | 1.185 | 0.626 | 0.933 | 0.776 | 19 |
N 6 | MCQCISD-MPWB | 1.138 | 1.161 | 1.260 | 1.282 | 0.640 | 34 | |||
N 6 | MCQCISD-MPW | 1.163 | 1.106 | 1.324 | 1.331 | 0.711 | 32 | |||
N 6 | MCQCISD-TS | 1.127 | 1.140 | 1.309 | 1.144 | 0.673 | 19 | |||
N 6 | MCUT-MPWB | 1.102 | 1.171 | 1.346 | 1.404 | 0.526 | 29 | |||
N 6 | MCUT-MPW | 1.134 | 1.092 | 1.416 | 1.545 | 0.643 | 17 | |||
N 6 | MCUT-TS | 1.104 | 1.125 | 1.391 | 1.241 | 0.598 | 13 | |||
N 5 | MCCO-MPWB | 0.828 | 1.678 | 0.267 | 34 | |||||
N 5 | MCCO-MPW | 0.957 | 1.546 | 0.386 | 32 | |||||
N 5 | MCCO-TS | 0.952 | 1.435 | 0.401 | 22 | |||||
N 5 | MC3BB | 1.332 | 0.205 | 39 | ||||||
N 5 | MC3MPW | 1.339 | 0.266 | 38 | ||||||
N 5 | MC3MPWB95 | 1.196 | 0.257 | 38 | ||||||
N 5 | MC3TS | 1.256 | 0.336 | 31 | ||||||
N 4 | TPSS1KCIS | 13 |
![]() | (5) |
Note that the three DFTX methods used in this work each has its own advantages. The use of MPW exchange and PW91 correlation does not require a computer code that includes kinetic energy density, but combining MPW exchange with B95 correlation is more accurate. Combining TPSS exchange and S correlation is even more accurate and leads to more stable SCF iterations.
We also developed three methods that scale as N6 and that are based on the MC-UT/3 method.15 In each of them, we replace the HF/MG3S component of the MC-UT/3 method by one of the three DFTX methods, resulting respectively in three multi-coefficient extrapolated DFT methods of a type that we will call doubly hybrid MC-UT methods. Again we also add a MP2/DIDZ calculation. The new methods can be written as:
![]() | (6) |
![]() | (7) |
We also optimized two new MC3-type methods, namely MC3MPWB and MC3TS. Both of them are analogs to MC3BB and MC3MPW, and they scale as N5. The MC3-type methods can be described as:
E(MC3-type) = c2[E(HF/DIDZ) + c1ΔE(MP2|HF/DIDZ)] + (1 − c2)E(DFTX/MG3S) + ESO | (8) |
As mentioned in the introduction, the DFTX method in MC3MPWB is MPWBX , whereas in MC3TS it is TSX. The optimized c1, c2 and X for MC3MPWB and MC3TS as well as the parameters in the MC3BB and MC3MPW methods are listed in Table 1.
Figs. 1–5 are the coefficient trees for the new multi-coefficient hybrid methods.
![]() | ||
Fig. 1 Coefficient tree for the new N7 multi-coefficient hybrid methods. |
![]() | ||
Fig. 2 Coefficient tree for the new N6 multi-coefficient hybrid methods based on MC-QCISD/3. |
![]() | ||
Fig. 3 Coefficient tree for the new N6 multi-coefficient hybrid methods based on MC-UT/3. |
![]() | ||
Fig. 4 Coefficient tree for the new N5 multi-coefficient hybrid methods based on MC-CO/3. |
![]() | ||
Fig. 5 Coefficient tree for the N5 MC3-type multi-coefficient hybrid methods. |
All calculations were performed with the GAUSSIAN0376 computer program. In all of the calculations, the spin–orbit stabilization energy was added to atoms and open-shell molecules for which it is nonzero, as described previously.11
MPWB MPW Exchange + B95 correlation |
MPW MPW Exchange + PW91 correlation |
TS TPSS Exchange + KCIS correlation |
Quantityb | Itemc | BPMb | MCG3-MPWB | MCG3-TS | MCG3-MPW | G3SX | MCG3/3 | G3SX(MP3) | CBS-Q |
---|---|---|---|---|---|---|---|---|---|
a QCISD/MG3 geometries are used. b All mean unsigned error (MUE) in atomization energies are in a per pond basis (MUEPB). These errors are computed by dividing the MUE for the entire data set or the subset corresponding to a given row by the average number of bonds per molecules (BPM) for that set. AMUE is defined as 1/3 times mean unsigned error per bond in atomization energies plus 1/3 times mean signed error in barrier heights plus 1/6 each times the mean unsigned errors in ionization potentials and electron affinities. RMSE is the root mean square error for the 173 data without weighting. c Number in parentheses is number of data for that row. d The cost is the sum of the times to calculate the gradients for the two molecules, 1-phosphinomethanol and 2,2-dichloro-1-ethanol, with a single 500 MHz R14000 processor on a Silicon Graphics Origin 3800 normalized by dividing by the sum of the times for MP2/6-31G(2df,p) gradient calculations on the same computer. e C4H6 (bicyclobutane). | |||||||||
MUEPB | Atomization energy (109) | 4.71 | 0.16 | 0.16 | 0.16 | 0.18 | 0.22 | 0.21 | 0.29 |
HCO compound (54) | 6.23 | 0.08 | 0.09 | 0.08 | 0.11 | 0.13 | 0.13 | 0.21 | |
Containing second row atom (34) | 2.78 | 0.37 | 0.39 | 0.39 | 0.44 | 0.52 | 0.50 | 0.55 | |
Other (21) | 3.84 | 0.24 | 0.20 | 0.21 | 0.18 | 0.23 | 0.19 | 0.30 | |
MUE | Barrier height (38) | 0.54 | 0.56 | 0.54 | 0.67 | 0.84 | 0.81 | 0.73 | |
MUE | Ionization potential (13) | 0.67 | 0.73 | 0.69 | 1.07 | 0.95 | 1.34 | 1.26 | |
MUE | Electron affinity (13) | 0.86 | 0.82 | 0.97 | 1.06 | 0.92 | 1.00 | 1.12 | |
AMUE | All data (173) | 0.49 | 0.50 | 0.51 | 0.64 | 0.66 | 0.73 | 0.74 | |
RMSE | All data (173) | 0.95 | 1.00 | 1.00 | 1.19 | 1.27 | 1.32 | 1.67 | |
Max (−) | All data (173) | −2.86 (BCl3) | −3.16 (PF3) | −3.55 (PF3) | −5.17 (CH3CH2O) | −4.79 (NF3) | −5.71 (CH3CH2O) | −4.09 (C4H6) e | |
Max (+) | All data (173) | 3.12 (O2−) | 3.00 (O2−) | 3.07 (O2−) | 3.75 (AlCl3) | 2.94 (CN) | 3.92 (AlCl3) | 8.33 (C2Cl4) | |
Cost d | 46.2 | 46.2 | 46.1 | 460 | 45.0 | 66 | 79 |
Table 2 summarizes the errors of various methods. In the tables, “HCO compounds” are compounds containing at most hydrogen, carbon and oxygen atoms. From Table 2, we can see that all three new N7 hybrid multi-coefficient methods outperform the G3SX method, which is one of the most accurate and most expensive Gaussian-n type methods. Note that the G3SX method is about 10 times more expensive than the new multi-coefficient hybrid methods.
If we compare the new hybrid methods to the MCG3/3 method, we can see that the costs of the new hybrid multi-coefficient methods are about the same as the MCG3/3 method, but the MUE for atomization energy calculated by the new methods is reduced about 25% as compared to the MCG3/3 method.
Table 2 also shows that CBS-Q and G3SX(MP3) are significantly more expensive than the new hybrid methods, but much less accurate. The DFT component in the new kind of method contains information (such as static correlation, which is present in DFT exchange) that is not present in the perturbative triple excitation method, QCISD(T). In principle, the DFT-type calculation also explicitly takes into account core correlation effects, but the basis sets used here do not have enough flexibility in the core region to treat this reliably. The semiempirical parameters determined in this paper combine WFT methods with DFT in a way that empirically blends these features to extrapolate toward complete configuration interaction.
The magnitude of worst errors for the new methods is much less than that for the G3SX, G3SX(MP3) and CBS-Q methods.
From Table 2, we can see that the main sources of error in atomization energies for the new methods are due to the molecules containing second row atoms. This is also the problem for other methods in Table 2. The relatively worse performance for the second row molecules is assumed to be due largely to the imperfection of the basis set.
Quantity | Item | MCQCISD-TS | MCQCISD-MPWB | MCQCISD-MPW | MC-QCISD | QCISD/MG3 |
---|---|---|---|---|---|---|
a QCISD/MG3 geometries are used. b See notes below Table 2 for the definition of MUEPB, AMUE and cost. | ||||||
MUEPB | Atomization energy (109) | 0.17 | 0.20 | 0.20 | 0.36 | 5.01 |
HCO compound (54) | 0.09 | 0.10 | 0.09 | 0.21 | 4.44 | |
Containing second row atom (34) | 0.41 | 0.43 | 0.48 | 0.70 | 6.05 | |
Other (21) | 0.26 | 0.38 | 0.34 | 0.56 | 6.26 | |
MUE | Barrier height (38) | 0.78 | 0.66 | 0.69 | 1.22 | 2.83 |
MUE | Ionization potential (13) | 0.99 | 1.28 | 0.87 | 1.95 | 3.28 |
MUE | Electron affinity (13) | 0.69 | 0.72 | 0.85 | 1.38 | 5.20 |
AMUE | All data (173) | 0.60 | 0.62 | 0.59 | 1.08 | 4.03 |
RMSE | All data (173) | 1.07 | 1.22 | 1.24 | 2.02 | 21.70 |
Max (−) | All data (173) | −3.71 (P2) | −4.06 (N2) | −5.62 (Si2H6) | −7.39 (P2) | −61.50 (C5H5N) |
Max (+) | All data (173) | 3.28 (C2Cl4) | 4.06 (C2Cl4) | 4.05 (C2Cl4) | 6.24 (AlCl3) | 9.68 (O−) |
Cost | 6.0 | 6.0 | 6.0 | 5.5 | 55 |
The performances of the three hybrid methods in Table 2 are quite similar, however, the three new N6 methods based on MC-QCISD give different performances. The MCQCISD-TS method, which is based on the TS DFT method, gives the overall best performance as indicated by its low AMUE and RMSE for all data. Note that this N6 method also outperforms the G3SX method and all other previous N7 methods listed in Table 2, and the cost of G3SX is about 100 times bigger than the new N6 hybrid methods in Table 3 even for the two modest size molecules on which we base our timings. The magnitude of worst errors of the MCQCISD-TS method is also smaller than the G3SX, G3SX(MP3) and CBS-Q methods.
From Table 1, we can see that the percentage of HF exchange in the DFT calculation in the MCQCISD-TS method is 19%, and this low HF exchange percentage causes the barrier heights calculated by the MCQCISD-TS method to be slightly less accurate than the other two hybrid methods in Table 3. All new hybrid methods in Table 3 have analytical gradients in the GAUSSIAN03 program.76
The single-level N6 calculation in Table 3, QCISD/MG3, gives a big systematic error as compared to all the multilevel methods in this paper.
The mean errors for the new hybrid N6 methods based on MC-UT/3 are given in Table 4. All three new hybrid methods in Table 4 do not require a post-SCF iterative process, and they outperform the MC-QCISD/3 method.
Quantity | Item | MCUT-TS | MCUT-MPWB | MCUT-MPW | MC-UT |
---|---|---|---|---|---|
a QCISD/MG3 geometries are used. b See notes below Table 2 for the definition of MUEPB, AMUE and cost. | |||||
MUEPB | Atomization energy (109) | 0.23 | 0.26 | 0.26 | 0.45 |
HCO compound (54) | 0.13 | 0.12 | 0.12 | 0.25 | |
Containing second row atom (34) | 0.46 | 0.52 | 0.57 | 0.85 | |
Other (21) | 0.37 | 0.52 | 0.49 | 0.84 | |
MUE | Barrier height (38) | 0.79 | 0.71 | 0.78 | 2.52 |
MUE | Ionization potential (13) | 1.05 | 1.22 | 1.40 | 1.93 |
MUE | Electron affinity (13) | 0.77 | 0.74 | 1.12 | 1.43 |
AMUE | All data (173) | 0.64 | 0.65 | 0.77 | 1.55 |
RMSE | All data (173) | 1.50 | 1.71 | 1.77 | 3.05 |
Max (−) | All data (173) | −10.65 (CN) | −11.88 (CN) | −11.77 (CN) | −19.15 (P2) |
Max (+) | All data (173) | 3.88 (C2Cl4) | 5.05 (C2Cl4) | 4.80 (C6Cl6) | 5.32 (AlF3) |
Cost | 5.5 | 5.5 | 5.5 | 5.3 |
If we compared the new hybrid methods in Table 4 to those in Table 3, we find that the hybrid methods in Table 3 significantly outperform the ones in Table 4. This result shows the superiority of QCISD to MP4SDQ for use as a component in multilevel methods. However the MCUT methods have the advantage that the N6 step in Table 4 is noniterative, whereas the N6 step in Table 3 is iterative.
Quantity | Item | MCCO-TS | MCCO-MPWB | MCCO-MPW | MC3BB | MC3MPWB | MC3TS | MC-CO | MC3MPW |
---|---|---|---|---|---|---|---|---|---|
a QCISD/MG3 geometries are used. b See notes below Table 2 for the definition of MUEPB, AMUE, and cost. c C5H8 (spiropentane). | |||||||||
MUEPB | Atomization energy (109) | 0.41 | 0.49 | 0.57 | 0.61 | 0.67 | 0.69 | 0.67 | 0.82 |
HCO compound (54) | 0.29 | 0.30 | 0.33 | 0.39 | 0.42 | 0.36 | 0.30 | 0.41 | |
Containing second row atom (34) | 0.74 | 0.91 | 1.23 | 1.17 | 1.30 | 1.81 | 1.52 | 2.12 | |
Other (21) | 0.55 | 0.80 | 0.81 | 0.88 | 1.01 | 0.77 | 1.19 | 1.00 | |
MUE | Barrier height (38) | 0.97 | 1.12 | 1.18 | 0.73 | 0.79 | 0.71 | 3.12 | 0.76 |
MUE | Ionization potential (13) | 1.43 | 1.56 | 2.34 | 2.17 | 2.42 | 2.34 | 2.09 | 2.34 |
MUE | Electron affinity (13) | 1.77 | 1.59 | 2.93 | 4.25 | 4.42 | 4.57 | 2.09 | 3.54 |
AMUE | All data (173) | 0.99 | 1.06 | 1.46 | 1.51 | 1.63 | 1.62 | 1.96 | 1.51 |
RMSE | All data (173) | 2.27 | 2.57 | 3.14 | 3.30 | 3.48 | 3.87 | 4.14 | 4.27 |
Max (−) | All data (173) | −9.88 (CN) | −10.58 (CN) | −12.10 (CN) | −10.30 (CN) | −10.82 (CN) | −15.52 (SiF4) | −21.66 (CN) | −16.14 (SiF4) |
Max (+) | All data (173) | 6.70 (C5H5N) | 8.01 (C2Cl4) | 9.36 (C6H6) | 9.94 (C5H8) c | 9.63 (C5H8) c | 12.32 (C6H6) | 12.01 (AlCl3) | 12.32 (C6H6) |
Cost | 5.2 | 5.2 | 5.2 | 2.0 | 2.0 | 2.0 | 4.9 | 1.8 |
The overall performances of the three hybrid methods based on MC-CO are better than the MC3-type hybrid methods. However, in part because the way they were optimized, the MC3-type methods are better for barrier heights calculations, and they have lower cost, so the MC3-type methods are very suitable for dynamics calculation.
Notice that for all the doubly hybrid methods in Table 1, the last coefficient (c8, c5, c3, or c2, depending on the method) is the fraction of WFT in the doubly hybrid method, and this fraction decreases as the level of WFT is lowered. Thus the highest-level methods are 72.7–81.5% WFT, plus the additional Hartree–Fock exchange in the DFT part, whereas the lowest-level ones are 20.5–33.6% WFT, plus the additional Hartree–Fock exchange in the DFT part. This is a very reasonable trend, and it increases our confidence in the physicality of the doubly hybrid methods.
Quantity | Item | MPW1B95 | TPSS1KCIS | MPW1PW91 | MPWB1K | B3LYP | TPSSh |
---|---|---|---|---|---|---|---|
a QCISD/MG3 geometries are used. All DFT calculations are performed with MG3S basis set. b See notes below Table 2 for the definition of MUEPB, AMUE and cost. c C5H8 (spiropentane). | |||||||
MUEPB | Atomization energy (109) | 0.63 | 0.67 | 0.89 | 0.98 | 0.92 | 0.98 |
HCO compound (54) | 0.48 | 0.40 | 0.51 | 0.60 | 0.51 | 0.59 | |
Containing second row atom (34) | 0.97 | 1.49 | 2.06 | 1.56 | 2.39 | 2.28 | |
Other (21) | 0.84 | 0.86 | 1.13 | 1.94 | 0.92 | 1.12 | |
MUE | Barrier height (38) | 3.02 | 4.69 | 3.55 | 1.29 | 4.23 | 5.97 |
MUE | Ionization potential (13) | 2.14 | 2.63 | 3.72 | 2.05 | 4.72 | 3.17 |
MUE | Electron affinity (13) | 2.91 | 2.81 | 2.62 | 4.11 | 2.29 | 2.81 |
AMUE | All data (173) | 2.06 | 2.69 | 2.54 | 1.78 | 2.88 | 3.31 |
RMSE | All data (173) | 3.84 | 4.46 | 4.97 | 5.15 | 5.79 | 5.90 |
Max (−) | All data (173) | −10.38 (SiF4) | −21.61 (SiF4) | −24.59 (SiF4) | −16.83 (SiF4) | −26.71 (SiCl4) | −23.47 (SiF4) |
Max (+) | All data (173) | 12.73 (C5H8) c | 8.96 (O+) | 9.05 (O2+) | 10.23 (OH−) | 12.95 (O+) | 13.54 (Si2H6) |
Cost | 1.8 | 1.8 | 1.7 | 1.8 | 1.7 | 1.8 |
Deviation (calculation − experiment) | |||||
---|---|---|---|---|---|
Molecule | Experimental b atomization energy | MCG3-TS | MCQCISD-TS | G3SX | G3SX(MP3) |
a QCISD/MG3 geometry is used for O3, and B3LYP/6-31(2df,p) geometries are used for P4, ClCN, OCS, C8H18, and C10H8. b The experimental atomization energy of O3 is from our Database/4.74,77 The experimental value for ClCN and OCS are taken from Martin and Oliveira.6 The experimental value for P4 is obtained by adding the scaled fundamental zero point vibrational energy (ZPVE) to D0 and D0 and fundamental ZPVE are taken from the Computational Chemistry Comparison and Benchmark Data Base (CCCBDB).78 The experimental values for C8H18 and C10H8 are obtained by subtracting the thermal vibrational-rotational-translational contribution and the scaled ZPVE (0.9854) at the B3LYP/6-31(2df,p) level and the experimental heat of formation79 of both molecules. c See notes below Table 2 for the definition of MUE, MUEPB. The average number of bonds per molecule is 7.67 for the TAE6/04 database. | |||||
O3 | 146.10 | −1.25 | −5.27 | 0.87 | 0.08 |
ClCN | 284.52 | 0.21 | −0.03 | 0.16 | 0.38 |
OCS | 334.24 | 1.79 | 0.77 | 2.06 | 2.61 |
P4 | 289.90 | −0.90 | −0.75 | −8.96 | −11.21 |
C8H18 (n-octane) | 2478.23 | −1.07 | −1.54 | 0.69 | 0.09 |
C10H8 (naphthalene) | 2162.16 | −0.96 | −2.59 | 0.14 | 0.76 |
MUEc | 1.03 | 1.83 | 2.14 | 2.52 | |
MUEPBc | 0.13 | 0.24 | 0.28 | 0.33 |
O3 is a very difficult case for CCSD(T) calculations with basis set extrapolation; the W2 theory gives an error of about 3 kcal mol−1.7 Helgaker and coworkers80 used CCSD(T)/cc-pcV(56)Z calculations with core correlation and relativistic contribution, and they still had an error about 2.6 kcal mol−1. Table 7 shows that G3SX and G3SX(MP3) work well for this molecule. The MCG3-TS method gives a 1.25 kcal mol−1 error and MCQCISD-TS method gives a 5.27 kcal mol−1 error. Apparently triple excitations are important for treating this molecule.
For the two molecules, ClCN and OCS, that are not in our training set and not in G3/99 set, our two methods perform better than G3SX and G3SX(MP3).
P4 is a difficult case for the scaled G3 methods, and it is the source of the maximum error for all scaled G3 methods and their variants.68Table 7 shows that our two new methods perform well for this molecule, and the errors for both multi-coefficient extrapolated DFT methods are less than 1 kcal mol−1.
For the two largest molecules in the G3/99 test, C8H18 (n-octane) and C10H8 (naphthalene), our new methods perform worse than G3SX and G3SX(MP3).
The MUEs and MUEPBs in Table 7 are consistent with previous tables. Both our N7 method and our N6 method outperform G3SX and G3SX(MP3).
From Table 7, we can conclude that our parametrizations are not overfitted by the training set; that is, they can usefully be applied to the systems that are not in the training set.
The new methods are more accurate than pure WFT multilevel methods and they have lower cost. In particular we developed several new methods (MCG3-MPWB, MCG3-MPW, MCG3-TS and MCQCISD-TS) that outperform G3SX and several less expensive methods (MCQCISD-MPWB, MCQCISD-MPW) that outperform G3SX(MP3). An even less expensive method (MCUT-TS) still outperforms CBS-Q.
During the course of the development of modern DFT methods, it has clearly been established that including some HF exchange provides accuracy that can not be obtained by pure DFT exchange functionals, Becke’s hybrid DFT methods, justified by adiabatic connection theory,24 revolutionalized quantum chemistry. Now hybrid DFT and hybrid meta DFT have become the most popular DFT methods in computational chemistry. On the other hand, in the present study, we found that the DFT calculations contain information which cannot be easily mimicked by WFT methods even with quasiperturbative triple excitations. Our empirical multi-coefficient approach combines the two different types of contributions together, and the new resulting methods are more accurate and more efficient than either type of methods used separately. We have named the new methods as extrapolated versions of DFT, but one can also consider them to involve improved version (based on DFT) of the high-level correction1,2 of conventional multilevel WFT theory.
Given that the number and quality of the exchange and correlation functionals increase year by year, we can expect that even more efficient and accurate hybrid multi-coefficient methods can be constructed for theoretical chemistry in the future.
Footnote |
† Electronic supplementary information (ESI) available: Databases for parametrization and testing. See http://www.rsc.org/suppdata/cp/b4/b416937a/ |
This journal is © the Owner Societies 2005 |