Andrew J.
May
a,
Edward
Valeev
bc,
Robert
Polly
d and
Frederick R.
Manby
*a
aSchool of Chemistry, University of Bristol, Cantocks Close, Bristol, UK BS8 1TS. E-mail: fred.manby@bris.ac.uk
bSchool of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, 30332-0400, USA
cOak Ridge National Laboratory, Oak Ridge, TN, 37830-8050, USA
dInstitut für Nukleare Entsorgung, Forschungszentrum Karlsruhe, Postfach 3640, 76021, Karlsruhe, Germany
First published on 10th June 2005
The explicitly correlated second order Møller–Plesset (MP2-R12) methods perform well in reproducing the last detail of the correlation cusp, allowing higher accuracy than can be accessed through conventional means. Nevertheless in basis sets that are practical for calculations on larger systems (i.e., around triple- or perhaps quadruple-zeta) MP2-R12 fails to bridge the divide between conventional MP2 and the MP2 basis set limit. In this contribution we analyse the sources of error in MP2-R12 calculations in such basis sets. We conclude that the main source of error is the choice of the correlation factor r12. Sources of error that must be avoided for accurate quantum chemistry include the neglect of exchange commutators and the extended Brillouin condition. The generalized Brillouin condition is found not to lead to significant errors.
Despite countless innovations in the field of explicitly correlated electronic structure theory5–8 there is really only one class of methods that can truly claim to have escaped the restriction to tiny molecules. These methods—now collectively labelled ‘R12’—were introduced in 1985 by Kutzelnigg9 and their success pivots on the ingenious use of the resolution of the identity (RI) to avoid the difficult many-electron integrals.9
The RI is based on 1-electron projection operators of the form
![]() ![]() | (1) |
〈ijm|r−112r23|mlk〉
≈
〈ijm|r−112![]() = 〈ij|r−112|mp′〉〈mp′|r12|kl〉. | (2) |
At the level of second order Møller–Plesset perturbation theory (MP2) the first order pair function is written
|uij〉
=
tijab|ab〉
+
tijkl![]() | (3) |
![]() ![]() ![]()
| (4) |
The last two of these operators, O12 and
12, lead to formally equivalent first order ansätze; the first is distinct. In the particular case where the RI is performed in the MO basis all of these operators lead to the same working equations.11 Since details can be found elsewhere11,13,20 we will not examine the equations that arise in each case; we will however identify those approximations that are commonly made in deriving the working equations.
First it is convenient to assume that the Fock equations have been solved exactly for the occupied space: |i〉
=
εi|i〉. This approximation is referred to in the literature as the generalized Brillouin condition (GBC),13 and allows the simplification of certain expressions by deleting commutators of the form [
1
+
2,
O12]. If either
P12 or
12 are used, the analogous commutators can only be eliminated under the more stringent assumption of the extended Brillouin condition (EBC),13 namely that the Fock equations hold exactly for all MOs:
|p〉
=
εp|p〉. In this case the EBC has the additional benefit that the coupling between the conventional and R12 parts of the first order wavefunction vanishes.
Developing the formalism one also find terms of the form [1
+
2, f12]. Clearly the commutators involving the local potential terms vanish, and one has the identity
[![]() ![]() ![]() ![]() ![]() ![]() |
[![]() ![]() ![]() ![]() | (5) |
These three approximations (GBC, EBC, [1, f12]
= 0) are present in various combinations in different flavours of MP2-R12 theory, as illustrated in Table 1. The methods that use the projector
P12
(ansatz1) are less accurate and will not be considered here. Of the remaining methods there is an increasing computational cost as one moves down the table. The highest scaling process in MP2-R12/2*A′ scales like
(o8), but this typically remains modest compared with other O(N5) steps. However in MP2-R12/2A′, an additional step that scales O(o6v2)
(eqn. (79) of ref. 11) is needed and this step quite quickly dominates the computational work. The original method MP2-R12/2B11 adds to this problem by requiring 4-index integrals with two indices drawn from the (generally quite large) RI basis set. The latter problem has recently been circumvented.22
The R12 methods perform very well in describing the last detail of the correlation cusp, efficiently making up the difference between large-basis conventional methods and the basis set limit. For more extended systems, though, large atomic orbital (AO) basis sets become impractical, and one is restricted (at the moment) to basis sets of triple-zeta or at most quadruple-zeta size. Unfortunately the performance in these smaller basis sets is much less satisfactory. For neon in cc-pVTZ the MP2 and MP2-R12/2*A′ correlation energies are −264 mEh and −292 mEh, respectively, compared to the basis set limit −320 mEh. It is dissappointing that MP2-R12 methods only make up about half of the energy gap to the basis set limit, and this is quite typical.
In the preceding text two distinct sources for this 30 mEh error have been identified: (1) the RI for many-electron integrals; (2) the additional approximations GBC, EBC, [1, f12]
= 0; and to these we can add a third possibility, namely (3) the quality of the ansatz for the first order wavefunction.
In this paper we assess the relative impact of these three sources of error, by scrutinizing the existing literature and by performing new benchmark calculations, and produce clear recommendations for superior explicitly correlated MP2 methods. It is anticipated that the central conclusions for the MP2 level of theory will be applicable to higher levels of theory such as CCSD-R1216,18 and CCSD(R12).19
Our benchmarks are based on valence correlation energies of the test set of 20 small first-row molecules from the textbook of Helgaker et al.23 Most calculations have been performed with our DF-MP2-R12 and DF-MP2-F12 codes21,24,25 using density fitting to compute all 4-index quantities. In such cases we have used the large cc-pV5Z density fitting basis sets of Hättig.26 The errors arising from density fitting are much smaller than the other errors we are considering, as shown elsewhere.24 Computations without the assumption of the EBC and GBC were carried out with the developmental version of the Massively Parallel Quantum Chemistry Program (MPQC).27
Over the past few years, though, there have been several developments. First amongst these in chronology as well as importance is the use of RI approximations in an auxiliary basis set11 (ABS). This allows one to study the effect of the accuracy of the integrals independently, and, from a more pragmatic point of view, allows one to converge the accuracy of the many electron integrals whilst retaining a reasonably modest basis set for the molecular orbitals.
The idea has been extended to formulations that need only RIs in the orthogonal complement of the AO basis.10 Density fitting (DF) can also be used as an alternative to RIs for the 3-electron integrals30 although this does not appear to offer significant advantages; however combined RI–DF approaches offer enhanced efficiency24 and accuracy.31 Finally one can use numerical quadrature for the many electron integrals as shown by Boys and Handy7 and recently rediscovered by Ten-no.32
Although we do not intend to benchmark all of these methods here, we will compare calculations using different ABS expansions to estimate the magnitude of the error that can be expected from the use of the conventional RI in an auxiliary basis. Valence MP2-R12/2*A′ correlation energies are plotted in Fig. 1 for AO basis set cc-pVTZ using uncontracted cc-pVnZ sets for the RI basis, with n = T, Q, 5. To highlight the differences between the RI basis sets, the data are presented relative to the MP2/cc-pVTZ correlation energies. Analogous data are presented in Fig. 2 for the cc-pVQZ AO basis. It can be seen that the differences between the two largest RI basis sets are at most of the order of a few mEh. This is using the most straightforward ABS RI implementation11 rather than more refined models;10,31 and we have also used basis sets that were not specifically optimized for this role.
![]() | ||
Fig. 1 MP2-R12 corrections using the cc-pVTZ orbital basis, and using uncontracted cc-pVnZ basis sets for the RI with n = T, Q, 5. |
![]() | ||
Fig. 2 MP2-R12 corrections using the cc-pVQZ orbital basis, and using uncontracted cc-pVnZ basis sets for the RI with n = Q, 5. |
Concerning the evaluation of 3-electron integrals we observe that: (1) the errors using the straightforward ABS method in standard basis sets leads are small; (2) the errors can be further reduced by more advanced methods;10,31 (3) the computational cost rises only linearly with RI basis size.22
It is therefore both technically straightforward and computationally feasible to reduce the RI errors way below the target accuracies required for standard chemical applications. Having said that, even with the currently available extensions to the RI approach, it will probably not be feasible to treat heavy elements. Elements with occupied f-orbitals require RI basis sets saturated to
= 6 with the RI-DF technique31 and to
= 9 with conventional RI approaches.
The effect of deleting exchange commutators is not negligible. The mean difference in valence correlation energies between the MP2-R12/2A′ and MP2-R12/2B methods for eight small first row systems is 5.5 mEh and 2.4 mEh for cc-pVTZ and cc-pVQZ, respectively.11 From this we can derive two conclusions: (1) the discrepancy is certainly large enough to be of concern; (2) it is not the principal source of the 30 mEh short-fall in correlation energy for the neon atom.
This error may instead arise from the Brillouin conditions. Fortunately it is now possible to eliminate both the GBC and EBC assumptions from MP2-R12 theory33 so we are in a position to quantify the impact of these two approximations—details of the theory will appear separately.33 MP2-R12 energy corrections are shown in Figs. 3 and 4 for the cc-pVTZ and cc-pVQZ basis sets. Data are presented for the case where neither Brillouin condition is assumed (labelled ‘exact’); for the case where only the GBC is assumed; and for the case where both the GBC and EBC are assumed. It can be seen that the impact of the GBC in these calculations is completely negligible. In the cc-pVTZ basis the mean absolute error arising from the GBC is less than 0.1 mEh and the maximum error is only 0.4 mEh. In the cc-pVQZ basis set the errors are marginally larger; this arises from the fact that the triple-zeta basis set appears to hit a ‘sweet-spot’, with the GBC errors being significantly smaller than those found with either cc-pVDZ or cc-pVQZ basis sets. This phenomenon need not distract us here, and will be discussed elsewhere.33
![]() | ||
Fig. 3 MP2-R12 corrections to valence correlation energies of the test molecules in mEh, using the cc-pVTZ AO basis set. The first set of data comes from MP2-R12 calculations without the assumption of either the GBC or the EBC (exact); the second and third rely on the GBC and both the GBC and EBC, respectively. Details of the calculations are given in the text. |
![]() | ||
Fig. 4 As for Fig. 3 but using the cc-pVQZ AO basis set. |
The effect of the EBC can be seen to be significantly larger than that of the GBC, but certainly not large enough to account for the disappointing performance of MP2-R12 methods. For both basis sets the mean absolute error from the combined effect of the GBC and EBC is around 1.5 mEh. In any case the EBC can readily be avoided by choosing the projection operator O12 given in eqn. (4) and following the procedure of Klopper and Samson.11
Quite a lot is now known about the evaluation of the necessary 2-electron integrals that arise with these and other correlation factors. For example, formulae have been published for integrals with the following 2-electron kernels: a simple Gaussian exp(−αr212);5,6 the damped factor r12exp(−αr212);39 the damped Coulomb form r−112exp(−αr212);21 arbitrary powers and a Gaussian rk12exp(−αr212);40 and a simple Slater exp(−αr12).38
Two of us have previously published a description of the MP2-F12 method in which an arbitrary correlation factor is chosen in place of r12,21 built as a frozen linear combination of Gaussian geminals:
![]() | (6) |
Our calculations are performed at the MP2-{R12,F12}/2*A′ levels of theory using cc-pVTZ and cc-pVQZ for the AO basis and uncontracted cc-pV5Z for the RI. The MP2-F12 and MP2-R12 corrections are compared with the difference between conventional MP2 and the MP2 basis set limit in Figs. 5 and 6. The basis set limit numbers were obtained by extrapolating cc-pV5Z and cc-pV6Z correlation energies using the cubic formula of Helgaker et al.43
![]() | ||
Fig. 5 MP2-R12 and MP2-F12 corrections to valence correlation energies of the test molecules in mEh using the cc-pVTZ AO basis set. For comparison the difference between MP2/V[5,6]Z and MP2/VTZ correlation energies are also shown. Details of the calculations are given in the text. |
![]() | ||
Fig. 6 As for Fig. 5 but using the cc-pVQZ AO basis set. |
It can be seen that substituting r12 by exp(−r12) recovers almost all of the difference between MP2-R12 and the basis set limit. For cc-pVTZ the MP2-R12 correlation errors are in error by an average of 21 mEh; this is reduced to just 1.8 mEh using MP2-F12. In the cc-pVQZ basis, the mean error is reduced from 7.5 mEh to 0.6 mEh.
It has been shown conclusively that the last factor is by far the most significant. For explicitly correlated methods that aspire to chemical accuracy in modest (triple- or quadruple-zeta) basis sets it is therefore essential that r12 is replaced by a more appropriate correlation factor.
The exact form of this factor could be determined in various ways. The most obvious is to construct the correlation factor as a linear combination of suitable functions—e.g. Gaussian geminals—and minimize the Hylleraas functional with respect to the coefficients. This has to be done independently for each pair to maintain extensivity and orbital invariance:
|uij〉
=
tijab|ab〉
+
tijkl,γ![]() | (7) |
The errors that arise from the deletion of exchange commutators and from the EBC are too large to ignore in triple- and quadruple-zeta basis sets. It does not appear to be necessary to drop the GBC so that the main structure of Klopper and Samson’s theory11 appears to remain extremely useful. We are therefore currently implementing the MP2-F12/2B method in the hope that it will yield higher accuracy than is currently possible.
This journal is © the Owner Societies 2005 |