Samuel O.
Odoh
a,
Giovanni Li
Manni
ab,
Rebecca K.
Carlson
a,
Donald G.
Truhlar
*a and
Laura
Gagliardi
*a
aDepartment of Chemistry, Chemical Theory Center, Supercomputing Institute, University of Minnesota, 207 Pleasant Street SE, Minneapolis, MN 55455-0431, USA. E-mail: gagliardi@umn.edu; truhlar@umn.edu
bMax-Planck Institut für Festkörperforshung, Heisenbergstraße 1, 70569 Stuttgart, Germany
First published on 16th December 2015
Multi-configuration pair-density functional theory (MC-PDFT) has proved to be a powerful way to combine the capabilities of multi-configuration self-consistent-field theory to represent the an electronic wave function with a highly efficient way to include dynamic correlation energy by density functional theory. All applications reported previously involved complete active space self-consistent-field (CASSCF) theory for the reference wave function. For treating large systems efficiently, it is necessary to ask whether good accuracy is retained when using less complete configuration interaction spaces. To answer this question, we present here calculations employing MC-PDFT with the separated pair (SP) approximation, which is a special case (defined in this article) of generalized active space self-consistent-field (GASSCF) theory in which no more than two orbitals are included in any GAS subspace and in which inter-subspace excitations are excluded. This special case of MC-PDFT will be called SP-PDFT. In SP-PDFT, the electronic kinetic energy and the classical Coulomb energy, the electronic density and its gradient, and the on-top pair density and its gradient are obtained from an SP approximation wave function; the electronic energy is then calculated from the first two of these quantities and an on-top density functional of the last four. The accuracy of the SP-PDFT method for predicting the structural properties and bond dissociation energies of twelve diatomic molecules and two triatomic molecules is compared to the SP approximation itself and to CASSCF, MC-PDFT based on CASSCF, CASSCF followed by second order perturbation theory (CASPT2), and Kohn–Sham density functional theory with the PBE exchange–correlation potential. We show that SP-PDFT reproduces the accuracy of MC-PDFT based on the corresponding CASSCF wave function for predicting C–H bond dissociation energies, the reaction barriers of pericyclic reactions and the properties of open-shell singlet systems, all at only a small fraction of the computational cost.
Generally, most of the configurations in the FCI expansion of the active space in CASSCF computations make only small contributions to the total wave function. As a result, Ruedenberg and coworkers suggested that these configuration state functions (CSFs) are “deadwood” that can be excluded without significantly affecting the accuracy of the results.4 The generalized active space (GAS),5,6 restricted active space (RAS),7,8 occupation restricted multiple active spaces (ORMAS)9 and Split-GAS10,11 approaches are some of the frameworks that attempt to remove deadwood CSFs by partitioning the active space into subspaces. We have previously shown that active spaces larger than the CAS(18,18) limit can be attained with the generalized active space self-consistent-field theory, GASSCF.6,10,11
These MCSCF-type wave functions (CASSCF, GASSCF, etc.) can recover static correlation effects well, but are impractically slowly convergent (with respect to active space size) for the dynamic correlation energy, which is necessary for chemically accurate energetic calculations. For higher accuracy they can be used as zeroth-order reference functions in post-SCF perturbative, multireference coupled-cluster (CC), or multireference configuration interaction (CI) calculations to obtain a good approximation to the dynamic correlation energy. CASPT2 is a popular example that applies second-order perturbation theory to a CASSCF zero-order wave function.12,13 Such approaches, while capable of high accuracy,8,14 are however not suited for studying large systems because their computational costs rise rapidly with system size.
We have recently proposed an approach for treating strongly correlated systems at much lower computational costs than CASPT2, by combining CASSCF with density functional theory (DFT). This approach is called multiconfiguration pair-density functional theory (MC-PDFT).15,16 It may be considered to be a multiconfigurational analog of Kohn–Sham16 density functional theory16,17 (KS-DFT). In KS-DFT, the energy is computed as the kinetic energy and classical Coulomb energy of a Slater determinant (which is a single-configuration reference wave function) and a one-electron integral over an exchange–correlation functional of the one-electron density of the Slater determinant. The classical Coulomb energy includes the nuclear attraction of the electrons, the classical interelectronic repulsion of the electronic charge density, and the nuclear repulsion. The exchange–correlation density functional includes electron exchange, electron correlation, and the difference between the exact kinetic energy and that computed from the Slater determinant. The exact exchange–correlation density functional is unknown, so one uses approximations. In MC-PDFT, the energy is computed as the kinetic energy and classical Coulomb energy of an MCSCF reference wave function and a one-electron integral over an on-top density functional of the one-electron density and the on-top pair density of the reference wave function. The on-top density functional includes electron exchange, electron correlation, and the difference between the exact kinetic energy and that computed from the reference wave function. The MC-PDFT energy may be written as
(1) |
KS-DFT is usually applied full self-consistently; that is, the exchange–correlation functional is included during the SCF step. MC-PDFT can also in principle be applied fully self-consistently, but in all work reported so far and in the present article, we carry out the MCSCF calculation by CASSCF without the on-top density functional, and then calculate the final energy post-SCF from eqn (1). In this post-SCF mode, MC-PDFT is like the perturbation theory, multireference CC, and multireference CI wave function methods in that it attempts to use an MCSCF method to obtain a balanced reference wave function in an SCF step and to calculate an accurate energy in a post-SCF step. However, in the case of MC-PDFT, the cost of the post-SCF density functional step is negligible (if coded efficiently) compared to the cost of the SCF step, whereas in the wave function methods like CASPT2, the post-SCF step is more expensive than the SCF step. The cost of the SCF step though is still prohibitive for large systems if one uses CASSCF as the MCSCF method. In the present article we test whether MC-PDFT can yield accurate results when based on a GASSCF wave function. In particular, we present a systematic way to choose the active space in GASSCF theory. This new way of choosing the active space is called the separated-pair (SP) approximation. The method is intermediate between generalized valence bond (GVB) theory and complete active space self-consistent-field (CASSCF) theory. We then use SP and CASSCF as reference wave functions for MC-PDFT. The MC-PDFT method based on a CASSCF and a SP reference wave function will be labeled as CAS-PDFT and SP-PDFT, respectively, when it is desired to distinguish the kind of MCSCF wave function being used as the reference.
The next section presents the relevant theory and defines the separated pair (SP) approximation. We then provide computational details, test sets, results, and discussion.
In the present work, we only use GAS subspaces in which each subspace contains at most two orbitals, and interspace excitations are not allowed. A GASSCF calculation with these restrictions will be called the separated pair (SP) approximation, and when the number of subspaces is m, it will be abbreviated SP-m. If each subspace contains two electrons in two orbitals, this would be specified in the language of GASSCF as GAS-m(2m,2m) with the additional specification that no inter-subspace excitations are allowed. For singlet systems with an even number of electrons, we typically do have two electrons in two orbitals in each subspace, and the two orbitals in a given subspace are usually a bonding orbital and the corresponding antibonding orbital. This is reminiscent of the generalized valence bond perfect pairing (GVB-PP) algorithm,21 but it is more general. The GVB-PP approximation has subspaces of two electrons in two orbitals coupled to a singlet; this involves two or three configurations, depending on symmetry. In the SP approximation, when there are two electrons in two orbitals, they may be coupled into either a singlet or a triplet, and the various triplet pairs may be coupled in all possible ways to obtain CSFs with the desired overall spin symmetry of the system (for example, if the overall wave function is a singlet, one may have CSFs where four of the pairs are triplets and all the others are singlets, and the four triplet pairs may be coupled to each other in a variety of ways to obtain an overall singlet); thus the SP approximation involves more possible configurations than does the GVB-PP approximation. Nevertheless, the SP approximation greatly reduces the number of CSFs in the CI expansion as compared to CASSCF. It is also important to note that we carry out a FCI expansion in each GAS subspace. This is because we allow both singles and double excitations in each subspace containing just two orbitals. The SP approximation is more similar to the generalized valence bond restricted pairing (GVB-RP) approximation22 than to GVB-PP. A key advantage of SP and GVB-RP is that, unlike GVB-PP, they allow dissociation to high-spin fragments.21,22
In the SP approximation, every GAS subspace contains one electron in one or two orbitals or two or three electrons in two orbitals, depending on the system. Intersubspace excitations are always excluded. For closed-shell systems (and for open-shell singlets that can be made by breaking a bond in a closed-shell system) an SP-m approximation always corresponds to GAS-m(2m,2m). But the value of m depends on which pairs are included in the active space and which are treated as inactive (doubly occupied in all CSFs), and that is an individual choice. For example, we treat the molecular orbitals with parentage in the 2s atomic orbitals as active for C2 but inactive for N2, O2, and F2. Moss and coworkers,23 in their GVB-CI calculations on O2, also removed the fully occupied 1σg, 1σu, 2σg and 2σu molecular orbitals from the active space. For F2, we also treat the molecular orbitals with parentage in the 2px and 2py orbitals as inactive.
The SP approximation we used for the carbon dimer, C2, is shown in Fig. 1. This molecule has a closed-shell singlet ground state, and the orbitals shown in Fig. 1 correspond formally (at equilibrium) to a double bond and a ground state configuration of 2σg2, 2σu2, 1πux2, 1πuy2. This corresponds to a double bond as the occupied 2σu orbital is actually of antibonding character. Within C1 symmetry, there are 150 CSFs in this reference for the closed-shell singlet, as compared to 1764 CSFs in the analogous CAS(8,8) reference (the analogous GVB-PP wave function would have only 16 CSFs). We note that the SP-4 reference correctly dissociates to two high-spin (3P) carbon atoms, just like the CAS(8,8) reference.
The SP scheme for open-shell systems depends on the type of open-shell character. The SP-3 approximation that we used for O2 is shown in Fig. 2. O2 differs from C2 in that the σ bonding combination of 2p orbitals lies higher in energy than the π bonding combination for C2 but lower for O2. In O2, as already mentioned, the 2σ and 2σ* molecular orbitals (which are predominantly formed from the 2s atomic orbitals) are kept inactive. Therefore the SP-3 approximation that we used for O2 has GAS1 containing two electrons in the 3σg and 3σu orbitals (which are predominantly formed from the 2pz atomic orbitals), GAS2 containing three electrons in the 1π(px) and 1π*(px) orbitals, and GAS3 containing three electrons in the 1π(py) and 1π*(py) orbitals. This is a GAS-3(8,6) reference. It contains 20 CSFs in comparison to 378 CSFs for the full valence CAS(12,8) and 105 CSFs for CAS(8,6). This GAS-3(8,6) reference also correctly dissociates into two 3P oxygen atoms. The SP approximations we used for SO and S2 are isoelectronic to that for O2.
For the Cr dimer, Cr2, we calculated the potential energy curve with an SP-6 approximation, equivalent to GAS-6(12,12), with the twelve valence orbitals coupled in six GAS subspaces and two active electrons in each GAS subspace. Within D2h symmetry, there are 1516 CSFs in the SP-6 CI expansion, as compared to 28784 CSFs in the analogous CAS(12,12) CI expansion. The SP-6 approximation is sufficiently complete that the dimer correctly dissociates to two high-spin (7S Cr) atoms.
For methylene triplet or methylene open-shell singlet, a full valence CAS is (6,6). We can think of CH2 as derived from methane by dissociating two C–H bonds, and the antibonding orbitals associated with those bonds have left with the hydrogens. Thus these systems each have two singly occupied orbitals, which are taken as their own GAS subspaces with one electron in one orbital in each. In addition, they have two GAS spaces that each have two electrons in two orbitals. Thus the separated pair approximation we use is SP-4, which is shorthand in this case for GAS-4(6,6).
There are two important points to note. First, the SP approximation allows one to design GAS subspaces that contain only the bonding and antibonding orbitals necessary to describe a particular process. For example to compute the C–H dissociation energies of acetylene, ethylene and ethane, we included only orbitals relevant to C–H bonding in the SP active space. This formally leads to a SP-3 active space for both acetylene and ethynyl, an SP-4 active space for both ethylene and vinyl, and an SP-6 active space for both ethane and ethyl. We illustrate this feature with several examples. For the ethyl radical, a full valence CAS space would be (13,13) with seven bonds. We think of this as derived from ethane by removing a hydrogen atom, and the antibonding orbital accompanies it. Constructing GAS subspaces with the same logic as explained above for methylene then yields an SP-7 approximation that is equivalent to GAS-7(13,13). However, when we study C–H bond dissociation in this paper, we treat the C–C bonding orbital as inactive and use an SP-6 approximation corresponding to GAS-6(11,11). Ethynyl has a full-valence CAS of size (9,9). Since we are interested in C–H bond dissociation, we made the four electrons in π and π* orbitals inactive, which yields an SP-3 approximation equivalent to a GAS-3(5,5) reference. Vinyl has a full valence CAS of size (11,11). Since we are interested in C–H bond dissociation in, and because we are interested in seeing the effect of aggressively reducing the size of the active space, we removed the four electrons in the CC bond and the associated σ, σ*, π, and π* orbitals from the active space, which yields an SP-4 approximation, equivalent to GAS-4(7,7).
Second, the SP-1 approximation is equivalent to CASSCF(2,2), a case which applies to lithium hydride (LiH), as an example. In addition, as we are performing a full CI for each subspace, SP and SP-PDFT are size consistent in so far as the active space is chosen correctly. For all other molecules, the SP approximation used here involves an active pair for all or some of the bonds, as specified in each case. Nonbonding valence orbitals and core orbitals are always doubly occupied.
Fig. 3 Mean absolute errors (MAE) with respect to experimental values of the calculated bond distances of eleven main-group diatomic molecules and the chromium dimer, Cr2, obtained with several computational approaches (left). The MAE obtained without the results for Cr2 (labeled as MAE-11) is shown on the right. The theoretical methods are grouped into three classes. The first are based on CASSCF (CASSCF, CASPT2, CASPT2-0, CAS-tPBE, CAS-ftPBE). The second are based on the SP approximation (SP, SP-tPBE, SP-ftPBE). The third is Kohn–Sham DFT with the PBE exchange–correlation functional. All experimental data were obtained from ref. 40. |
It has previously been recognized that CASSCF solutions generally lead to equilibrium bond lengths that are too long,13 and our results are consistent with this. CASSCF has a mean absolute error (MAE) of 0.146 Å when compared to experimental data. This statistic is however dominated by the result obtained for Cr2, for which CASSCF overestimates the equilibrium bond length by 1.52 Å. Without the results obtained for Cr2, the MAE of CASSCF (labeled as MAE-11) is 0.021 Å. This is similar to previous results.13 The MAE-11 of KS-PBE (0.022 Å) is similar to that of CASSCF. However, we find that the CAS-tPBE and CAS-ftPBE methods reduce the MAEs of both CASSCF and KS-PBE by in excess of 50%. As shown in Fig. 3, CAS-tPBE (MAE of 0.012 Å and MAE-11 of 0.010 Å) and CAS-ftPBE (MAE of 0.009 Å and MAE of 0.008 Å) perform as well as the much more expensive CASPT2 method (MAE of 0.011 Å and MAE-11 of 0.011 Å), with CAS-ftPBE being the best of the three approaches for bond distances. Without the IPEA shift, CASPT2 (labeled as CASPT2-0), performs poorly for Cr2, resulting in a MAE of 0.076 Å. Even without the data for Cr2, CASPT2-0 (MAE-11 of 0.012 Å) is not as good as CAS-tPBE and CAS-ftPBE.
Examination of Table S1† (tables and figures numbered with the prefix S are in the ESI†) indicates that CASPT2 and CASPT2-0 significantly outperform CAS-PDFT and SP-PDFT only for the bond length of the fluorine molecular dimer, F2. CAS-tPBE and CAS-ftPBE underestimate the bond-length of F2 by 0.021–0.023 Å, while CASPT2 and CASPT2-0 overestimate it by 0.011–0.014 Å. For the highly multireference systems (B2, C2, and Cr2), CAS-tPBE and CAS-ftPBE perform better than CASPT2 and CASPT2-0 for B2 and C2, while CAS-ftPBE gives a similar error as CASPT2 for Cr2.
When comparing SP and CASSCF, we see that restricting the active space with the SP approximation only marginally alters the MAE and MAE-11 of the calculated bond distances of these diatomic molecules. The largest difference between the results obtained with CASSCF and SP was found for B2 and Cr2. In all other cases, the difference between these methods is in the range 0.002–0.007 Å, as shown in Table S1.† More importantly, there is no noticeable difference in the MAE obtained for SP-PDFT (SP-tPBE and SP-ftPBE) and MC-PDFT (CAS-tPBE and CAS-ftPBE), as shown in Fig. 3. Indeed, SP-PDFT performs equally as well as CAS-PDFT in all the cases that were tested; details are in Table S1.†
Expt. | LiH | HF | B2 | C2 | CO | S2 | SO | NH | N2 | O2 | F2 | Cr2 | MAEa | MAE-11 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
57.7 | 141.3 | 70.0 | 146.0 | 256.2 | 100.8 | 123.5 | 78.5 | 228.5 | 120.3 | 38.2 | 33.9 | |||
a MAE = mean absolute error and MAE-11 = mean absolute error without the data for Cr2. b These rows give the numbers of CSFs in the CASSCF and SP calculations. | ||||||||||||||
CAS CSFsb | 3 | 15 | 1512 | 1764 | 1176 | 378 | 378 | 45 | 1176 | 378 | 36 | 28784 | ||
CASSCF | −13.3 | −26.8 | −11.0 | −1.3 | −4.4 | −26.3 | +36.5 | −13.7 | −16.0 | −31.9 | −19.1 | −30.8 | 19.3 | 18.2 |
CASPT2 | −3.9 | −1.5 | −3.7 | +0.1 | −4.0 | −1.1 | +1.5 | +1.4 | −7.1 | −0.1 | −0.5 | −5.8 | 2.6 | 2.3 |
CASPT2-0 | −3.5 | −2.2 | −4.0 | −3.6 | −9.6 | −4.4 | +3.2 | −0.3 | −16.5 | −1.0 | −1.8 | −7.8 | 4.8 | 4.6 |
CAS-tPBE | −3.4 | +4.8 | −2.5 | −1.3 | +0.4 | +6.6 | +4.8 | +4.5 | −1.6 | +8.6 | +12.6 | −18.5 | 5.8 | 4.6 |
CAS-ftPBE | −2.7 | +5.7 | −2.3 | +3.8 | −0.5 | +1.9 | 0.3 | +9.4 | +4.0 | +1.0 | +10.8 | −5.1 | 4.0 | 3.9 |
SP CSFsb | 3 | 3 | 100 | 150 | 37 | 20 | 20 | 4 | 37 | 20 | 3 | 1516 | ||
SP | −13.3 | −26.8 | −15.9 | −7.1 | −8.1 | −26.8 | +37.1 | −15.2 | −24.7 | −32.6 | −21.6 | −31.2 | 21.7 | 20.8 |
SP-tPBE | −3.4 | +4.8 | +0.7 | −1.2 | +0.2 | +6.3 | +4.4 | +5.0 | −3.2 | +8.1 | +10.2 | −19.4 | 5.6 | 4.3 |
SP-ftPBE | −2.7 | +5.7 | −1.2 | +3.6 | +0.5 | +1.7 | +0.6 | +9.5 | +2.5 | +0.6 | +9.0 | −6.1 | 3.6 | 3.4 |
PBE | −4.2 | +0.7 | +7.0 | +10.2 | +11.9 | +12.0 | +14.6 | +18.4 | +14.9 | +22.9 | +14.8 | +10.5 | 11.8 | 12.0 |
KS-PBE calculations perform much better than either CASSCF or SP in nearly all cases, as expected since neither CASSCF nor SP include dynamic correlation. However KS-PBE also has a rather large MAE (11.8 kcal mol−1) as well as a large MAE-11 (12.0 kcal mol−1). The only system for which KS-PBE approaches chemical accuracy is hydrogen fluoride (HF), a system in which the dominant configuration has a weight of 99.9%. This is by all measures a single-reference system.
The importance of including dynamic correlation for correctly computing the dissociation energies of these diatomic molecules is seen by comparing CASSCF with CAS-tPBE, CAS-ftPBE, CASPT2, and CASPT2-0 as well as by comparing SP with SP-tPBE and SP-ftPBE. The CAS-PDFT and SP-PDFT methods both perform very well for B2 and C2, which are two systems with strong multireference character (the dominant configuration has a weight of less than 80%).41 They reduce the MAEs and MAE-11s of CASSCF and SP by factors of about 4 and 6 respectively. For the systems presented in Table 1, the MAEs and MAE-11s of CAS-PDFT and SP-PDFT are close to those of CASPT2. When comparing CAS-PDFT and SP-PDFT to CASPT2, one has to bear in mind that the latter incorporates an empirical IPEA shift, specifically designed to improve agreement with experimental results; 2.2 (2.3) kcal mol−1 separates the MAE (MAE-11) of CASPT2 and CASPT2-0, indicating the importance of the empirical IPEA shift.26
The CAS-PDFT and SP-PDFT results are almost as good as CASPT2 and CASPT2-0. Examination of Table 1 shows that the worst results for CAS-PDFT and SP-PDFT are obtained for F2 (and Cr2 in the case of CAS-tPBE and SP-tPBE). It is particularly encouraging that SP-tPBE and SP-ftPBE essentially match CAS-tPBE and CAS-ftPBE, which are based on full-valence CASSCF solutions; this is one of the key findings of this paper, and it is important because SP-PDFT can treat much larger systems that CAS-PDFT. In principle, as CASSCF is affordable to upwards of 35 million CSFs, it should be possible to create SP solutions that approach that limit as well. As such one can envisage using SP and SP-PDFT for systems that are unaffordable for CASSCF and CAS-PDFT. As examples, SP and SP-PDFT can be used to describe the full π/π* manifold of chrysene (C18H12) as well as the full valence space of benzene–tetracyanoethylene complexes.
Two other interesting points are (1) that the results are stable as far as replacing tPBE by ftPBE or vice versa and (2) that ftPBE results in significant improvements in the results obtained for Cr2 (both for bond distances, Table S1† and for bond dissociation energies, Table 1), suggesting that it might be particularly well suited for transition metal systems.
For O2 and N2 in the bonding regions (∼0.9–1.2 Å), the total electronic energy obtained with SP deviates from the CASSCF energy by about 3–10 kcal mol−1 as shown in Table 2. This is because some of the CSFs deleted in going from the complete active space to the separated-pair active space contribute nonnegligible amounts of dynamic correlation in these cases. At greater internuclear separations (2.5 and 5.0 Å), the differences between the total energies obtained with CASSCF and SP become small, as also shown in Table 2. In contrast, the total energies obtained with SP-tPBE and SP-ftPBE are much closer to those of CAS-tPBE and CAS-ftPBE respectively. In Table 2, we see that the largest difference between the total energies obtained with the SP-PDFT and CAS-PDFT approaches are about 1.7 kcal mol−1, which shows that the PDFT approach recovers the static and dynamic correlation energy that were neglected by using the approximate SP approximation in a variational wave function calculation. This is extremely encouraging. The ability of PDFT to recover these electron correlation effects is the reason why the potential energy curves obtained with SP-PDFT are closer to those obtained with CAS-PDFT in Fig. SI1† than SP is to CASSCF.
R (Å) | % weight of dominant configuration CASSCF:SP | CASSCF:SP | CAS-tPBE:SP-tPBE | CAS-ftPBE:SP-ftPBE | |
---|---|---|---|---|---|
N2 | 0.9 | 96.0:96.6 | 7.1 | 1.4 | 1.2 |
1.2 | 90.6:91.2 | 9.4 | 1.6 | 1.7 | |
2.5 | 12.4:12.1 | 0.54 | 0.26 | 0.27 | |
5.0 | 6.3:8.3 | 0.002 | 0.001 | 0.001 | |
O2 | 1.1 | 96.0:96.3 | 3.6 | 0.28 | 0.29 |
1.2 | 94.2:94.6 | 4.0 | 0.033 | 0.046 | |
2.5 | 34.2:33.8 | 0.26 | 0.21 | 0.20 | |
5.0 | 25.0:33.3 | 0.044 | 0.030 | 0.038 | |
Cr2 | 1.6 | 57.6:56.4 | 8.8 | 1.0 | 1.1 |
1.8 | 28.8:25.9 | 3.7 | 1.5 | 1.8 | |
2.6 | <2:<2 | 0.3 | 0.03 | 0.004 | |
5.0 | <2:<2 | <0.001 | 0.002 | 0.002 |
We emphasize that the SP-PDFT and CAS-PDFT agree well both in the bonding regions of N2 and O2, where there are dominant configurations with greater than 90% weight, and in the limit of dissociation, where there are many configurations with appreciable weights. We see similar effects in Cr2. These results are shown in Table 2. It is also interesting to probe the origins of the agreement between SP-PDFT and CAS-PDFT by examining the approximate occupation numbers of the correlated orbitals in the SP and CASSCF solutions on which they are based, respectively. If we examine C2 at equilibrium, the CASSCF solution has occupation numbers of (1.9840:0.0136); (1.5960:0.3997); (1.8913:0.1121); (1.8911:0.1123), in order, for the pairs shown in Fig. 1. The SP solution has very similar occupation numbers of (1.9892:0.0108); (1.6051:0.3949); (1.8924:0.1074); (1.8924:0.1074). This suggests that the CASSCF and SP solutions result in comparable density and on-top pair density, a fact that is sufficient for quantitatively accurate SP-PDFT calculations, even though the parent SP solution is higher in energy than the analogous CASSCF solution by 0.0116 Hartrees (11.6 mH).
For CH2, a full valence CASSCF(6,6) wave function is used for subsequent CASPT2, CASPT2-0, and CAS-PDFT calculations. In C1 symmetry, this active space choice results in 189 and 175 CSFs for the 3B1 and 1A1 states, respectively. For SP and SP-PDFT calculations, we used the SP-4 active space, as described above. This leads to a total of 25 and 17 CSFs for the 3B1 and 1A1 states, respectively.
For ozone, we used a CAS(12,9) reference for CASSCF and an SP-3 reference for the SP approximation, with the latter resulting in the reduction of the number of CSFs from 2520 to 37. In essence 98.5% of the CSFs in the CASSCF(12,9) solution are completely neglected in the SP-3 approximate wave function. These active space schemes are illustrated in Fig. 4.
CASSCF | CASPT2 | CASPT2-0 | CAS-tPBE | CAS-ftPBE | SP | SP-tPBE | SP-ftPBE | KS-PBE | Expt.44,48,a | |
---|---|---|---|---|---|---|---|---|---|---|
a To obtain these values, the thermal correction to the enthalpy at 298 K obtained by KS-PBE/aug-cc-pVTZ is added to the empirical atomization ΔH298 of CH2. The frequency component of this correction was scaled using a scaling factor obtained from ref. 35 for this model chemistry. | ||||||||||
Methylene | ||||||||||
R e (Å) | ||||||||||
3B1 | +0.005 | −0.006 | −0.005 | −0.002 | −0.003 | +0.005 | −0.003 | −0.004 | +0.000 | 1.085 |
1A1 | +0.017 | +0.005 | +0.005 | +0.011 | +0.010 | +0.023 | +0.012 | +0.006 | +0.015 | 1.107 |
HCH (degrees) | ||||||||||
3B1 | −4.2 | −1.9 | −1.3 | +0.7 | +1.2 | −4.1 | +2.0 | +3.4 | +0.0 | 135.5 |
1A1 | −1.1 | −0.5 | −0.7 | +0.1 | +1.0 | −1.5 | +0.5 | +2.6 | −1.6 | 102.4 |
AE of 3B1 (kcal mol−1) | −19.5 | +0.9 | −0.8 | +2.5 | +8.1 | −22.4 | +2.4 | +7.1 | +8.2 | 186.2 |
S–T gap (kcal mol−1) | +1.5 | +2.7 | +4.8 | −1.2 | −2.3 | +4.5 | −0.8 | −2.7 | +6.9 | 8.6 |
Ozone | ||||||||||
R e (Å) | +0.005 | +0.007 | +0.007 | −0.006 | −0.005 | +0.002 | −0.009 | −0.012 | −0.003 | 1.278 |
OOO (degrees) | −0.6 | −0.2 | −0.4 | +0.6 | +0.4 | −2.2 | +1.8 | +1.6 | +1.4 | 116.8 |
AE (kcal mol−1) | −46.2 | +2.2 | −4.8 | +27.7 | +21.4 | −81.0 | +29.9 | +27.5 | +41.9 | 142.5 |
CAS-PDFT reduces the errors in the calculated structural properties of CH2 to within the margins provided by CASPT2 and CASPT2-0. In general, the C–H bond lengths obtained with CAS-PDFT are within 0.004–0.007 Å of the values obtained with CASPT2 and CASPT2-0. This is the case for the 3B1 and 1A1 states.
For the 3B1 state, CAS-PDFT overestimates bond angles by about 0.7–1.5° while CASPT2 underestimates them by about 1.9°. Compared with CAS-PDFT, SP-PDFT gives almost the same C–H bond lengths, and the bond angles are about 2° larger. We note that Jensen and Bunker obtained a bond angle of 133.9° for the 3B1 state.45 This is 1.6° below the experimental value shown in Table 3, and indicates that the results obtained with CASPT2, CASPT2-0, CAS-PDFT and SP-PDFT are within the range of available experimental data.
For the 1A1 state, CAS-PDFT overestimates the bond angle by up to 1.0°, while CASPT2 and CASPT2-0 underestimate by 0.5 and 0.7°, respectively. Similar to the situation for the 3B1 state, SP-PDFT results in slightly larger bond angles.
An earlier approach for combining MCSCF-type methods with DFT has been described by Cremer and coworkers.46 This method, which they call CAS-DFT, does not perform as well as CAS-PDFT and SP-PDFT for predicting the structural properties of CH2.47 It overestimates the C–H bond length of the 3B1 state by 0.017 Å. For the 1A1 state of CH2, it overestimates the C–H bond length by 0.031 Å.
To calculate the atomization energy of CH2 and O3, the C–H and O–O bond lengths are stretched to 12 Å, while keeping the equilibrium bond angle fixed at the value obtained with each method. (Our general conclusions remain unchanged if we use the experimental value of the bond angle.) CASSCF and SP underestimate the atomization energy of the 3B1 state of CH2 by about 20 kcal mol−1 while PBE overestimates by about 8.5 kcal mol−1, as shown in Table 3. Calculations with CAS-tPBE and SP-tPBE bring the error down to below 3.0 kcal mol−1, which is similar to CASPT2, which overestimates the bond energy by 1.2 kcal mol−1. Inclusion of the gradient of the on-top pair density, results in errors of 8.4 and 7.4 kcal mol−1 for CAS-ftPBE and SP-ftPBE, respectively, still better than KS-PBE but much worse than tPBE.
Table 3 shows that the CAS-tPBE and SP-tPBE calculations underestimate the adiabatic singlet–triplet gap by about 1.0 kcal mol−1 while CAS-ftPBE and SP-ftPBE calculations underestimate the separation by 2.3 and 2.7 kcal mol−1, respectively. The earlier CAS-DFT approach also underestimates the gap (by 1.7 kcal mol−1).47 These results are quite encouraging when compared to CASPT2 and CASPT2-0, which overestimate the separation by 2.7 and 4.8 kcal mol−1, respectively.
In the present work, we have used an even more restrictive framework, namely SP-3, which becomes GASSCF-3(6,6)-0e in the general notation. The first two subspaces each contain a coupled pair of σ and σ* orbitals while the third space contains a coupled pair of π and π* orbitals. In contrast, we placed 12 electrons in 9 orbitals for the CASSCF calculations. The nine orbitals are those formed by combination of the 2px, 2py, and 2pz orbitals of the three oxygen atoms, as shown in Fig. 5. The dominant configuration in the CASSCF(12,9) wave function has a weight of only 84%, showing that this system has significant multi-reference character.
Table 3 shows that the optimized geometry of O3 obtained with CASSCF is in good agreement with the results of Tsuneda et al.,51 who used a similar active space with the cc-pVTZ basis sets augmented with s, p, and d diffuse functions. Also, the structural parameters obtained with CASPT2 are in agreement with the reports of Ljubic and Sabljic, who used the same active space.50 The bond lengths and bond angle obtained with SP-PDFT are similar to those obtained with CAS-PDFT, despite the fact that the underlying SP wave function contains only about 1.4% of the number of CSFs in the CASSCF solution. CAS-PDFT slightly underestimates the O–O distances and slightly overestimates the bond angle, while CASPT2 and CASPT2-0 have opposite behaviors.
Krishna and Jordan have previously reported that CASSCF underestimates the atomization energy of O3 by about 57.7 kcal mol−1.52Table 3 shows that CASSCF and the SP approximation are both poor for calculating the atomization energy of O3. These are the two methods that do not attempt to include most of the dynamic electron correlation. On the other hand, PBE overestimates the atomization energy by about 42 kcal mol−1 but CAS-tPBE and CAS-ftPBE reduce the error of PBE by 14 and 20 kcal mol−1, respectively. SP-tPBE and SP-ftPBE behave similarly to CAS-tPBE and CAS-ftPBE respectively. However, CASPT2 and CASPT2-0 perform best for predicting the atomization energy of O3.
C2H2 → C2H˙ + H˙ |
C2H4 → C2H3˙ + H˙ |
C2H6 → C2H5˙ + H˙ |
The calculated energies for these reactions are compiled in Table 4, where they are compared to experimental values estimated by adding the thermal correction to the enthalpy at 298 K obtained by KS-PBE/aug-cc-pVTZ to the empirical ΔH298 reported by Blanskby and Ellison.53 The frequency component of this correction was scaled using scaling factors obtained from ref. 35 for this model chemistry. We used full valence active spaces for the CASSCF calculations in this table: CAS(10,10), CAS(9,9), CAS(12,12), CAS(11,11), CAS(14,14) and CAS(13,13) for acetylene, ethynyl, ethylene, vinyl, ethane and ethyl respectively. With C1 symmetry, these active spaces result in 19404, 8820, 226512, 104544, 2760615 and 1288287 CSFs respectively. In Table 4 we report the number of CSFs only for the parent compound and not for the dissociation radical species. The active spaces used in SP and SP-PDFT calculations are presented in the computational details section and are illustrated for ethane and the ethyl radical in Fig. 5. Only orbitals with significant C–H character are included in the SP active spaces. As previously noted, we used SP-3, SP-3, SP-4, SP-4, SP-6 and SP-6 active spaces for the acetylene, ethynyl, ethylene, vinyl, ethane and ethyl, respectively. These result in 37, 17, 150, 76, 3012 and 1704 CSFs respectively, a significant reduction compared with the full CASSCF calculation.
CASSCF | CAS CSFsa | CASPT2 | CASPT2-0 | CAS-tPBE | CAS-ftPBE | SP | SP CSFsa | SP-tPBE | SP-ftPBE | KS-PBE | Expt.b | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
a These columns give the numbers of CSFs in the CASSCF and SP calculations for the compound. b Thermal correction to the enthalpy at 298 K obtained by KS-PBE/aug-cc-pVTZ are added to empirical enthalpy ΔH298 data.53 | ||||||||||||
Acetylene | 123.7 | 19404 | 136.1 | 134.8 | 140.7 | 141.4 | 115.7 | 37 | 141.5 | 141.3 | 138.2 | 140.9 |
Error | −17.2 | −4.8 | −6.1 | −0.2 | +0.5 | −25.2 | +0.6 | +0.4 | −2.7 | |||
Ethylene | 106.3 | 226512 | 114.9 | 113.7 | 116.7 | 116.9 | 93.8 | 150 | 117.4 | 116.0 | 113.6 | 119.5 |
Error | −13.2 | −4.6 | −5.8 | −2.8 | −2.6 | −25.7 | −2.1 | −3.5 | −5.9 | |||
Ethane | 99.7 | 2760615 | 106.4 | 105.1 | 106.1 | 105.7 | 68.8 | 3012 | 112.3 | 111.3 | 104.6 | 110.4 |
Error | −10.7 | −4.0 | −5.3 | −4.3 | −4.7 | −41.6 | +1.9 | +0.9 | −5.8 | |||
MSE | −13.7 | −4.5 | −5.7 | −2.4 | −2.3 | −30.8 | 0.1 | −0.7 | −4.8 | |||
MAE | 13.7 | 4.5 | 5.7 | 2.4 | 2.6 | 30.8 | 1.5 | 1.6 | 4.8 |
In all the three cases presented in Table 4, CAS-PDFT performs much better than CASPT2, CASPT2-0, or PBE, and SP-tPBE and SP-ftPBE do even better with MAEs of only 1.6 and 1.7 kcal mol−1, respectively. This is another demonstration that PDFT effectively recovers correlation that is left behind by the active space restrictions of the SP approximation, even though we were very aggressive in including only a small number of pairs. Fig. S2† shows that the SP-PDFT potential energy curves for C–H cleavage in acetylene, ethylene, and ethane match those obtained with CASPT2 and CAS-PDFT very well, so the active space restrictions do not distort the shape of the potential energy curves.
In general we are presenting the SP-PDFT results for just one small active space, as previously indicated. We did do some testing to see the effect of using different choices of which pair of orbitals to include, and we found that the effect of adding or removing spectator pairs was small. For example we used SP-5 for ethylene and vinyl and found a difference in the calculated C–H dissociation energy of only 0.2 kcal mol−1 as compared to the SP-4 results presented in the table.
CASSCF | CAS CSFsa | CASPT2 | CASPT2-0 | CAS-tPBE | CAS-ftPBE | SP | SP CSFsa | SP-tPBE | SP-ftPBE | KS-PBE | Expt.b | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
a These columns give the numbers of CSFs in the CASSCF and SP calculations. b Scaled zero point energy corrections obtained at the B3LYP/6-31G(d) level are added to experimental ΔH‡0K data to obtain the last column.33,54 | ||||||||||||
1 | 36.0 | 20 | 34.0 | 32.5 | 36.0 | 36.2 | 36.6 | 10 | 36.2 | 36.3 | 32.1 | 33.6 |
Error | +2.4 | +0.4 | −1.1 | +2.4 | +2.6 | +3.0 | +2.6 | +2.7 | −1.5 | |||
2 | 45.3 | 175 | 28.8 | 27.7 | 30.0 | 30.0 | 58.3 | 37 | 30.3 | 30.8 | 25.5 | 30.2 |
Error | +15.1 | −1.4 | −2.5 | −0.2 | −0.2 | +28.1 | +0.1 | +0.6 | −4.7 | |||
3 | 39.1 | 1764 | 25.5 | 24.4 | 25.8 | 25.8 | 52.1 | 150 | 25.5 | 25.9 | 23.2 | 29.5 |
Error | +9.6 | −4.0 | −5.2 | −3.7 | −3.7 | 22.0 | −4.0 | −3.6 | −6.3 | |||
4 | 50.8 | 20 | 36.4 | 35.4 | 31.9 | 33.1 | 60.1 | 10 | 32.0 | 32.9 | 31.3 | 38.8 |
Error | +12.0 | −2.4 | −3.4 | −6.9 | −5.7 | +21.3 | −6.8 | −5.9 | −7.5 | |||
5 | 43.9 | 20 | 25.5 | 25.4 | 21.0 | 22.3 | 40.7 | 10 | 20.9 | 21.9 | 22.8 | 25.8 |
Error | +18.1 | −0.3 | −0.4 | −4.8 | −3.5 | +14.9 | −4.8 | −3.8 | −3.0 | |||
MSE | 11.4 | −1.5 | −2.5 | −2.6 | −2.1 | 17.9 | −2.6 | −2.0 | −4.6 | |||
MAE | 11.4 | 1.7 | 2.5 | 3.6 | 3.1 | 17.9 | 3.7 | 3.2 | 4.6 |
Fig. 6 The barriers for these five pericyclic reactions were calculated with SP-PDFT and other theoretical methods. |
SP and CASSCF overestimate the reaction barriers by 15 kcal mol−1 or more, in nearly all cases. Table 5 shows that the only exception is for reaction 1, for which they overestimate it by only 2.4 and 3.0 kcal mol−1 respectively. SP-PDFT and CAS-PDFT greatly improve on CASSCF and SP, reducing the MAEs by factors of 3–6. This is similar to the situation found for reactions involving small molecules.19 The MAEs of SP-PDFT and CAS-PDFT are still somewhat large, at least when compared to CASPT2. However, regarding the central topic of this manuscript, we find that the calculated reaction barriers are stable to the use of a restricted SP wave function; that is, SP-PDFT and CAS-PDFT yield similar results for each reaction and overall have similar MAEs.
Fig. 7 Illustrations of the structures of twisted ethylene, 1,4-didehydrobenzene and α,3-didehydrotoluene. |
CASSCF | CAS CSFsa | CASPT2 | CASPT2-0 | CAS-tPBE | CAS-ftPBE | SP | SP CSFsa | SP-tPBE | SP-ftPBE | KS-PBEb | Expt. | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
a These columns give the numbers of CSFs in the CASSCF and SP calculations. For B and C, we give the number of CSFs in the triplet state first. b We compare the variational energies obtained at the PBE level. c For A, we give the energy of twisted ethylene relative to planar ethylene. For B and C, we give the energies of the singlet states relative to the triplet state. | ||||||||||||
A | 73.1 | 226512 | 69.7 | 67.6 | 72.6 | 69.5 | 72.0 | 3 | 74.3 | 72.5 | 65.8 | |
B | −2.6 | 2352; 1764 | −4.7 | −4.5 | −3.9 | −4.0 | −3.7 | 204; 200 | −3.9 | −4.0 | −3.6 | −3.5 ± 0.5 (ref. 55) |
C | −2.9 | 2352; 1764 | −1.9 | −1.7 | −0.3 | −0.4 | +0.7 | 204; 200 | −0.3 | −0.3 | −0.7 | −2.5 ± 2.5 (ref. 56) |
For the gap between twisted and planar ethylene, SP-tPBE and SP-ftPBE provide similar results that are 2–3 kcal mol−1 above those obtained with CAS-tPBE and CAS-ftPBE respectively, as shown in Table 6. The results obtained with CAS-ftPBE and CASPT2 are the same. The results obtain for the fully translated functionals improve upon CAS-tPBE and SP-tPBE. Lischka and coworkers obtained a value of 69.2 kcal mol−1 at the MR-CISD+Q/SA-3-RDP level while using similar basis sets.34 This is close to the values obtained by CAS-ftPBE and CASPT2.
For 1,4-didehydrobenzene, the results obtained with CAS-tPBE, CAS-ftPBE, SP-tPBE and SP-ftPBE fall within the error bar of the experiment, which is −3.5 ± 0.5 kcal mol−1. In contrast, CASPT2 and CASPT2-0 fall outside this range; they overestimate even as compared to the high end of the experimental results55 by 0.7 and 0.5 kcal mol−1, respectively. CAS-DFT predicted an energy separation of −2.6 kcal mol−1 between the 1Ag and 3B1u states of 1,4-didehydrobenzene.47 For α,3-didehydrotoluene, CAS-tPBE, CAS-ftPBE, SP-tPBE and SP-ftPBE correctly predict that the 1A′′ state is more stable than the 3A′′ state. The separations provided by all these methods are similar and fall within the range of available experimental data. They are however about 1.5 kcal mol−1 smaller than the separations predicted by CASPT2 and CASPT2-0. Unfortunately, available experimental reports only indicate that the splitting should be lesser than 5.0 kcal mol−1.
We subsequently considered whether the SP approximation is useful for multi-configuration pair-density functional theory (MC-PDFT), and to put this in context we first contrast MC-PDFT to Kohn–Sham density functional theory (KS-DFT). In KS-DFT, ones represents the density by a Slater determinant, and one writes the total energy as the sum of the kinetic energy computed from the Slater determinant by standard wave function methods, the Coulomb energy computed classically from the density, and a remainder. The remainder is a functional of the density and is called the exchange–correlation energy, and it includes not just the deviation of the true potential energy from the Coulomb energy computed classically from the density, but also the deviation of the true kinetic energy from the Slater-determinant kinetic energy. In MC-PDFT, we represent the density and the on-top pair density by a multi-configurational wave function, and we write the total energy as the sum of the kinetic energy computed from the multi-configurational wave function by standard wave function methods, the Coulomb energy computed classically from the density, and a remainder. The remainder is a functional of the density and the on-top pair density and is called the on-top energy, and it includes both the deviation of the true potential energy from the Coulomb energy computed classically from the density and the deviation of the true kinetic energy from the multi-configurational-wave-function kinetic energy. In most previous attempts to combine multi-configurational wave functions with density functional theory, one writes the total energy as the sum of the energy computed by wave function theory from the multi-configurational wave function plus a remainder. Because the energy computed by wave function theory from the multi-configurational wave function includes some of the effect of electron correlation on the true potential energy, one must be careful not to include this portion of the correlation energy in the remainder; this can be called the double counting problem. Because we use only the kinetic energy of the multi-configurational wave function, we avoid this double counting problem. Note though that we do not know an exact on-top functional, just as an exact exchange–correlation functional is not known in KS-DFT, so our treatment is not exact. A major goal of both KS-DFT and MC-PDFT is to find a better approximation to the corresponding exact functional. One motivation for MC-PDFT is that it might be “easier” to find a good on-top functional than to find a good exchange–correlation functional for two reasons: (1) our kinetic energy is based on a representation that better conforms to the true wave function when (as is often the case) the system is inherently multi-configurational (as, for example, for describing the breaking of a covalent bond), and (2) our functional is allowed to depend on the on-top pair density, which brings in extra information. Many years of development have gone into modern exchange–correlation functionals,57 whereas for on-top functionals we are still using first-generation approximations. The present paper, however, is not about improving the functional but rather about testing how many and what kind of configurations need be present in the multi-configurational wave function in order obtain reasonable results with simple density functionals. We found that the new SP approximation, discussed in the previous paragraph for wave function theory, provides an economical multi-configurational wave function that yields good accuracy with MC-PDFT. Thus we have presented a version of MC-PDFT called separated-pair pair-density functional theory, abbreviated SP-PDFT. The SP-PDFT method uses a separated-pair (SP) wave function to generate the kinetic and classical Coulomb contributions to the total electronic energy, and the remainder of the total electronic energy is computed from a functional of the total density and the on-top pair density taken from the SP wave function. The SP-PDFT methods can therefore be used for large systems for which conventional CASSCF calculations, CASPT2, and CAS-PDFT are unaffordable.
Sometimes the SP approximation wave function calculations agree well with the CASSCF ones; in other cases they are less accurate, as would be expected. But even in cases where the energetic results obtained by wave function theory from the SP approximation are less accurate than those obtained by CASSCF, we show that the SP approximation is able to produce an accurate enough kinetic energy and on-top pair density that the SP-PDFT results are in generally good agreement with the CAS-PDFT results. The tests included in this article include structural properties and bond dissociation energies of eleven diatomic and two triatomic molecules, the C–H dissociation energies of prototypical organic systems, the reaction barriers of pericyclic reactions, and the description of open-shell singlet species. In all the cases that were tested, SP-PDFT provides approximately the same accuracy as CAS-PDFT. In most cases, both SP-PDFT and CAS-PDFT provide similar levels of accuracy as the much more expensive CASPT2 approach; the only exception to this is for the reaction barriers of pericyclic reactions.
The key result for wave function theory is that the SP approximation often agrees quite well with CASSCF, at greatly reduced cost, and this extends the usefulness of the method to bigger systems. The key result for density functional theory is that the quality of results obtained from MC-PDFT calculations remains largely unchanged even with drastic reductions in the number of included CSFs, as we have in made in the SP-PDFT variant of the method. In addition the SP-PDFT approach, just as is the more general MC-PDFT framework, is free of double-counting of electron correlation energies. This double-counting problem plagues nearly all other hybrid approaches for combining CASSCF and KS-DFT.
Future work that one can anticipate includes testing the performance of the SP and SP-PDFT methods for transition metal complexes, developing better on-top functionals of the MCSCF density and on-top pair density, and developing an orbital optimization algorithm that includes the on-top functional in the self-consistent-field step.
Footnote |
† Electronic supplementary information (ESI) available: Optimized structures of all molecules. Details of the active spaces used in CASSCF and SP computations. See DOI: 10.1039/c5sc03321g |
This journal is © The Royal Society of Chemistry 2016 |