Kenji
Sugisaki
*abc,
Takumi
Kato‡
d,
Yuichiro
Minato
d,
Koji
Okuwaki
e and
Yuji
Mochizuki
ef
aDepartment of Chemistry, Graduate School of Science, Osaka City University, 3-3-138 Sugimoto, Sumiyoshi-ku, Osaka 558-8585, Japan. E-mail: sugisaki@osaka-cu.ac.jp
bJST PRESTO, 4-1-8 Honcho, Kawaguchi, Saitama 332-0012, Japan
cCentre for Quantum Engineering, Research and Education (CQuERE), TCG Centres for Research and Education in Science and Technology (TCG CREST), 16th Floor, Omega, BIPL Building, Blocks EP & GP, Sector V, Salt Lake, Kolkata, 700091, India
dBlueqat Inc., 2-24-12 Shibuya, Shibuya-ku, Tokyo 150-6139, Japan
eDepartment of Chemistry, Faculty of Science, Rikkyo University, 3-34-1 Nishi-ikebukuro, Toshima-ku, Tokyo 171-8501, Japan
fInstitute of Industrial Science, The University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo 153-8505, Japan
First published on 15th March 2022
Variational quantum eigensolver (VQE)-based quantum chemical calculations have been extensively studied as a computational model using noisy intermediate-scale quantum devices. The VQE uses a parametrized quantum circuit defined through an “ansatz” to generate approximated wave functions, and the appropriate choice of an ansatz is the most important step. Because most chemistry problems focus on the energy difference between two electronic states or structures, calculating the total energies in different molecular structures with the same accuracy is essential to correctly understand chemistry and chemical processes. In this context, the development of ansatzes that are capable of describing electronic structures of strongly correlated systems accurately is an important task. Here we applied a conventional unitary coupled cluster (UCC) and a newly developed multireference unitary coupled cluster with partially generalized singles and doubles (MR-UCCpGSD) ansatzes to the quasi-reaction pathway of Be insertion into H2, LiH molecule under covalent bond dissociation, and a rectangular tetra-hydrogen cluster known as a P4 cluster; these are representative systems in which the static electron correlation effect is prominent. Our numerical simulations revealed that the UCCSD ansatz exhibits extremely slow convergence behaviour around the point where an avoided crossing occurs in the Be + H2 → BeH2 reaction pathway, resulting in a large discrepancy of the simulated VQE energy from the full-configuration interaction (full-CI) value. By contrast, the MR-UCCpGSD ansatz can give more reliable results with respect to total energy and the overlap with the full-CI solution, insisting the importance of multiconfigurational treatments in the calculations of strongly correlated systems. The MR-UCCpGSD ansatz allows us to compute the energy with the same accuracy regardless of the strength of multiconfigurational character, which is an essential property to discuss energy differences of various molecular systems.
A quantum–classical hybrid algorithm known as a variational quantum eigensolver (VQE) was proposed in 20147,8 as the alternative computational model in the noisy intermediate-scale quantum (NISQ)9 era. The VQE uses a quantum processing unit (QPU) for the preparation of approximated wave function by utilizing parameterized quantum circuits and evaluation of energy expectation values, and a classical processing unit (CPU) for the variational optimizations. The parametrized quantum circuit is defined by an empirical “ansatz”, and thus the ansatz used in the computation determines the accuracy of the wave function and energy. Intuition-oriented ansatzes of chemists such as unitary coupled cluster (UCC) ansatz,10–12 and adaptive derivative-assembled pseudo-Trotter ansatz (ADAPT),13 qubit coupled cluster,14 and more heuristic ansatzes called “hardware-efficient” ones15 have been well investigated. Development of new ansatzes16–25 is one of the mainstream of the VQE study, and other developments such as qubit reductions by utilizing natural orbitals,26,27 extension of ansatzes for larger systems,28–31 introduction of error mitigation techniques,32–34 spatial and spin symmetry adaptations,35–38 reduction of the number of qubit measurements,39–42 applications for electronically excited states,32,43–46 and proof-of-principle demonstration on quantum devices47–51 have also been documented. Recent reviews in this field can be found elsewhere.52–56
From the viewpoint of chemistry, we emphasize that most problems in chemistry focus on the energy differences between two or larger numbers of electronic states or geometries, rather than the total energies themselves. To understand the chemistry and chemical processes correctly and to make quantum computers useful in the investigations of real-world chemistry problems, calculating various electronic structures and molecular systems with the same accuracy is essential. Most theoretical studies on VQE-based energy calculations focus on the simple potential energy curves by stretching covalent bonds or changing bonding angles, or simple concerted chemical reactions. It is interesting whether VQE is able to correctly describe more complex chemical reactions; for example, the reactant and product have different electronic configurations, and an avoided crossing between the ground and excited states is involved in the reaction pathway. If an avoided crossing is present in the reaction process, the wave function around the crossing point cannot be well approximated by a single Slater determinant like Hartree–Fock (HF) and by a single configuration state function. In such systems both dynamic and static (or non-dynamic) electron correlation effects are prominent, and non-variational single-reference (SR) molecular orbital theories such as the second-order Møller–Plesset (MP2) and a coupled cluster with singles and doubles (CCSD) become less reliable.57–59 Multireference (MR) treatment is more feasible for the study of such strongly correlated systems.
In this work, we mainly focus on a beryllium atom insertion reaction into H2 to generate a BeH2 molecule illustrated in Fig. 1 as the representative example of the chemical reactions in which S0–S1 avoided crossing is involved, to study how the performance of the VQE changes for weakly and strongly correlated regimes and to examine the MR ansatz for strongly correlated systems. This reaction pathway has been precisely investigated as the model system of MR electronic structure treatments.60–67 As clearly seen in Fig. 1, this reaction pathway contains avoided crossing at R(Be⋯H2) ∼ 2.75 Bohr, and it is a good testing ground for the sophisticated quantum chemical calculations using the VQE. This reaction pathway was studied by Metcalf and co-workers by means of the VQE using a double unitary coupled cluster (DUCC) ansatz, which effectively downfold correlation effects into the reduced-size orbital space.25 They reported a DUCC ansatz with 6 orbitals that provides substantial improvement of the bare Hamiltonian in the active space. Evangelista studied this reaction pathway using various SR-CC methods including traditional CCSD, UCCSD, variational CCSD and extended CCSD in the context of classical computing, finding that these SR-CC approaches break down near the MR region.67 It should also be noted that this system is illustrative in teaching physical chemistry including molecular orbital theory and valence bond descriptions. We examined numerical simulations of the VQE along the quasi-reaction pathway in C2v symmetry by using a traditional UCCSD ansatz and a multireference unitary coupled cluster with partially generalized singles and doubles (MR-UCCpGSD) ansatz, focusing on the accuracy of wave functions and energies, and convergence behaviour of the variational optimization in the VQE. We also applied the MR-UCCpGSD ansatz to the LiH molecule under bond dissociation (R(Li–H) = 3.0 and 4.0 Å), and a rectangular tetra-hydrogen atom cluster known as a P4 model,68 which also exhibits strong multiconfigurational characters.
In order to execute the VQE, the wave function is mapped onto qubits and the second quantized Hamiltonian in eqn (1) is transformed into the qubit Hamiltonian in eqn (2) that is described by a linear combination of Pauli strings in eqn (3), by applying fermion–qubit transformations.3,69–73
![]() | (1) |
![]() | (2) |
![]() | (3) |
and ap in eqn (1) are the creation and annihilation operators, respectively, acting on the p-th spin orbital. hpq and hpqrs are one- and two-electron molecular orbital integrals, respectively. Throughout this paper we use indices i, j, and k for occupied; a, b, and c for unoccupied; and p, q, r, and s for general spin orbitals. We used u and v for the indices for general molecular orbitals. Pauli strings Pn are shown in eqn (2) and the number of qubits used for wave function storage N in eqn (3) depends on the fermion–qubit transformation method being adopted. In this study we used a Jordan–Wigner transformation (JWT),3 in which each qubit possesses an occupation number of a particular spin orbital; the qubit is in the |1〉 state if the spin orbital is occupied, otherwise |0〉. N equals the number of spin orbitals included in the active space.
An approximate wave function is generated on the QPU by using a parametrized quantum circuit defined through an ansatz, and thus selection of an appropriate ansatz is the most important process. Ansatzes used in VQE-based quantum chemical calculations can be roughly classified into two categories; chemistry inspired and hardware efficient.56 The most famous chemistry inspired ansatz is the UCC ansatz defined as in eqn (4) and (5).10
![]() | (4) |
![]() | (5) |
In eqn (4), is the reference wave function and the HF wave function |ΨHF〉 is usually used. T defined in eqn (5) is the operator describing electron excitations from the occupied orbitals to the virtual orbitals in the reference wave function. Compared with the traditional CC method, the UCC wave function takes into account electron de-excitations from the unoccupied to occupied orbitals (T†) and electron excitations T. The UCC energy can be computed variationally, although the number of UCC applications in classical computing is very limited.67,74–76 One of the reasons is that the Baker–Campbell–Hausdorff (BCH) expansion of the similarly transformed Hamiltonian
does not terminate and therefore truncation of the expansion is necessary. By contrast, preparation of the UCC wave function on a quantum computer is straightforward, because
is a unitary operator. Since the CCSD(T)77 (single and double excitation operators are considered, and effects of connected triples are taken into account through many body perturbation theory) method is regarded as the “gold standard” in quantum chemistry, the UCCSD can be a practical tool for reliable quantum chemical calculations on quantum computers. However, it is well known that the approximation of CCSD becomes worse when the static electron correlation effect is significant and wave function cannot be well approximated at the HF level. As a result, traditional CCSD cannot describe the potential curve associated with covalent bond dissociation correctly where static correlation is crucial and it sometimes gives the energy substantially lower than the full-CI value.58,59 By contrast, UCCSD is solved variationally and thus an upper bound condition is always satisfied. However, as Evangelista reported, the UCCSD energy largely deviates from the full-CI value around the transition structure of Be + H2 → BeH2 reaction due to slow convergence behaviour of the BCH expansion.67 Following the ab initio molecular orbital theory for classical computers, we expect that MR extension of the UCCSD ansatz is promising for the description of strongly correlated systems.
So far, VQE with the multiconfigurational wave functions based on the UCC with generalized singles and doubles (UCCGSD) and the k-UpCCGSD ansatzes have been proposed.17,23,24 The UCCGSD ansatz takes into account occupied → occupied (e.g., ) and unoccupied → unoccupied (
) excitations, in addition to occupied → unoccupied (
) excitations included in the conventional UCCSD ansatz.17 In the k-UpCCGSD ansatz, fully generalized singles (
) and generalized pair-double excitations (
) are considered, and the cluster operator is applied k times to the reference wave function, with each k having variationally independent amplitudes.17,24 Here, we examine the most straightforward extension of the UCC ansatz to the MR regime by using the MR-UCCpGSD ansatz as the modification of the reported MR ansatzes. In the MR-UCCpGSD the reference wave function |Ψ0〉 is not a single determinant but a multiconfigurational (MC) wave function such as complete active space self-consistent-field (CASSCF) wave functions,78,79 as defined in eqn (6) and (7).
![]() | (6) |
![]() | (7) |
Here, is the Slater determinant included in |ΨMC〉. As the excitation operator T we take into account all possible symmetry-adapted single and double excitation operators from each reference Slater determinant |ΦI〉, as in eqn (8).
![]() | (8) |
Here, the summations run over the spin orbitals those i and j are occupied and a and b are unoccupied in . For example, in this study we used the CASSCF(2e,2o) wave function as the |ΨMC〉 in a BeH2 system, which consists of two Slater determinants: |2220000000〉 and |2202000000〉. Here, 2 and 0 specify the occupation number of molecular orbitals. In this case we consider one- and two-electron excitation operators and their complex conjugates from these two determinants. Therefore, in the MR-UCCpGSD calculations up to four-electron excitation operators from the HF configuration are involved. In our formulation of the MR-UCCpGSD, the occupied → occupied and the unoccupied → unoccupied excitations outside of the active space are not included, because these terms have zero contributions to the reference wave function as the connected terms
. They can have nonzero contributions as the disconnected terms such as
and
, but we assume that such contributions are small. Similar to the UCCGSD ansatz, the cluster operator is applied only once. Thus, the number of variables in MR-UCCpGSD is smaller than that of UCCGSD. In this context we used the term “partially generalized” for the name of ansatz. As easily expected, the same excitation operator can be derived from different reference determinants. In this case we merge the operators and treat them as one excitation operator. In the traditional state-specific MRCC calculations on classical computers, (i) optimization of excitation amplitudes tia and tijab and (ii) diagonalization of an effective operator to optimize |ΨMC〉 are iterated until converge.64–66 In our MR-UCCpGSD ansatz, by contrast, the abovementioned step (ii) is not included. Instead, excitation/de-excitation operators within the active space (HOMO–LUMO two-electron excitation/de-excitation operator in the case of a BeH2 system with CASSCF(2e,2o) reference) are included in the cluster operators to describe relaxation of the CI coefficients in |ΨMC〉.
It should be noted that the same excited function can be generated from different cluster operators in the MR-UCCpGSD ansatz. For example, |2200200000〉 can be generated by the operation and
. In our implementation of the MR-UCCpGSD ansatz, different cluster amplitudes are assigned for these excitations. However, by comparison with the MRCI wave function, only one CI coefficient per excited function is determined from the variational principle, and thus redundancy of the cluster operators is present. The existence of such variational parameter redundancy can retard variational optimization, but to our knowledge, this aspect has not been well discussed in the preceding work.24 It should also be noted that this parameter redundancy problem has a connection with the barren plateau problem80 often discussed in the VQE with hardware-efficient ansatzes. Removal of the linearly dependent variables is important to accelerate the VQE convergence, which is left as a future study.
Once an energy expectation value is computed on the QPU, the CPU executes variational optimization of the parameters. The variables are excitation amplitudes tia and tijab in eqn (5) and (8) for the UCCSD and MR-UCCpGSD ansatzes, respectively. Variational optimizations are often carried out by using gradient-free algorithms such as Nelder–Mead, Powell, and constrained optimization by linear approximation (COBYLA) methods.
As naturally expected, variational optimization in VQE cycles converges faster if the initial estimates of the parameters are closer to the optimal values. In the UCCSD ansatz, an approach to use the MP2 amplitudes is given in eqn (9) as the initial amplitudes were proposed.81
![]() | (9) |
In the MP2 framework one electron excitation amplitudes tia are zero due to Brillouin's theorem.82 However, if we consider the second order wave function in the perturbation theory, we can derive eqn (10),83 which can be used as the initial estimate of tia amplitudes.
![]() | (10) |
Note that in the presence of non-dynamic electron correlation, the reorganization from the HF description should be substantial through sizable contributions from single excitations.84,85 We expect that nonzero tia initial amplitudes can accelerate the convergence of the VQE, especially when static electron correlation is prominent. Using a partial renormalization with size-consistency, the scaled amplitudes in eqn (11) can also be derived86 and tested in this study.
![]() | (11) |
Importantly, only the spatial symmetry-adapted excitation operators have nonzero amplitudes by adopting the initial amplitude estimation methods described here. Excitation operators giving zero initial amplitudes have no contribution to the ground state wave function when they appear in the connected terms. Thus, application of perturbation theory-based initial amplitude estimation is useful not only to find good initial estimates of variables but also to automatically select excitation operators that give a nonzero contribution to the ground state wave function. For the MR-UCCpGSD calculations, we extended the initial amplitude estimation technique described above to the MR regime as follows. Assume that the reference wave function is described by a linear combination of Slater determinants as in eqn (7). The initial amplitudes for MR-UCCpGSD can be computed by a linear combination of the product of the CI expansion coefficient cI and the MP2 amplitudes computed by using each Slater determinant as in eqn (12), for two electron excitation amplitudes, for example.
![]() | (12) |
Calculations of the amplitudes tijab(ΦI) using eqn (9)–(11) require orbital energies. We used the CASSCF canonical orbital energies for the 1 1A1 ground state for the initial amplitude estimations. The UCCSD and MR-UCCpGSD energies using initial (unoptimized) amplitudes are lower than the energy of reference wave functions (see Table S1 in the ESI† for details). It should be emphasized that the ability to use perturbation theory to estimate good initial amplitudes with a low computational cost described here is one of the major advantages of the UCCSD and the MR-UCCpGSD ansatzes.
Because molecules exhibit different molecular properties at different spin multiplicities, spin symmetry-adapted treatment is very important in quantum chemical calculations. Spin symmetry adaptation can be accomplished by considering the spin symmetry-adapted excitation operators for the singlet defined in eqn (13).
![]() | (13) |
In the UCCSD and MR-UCCpGSD ansatzes the spin symmetry adaptation can be done by using the same excitation amplitudes for the spin α → α and β → β excitations, as in eqn (14).
![]() | (14) |
For two-electron excitation operators, we assumed the spin independence of the amplitudes in the Goldstone diagram82 to ensure the wave function being spin symmetry-adapted.
Point | X | Y | Z |
---|---|---|---|
A | 0.0 | ±2.540 | 0.00 |
B | 0.0 | ±2.080 | 1.00 |
C | 0.0 | ±1.620 | 2.00 |
D | 0.0 | ±1.390 | 2.50 |
E | 0.0 | ±1.275 | 2.75 |
F | 0.0 | ±1.160 | 3.00 |
G | 0.0 | ±0.930 | 3.50 |
H | 0.0 | ±0.700 | 4.00 |
I | 0.0 | ±0.700 | 6.00 |
J | 0.0 | ±0.700 | 20.00 |
For the numerical simulations of the VQE-based UCCSD and MR-UCCpGSD calculations, we developed a python program by utilizing OpenFermion87 and Cirq88 libraries. We used a quantum state vector simulator to calculate the energy expectation value, which calculates the expectation value of operator A using the state vector directly, 〈A〉 = 〈Ψ|A|Ψ〉, rather than a statistical sampling of the measurement outcome. This corresponds to the infinite number of repetitive measurements. Needless to say, in the real quantum devices only a finite number of measurements are available, but we are interested in the accuracy and the validity of the ansatz, and hence a quantum state vector simulator is more suitable for this purpose. The quantum circuits for the UCCSD and MR-UCCpGSD ansatzes are constructed by adopting the first order Trotter decomposition with the Trotter slice number n = 1. It is known that the Trotterized UCC ansatz depends on the term ordering of cluster operators.89 In this work we adopted a magnitude ordering, in which excitation operators are ordered by the absolute value of initial amplitudes. Magnitude ordering is known to exhibit a smaller Trotter error than lexicographical ordering.90 We also evaluated the effect of Trotter term ordering by randomly shuffling the terms for UCCSD/STO-3G simulations of the BeH2 system at point E, obtaining that the standard deviation is less than 1 kcal mol−1 for ten simulations (see Table S3 in the ESI†). The HF calculations and computations of one- and two-electron atomic orbital integrals were performed by using the GAMESS-US program package.91 One- and two-electron molecular orbital integrals were prepared by using our own AO → MO integral transformation program. All the numerical simulations were executed on Linux workstations with Intel Xeon Gold 6134 processors.
![]() | ||
Fig. 4 The VQE-UCCSD simulation results of the LiH molecule with R(Li–H) = 3.0 Å. Top: The energy difference from the full-CI value. Bottom: The square overlap with the full-CI wave function. |
Convergence behaviour of the VQE-UCCSD simulations at point E are plotted in Fig. 6, and those of other points is summarized in Fig. S5 in the ESI.† To estimate the number of iterations required to achieve convergence, we attempted to fit the energy difference plot in the range between 1000 and 10000 iterations with an exponential function, obtaining ΔE = 123.68x−0.303 with R2 = 0.9827 (see Fig. S6 in the ESI†). Needless to say, there is no theoretical background for the energy change in the optimizations to be in the form of an exponential function. However, if we assume that the VQE optimization proceeds along the exponential function, we need about 8
000
000 iterations to achieve 1.0 kcal mol−1 of deviation from the full-CI value. It should also be noted that the square overlap between the UCCSD and the full-CI wave functions is at most 0.783 even after 10
000 iterations at point E. These results clearly exemplify the fact that convergence of the UCCSD ansatz is very slow if the square overlap between the reference and full-CI wave functions is small. One of the major reasons for this extreme slow convergence behaviour is that the number of excitation operators with sizable excitation amplitudes are large at point E, due to the poorness of the HF approximation and significant contributions from the T1 terms. The wave function of strongly correlated systems are highly entangled, and thus acquiring optimal parameters becomes a more difficult task. This aspect can be supported by the traditional CCSD, CCSD(T), quadratic configuration interaction with singles and doubles (QCISD),93 and Brueckner doubles (BD)94 calculations (see Section S9 in the ESI†). It should also be noted that in the VQE-DUCC study by Metcalf and co-workers, point E showed a larger deviation from the full-CI energy (27 mHartree) compared with other points (12–17 mHartree).25 This fact also exemplifies the complexity of the electronic structure at point E. Another possible reason is that the initial amplitudes evaluated using eqn (9)–(11) are not so good for strongly correlated electronic structures because of the slow convergence of perturbation expansion.
![]() | ||
Fig. 6 Convergence behaviour of the VQE-UCCSD and MR-UCCpGSD simulations at point E. (a) The energy difference from the full-CI and (b) the square overlap with the full-CI wave function. |
To investigate the convergence behaviour in more detail, we examined the UCCSD simulations at point E by using the STO-3G basis set. The number of variables is reduced from 151 to 44 by employing the STO-3G basis set, and thus VQE optimization converges faster. The UCCSD/STO-3G simulation converged after 3329 iterations, giving ΔEUCCSD–full-CI = 2.866 kcal mol−1 and |〈ΨUCCSD|Ψfull-CI〉|2 = 0.963. From this result the UCCSD seems to give accurate energy if a sufficient number of iterations are possible. We also checked the reference orbital dependence on the convergence behaviour by using the optimized orbital of the CASSCF(2e,2o)/STO-3G as the SR-UCCSD simulations. After 3496 iterations, we obtained that ΔEUCCSD–full-CI = 2.864 kcal mol−1 and |〈ΨUCCSD|Ψfull-CI〉|2 = 0.962, confirming that reference orbital dependence is very small. Nevertheless, it is natural to assume that the number of iterations required for convergence depends on the choice of reference orbitals. In this context, orbital optimized VQE techniques19,20 are promising candidates to accelerate the convergence behaviour. In addition, we performed the k-UpCCGSD/STO-3G simulations with k = 1, 2, and 3 at point E for comparison. The simulations did not converge after 10000 iterations for k = 3. The energies obtained from the unconverged 3-UpCCGSD simulations are ΔE = 10.231 and 6.907 kcal mol−1 for the HF and CASSCF reference orbitals, respectively. The 3-UpCCGSD energies are slightly larger than the UCCSD ones, but we cannot exclude the possibility that 3-UpCCGSD can have a lower energy than UCCSD and MR-UCCpGSD, by improving the initial amplitudes or by taking more iterations (see Section S10 in the ESI† for details).
Point | y | ΔE2c–full-CI/kcal mol−1 | |〈Ψ2c|Ψfull-CI〉|2 |
---|---|---|---|
C | 0.0008 | 28.536 | 0.952 |
D | 0.2991 | 35.591 | 0.933 |
E | 0.7851 | 43.553 | 0.905 |
F | 0.6125 | 48.498 | 0.886 |
G | 0.0373 | 42.417 | 0.903 |
H | 0.0007 | 40.998 | 0.906 |
Noticeably, the two-configurational wave function constructed by using the diradical character y has a larger square overlap with the full-CI wave function compared with the unconverged UCCSD wave function after 10000 iterations at point E due to the extremely slow convergence behaviour of the UCCSD ansatz, although the energy difference between the two-configurational wave function and the full-CI is notably large due to the lack of the dynamic electron correlation effect. One of the anticipated applications of the VQE is the preparation of approximate wave functions for the input of the QPE-based full-CI.96 The QPE utilizes a projective measurement to obtain the full-CI energy, and therefore using the wave function having a large overlap with the full-CI is very important.3 In this context, the conventional SR-UCCSD is less appropriate than the two-configurational wave functions constructed using the diradical characters for the initial wave functions of the QPE,95 unless this slow convergence problem is resolved. We attempted to carry out VQE simulations with the MR-UCCpGSD ansatz for the points D, E, and F, which have large diradical characters. However, using the two-configurational wave functions directly as the reference wave functions in the MR-UCCpGSD simulations is not plausible, because the two-configurational wave functions constructed by utilizing the diradical character y are not spatial symmetry adapted, and therefore the number of excitation operators with nonzero contributions to the electronic ground state will increase. Instead, we performed the CASSCF(2e,2o) calculations and use the CASSCF wave function as the reference in the MR-UCCpGSD calculations. Note that the computational cost for CASSCF increases exponentially against the size of active space, and thus the reference CASSCF calculation itself becomes a bottleneck in the MR-UCCpGSD calculations for large systems. The application of the spatial symmetry recovery technique97 to the natural orbitals of the BS-UHF wave function is a plausible solution.
The MR-UCCpGSD results are also plotted in Fig. 5, and the convergence behaviour of the UCCSD and MR-UCCpGSD calculations at point E are depicted in Fig. 6. The energy expectation value at point E drastically improved by adopting the MR approach. The MR-UCCpGSD calculation did not converge even after 10000 iterations, but the deviation from the full-CI energy is calculated to be 2.143 kcal mol−1 and 〈ΨMR-UCCpGSD|Ψfull-CI〉|2 = 0.950, which is greatly improved by the SR-UCCSD results. By fitting the plot of energy difference with an exponential function, we obtained ΔE = 1289.2x−0.693 with R2 = 0.9932 (see Fig. S6 in the ESI†). From this, we expect that about 30
000 iterations are needed to achieve the energy deviation from the full-CI within 1 kcal mol−1. This value should be compared with ca. 8
000
000 iterations in the conventional SR-UCCSD. However, 30
000 iterations are still too many for practical use. The MR-UCCpGSD simulations took more time (33 days) compared with the SR-UCCSD simulations (18 days), because the MR-UCCpGSD ansatz includes a larger number of variables, and the parametrized quantum circuit is deeper. Note that our numerical simulations were performed without parallelization, and therefore the simulation can be accelerated by high performance computing with parallelization.
The current simulations are based on gradient-free optimization, and gradient-based optimization may accelerate the convergence. We tested the gradient-based optimizations in the UCCSD/STO-3G and MR-UCCpGSD/STO-3G calculations at point E using the BFGS algorithm in conjunction with the gradient estimation based on the 2-point finite difference method. By adopting the BFGS algorithm the optimization converged a smaller number of iterations (35 and 63 iterations for UCCSD and MR-UCCpGSD, respectively) giving ΔE(VQE–full-CI) = 2.579 and 1.647 kcal mol−1 for UCCSD and MR-UCCpGSD, respectively. However, we need a larger number of function evaluations (3566 and 9512 for UCCSD and MR-UCCpGSD) than COBYLA (3329 and 4917) to achieve convergence. We emphasize that implementation of methods that can reduce the number of function evaluations or application of more sophisticated optimization algorithms such as an approach to calculate analytical gradients by adopting a parameter shift rule98 or DIIS-based algorithm99 is essential for the practical use of the VQE for quantum chemical calculations. Another possible approach to improve the convergence behaviour is adopting the sequential optimization approaches. Nevertheless, it should be emphasized that the MR-UCCpGSD ansatz shows faster convergence behaviour compared with UCCSD at strongly correlated systems and it can give a more reliable wave function. Multiconfigurational treatment is quite powerful to study the strongly correlated systems on quantum computers.
The results are summarized in Table 3. In all the geometries being studied, the MR-UCCpGSD ansatz needs more iterations than the UCCSD ansatz, but the deviations of the computed energies from the full-CI values are smaller for MR-UCCpGSD ansatz. Importantly, the UCCSD gives large ΔE values at the geometries with a large diradical character, y, although the MR-UCCpGSD gives ΔE < 0.13 kcal mol−1 and an almost constant error for all the points being studied. These results exemplify that the MR-UCCpGSD ansatz is capable of describing the electronic structures of intermediate and strongly correlated systems with the same accuracy, which is essential to correctly understand chemical phenomena from quantum chemical calculations.
System | Geometry | y(PUHF) | ΔE(VQE–full-CI)/kcal mol−1 | |〈ΨVQE|Ψfull-CI〉|2 | Number of function evaluations | |||
---|---|---|---|---|---|---|---|---|
UCCSD | MR-UCCpGSD | UCCSD | MR-UCCpGSD | UCCSD | MR-UCCpGSD | |||
LiH | R(Li–H) = 3.0 Å | 0.5015 | 0.111 | 0.024 | 0.9987 | 0.9996 | 1013 | 1060 |
LiH | R(Li–H) = 4.0 Å | 0.8334 | 0.156 | 0.022 | 0.9985 | 0.9996 | 680 | 957 |
P4 cluster | α = 1.80 Bohr | 0.3895 | 0.537 | 0.125 | 0.9990 | 0.9999 | 2247 | 4814 |
P4 cluster | α = 1.90 Bohr | 0.6850 | 1.259 | 0.105 | 0.9965 | 0.9999 | 2361 | 3234 |
P4 cluster | α = 1.99 Bohr | 0.9692 | 2.685 | 0.122 | 0.9921 | 0.9998 | 2847 | 3558 |
P4 cluster | α = 2.01 Bohr | 0.9696 | 2.668 | 0.117 | 0.9922 | 0.9999 | 2680 | 3330 |
P4 cluster | α = 2.10 Bohr | 0.7148 | 1.242 | 0.113 | 0.9965 | 0.9999 | 2427 | 3053 |
P4 cluster | α = 2.20 Bohr | 0.4860 | 0.560 | 0.093 | 0.9989 | 0.9999 | 2051 | 3134 |
By contrast, the MR-UCCpGSD ansatz gives an energy much closer to the full-CI one at the geometry nearby the avoided crossing, exemplifying that the MR treatment is more feasible for the accurate descriptions of the electronic structures of strongly correlated systems. The reaction energy barrier computed using the MR-UCCpGSD ansatz is still overestimated, but significant improvement of the transition energy is observed by applying the MR approach. Importantly, in the calculations of the LiH molecule and P4 cluster, the deviation of the UCCSD energies from the full-CI values depends on the diradical character y and hence multiconfigurational character. By contrast, the MR-UCCpGSD ansatz gives similar ΔE(VQE–full-CI) values regardless of the magnitude of open shell electronic configurations. This feature is very important for the application of the VQE to various chemical systems and for correctly understanding the chemical phenomena from the quantum chemical point of view.
The reaction pathway of Be atom insertion into a H2 molecule being investigated is a representative system, in which both dynamic and static electron correlation effects are prominent. However, investigating other strongly correlated systems is nevertheless very important to further disclose the characteristics of the UCCSD and the MR-UCCpGSD ansatzes in more detail. There are many molecular systems whose dynamic and static electron correlation effects play significant roles, such as the electronic ground state of ozone,100 the out-of-plane transition state of the cis–trans isomerization reaction of diazene,101 zigzag edges of graphene nanoribbons,102,103 and so on. Molecules having such complicated electronic structures are the systems in which sophisticated quantum chemical calculations are truly desirable, and thus the importance of MR treatments cannot be overemphasized. Applications of the MR-UCCpGSD ansatz for other strongly correlated systems are ongoing and will be discussed in the forthcoming paper.
Footnotes |
† Electronic supplementary information (ESI) available: the Gaussian basis set used for the BeH2 system; the UCCSD and MR-UCCpGSD energies and wave functions with initial amplitudes; Trotter term ordering dependence; VQE-UCCSD simulations of LiH; initial amplitude dependence on the UCCSD wave functions; RHF, CASSCF, UCCSD, MR-UCCpGSD, and full-CI energies of BeH2; singlet and triplet instabilities; convergence behaviour of the VQE-UCCSD simulations of BeH2; fitting the energy difference plots; CCSD, CCSD(T), BD, and QCISD calculations; and k-UpCCGSD simulations. See DOI: 10.1039/d1cp04318h |
‡ Present address: NTT DATA Corporation, Toyosu Center Bldg. Annex, 3–9, Toyosu 3–chome, Koto–ku, Tokyo 135-8671, Japan. |
This journal is © the Owner Societies 2022 |