 Open Access Article
 Open Access Article
      
        
          
            Huan 
            Ma
          
        
       a, 
      
        
          
            Jie 
            Liu
a, 
      
        
          
            Jie 
            Liu
          
        
       *a, 
      
        
          
            Honghui 
            Shang
          
        
      *c, 
      
        
          
            Yi 
            Fan
          
        
      b, 
      
        
          
            Zhenyu 
            Li
*a, 
      
        
          
            Honghui 
            Shang
          
        
      *c, 
      
        
          
            Yi 
            Fan
          
        
      b, 
      
        
          
            Zhenyu 
            Li
          
        
       ab and 
      
        
          
            Jinlong 
            Yang
ab and 
      
        
          
            Jinlong 
            Yang
          
        
       *ab
*ab
      
aHefei National Laboratory, University of Science and Technology of China, Hefei 230088, China. E-mail: liujie86@ustc.edu.cn; jlyang@ustc.edu.cn
      
bHefei National Research Center for Physical Sciences at the Microscale, University of Science and Technology of China, Hefei 230026, China
      
cState Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China. E-mail: shanghonghui@ict.ac.cn
    
First published on 16th February 2023
Exploring the potential applications of quantum computers in material design and drug discovery is attracting more and more attention after quantum advantage has been demonstrated using Gaussian boson sampling. However, quantum resource requirements in material and (bio)molecular simulations are far beyond the capacity of near-term quantum devices. In this work, multiscale quantum computing is proposed for quantum simulations of complex systems by integrating multiple computational methods at different scales of resolution. In this framework, most computational methods can be implemented in an efficient way on classical computers, leaving the critical portion of the computation to quantum computers. The simulation scale of quantum computing strongly depends on available quantum resources. As a near-term scheme, we integrate adaptive variational quantum eigensolver algorithms, second-order Møller–Plesset perturbation theory and Hartree–Fock theory within the framework of the many-body expansion fragmentation approach. This new algorithm is applied to model systems consisting of hundreds of orbitals with decent accuracy on the classical simulator. This work should encourage further studies on quantum computing for solving practical material and biochemistry problems.
The difference between the available and actually required quantum resources in practical quantum simulations has renewed the interest in divide and conquer (DC) based methods.15–19 Realistic material and (bio)chemistry systems often involve complex environments, such as surfaces and interfaces. To model these systems, the Schrödinger equations are much too complicated to be solvable. It therefore becomes desirable that approximate practical methods of applying quantum mechanics be developed.20 One popular scheme is to divide the complex problem under consideration into as many parts as possible until these become simple enough for an adequate solution, namely the philosophy of DC.21 The DC method is particularly suitable for NISQ devices since the sub-problem for each part can in principle be solved with fewer computational resources.15–18,22–25 One successful application of DC is to estimate the ground-state potential energy surface of a ring containing 10 hydrogen atoms using the density matrix embedding theory (DMET) on a trapped-ion quantum computer, in which a 20-qubit problem is decomposed into ten 2-qubit problems.18
DC often treats all subsystems at the same computational level and estimates physical observables by summing up the corresponding quantities of subsystems, while in practical simulations of complex systems, the particle–particle interactions may exhibit completely different characteristics in and between subsystems. Long-range Coulomb interactions can be well approximated as quasiclassical electrostatic interactions since empirical methods, such as empirical force filed (EFF) approaches,26 are promising to describe these interactions. As the distance between particles decreases, the repulsive exchange interactions from electrons having the same spin become important so that quantum mean-field approaches, such as Hartree–Fock (HF), are necessary to characterize these electronic interactions. Furthermore, van der Waals interactions, such as hydrogen bond and stacked π–π bond interactions and (near-)degenerate electronic interactions should be evaluated using more accurate wave function methods.27,28 Therefore, after decomposing a complex system into many subsystems, treating interactions in and between them with different computational models according to the characteristics of these interactions is an optimal way to approximate the exact results with moderate computational cost. This strategy is often referred to as multiscale modeling that aims at solving problems at multiple scales of space and/or time.29–31
One of the most widely used multiscale models is the combination of classical and quantum mechanical (QM) principles for theoretical simulations of large biological molecules and condensed matter systems. Here, QM models are computationally demanding due to the high computational complexity of QM methods. In particular, when the QM models using the high-level wave function theory (WFT) are employed to describe the central part of a complex system, the size of the QM region to be chosen is extremely limited. Due to the potential computational power of quantum computing, QM models using quantum algorithms are a feasible way to overcome the compromise between accuracy and size. In this work, we refer to this strategy that integrates quantum computing into multiscale models as multiscale quantum computing (see a typical example in Fig. 1). Quantum algorithms are expected to provide an exact solution of the Schrödinger equation in polynomial time,10,11 which enables us to enlarge the size of the QM region without loss of efficiency.
Analogous to classical multiscale modeling, one is able to employ molecular mechanical (MM) approaches, semiempirical approaches or even quantum mean-field approaches, such as HF or density functional theory (DFT), to describe the environment and the interaction between the molecule and environment. The correlation energy should in principle be fully accounted for using quantum computing models when fault-tolerant quantum computers are available. In the NISQ era, the major challenge of multiscale quantum computing is analogous to that of classical multiscale modeling, that is, how to use limited quantum resources to approximate the exact results. In this work, we divide the QM region into fragments and build the QM model using energy-based fragmentation approaches.32 Furthermore, in each fragment, one can decompose the correlation effect into dynamic and static correlations, leaving only the latter one to the quantum computer in order to reduce quantum circuit sizes and depths. Thus, this scheme is able to extend applications of quantum simulations to complex systems.
In Section 2, we present a detailed introduction to a general framework of multiscale quantum computing for complex system simulations, including typical computational models, DC strategies and correlation energy decomposition schemes. In Section 3, we propose a practical scheme for implementing multiscale quantum computing in quantum chemistry. We focus on modeling the QM part with high accuracy on NISQ devices since its combination with low-level computational methods for modeling other surrounding parts has been extensively studied.29–31 To adapt our algorithm to contemporary quantum resources, we first employ the many-body expansion (MBE) fragmentation approach to partition the QM part of a system into small fragments. The accuracy of the MBE approach can be systematically improved toward the accurate results by including high-order many-body corrections. In each fragment, the quantum algorithm is employed to exactly solve complete active space (CAS) problems to describe the static correlation energy. The perturbation theory is applied to recover the dynamic correlation energy that is also essential in accurately characterizing energy profiles. In comparison with the DMET33 that was recently introduced in quantum computing, applying perturbation theories, such as second-order Møller–Plesset perturbation theory (MP2), in the MBE approach is straightforward. As pilot applications, our methods are used to model systems consisting of hundreds of orbitals with decent accuracy. This demonstrates that our method would be particularly applicable to near-term quantum devices.
A popular multiscale scheme to deal with complex systems at the electronic scale is to combine QM models with MM models.30 The total energy of a QM/MM system can be written as
| Etotal = EQM + EMM + EQM–MM | (1) | 
When commercial quantum computers become a reality, it is expected that quantum resources will be sufficient to implement the QM model straightforwardly within the framework of quantum computing. In the case of NISQ devices, it is necessary to further decompose the QM problem into sub-problems that are treated at different levels, leaving the most time-consuming sub-problem to quantum computing. Fig. 1 shows a schematic structure of multiscale quantum computing using a protein–ligand system as an example. This complex system is first divided into a molecule and a molecular environment as is commonly performed in the QM/MM scheme. The environment can be represented by a variety of low-scaling computational models, ranging from EFFs to semiempirical wave function approaches, or even to DFT. The central region should include the ligand molecule and surrounding segments of the protein, leading to a QM system containing tens or even hundreds of atoms. We further divide this QM system into many fragments, with intra-fragment interactions described by the WFT and high-order inter-fragment interactions described by the HF theory. Finally, the orbital space is divided into an active space and a frozen space. In the active space, the CAS configuration interaction (CI) problem is diagonalized on a quantum computer. After this three-step DC procedure, the CASCI problem is usually simple enough to implement on NISQ devices with low-depth quantum circuits.
| ĤQMtotal = ĤQM + ![[V with combining circumflex]](https://www.rsc.org/images/entities/i_char_0056_0302.gif) es + ![[V with combining circumflex]](https://www.rsc.org/images/entities/i_char_0056_0302.gif) pol | (2) | 
![[V with combining circumflex]](https://www.rsc.org/images/entities/i_char_0056_0302.gif) es and
es and ![[V with combining circumflex]](https://www.rsc.org/images/entities/i_char_0056_0302.gif) pol are electrostatic and polarization potentials, respectively.
pol are electrostatic and polarization potentials, respectively.
        Classical potential energy function-based approaches ignore the exchange interactions. As such, they are not suitable for describing the situation where the electron density distribution changes greatly because the exchange interactions become important in this case. Alternatively, it is possible to construct a more accurate embedding potential from QM methods. Frozen-density embedding theory (FDET) constructs an exact embedding potential through a constrained optimization of the total electron density.36 Given that the FDET embedding potential has the form of a universal functional of charge densities, the formalism is in principle rigorous if the functional is exact. In practical simulations, charge densities can be determined from a non-self-consistent-field optimization or even from individual calculations of fragments to save the computational cost. The embedding potential can also be constructed from wave function-based approaches. The X-Pol method is such a theory in which the embedding potential is formulated based on the wave function of other surrounding fragments.37
When accuracy in the laboratory, often referring to chemical accuracy (within 1 kcal mol−1), is required, WFT approaches are an appealing option since their accuracy can be systematically improved toward the full configuration interaction (FCI). While, due to the exponentially increasing computational cost, the largest exact diagonalization calculations were only carried out for a CAS configuration interaction (CASCI) problem with 24 electrons in 24 spatial orbitals. For nondegenerate ground-state problems, the CCSD(T) method in the complete basis set limit is considered as the gold standard for predicting relative energies of different molecular and material conformations.39,40 When systems consist of transition metal atoms, multi-reference approaches, such as CASSCF and MRCI, are necessary for describing the strong correlation effect. Both single-reference and multi-reference approaches have an unaffordable computational complexity for accurate simulations of complex systems.
Quantum algorithms are anticipated to provide an exponential speedup for exactly solving quantum chemistry problems. Quantum phase estimation (QPE) is considered a quantum algorithm that can realise quantum advantage in chemical and material simulations.41 It is worth discussing whether the QPE algorithm has real exponential acceleration when solving general quantum chemistry problems. The possibility that QPE can only achieve a polynomial acceleration still presents significant progress in quantum chemistry.42 In the current stage, the variational quantum eigensolver (VQE) is an alternative choice for exploring the potential applications of quantum computing in quantum chemistry.5
To find an exact solution to the Schrödinger equation it requires the implementation of state-of-the-art wave function ansatzes in the VQE. These ansatzes often result in deep quantum circuits that go beyond the capacity of current quantum devices even for few qubit VQE calculations due to coherence time. For example, the unitary coupled cluster (UCC) with single and double excitations (UCCSD) is one of the most widely used wave function ansatzes in the VQE. Its circuit depth scales as quartic as the system size. In addition, the UCCSD ansatz is unable to describe the strong correlation effect, such as the triple bond dissociation of molecular nitrogen.43 Many recent studies have contributed to the development of adaptive VQE algorithms to reduce circuit depth.44–48 Adaptive derivative-assembled pseudo-trotter (ADAPT) ansatz44 builds a very compact representation of electronic wave functions while maintaining high accuracy. The qubit coupled cluster (QCC) represents the electronic wave function in the qubit representation. Furthermore, in combination with the effective Hamiltonian theory, the QCC maintains a constant circuit depth for electronic structure simulations.49
(1) The size of subsystems that determines the number of qubits used in quantum computing can adapt to available quantum resources. And then massively parallel quantum simulations of subsystems can be performed on NISQ devices. Finally, properties of subsystems are combined to obtain the corresponding quantities of the full system. This maximizes the utilization efficiency of small-scale quantum computational resources in practical applications.
(2) The circuit depth in each subsystem simulation is significantly reduced in comparison with that in the simulation of the full system. For example, the circuit depth of UCCSD scales as  with N being the system size. If the full system is divided into Ns subsystems, the circuit depth for each subsystem is in principle reduced by Ns4. Low-depth circuits can be effectively combined with error mitigation techniques to reduce the total error rate.
 with N being the system size. If the full system is divided into Ns subsystems, the circuit depth for each subsystem is in principle reduced by Ns4. Low-depth circuits can be effectively combined with error mitigation techniques to reduce the total error rate.
In the age of quantum computing without error correction, it is appealing to explore these two aspects by combining the VQE with the DC philosophy to enable large-scale simulations on NISQ devices.
Two popular DC schemes for quantum chemistry simulations are fragment-based approaches32 and quantum embedding theories.33 Fragment-based approaches usually partition a system according to chemical information, such as chemical functional groups and chemical bonds. A variety of fragment-based approaches, such as the fragment molecular-orbital method,50 the molecular fragmentation with conjugated caps method,51,52 and the (generalized) many-body expansion,53–55 have been proposed for studying complex systems. The accuracy of fragment-based approaches depends on molecular fragmentation schemes and posteriori corrections to high-order many-body interactions.32 In contrast, quantum embedding theories provide a more flexible way to partition large systems. The interaction between the subsystem and the environment is described by including bath orbitals in subsystem calculations. However, applying posteriori corrections to the DMET is not a trivial task since it involves a self-consistent iteration procedure. A pilot implementation of the DMET within the framework of the VQE has been recently realized for a ring of 10 hydrogen atoms on a trapped-ion quantum computer.18 A numerical simulation of C18 with the cc-pVDZ basis (up to 144 qubits) has been performed with the DMET-VQE method.19 Although these preliminary studies have demonstrated the advantage of the DC strategy in quantum computing, the application of the DC approaches within the VQE framework to practical quantum chemistry problems is still an open question.
Assuming that the static and dynamic correlation can be respectively treated variationally and perturbatively, an inverse scheme, namely ∼first dynamic then static∼, should also work for recovering the total correlation energy. One such framework is the effective Hamiltonian, which incorporates the dynamics correlation by a similarity transformation of the system Hamiltonian. In the context of quantum computing, this transformation often adopts a unitary form in order to construct a Hermitian form of the effective Hamiltonian. And then the diagonalization of the effective Hamiltonian is simplified to a CAS problem within a small active space. In the double UCC method, electronic excitations are decoupled into two disjoint sets so that the correlation effect for a subset of excitations can be downfolded into an effective Hamiltonian.59 In addition, the effective Hamiltonian is also able to account for the basis set convergence problem by incorporating explicit inter-electron coordinates, as is performed in the F12 methods.60,61
Before introducing theoretical methods, we first specify the notations to be used in the following. In the MBE method, different fragments are labeled with the Greek alphabets α, β, γ…. For HF molecular orbitals (MOs) in each fragment, the occupied orbitals and the virtual orbitals are labeled with i, j, k… and a, b, c…, respectively and p, q, r, s… label general MOs. Given a set of HF orbitals ϕp, the second-quantized Hamiltonian can be written as:
|  | (3) | 
|  | (4) | 
|  | (5) | 
 are mapped to qubit operators using the Jordan–Wigner or Bravyi–Kitaev transformation. The energy can be obtained by measuring and then summing up the expectation values of Pauli strings {
 are mapped to qubit operators using the Jordan–Wigner or Bravyi–Kitaev transformation. The energy can be obtained by measuring and then summing up the expectation values of Pauli strings {![[P with combining circumflex]](https://www.rsc.org/images/entities/i_char_0050_0302.gif) } ∈ {I, σx, σy, σz}⊗n as
} ∈ {I, σx, σy, σz}⊗n as|  | (6) | 
Hereafter, we will drop the label “QM” for simplicity since the methods introduced in the following focus on QM problems.
|  | (7) | 
|  | (8) | 
|  | (9) | 
If one includes third-order corrections, the total energy is
|  | (10) | 
Note that the total energy is exact if no truncation is applied to eqn (7). When the MBE is truncated at second or third order, additional corrections to the long-range interactions are necessary to recover the exact results. The most popular scheme for MBE corrections is the electrostatic embedding approach,53 in which the energies are revised by the electrostatic interactions between the n-mer and the rest of the fragments,
| En-mer = 〈Ψ|Ĥn-mer + ![[V with combining circumflex]](https://www.rsc.org/images/entities/i_char_0056_0302.gif) es|Ψ〉 | (11) | 
To further improve the accuracy of the MBE, it is also possible to use low-scaling computational methods, such as semiempirical approaches and HF, to describe the high-order many-body interactions. A simple way to implement this strategy is the ONIOM method71 with the total energy expressed as
| Etotal = EMBEhigh + Elow − EMBElow | (12) | 
One intrinsic drawback of fragment-based methods is the need to break chemical bonds resulting from molecular fragmentation. The hydrogen atom is usually chosen to saturate the severed bonds as shown in Fig. 2. It is important to guarantee that the net number of capped hydrogen atoms is zero for a valid fragmentation method. This condition is automatically satisfied in MBE methods.
|  | ||
| Fig. 2 Fragmentation strategy for the MBE2 method. (a) Pentenol molecule; (b) monomers and (c) dimers. Hydrogen atoms highlighted in yellow are link atoms added to saturate the severed bonds. | ||
| |Ψ(k)ADAPT〉 = (eθk ![[small tau, Greek, circumflex]](https://www.rsc.org/images/entities/i_char_e0b8.gif) k)⋯(eθ1 ![[small tau, Greek, circumflex]](https://www.rsc.org/images/entities/i_char_e0b8.gif) 1)|Ψ0〉 | (13) | 
 :
:|  | (14) | 
|  | (15) | 
At the kth iteration, the energy is minimized through a conventional VQE procedure:
|  | (16) | 
The residual gradient of the ith operator ![[small tau, Greek, circumflex]](https://www.rsc.org/images/entities/i_char_e0b8.gif) i in
i in  must be evaluated at each iteration to determine the sequence of the pseudo-Trotter ansatz:
 must be evaluated at each iteration to determine the sequence of the pseudo-Trotter ansatz:
|  | (17) | 
The operator ![[small tau, Greek, circumflex]](https://www.rsc.org/images/entities/i_char_e0b8.gif) (k + 1) with the largest pre-estimated gradient R(k) is selected from
(k + 1) with the largest pre-estimated gradient R(k) is selected from  and added at the next iteration to update the wave function ansatz
 and added at the next iteration to update the wave function ansatz
|  | (18) | 
The convergence criteria are defined as the norm of pre-estimated gradients less than a specified threshold ε, that is,
|  | (19) | 
The fermionic creation and annihilation operators are mapped onto qubit operators {σx, σy, σz} using transformations, such as the Jordan–Wigner73,74 or Bravyi–Kitaev.75–77 The operator pool in adaptive VQE algorithms can be constructed based on a variety of operator forms. In addition to fermionic operators, another popular form is the qubit excitation operator. As introduced in the qubit-ADAPT-VQE ansatz,78 Pauli string operators with a maximal length of 4 are used to reduce the number of C-NOT gates in the quantum circuit. One scheme to construct qubit operators is to choose individual Pauli strings after the fermionic operators in the unitary coupled cluster ansatz are mapped onto qubit operators. The qubit operator generally has the form:
|  | (20) | 
In contrast to the qubit-ADAPT-VQE, the qubit excitation based (QEB) VQE employs a qubit operator form that explicitly relates to creation and annihilation operators of qubits.79 In qubit-ADAPT-VQE and QEB-VQE, Pauli Z strings are discarded in order to construct shallow circuits. As a consequence, more circuit parameters may be required to restore the antisymmetry of the wave function.
![[F with combining circumflex]](https://www.rsc.org/images/entities/i_char_0046_0302.gif) as the unperturbed Hamiltonian and the fluctuation potential
 as the unperturbed Hamiltonian and the fluctuation potential ![[V with combining circumflex]](https://www.rsc.org/images/entities/i_char_0056_0302.gif) as the perturbation. The overall MP2 correlation energy is defined as:
 as the perturbation. The overall MP2 correlation energy is defined as:|  | (21) | 
|  | (22) | 
In the framework of a multiscale quantum algorithm, the correlation energy defined in the active space is extracted using quantum algorithms. And we use MP2 to compute the remaining part of the correlation energy as an a posteriori correction to quantum algorithms, which is defined as:
|  | (23) | 
 indicates the active space. If we consider
 indicates the active space. If we consider  as the central part, eqn (23) can be regarded as a MP2 correction in the ONIOM form. As such, we can replace MP2 with arbitrary correlated methods, such as coupled cluster with single and double excitations (CCSD) and second-order algebraic-diagrammatic construction, to account for the dynamic correlation.
 as the central part, eqn (23) can be regarded as a MP2 correction in the ONIOM form. As such, we can replace MP2 with arbitrary correlated methods, such as coupled cluster with single and double excitations (CCSD) and second-order algebraic-diagrammatic construction, to account for the dynamic correlation.
      
      
        
        • Dividing the full system into fragments with appropriate sizes.
• Determining the n-mers required to compute high-order many-body corrections.
• Adding link atoms at appropriate positions if necessary.
• Performing HF calculation for every n-mer structure.
• Deciding the active spaces for the VQE algorithm and performing VQE calculations.
• Adding the MP2 correction corresponding to frozen orbitals.
• Calculating the total energy following the MBE energy expression.
In this work, the C–C bonds are severed when fragments are defined for C18. A hydrogen atom is placed at a position rcap between two carbon atoms at r1 and r2. The direction of the C–H bond is the same as that of the original C–C bond, and its bond length is set to rC–H = 1.061. The coordinate of rcap is
|  | (24) | 
One of the most important merits of the MBE-VQE method is the completely independent calculations for subsystems that can be performed with massive parallelization. And this parallelization does not require extensive modification to existing VQE algorithms. Note that it is necessary to avoid severing multiple bonds in the MBE-VQE method because this will result in relatively large errors.
Fig. 3 shows PESs and their error curves with respect to the FCI results for H10. The errors are defined as deviations from the total energy, namely EMBE-VQE − EFCI (or EDMET-VQE − EFCI). The MBE2-VQE method and the DMET-VQE method use the same number of qubits, but the MBE2-VQE method has a lower average deviation (2.42 millihartree for MBE2-VQE and 9.00 millihartree for DMET-VQE). As shown in Fig. 3, it is clear that, as the bond length increases, the total energy deviations decrease in all three methods. This is consistent with the intuition that DC methods are more accurate when the interactions between fragments become smaller. When the hydrogen atoms are well separated, namely when the H–H bond length is larger than 2.5 Å, the errors in both MBE2-VQE and DMET-VQE are less than 1 millihartree. In comparison with MBE2-VQE, the overall errors of MBE3-VQE are much smaller. The MBE3-VQE has an average deviation of 0.45 millihartree and all MBE3-VQE results have reached chemical accuracy.
Another symmetric geometry, a doubly Hückel aromatic cumulenic D18h structure of only C![[double bond, length as m-dash]](https://www.rsc.org/images/entities/char_e001.gif) C double bonds, has also been considered.91 Therefore, cyclo[18]carbon is a good candidate to assess the MBE-VQE method. In the following calculations, we can demonstrate that the MBE-VQE method can be applied to covalent systems and gives a quite reliable description of systems that have strong intrafragment interactions but weak interfragment interactions. In addition, it is well known that the MBE method is not an appropriate method for describing delocalized correlated systems. The MBE-VQE calculations of the cyclo[18]carbon system also show the same conclusion.
C double bonds, has also been considered.91 Therefore, cyclo[18]carbon is a good candidate to assess the MBE-VQE method. In the following calculations, we can demonstrate that the MBE-VQE method can be applied to covalent systems and gives a quite reliable description of systems that have strong intrafragment interactions but weak interfragment interactions. In addition, it is well known that the MBE method is not an appropriate method for describing delocalized correlated systems. The MBE-VQE calculations of the cyclo[18]carbon system also show the same conclusion.
We apply the MBE-VQE method to study relative energies of the cyclo[18]carbon ring. A series of cyclo[18]carbon structures are generated by fixing the diameter of the ring to 7.31 Å while changing θ and BLA. Fig. 4 shows the relative energies (defined as the energy difference with the lowest point) as a function of the θ or BLA. The cc-pVDZ basis-set is utilized in all of the calculations. Considering the existence of triple bonds in cyclo[n]carbon molecules, an appropriate way to set fragments is to take two carbon atoms as one fragment. Such a fragment breaks the original single bond between fragments, so hydrogen link atoms should be added. In this situation, a monomer in the MBE calculation is an ethyne molecule, two carbon atoms and two hydrogen atoms form a total of 38 MOs. A dimer is either a butadiyne molecule or two ethyne molecules, with 66 or 76 MOs. For monomers, the five orbitals (two occupied and three virtual) closest to the Fermi level are chosen to be the active space for VQE calculation. For dimers, nine orbitals (three occupied and six virtual) form the active space. The MP2 correction to the VQE is used for describing the correlation effect resulting from orbitals not included in the active space. In the DMET-VQE simulations of cyclo[18]carbon, a single carbon atom is considered as one fragment and the other carbon atoms are considered as the environment. In each fragment, four valence orbitals (2s orbital and three 2p orbitals) are treated as impurity orbitals. The rest orbitals in that fragment are treated at the Hartree–Fock level. Four bath orbitals are constructed with orbitals from environmental carbon atoms.
|  | ||
| Fig. 4 Relative energies as a function of θ and bond length alternation (BLA) computed with MBE-VQE (without HF correction, abbreviated as “-woHF”) and DMET-VQE algorithms. | ||
Fig. 4 shows that MBE-VQE agrees very well with CCSD when θ is small and BLA is large. As BLA approaches 0, the deviations resulting from MBE keep getting larger. This is because when BLA gets smaller, the bond structure exhibits a transition from an alternating single bond and triple bond to uniform double bonds. In this case, breaking the double bonds inevitably introduces large numerical errors in MBE and DMET methods. In addition, it is also improper to add a hydrogen atom as the link atom, because the bonds we serve cannot be treated as a single bond anymore. Instead of the link atom approach, ghost orbitals introduced in the FMO method92 may be a more appropriate choice to saturate the carbon atoms. However, the introduction of the FMO method cannot fix the problem of severing delocalized orbitals in the case of a small BLA. In addition, cyclo[18]carbon is a very special case in which we cannot avoid severing double bonds since it has a ring structure. In most cases, we can always choose an appropriate fragmentation scheme to avoid breaking important chemical bonds, especially when we have enough computational resources. It is also worth mentioning that CCSD may not be accurate enough to study cyclo[18]carbon when all C–C bond lengths are almost equivalent. In such a conformation, there are many degenerate orbitals so that the correlation effect becomes very strong. Table 3 shows the energy deviation of the MBE-VQE method and DMET-VQE method with respect to the CCSD method. The values of BLA at the most stable conformation are estimated to be 0.11 and 0.10 Å with the MBE-VQE and DMET-VQE methods, respectively. When the value of BLA is larger or close to that of the equilibrium geometry, the MBE-VQE method yields an excellent agreement with the CCSD method. While, as the value of BLA decreases, the MBE-VQE method deteriorates very quickly since the fragmentation method is not suitable for partitioning these strong correlation problems.
| BLA | MBE-VQE | DMET-VQE | 
|---|---|---|
| 0.201 | −0.08 | 21.73 | 
| 0.188 | −0.40 | 19.35 | 
| 0.176 | −0.50 | 15.08 | 
| 0.163 | −0.64 | 12.05 | 
| 0.151 | −0.39 | 8.90 | 
| 0.138 | −0.84 | 5.87 | 
| 0.126 | −0.37 | 3.11 | 
| 0.113 | 0.00 | 0.65 | 
| 0.101 | 0.97 | −0.44 | 
| 0.088 | 2.26 | −2.22 | 
| 0.075 | 4.20 | −3.32 | 
| 0.063 | 6.79 | −3.88 | 
| 0.050 | 10.55 | −5.15 | 
| 0.038 | 15.66 | −5.38 | 
| BLAmin | 0.113 | 0.101 | 
Fig. 5 shows eight of the most stable water hexamer structures determined in ref. 94. An annealing strategy is used to find a series of stable conformations at the MP2/6-311++g** level. The most stable structure is estimated to be the prism one. The total energies of these structures are calculated with the MBE-VQE method. The DMET-VQE method is also used as a comparison. In water hexamers, each water molecule is specified as a fragment. The cc-pVDZ basis set is employed. In the MBE-VQE method, a monomer contains one water molecule with 24 molecular orbitals and a dimer has two water molecules with 48 molecular orbitals. To achieve the best accuracy with tolerable computing resources, we choose a small active space for VQE calculations. For monomers, two occupied MOs with the lowest energy are treated as frozen orbitals, and the following four MOs are chosen as the active space to construct the effective Hamiltonian; for dimers, four MOs with the lowest energy are frozen and the following eight MOs form the active space. So in the MBE-VQE calculations, active spaces containing up to 16 qubits are used. Due to the small active space used in the VQE, the neglect of other orbitals is corrected by including a MP2 correction using eqn (23).
|  | ||
| Fig. 5 A series of typical structures of water hexamers, (a) boat, (b) book, (c) cage, (d) chair, (e) cyclic (f) prism, (g) prism-book (p-b for short) and (h) twisted-boat (t-boat for short). | ||
Fig. 6 shows the relative energies of eight different structures of water hexamers. Both the MBE-VQE and DMET-VQE methods give a correct prediction of the most stable water hexamer, namely the prism one. As such, relative energies are defined with respect to the energy of the prism water hexamer. For DMET-VQE, the relative energy of the cyclic structure is significantly underestimated with respect to the CCSD results, in which the energy of the cyclic structure is much higher than those of the book, cage and prism-book structures. In addition, all relative energies of other seven hexamers predicted by DMET-VQE with respect to the prism hexamer are systematically higher than the corresponding ones predicted by CCSD. As shown in Table 4, the average error of the DMET-VQE method is as large as 5.91 millihartree. As discussed in ref. 84, the accuracy of the DMET method can be improved by increasing the size of fragments, but this also results in higher computational cost. An alternative scheme to improve the DMET results is to perform posterior corrections after the DMET calculations. The DMET energy is automatically adjusted by iteratively relaxing one-particle orbitals in fragments, accompanied by a change in the global particle number and chemical potential. Therefore, formulating posterior corrections beyond the active space using the perturbation theory is very involved, requiring a comprehensive consideration of the choice of bath orbitals, the construction of one-particle orbitals and the fragment Hamiltonian. In the MBE-VQE method, we describe the correlation energy missing in the active space calculations by adding MP2 corrections. The MBE-VQE results shown in Table 4 reveal that such corrections are able to accurately restore the relative energies of eight water hexamers, with an average error of 0.58 millihartree, with respect to those from CCSD.
|  | ||
| Fig. 6 Relative energies (in hartree) of different water hexamer structures computed by the MBE-VQE algorithm and DMET-VQE algorithm. The energy of the prism hexamer is used as a reference value. | ||
| Configure | DMET-VQE | MBE2-VQE | 
|---|---|---|
| Boat | 8.90 | 0.76 | 
| Book | 6.68 | 1.35 | 
| Cage | 6.54 | 0.00 | 
| Chair | 7.61 | 0.40 | 
| Cyclic | 2.26 | 0.36 | 
| Prism | 0.00 | 0.00 | 
| Prism-book | 6.67 | 0.70 | 
| Twisted-boat | 8.61 | 1.08 | 
| AE | 5.91 | 0.58 | 
In energy-based fragmentation approaches, the total energy is assembled from the energies of subsystems. If the electronic structure of a subsystem is significantly different from the corresponding one in the whole system, it may result in large errors in the total energy. Therefore, we should avoid breaking bonds that leads to a remarkable change in electronic environments, such as delocalized chemical bonds and bonds involved in strongly correlated interactions, when defining fragments in the MBE-VQE algorithm. As a result, the MBE-VQE algorithm may make it difficult to describe transition metal coordination compounds or organometallic compounds because breaking metal–ligand covalent bonds makes it unlikely to maintain an accurate estimate of the energy. However, apart from this limitation, namely including these bonds in one fragment, the MBE-VQE algorithm is still expected to work for these systems. In addition, the MBE-VQE algorithm may make it difficult to describe nonlocal electronic excitations and magnetic systems, such as frustrated spin systems.
It is worth mentioning that the numerical results presented in this work are evaluated using a quantum simulator. The implementation of quantum algorithms on noisy quantum devices is quite challenging due to error accumulation. For the MBE-VQE method, the total energy is obtained by collecting many energy terms of monomers, dimers, trimers, … Therefore, in the worst case, the total error will be NtermsδE. Here, Nterms is the number of the energy terms and δE is the largest error in the energy calculations, while, in the best case, if the errors due to noise are systematically positive (or negative), a large part of the errors will cancel each other. On the other hand, when a quantum simulation of the full system is conducted on noisy quantum devices, the quantum circuit required to achieve a high accuracy will be much more complicated than those for subsystems. As the size and depth of quantum circuits increase, error accumulation due to noise will become more serious. Therefore, further studies are necessary to assess the performance of the MBE-VQE method on noisy quantum devices. In addition, in the long term, the error from the MBE-VQE method may become important if the error from noise in a small-scale quantum simulation can be remarkably suppressed.
It is well known that MBE methods suffer from the basis-set superposition error (BSSE), which originates from the fact that molecular properties calculated using the basis set of a single molecular fragment differ from those calculated using the basis set of the entire molecule. The impact of BSSE becomes more significant when diffuse basis functions are used.99,100 This is because diffuse functions centered on one fragment are more likely to have a remarkable overlap with the basis functions centered on its neighbouring fragments. Previous studies suggested several methods to avoid or minimize the BSSE effect in the MBE method.99,100 One simple strategy is to use a large basis set without diffuse functions. Another effective method is the counterpoise (CP) method,101,102 in which the energies of subsystems are calculated with the entire molecule's basis functions. In order to improve the results of the MBE-VQE calculations, it is necessary to include the BSSE effect in future studies.
The MBE-VQE algorithm has been applied to study the cyclo[18]carbon molecule and water hexamers. It gives a good description of the relative energies of these two systems with respect to the CCSD method. For the cyclo[18]carbon molecule, the equivalent bond length predicted by the MBE-VQE algorithm agrees well with the experimental results. For water hexamers, the MBE-VQE algorithm gives a good estimation of the relative stability of different configurations. The methods are expected to find promising applications in fields such as biochemistry, catalysis or other complicated chemical reactions. As discussed in Section 2, the combination of computational methods at different levels of resolution is diverse in the sense that exploring multiscale quantum computing is still an open problem in the NISQ era.
Most chemical reactions happen in complex environments, such as solution or proteins. Therefore, in order to apply quantum computers to design new catalysts or develop new drugs, we must consider the environmental effect in order to obtain reliable results. Classical multiscale simulation methods have been developed for many decades. In such a framework, the active region is often treated with quantum mechanical methods. While, limited by the computing power of classical computers, mean-field methods, such as density functional theory, or coupled cluster methods are often used to describe the electronic structure of active regions. The multiscale quantum computing framework proposed in this work can utilize quantum algorithms that can provide quantum advantage when fault-tolerant quantum computers become a reality such that it will be superior to classical multiscale simulation methods in the future. As such, the multiscale quantum computing framework provides a viable strategy for applying quantum computers to complicated material and biological molecular simulations.
 can be decomposed into two subsystems
 can be decomposed into two subsystems  and
 and  , namely
, namely  . An arbitrary quantum state |Ψ〉 of
. An arbitrary quantum state |Ψ〉 of  can be in general expressed in the Hilbert space of |ΨAI〉 ⊗ |ΨBJ〉,
 can be in general expressed in the Hilbert space of |ΨAI〉 ⊗ |ΨBJ〉,|  | (25) | 

 of dimension dA (dB). In quantum chemistry simulations, molecular or material systems are often composed of multiple subsystems that are small enough to enable high-level quantum mechanical calculations.
 of dimension dA (dB). In quantum chemistry simulations, molecular or material systems are often composed of multiple subsystems that are small enough to enable high-level quantum mechanical calculations.  can be considered one of the local subsystems and
 can be considered one of the local subsystems and  is considered the environment that is composed of the rest of the subsystems. The central idea of the DMET is to find the Schmidt decomposition of the wave function |Ψ〉. By the singular value decomposition of the coefficients C,
 is considered the environment that is composed of the rest of the subsystems. The central idea of the DMET is to find the Schmidt decomposition of the wave function |Ψ〉. By the singular value decomposition of the coefficients C,|  | (26) | 
 and its DMET bath,
 and its DMET bath,|  | (27) | 
|  | (28) | 
A simple construction of the embedding Hamiltonian in the basis of Schmidt states is
|  | (29) | 
|  | (30) | 
|  | (31) | 
If the wave function of the full system is known a priori, the bath orbitals and the embedding Hamiltonian are well defined. However, due to the high computational complexity of obtaining an exact wave function of the full system, an approximate wave function from the low-level electronic structure calculation, such as HF, is used to construct the bath orbitals. These bath orbitals can be self-consistently improved by solving the embedded problems with a high-level quantum mechanical method. Given the wave function approximation introduced in the DMET, the embedding density matrix for the subsystem SA may differ from the exact one. To ensure that the total number of electrons in the subsystems adds up to NAe, a global potential μ is applied to fix the embedded Hamiltonian:
|  | (32) | 
Furthermore a cost function is needed to evaluate the general potentials used. As was mentioned before, the mean-field ground state will cause differences in the density matrix, so the variance of the density matrix is a good cost function to use:
|  | (33) | 
In ref. 84, there are some of the cost functions summarized to optimize the low-level Hamiltonian and correlation potential, including fragment plus bath fitting, fragment only fitting, and single-shot embedding. In this work, the simplest cost function
|  | (34) | 
|  | (35) | 
| This journal is © The Royal Society of Chemistry 2023 |