Open Access Article
Nikolay V.
Tkachenko
*abcd,
Hang
Ren
a,
Wendy M.
Billings
a,
Rebecca
Tomann
a,
K. Birgitta
Whaley
*ae and
Martin
Head-Gordon
*ade
aDepartment of Chemistry, University of California, Berkeley, CA 94720, USA. E-mail: nikolay.tkachenko@ou.edu; whaley@berkeley.edu; m_headgordon@berkeley.edu
bMaterials Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
cDepartment of Chemistry and Biochemistry, University of Oklahoma, Norman, Oklahoma 73019, USA
dInstitute for Decarbonization Materials, University of California, Berkeley, California 94720, USA
eChemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
First published on 5th November 2025
Near-term quantum devices require wavefunction ansätze that are expressive while also of shallow circuit depth in order to both accurately and efficiently simulate molecular electronic structure. While the unitary coupled cluster ansatz (e.g., UCCSD) has become a standard, the high gate count associated with the implementation of this limits its feasibility on noisy intermediate-scale quantum (NISQ) hardware. k-Fold unitary cluster Jastrow (uCJ) ansätze mitigate this challenge by providing O(kN2) circuit scaling and favorable linear depth circuit implementation. Previous work has focused on the real orbitalrotation (Re-uCJ) variant of uCJ, which allows an exact (Trotter-free) implementation. Here we extend and generalize the k-fold uCJ framework by introducing two new variants, Im-uCJ and g-uCJ, which incorporate imaginary and fully complex orbital rotation operators, respectively. Similar to Re-uCJ, both of the new variants achieve quadratic gate-count scaling. Our results focus on the simplest k = 1 model, and show that the uCJ models frequently maintain energy errors within chemical accuracy (∼1 kcal mol−1). Both g-uCJ and Im-uCJ are more expressive in terms of capturing electron correlation and are also more accurate than the earlier Re-uCJ ansatz. We further show that Im-uCJ and g-uCJ circuits can also be implemented exactly, without any Trotter decomposition. Numerical tests using k = 1 on H2, H3+, Be2, C2H4, C2H6 and C6H6 in various basis sets confirm the practical feasibility of these shallow Jastrow-based ansätze for applications on near-term quantum hardware.
A wide variety of algorithms have been proposed for efficient electronic structure calculations on quantum computers.1,2,18–20 While the quantum phase estimation (QPE) algorithm21 offers a scalable pathway for addressing the electronic structure problem,22 the high circuit costs of this algorithm and its variants23 appear to require fault-tolerant quantum hardware.24–26 The majority of algorithms proposed for the simulation of electronic structure on near-term noisy, intermediate scale quantum (NISQ) devices fall either within the category of variational optimization algorithms27–31 or lie within the broad class of quantum subspace diagonalization (QSD)32–35 algorithms. In the variational optimization category, the variational quantum eigensolver (VQE)27 is perhaps the most well-known. VQE uses a quantum computer to prepare trial wavefunctions and a classical computer to optimize the parameters of these wavefunctions. While initial studies indicated significant promise for ground-state energy calculations,27,28 subsequent work showed that the classical optimization component is non-trivial, due to the frequent occurrence of barren plateaux that arise from shallow or even flat energy landscapes.36–39 Several extensions and modifications of VQE have since been developed to improve its accuracy and applicability.40–49 For example, PermVQE47 introduces correlation-informed qubit permutation to the optimization process, allowing the accuracy of energy predictions to be improved without increasing the circuit depth. Another example is ADAPT-VQE,48 which optimizes construction of the quantum circuit by selectively adding operators that provide the greatest reduction in energy at each iteration. The more recent qubit-ADAPT-VQE49 further enhances the performance of this adaptive approach by optimizing the operator selection at the qubit-operator level, thereby reducing the number of quantum gates required for efficient simulations.
An alternative class of quantum algorithms for calculating electronic energies on NISQ or near-term hardware is provided by subspace diagonalization algorithms. These focus on capturing electron correlation by expanding the state basis. They are hybrid algorithms that generally involve classical postprocessing of a Hamiltonian eigenvalue problem with the matrix elements evaluated on a quantum processor. These algorithms can be further classified according to the methods used to define the basis states, which may or may not be orthogonal. For instance, the Quantum Subspace Expansion (QSE) builds on VQE outputs by constructing an expanded subspace from the original VQE solution, e.g., â†pâq|ΨVQE〉.50,51 It is also possible to achieve provable convergence with respect to growth of the subspace when the set of expanded states forms a Krylov basis that is generated by the repeated application of the matrix of interest to an initial guess vector.52 Several algorithms have been proposed along this direction, including quantum filter diagonalization,33 quantum Lanczos,53 and quantum Davidson methods.54 A different form of subspace diagonalization is provided by the non-orthogonal quantum eigensolver (NOQE),55 which is a multi-reference method for systems with both strong and weak electronic correlations that offers both algorithmic and practical quantum advantages, and has recently been shown to provide a complexity theoretic quantum speedup for such systems.56
The work we report here focuses on the adaptations needed to successfully use cluster wavefunctions as reference states for systems with both strong and weak electronic correlations. As such, it is relevant to both variational methods and the NOQE. Classical CC theory converges only slowly with rank for strongly correlated systems57 due to both its nonvariational character and the nature of the cluster expansion.58 The variational issue is well solved via the VQE and unitary CC (UCC) theory.27,59–63 However, the large number of variational parameters, even at the lowest singles and doubles (i.e., quartic scaling amplitudes at the UCCSD level) which results in large circuit depths has motivated development of more compact alternatives.40,64,65 We focus here on the unitary cluster Jastrow approximation (uCJ),66 which builds on the well-known real-space Jastrow factors from quantum Monte Carlo, extended into Hilbert space to become cluster operators.67,68 The uCJ approach was used in the NOQE55 where the reduction in gate count was noted, and has also been successfully mapped onto aspects of both physics of strong correlation and qubit connectivity with a local uCJ extension.69,70
In this work, we introduce and systematically explore several variants of the uCJ ansätze, analyzing their circuit depth, expressibility, and accuracy by use of calculations that minimize the Hamiltonian energy for individual ansatz states. Specifically, we examine two new forms of the uCJ correlator, one with imaginary orbital rotation operators (Im-uCJ) and one with complex orbitalrotation operators (g-uCJ). We show that similar to the previously introduced Re-uCJ ansatz,55 these new ansätze also reduce the gate count relative to the widely used UCCSD ansatz, making the uCJ correlators more suitable for near-term hardware. We demonstrate that both Im-uCJ and g-uCJ can achieve high accuracy, often surpassing both UCCSD and their real-rotation counterpart (Re-uCJ), while preserving a shallow circuit depth. We benchmark these uCJ ansätze on a series of small molecules, demonstrating that cluster Jastrow correlators can serve as a key step toward practical and resource-efficient quantum algorithms for molecular electronic structure on NISQ and near-term devices.
![]() | (1) |
and Ĵ are defined as![]() | (2) |
![]() | (3) |
operator, we restrict orbital rotations to occur only within the same spin subspace. This choice constrains the number of variational parameters and simplifies the ansatz. However, lifting this restriction (allowing rotations connecting different spin states) increases the number of variational parameters (while still scaling quadratically with the number of spin orbitals) and can enhance expressibility. In this work, we primarily investigate the spin-preserving form, and will state explicitly when the spin-generalized form of
is used.
In eqn (1)–(3), p and q refer to spatial molecular orbitals, σ and τ represent spin polarization (either α or β), and |HF〉 is the mean-field restricted or unrestricted Hartree–Fock reference state. The parameter k controls how many replicas of the Jastrow type correlators are included in the ansatz. If k is not truncated, eqn (1) can be exact.59,66 Nevertheless, we shall limit ourselves to k = 1 since this choice is the simplest and cleanest, and we wish also to explore the extent to which this can provide a good balance between capturing a significant portion of electron correlations while maintaining a shallow enough ansatz to be interesting for near-term devices. For notational simplicity, we will omit the σ subscripts in Kpq,σσ, while noting that the orbital rotations only couple orbitals of the same spin. To maintain unitarity in the uCJ ansätz, the coefficients Kpq and Jpq,στ must satisfy specific conditions: the matrix K is required to be anti-Hermitian, while J must be purely imaginary and symmetric.66
Previous studies55,66,69 have explored the uCJ ansatz with real orbital rotations, which imposes the restriction that K is real and anti-Hermitian. We refer to this form as real uCJ (Re-uCJ). In this case, the effective form of the
operator can be written as:
![]() | (4) |
However, the flexibility of the uCJ ansatz also allows additional choices that to our knowledge have not yet been explored. It is particularly interesting to explore whether different choices can yield significant improvements for truncation at k = 1 that we impose throughout this work.
The first alternative we consider is the use of the
operator with the opposite restriction to that imposed in Re-uCJ, i.e., restricting K to be imaginary. We thereby obtain another form of the uCJ ansatz, which we shall refer to as imaginary uCJ (Im-uCJ):
![]() | (5) |
The second alternative is to consider the least restricted scenario for K, where it is allowed to be complex with both real and imaginary parts. This formulation provides the greatest flexibility for a given truncation of k, and therefore we refer to it as generalized uCJ (g-uCJ). The
operator for g-uCJ is written as:
![]() | (6) |
The overall scalability of the Jastrow ansatz is significantly more favorable than other UCC variants, since the number of terms in e
i and eĴi scale as O(N2), where N is the number of spin orbitals. This results in an overall O(N2) scaling of the uCJ ansatz, in contrast to the formal O(N4) scaling for a single Trotter step of a UCC ansatz with double excitations.71 Additionally, by applying a fermionic-to-spin transformation to
(e.g., the Jordan–Wigner (JW) transformation2,72), we find that the restricted Re-uCJ and Im-uCJ ansätze are both represented with a number of Pauli words that is smaller by a factor of two than that for the g-uCJ ansatz. Once the ansatz is defined, the parameters in the matrices K and J can be optimized either (i) purely classically, for example through methods such as classical variational Monte-Carlo,67,68 or (ii) within the VQE framework. In this work, we test the performance of all three uCJ ansatz variants, Re-uCJ, Im-uCJ, and g-uCJ.
and Ĵ from the fermionic to a qubit representation, the question arises of how to accurately represent the exponentials of these operators. In general, once
and Ĵ are mapped into qubit space, they may consist of Pauli words that do not commute with each other. A straightforward solution to this non-commutativity is to employ the approximate Trotter decomposition. However, for the specific case of the JW transformation, the need for Trotter decomposition can be avoided. In the JW mapping, the exponentiation of the Ĵ operator is straightforward to handle, since the number operators are mapped to commuting Ẑ Pauli operators, according to![]() | (7) |
The JW mapping then allows us to express the sum of these operators as an exact product of exponentials of individual Pauli terms.
On the other hand, the JW transformed
operator generally includes non-commuting terms. Implementation of this operator presents a greater challenge, but can be realized by taking advantage of the demonstration that any real unitary orbital rotation operator can be implemented efficiently using Givens rotation operators.73 Here we extend this approach to show that the exponentials exp(
Im-uCJ) and exp(
g-uCJ) can also be represented as consecutive applications of a generalized form of Givens rotation.
The analysis in ref. 73 showed that any particle number-preserving rotation operator of the single-particle basis
![]() | (8) |
real fermionic rotations of the form pq(θk) = exp(θk(â†pâq − â†qâp)), | (9) |
The proof is based on the equivalence of application of the orbital rotation operator
pq(θk) to the unitary Û(u) and rotations of matrix u
pq(θk)Û(u) = Û(rpq(θk)u), | (10) |
![]() | (11) |
![]() | (12) |
![]() | (13) |
, followed by the inverse of the sequence of two-qubit rotations
pq(−θk):![]() | (14) |
This implementation can be directly applied to the Re-uCJ ansatz by choosing the unitary operator u as
| u = exp(K), | (15) |
operator, which thus represents a real orbital rotation unitary and can be then implemented by using eqn (14). However, the decomposition in ref. 73 does not directly apply to the Im-uCJ and g-uCJ ansätze, which involve imaginary and complex orbital rotations, respectively, and therefore the corresponding matrices u are no longer real. To extend the method to be applicable with complex u, we use a generalized Givens rotation of the form![]() | (16) |
![]() | (17) |
and
. With these generalized rotation operators in hand, we can apply the same procedure derived in ref. 73 and summarized above, but replacing
pq by
, and rpq(θ) by
. The values θ = θk and ϕ = ϕk for the complex-valued rotation matrix
that diagonalizes the corresponding complex-valued matrix u (cf.eqn (12)) are obtained from a corresponding generalization of the QR-like decomposition that is given explicitly in the SI. As a result, the general complex orbital rotation matrix for the Im-uCJ and g-uCJ ansatz can be decomposed into
generalized fermionic rotation operators, according to![]() | (18) |
By further restricting the fermionic rotations to adjacent qubits we avoid non-local lengthy JW Ẑ Pauli strings and use only local Givens rotations operators. With this realization, the operator
is effectively a two-qubit operator, which can be realized with the use of only three CNOT gates.74–76
One of the important benefits of using exact exponentiation over Trotter decomposition is the fact that we can then effectively treat biradicaloid systems when using the NOQE.55 In a typical biradicaloid, pairs of unrestricted Hartree–Fock reference states differ only by permutations of spin orbitals, allowing the same K and J matrices to be reused for each reference simply by permuting the corresponding matrix indices. This reuse significantly reduces the cost of parameter optimization, as only one set of parameters must be optimized. The energy accuracy can then be improved by incorporating additional references with no more classical preprocessing costs, although there is now the additional quantum processor expense to measure more nondiagonal overlap and Hamiltonian matrix elements. By contrast, in an approximate Trotter decomposition scheme, these parameters must generally be re-optimized for each reference state, removing the classical preprocessing efficiency that exact exponentiation affords.
g-uCJ is: g-uCJ = K13â†1a3 + K24â†2a4 + K31â†3a1 + K42â†4a2. | (19) |
![]() | (20) |
| Re(K13) = −Re(K31), Re(K24) = −Re(K42), Im(K13) = Im(K31), Im(K24) = Im(K42). | (21) |
Now, let us express the
operator in terms of Pauli words using the Jordan–Wigner transformation:
![]() | (22) |
Given the anti-Hermitian nature of the K matrix, the expression above can be rewritten as:
![]() | (23) |
Thus, there are four real parameters to optimize for this operator. However, by restricting Re(Kij) = 0 (Im-uCJ, eqn (5)) or Im(Kij) = 0 (Re-uCJ, eqn (4)), we reduce the number of parameters to two:
![]() | (24) |
![]() | (25) |
Since [
i,
j] = [â†iâi, â†jâj] = 0, and given that J is symmetric and purely imaginary, the explicit form of the Ĵ operator for our two electron, four spin–orbital singlet case is:
| Ĵ = 2[J12â†1â1â†2â2 + J14â†1â1â†4â4 + J23â†2â2â†3â3 + J34â†3â3â†4â4 ] | (26) |
![]() | (27) |
This yields four variational parameters to optimize. Note that we assumed
1
3|Ψ〉 =
2
4|Ψ〉 = 0, given the singlet multiplicity of the wavefunction for ground state H2. In the general case, without this assumption, there would be six variational parameters to optimize for J.
operator in the Re-, Im-, and g-uCJ circuits was implemented exactly through the Givens rotation as described in Section 2.2 above. For the C2H6, Be2, C2H4, and C6H6 molecules, the CASCI formalism was used with active spaces of (2e,2o),(2e,2o),(4e,4o), and (4e,4o), respectively (we denote the number of spatial orbitals o here, according to quantum chemistry convention, and use N to denote the number of spin orbitals, according to quantum computing convention). The corresponding active space orbitals are illustrated in Fig. S2. For benzene, we employed a (4e,4o) active space focusing on the frontier HOMO–LUMO, since the fully bonding π and fully antibonding π* orbitals are energetically distant and contribute minimally to the correlation energy. CASCI calculations confirm that expanding from (4e,4o) to a (6e,6o) active space changes the total correlation energy by only ≈2%. Corresponding energies for both (4e,4o) and (6e,6o) active spaces are provided in the SI file (Table S2).
To analyze the gate counts for the circuits, we performed UCCSD simulations performed with the Qiskit-nature package.78,79 For the single Trotter-step decomposed ansätze (UCCSD, Re-uCJ, Im-uCJ, and g-uCJ), the number of CNOT gates was computed by first performing the Jordan–Wigner transformation of the
1,
2, Ĵ and
operators, followed by simplifying the resulting Pauli word sums through term cancellation, and finally representing each remaining exponentiated Pauli word by efficient quantum circuits as described in ref. 80. For exact implementations of the uCJ ansätze, the exponential eĴ was similarly decomposed by obtaining its JW representation, recognizing that in this case all Pauli words contain only Z and I gates and thus pairwise commute, allowing the exact decomposition into a product of exponentials of individual Pauli words. The e
and e−
terms were implemented by applying
generalized orbital rotation operators
as described in Section 2.2, where N represents the total number of spin orbitals, and division by two accounts for spin–orbital rotations only within the same spin space.
Optimization of the K and J matrices was carried out classically with an implementation of the SLSQP minimizer,81 using a sparse-vector representation of quantum states. The optimization minimizes the expectation value of the Hamiltonian in each of the reference states, by generating 〈Ĥ〉 using uCJ states from K and J matrices and variationally optimizing the expectation value with respect to their parameters. This classical optimization represents a pre-processing stage that is not susceptible to the measurement shot noise or circuit noise endemic to quantum processors. For larger active spaces (greater than or equal to 8 qubits), optimization of the K and J matrices was first performed in a reduced space under the perfect pairing assumption (only one bonding and its corresponding antibonding orbital are entangled at a time through the K and J operators). The orbital connectivity was then gradually extended toward the fully connected case, using the optimized coefficients from the previous step as the initial guess for each subsequent stage. The sparse-vector simulator implementation was written locally, and can be accessed through GitHub.82
In order to assess the performance of the ansätze in a realistic setting on current NISQ hardware, we undertook circuit-based simulations. Two types of these simulations were performed. The first type, run on the QASM simulator, includes only shot noise from measurement sampling and was used to assess the robustness of a variational optimization on a quantum device. The Jordan–Wigner Hamiltonian was partitioned according to the qubit-wise commuting (QWC) approach83 (Table S3) where the Pauli strings are grouped into sets in which each Pauli string commutes with every other string in a qubit-wise manner. This allows all terms in a group to be sampled simultaneously. Each group can then be measured by making appropriate one-qubit rotations then measuring in the computational basis. We used 10
000 shots per group for optimization in this setting without circuit noise. Parameters were optimized with the Powell optimizer.84 The final energy evaluations with optimized parameters were repeated for over 20 independent trials using 10
000 shots per QWC group.
The second type of simulation includes both the shot noise and realistic circuit noise (including readout noise). The Qiskit Aer backend was used here to introduce hardware noise models. For these simulations we evaluated the energy expectation value on optimized uCJ ansatz states for which the K and J parameters were taken from the noiseless classical optimizations described above. The circuits were executed on the same backend for four hardware noise models. We denote these as (i) a “T1/T2″ model incorporating only decoherence noise, (ii) a “T1/T2 + dep” model that adds light single- and two-qubit depolarizing noise, (iii) a “H1-like” model that resembles the Quantinuum H1 device with asymmetric readout errors using parameters from ref. 85, and (iv) an “IBM-like” model that resembles the IBM Pittsburgh device using parameters from ref. 86. For detailed description of these noise models see Table S4 in the SI. For these simulations with circuit noise, each QWC group was evaluated with 20
000 shots and repeated over 100 independent trials with different random seeds. The IBM-like circuits (for an “IBM-like” model) were compiled into the native gate basis CZ, RZZ, RX, RZ, Sx, X, while the H1-like circuits (for “T1/T2”, “T1/T2 + dep”, and “H1-like” models) used RZ, RX, RZZ gates, corresponding to the physical single-qubit and entangling operations of the corresponding hardware platforms. All-to-all qubit connectivity was assumed in these noisy simulations. The test was conducted for H2 in the STO-3G basis at R(H–H) = 1.7 Å, (a point on PES where the energy difference between ansätze is the most pronounced).
| System | N elec | N qubit | UCCSD N2q | g-uCJ N2q | Re-uCJ N2q | Im-uCJ N2q | |||
|---|---|---|---|---|---|---|---|---|---|
| Single T-step | Single T-step | Exact | Single T-step | Exact | Single T-step | Exact | |||
| H2 (STO-3G) | 2 | 4 | 42 | 46 | 20 | 36 | 20 | 34 | 20 |
| Be2 (STO-3G, (2e, 2o)) | 2 | 4 | 42 | 46 | 20 | 36 | 20 | 34 | 20 |
| C2H6 (STO-3G, (2e, 2o)) | 2 | 4 | 42 | 46 | 20 | 36 | 20 | 34 | 20 |
| H3+ (STO-3G) | 2 | 6 | 166 | 166 | 54 | 120 | 54 | 126 | 54 |
| H2 (6-31G) | 2 | 8 | 404 | 376 | 104 | 264 | 104 | 270 | 104 |
| H4 (STO-3G) | 4 | 8 | 753 | 400 | 128 | 288 | 128 | 294 | 128 |
| C2H4 (STO-3G, (4e, 4o)) | 4 | 8 | 753 | 400 | 128 | 288 | 128 | 294 | 128 |
| C6H6 (STO-3G, (4e,4o)) | 4 | 8 | 753 | 400 | 128 | 288 | 128 | 294 | 128 |
| H2 (6-311G) | 2 | 12 | 1202 | 1319 | 192 | 894 | 192 | 902 | 192 |
We now compare the performance of the restricted versus general uCJ ansätze. For single-step Trotterized circuits, the restricted Re-uCJ and Im-uCJ variants of uCJ employ approximately two-thirds the number of two-qubit gates compared to the generalized g-uCJ ansätz. This behavior arises from the two-fold increase in the number of Pauli terms in the definition of the
operator for g-uCJ (see eqn (23)–(25)). In contrast, for the exact implementations of Im-uCJ and g-uCJ introduced in this work that use the generalized Givens rotations, the CNOT cost is similar across all three uCJ variants in their exact form (see eqn (12)–(16)), although the number of variational parameters differs between restricted and generalized variants.
As we demonstrate in the Performance analysis section below, the increased number of variational parameters in g-uCJ is offset by its greater flexibility and expressivity. In all of the considered cases, the g-uCJ ansatz recovers a larger portion of the correlation energy than its restricted counterparts and often yields results within chemical accuracy. If the number of variational parameters is a critical constraint, we recommend using the restricted variants (Re-uCJ or Im-uCJ), with the remaining correlation energy recovered using, for example, the NOQE algorithm.55
As illustrated for H2, Be2 (2e,2o), C2H6 (2e,2o), and linear H3+ in Fig. 1, the performances of the Im-uCJ and Re-uCJ ansätze show significant differences. Despite requiring the same number of two-qubit gates, Im-uCJ consistently yields more accurate results, whereas Re-uCJ sometimes converges to metastable local minima and struggles with optimization. This behavior is clearly illustrated in Fig. 1a and c. Here the Re-uCJ curves denoted as ‘dissociation’ were obtained by gradually increasing the bond distance and using the optimized coefficients from each prior step as an initial guess. Due to the lack of flexibility of the Re-uCJ ansatz, this approach accumulates errors as the bond breaks, as is also seen and well-known for RHF ansätze. The curves denoted as ‘association’ were generated by gradually decreasing the bond distance, with each calculation initialized using the optimized coefficients from the previous step. In the bond breaking regime, the energy still can be reduced by variational optimisation of the Re-uCJ parameters, but the results remain inferior to Im-uCJ (except for the C2H6 case with RCC > 2.9 Å, where Re-uCJ performs slightly better than Im-uCJ), and this variational local minimum becomes inferior to Re-uCJ ‘dissociation’ results at shorter bond distances (RHH < 1.6 Å for H2, and RCC < 2.4 Å for C2H6). We note that random initialization of the Re-uCJ parameters along the dissociation curve leads to erratic optimization behavior due to jumping between the dissociation and association curves, disrupting smoothness of the energy as a result. In contrast, Im-uCJ reliably converges to a single lower energy solution regardless of the initial guess procedure, demonstrating greater stability.
Similar results are observed for the other molecular systems shown in Fig. 1b and c, although there are some specific features worth commenting on. In particular, we see that Be2 exhibits no binding in calculations using an RHF reference, while in fact it is known from experiments to have a weak bond, with Re = 2.45 Å. Part of its strong correlation effect can be captured with a (2e,2o) active space, allowing
excitations, as illustrated in Fig. S2b. The electron correlations are stronger at Re than at dissociation, as reflected in the sharp rise in the Re-uCJ error at short distances in Fig. 1b. Similar to the dissociation of H2, for Be2 we see that Im-uCJ again provides dramatic improvement over Re-uCJ, while g-uCJ attains exactness at all distances.
To further assess the robustness of the proposed ansätze in practice, we carried out simulations that introduce both measurement shot noise as well as quantum hardware noise. First, Fig. S3 presents the PES of H2 computed with the three uCJ variants using the QASM simulator and including only statistical shot noise. The simulations show that the Im-uCJ and g-uCJ ansätze retained their higher accuracy relative to Re-uCJ, indicating that they not only achieve better energies but also retain stable optimization behavior under sampling fluctuations.
Second, to quantify the impact of quantum device noise derived from both physical gate and measurement noise sources, Table 2 reports the results of hardware-noise simulations performed for the same H2 system at an internuclear distance of 1.7 Å (the point of maximum energy deviations among the three ansätze on the PES). For this test, we used the classically optimized parameter sets and introduced representative noise models, including ones that correspond to Quantinuum H1-like and IBM-like devices (see Section 2.2 and the SI for more details). We find that across all models tested here, the relative accuracy of the ansätze remains unchanged. Specifically, g-uCJ consistently exhibits the smallest deviation from the FCI reference, while Re-uCJ exhibits the largest error. These results demonstrate that the qualitative performance order of the uCJ variants is also preserved under realistic hardware noise conditions. It is important to note, however, that even for the g-uCJ ansatz, the realistic gate, readout, and decoherence error rates (H1-like and IBM-like models) on current quantum devices (without any form of noise mitigation and error correction) produce energy deviations of approximately 1.4 × 10−2–2.6 × 10−2 a.u. relative to the exact FCI solution, as shown in Table 2. While both of these lie outside chemical accuracy (∼1 kcal mol−1, equivalent to 1.6 × 10−3 a.u.), the results with circuit noise emulating ion-trap platforms such as Quantinuum H1 exhibit smaller deviations, due to their higher single- and two-qubit gate fidelities. These results illustrate the limitations of NISQ era hardware, which are expected to recede as we move into the pre-fault-tolerant era where use of error-detecting codes with post-selection, additional layers of error mitigation, and eventually also some error correction, will significantly improve the performance.
| Ansatz | Noise model | E mean (a.u.) | σ (a.u.) | ΔE (a.u.) |
|---|---|---|---|---|
| Re-uCJ | None | −0.938640 | 0.001052 | 0.032786 |
| T 1/T2 | −0.932796 | 0.001091 | 0.038630 | |
| T 1/T2 + dep | −0.932253 | 0.001111 | 0.039173 | |
| H1-like | −0.926495 | 0.001152 | 0.044931 | |
| IBM-like | −0.916108 | 0.001205 | 0.055319 | |
| Im-uCJ | None | −0.964422 | 0.001057 | 0.007004 |
| T 1/T2 | −0.957332 | 0.001089 | 0.014095 | |
| T 1/T2 + dep | −0.956925 | 0.001110 | 0.014501 | |
| H1-like | −0.950271 | 0.001130 | 0.021155 | |
| IBM-like | −0.938314 | 0.001258 | 0.033112 | |
| g-uCJ | None | −0.971274 | 0.001126 | 0.000152 |
| T 1/T2 | −0.964271 | 0.001137 | 0.007146 | |
| T 1/T2 + dep | −0.963883 | 0.001102 | 0.007544 | |
| H1-like | −0.957142 | 0.001131 | 0.014285 | |
| IBM-like | −0.945106 | 0.001271 | 0.026319 |
While exactness for two-electron systems is a desirable property and can be achieved for H2 by optimization of both g-uCJ and UCCSD ansätze, realistic applications typically involve larger numbers of electrons in the active space. To assess the performance of the uCJ variants in such settings, we examined the case of a 4-electron, 4-orbital system corresponding to the double bond breaking in C2H4. The results are shown in Fig. 2. As can be seen here, the same general trend persists, namely that g-uCJ performs best, while Re-uCJ yields the least accurate results. Notably, g-uCJ maintains chemical accuracy up to a C–C bond distance of 1.7 Å. At larger separations, its performance degrades due to the inability to fully describe the dissociation into two 3CH2 radicals, a process that involves quadruple excitations, which requires going beyond the k = 1 level of a uCJ ansatz. Thus, in a situation where bonds with double or higher order bond character are broken at larger distances, it is recommended to go beyond the k = 1 level of the uCJ ansatz.
For larger systems, single-point energy calculations were performed using the g-uCJ, Re-uCJ, and Im-uCJ ansätze. These are summarized in Table 3, where the previously observed trends are maintained. Thus, Re-uCJ generally recovers the smallest fraction of the correlation energy, Im-uCJ performs better, and of the three ansätze, g-uCJ captures the largest fraction of the correlation energy. A comparison with UCCSD is also provided in Table 3. As shown there, g-uCJ achieves accuracy comparable to UCCSD while requiring significantly fewer two-qubit gates. Since symmetry breaking can contribute to the correlation energy in some systems, we also report the percentage of correlation energy captured by a single broken-symmetry UHF reference state, without dressing by a cluster Jastrow operator (last column of Table 3). As shown in line 2 of Table 3 for the highly correlated H4 molecule in a square geometry, using a bare UHF reference state captures approximately ≈87% of the electronic correlation energy, while applying the g-uCJ ansatz to the RHF reference improves the accuracy, recovering around ≈95% of the correlation energy and bringing the result significantly closer to the FCI energy. To further improve upon the accuracy achieved with g-uCJ, algorithms such as NOQE and, in the long term, QPE can be employed.
| System | UCCSD | g-uCJ | Re-uCJ | Im-uCJ | UHF |
|---|---|---|---|---|---|
| H2 (6-31G, RHH = 1.2 Å) | 100 | 100 | 82.88 | 99.96 | 0.06 |
| H4 (STO-3G, RHH = 1.1 Å) | 92.84 | 94.56 | 89.76 | 92.01 | 87.02 |
| C6H6 (STO-3G, (4e,4o)) | 100 | 92.26 | 59.74 | 83.97 | N/A |
We note that for even larger systems, the VQE optimization of uCJ circuits is expected to pose challenges such as barren plateaus, sensitivity to initialization and convergence difficulties. Empirically, we observe that Im-uCJ is less sensitive to initial conditions than Re-uCJ, suggesting that imaginary orbital rotations provide additional flexibility that stabilizes convergence. Similar observations have been made in recent studies, where initializing generalized gates in complex space was found to improve optimization performance, even when the exact ground state is real.87,88 This suggests that access to complex rotations provides additional flexibility that reduces the susceptibility to fall into local minima.
Interestingly, we find that the Im-uCJ ansatz exhibits a different structure. As shown in Fig. 3a, in addition to the HF and doubly excited CSFs, the wavefunction also includes contributions from the triplet Ms = 0 CSF. As is familiar in UHF, this additional flexibility lowers the energy and brings the result closer to the exact FCI energy, albeit at the cost of mixing different spin states. The contribution from the triplet CSF decreases smoothly at short distances, but, in contrast to UHF, is still non-zero at bond distances 0.25 Å shorter than Re: there is no analog of a Coulson–Fischer point, i.e., no collapse to RHF solution. Consequently, the value of 〈S2〉 deviates from zero (a pure singlet) all along the potential energy curve.
The Re-uCJ ansatz displays an even more intriguing behavior, showing very distinct wavefunction compositions for the two local minima derived from the equilibrium (dissociation ansatz) and separated atom regimes (association ansatz). With the dissociation ansatz (Fig. 3d), the wavefunction derived from equilibrium retains significant overlap with the HF state, increasing the overall energy error at large distances. Additionally, we observe contributions from the open-shell singlet CSF, indicating that while the 〈S2〉 and 〈Ms〉 are correct for a pure singlet state, the Re-uCJ ansatz exhibits spatial symmetry breaking (the open-shell singlet has ungerade (u) symmetry).
In contrast, the Re-uCJ local minimum derived from the separate atom regime (i.e., the association ansatz, Fig. 3c) behaves quite differently. Initially, in the dissociation regime, the wavefunction contains a mixture of triplet Ms = 0, doubly excited singlet, and HF singlet CSFs, but as the bond distance decreases, the wavefunction collapses into the RHF solution, rationalizing the energy curve behavior observed in Fig. 1a. Therefore this solution exhibits the analog of a Coulson–Fischer point, and appears to approach 50% triplet character at dissociation, resembling a UHF reference state. Interestingly, its spin contamination exceeds that of the Im-uCJ solution for all distances beyond about 1.2 Å as can be seen from the larger contribution form the triplet Ms = 0 CSF.
The difference in CSF composition between Re-uCJ, Im-uCJ, and g-uCJ can be rationalized by their different relative expressibility (i.e. the flexibility of their functional forms). The Re-uCJ ansatz cannot exactly reproduce the exact ground-state singlet wavefunction (which consists of a combination of Hartree–Fock and doubly excited singlet CSFs). As Fig. 3c shows, Re-uCJ introduces correlation by also introducing symmetry-breaking configurations. Additionally, limitations of the (k = 1) Re-uCJ ansatz force higher triplet vs. singlet CSF character whenever they are non-zero (giving a lower energy solution only at the dissociation limit). That is why an alternative solution involving the open-shell singlet CSF was located at smaller H–H distances (Fig. 3d). In contrast, the Im-uCJ ansatz accesses a different subspace of the Hilbert space through complex orbital rotations, allowing it to reach the doubly excited singlet CSF through variable inclusion of triplet CSF character at all H–H bond distances (Fig. 3a). In turn, g-uCJ has additional flexibility beyond either variant (Im- or Re-uCJ) individually. Thus, this more expressive circuit recovers the correct composition of CSFs observed in the exact singlet ground state wavefunction (Fig. 3b).
The exponential of the
operator (e
) can be represented as a series of Givens rotations,73 which requires
operations, where N is the number of spin–orbitals. Given that our generalized Givens rotation is essentially a two-qubit gate, it can be in general represented with 3 CNOT gates.74–76 The exponential of the Ĵ operator (eĴ), which consists of paired number operator rotations of the form e−iθ
i
j, requires two CNOT gates and one Rz gate per term. Noting that there are
distinct number operator pair products (without diagonal terms), one can then estimate the maximum number of CNOT gates as:
![]() | (28) |
![]() | (29) |
This results in total counts of at most
![]() | (30) |
Superconducting devices, by contrast, typically allow only nearest-neighbor connectivity but can exploit effective gate parallelization. In cases of limited connectivity, the exact implementation of Givens-rotation operators would still be possible without extra CNOT gate overhead, as it requires only nearest neighbor qubits. The challenge then arises for the eĴ term, since its decomposition involves factors such as eẐiẐj with distant qubits, which must be realized through chains of CNOT gates. Thus, for superconducting devices, the Local-uCJ variants in ref. 69 provide a natural alternative, by simplifying the Jastrow-factor operator to include only eẐiẐj terms that are connected on a particular device. This effectively reduces expressibility, but this effect can be mitigated by increasing k.
We would like to note that, formally, the number of parameters and entangling gates in the k-fold uCJ family scales linearly with the parameter k. As we noted above, k = 1 defines a well-behaved model chemistry that is tractable and is already able to capture a large portion of electron correlation effects. Increasing k systematically improves expressibility but comes at the price of a more challenging optimization problem and greater sensitivity to noise. For near-term devices, the recommended strategy is to begin with k = 1 and only increase k if the uCJ circuit does not achieve the desired accuracy. Overall, we expect ion-trap architectures (with high fidelities and near all-to-all connectivity) to be particularly well suited for uCJ implementations, while superconducting devices may benefit from the more restricted Local-uCJ ansatz with k > 1.
Our k = 1 results show that the restricted Im-uCJ ansatz has the best performance, offering a compelling balance between accuracy and computational efficiency. This ansatz yields shallow circuits, a moderate number of parameters for optimization, and superior performance compared to the previously proposed Re-uCJ ansatz, which often struggles with local minima and convergence issues. The generalized g-uCJ ansatz achieves near-exact accuracy, even reproducing FCI energies for small systems; however it does so at the cost of requiring a larger number of parameters.
A configuration state function (CSF) analysis provided deeper insight into the behavior of these uCJ ansätze. We observed that Im-uCJ lowers the energy at the cost of mixing spin states, while g-uCJ maintains the correct spin symmetry and achieves highly accurate results, albeit with more parameters. On the other hand, Re-uCJ exhibits a strong dependence on initialization, with distinct behaviors in association and dissociation regimes.
We also analyzed the effects on evaluation of energy expectation values in the uCJ ansatz states, of measurement shot noise and circuit noise derived from current hardware limitations on NISQ era machines that do not implement any form of error mitigation or error correction. For single reference states, while the effects of measurement shot noise were not significant, the effects of realistic circuit noise, including both gate and readout errors, were found to limit the accuracy to values outside the desired regime of chemical accuracy. This highlights the limitations of NISQ era hardware for accurate quantum chemical calculations and emphasizes the key importance of hardware developments in the pre-fault tolerant era we are entering now, to enable effective and efficient error mitigation, as well as operation of error-detecting and error-correcting codes. Given the strong performance of the Im-uCJ and g-uCJ ansätze seen in this work in the single reference context with classical pre-optimization, our future objectives are to apply the proposed uCJ circuits within the NOQE algorithm to explore their potential for multi-reference quantum eigensolvers on the pre-fault tolerant quantum hardware currently under development.
In particular, for biradical systems with two broken-symmetry UHF reference states that differ only by a permutation of α and β electrons, the non-orthogonal quantum eigensolver algorithm can improve energy estimates without requiring reoptimization of uCJ parameters. In such cases, only a single parameter set needs to be optimized for Im- or g-uCJ, with the second set readily obtained via index permutation in the K and J matrices. This analysis implies that uCJ ansätze, particularly Im- and g-uCJ, are strong candidates for near-term quantum simulations, offering viable and compact alternatives to more resource-intensive methods like UCCSD. It will also be of interest to apply these alternative unitary cluster Jastrow models to lattice based problems, expanding the capability of the local uCJ ansatz that has been applied recently to Fermi-Hubbard models.93
| This journal is © The Royal Society of Chemistry 2025 |