Open Access Article
Irma
Avdic
a,
Yuchen
Wang†
a,
Michael
Rose†
a,
Lillian I.
Payne Torres
a,
Anna O.
Schouten
a,
Kevin J.
Sung
b and
David A.
Mazziotti
*a
aDepartment of Chemistry, The James Franck Institute, The University of Chicago, Chicago, IL 60637, USA. E-mail: damazz@uchicago.edu
bIBM Quantum, IBM T.J. Watson Research Center, Yorktown Heights, NY 10598, USA. E-mail: kevinsung@ibm.com
First published on 27th March 2026
Quantum state tomography is a fundamental task in quantum information science, enabling detailed characterization of correlations, entanglement, and electronic structure in quantum systems. However, its exponential measurement and computational demands limit scalability, motivating efficient alternatives such as classical shadows, which enable accurate prediction of many observables from randomized measurements. In this work, we introduce a bi-objective semidefinite programming approach for constrained shadow tomography, designed to reconstruct the two-particle reduced density matrix (2-RDM) from noisy or incomplete shadow data. By integrating N-representability constraints and nuclear-norm regularization into the optimization, the method builds an N-representable 2-RDM that balances fidelity to the shadow measurements with energy minimization. This unified framework mitigates noise and sampling errors while enforcing physical consistency in the reconstructed states. Numerical and hardware results demonstrate that the approach significantly improves accuracy, noise resilience, and scalability, providing a robust foundation for physically consistent fermionic state reconstruction in realistic quantum simulations.
To overcome this limitation, a wide range of scalable tomographic approaches have been developed. These methods aim to infer only the most relevant or physically meaningful aspects of a quantum state rather than reconstructing the full wavefunction. Such approaches include compressed sensing,9,10 neural-network-based tomography,11,12 direct fidelity estimation,13–15 and maximum-entropy-inspired methods based on restricted Boltzmann machines,16 among others. A particularly powerful and widely adopted framework is that of classical shadows,17,18 which enables efficient estimation of a large number of observables from randomized measurements. The shadow tomography protocol constructs a compact classical representation, termed a “shadow”, from a set of randomized qubit or fermionic measurements, from which expectation values of many observables can be predicted with polynomial or even logarithmic scaling in system size. This paradigm has been successfully applied in a variety of contexts, including entanglement estimation,19 Hamiltonian learning,20–22 and reduced density matrix (RDM) reconstruction for quantum chemistry.23,24
When specialized to fermionic systems, shadow-based methods can leverage rich physical structure, such as antisymmetry,25 particle-number conservation,24 spin symmetries, and Gaussian unitary transformations,23,26,27 to further improve sampling efficiency and reconstruction fidelity. These fermionic shadow protocols enable scalable characterization of electronic structure properties and correlation functions relevant to molecular and solid-state systems, potentially bridging the gap between quantum simulation and classical post-processing. However, despite their theoretical efficiency, existing fermionic shadow tomography approaches face significant practical challenges. In particular, the presence of sampling noise, gate infidelity, and finite measurement statistics on quantum devices can severely degrade reconstruction accuracy.
In this work, we introduce an improved and optimized constrained shadow tomography framework, building on prior work,28,29 specifically tailored for the simulation of realistic fermionic systems on quantum devices. Our method formulates the reconstruction of the two-particle reduced density matrix (2-RDM) as a bi-objective semidefinite optimization, extending shadow-based tomography through a constrained formulation designed to improve robustness to noise and limited measurement data. By incorporating physical N-representability constraints directly into the optimization and adding a nuclear-norm regularization to the energy minimization, the method reconstructs a shadow-consistent 2-RDM that satisfies the N-representability conditions through an optimization that balances energy minimization against fidelity to the measured shadow data. This unified semidefinite program simultaneously mitigates errors arising from finite sampling and hardware noise while enforcing physical consistency in the resulting RDMs. Consequently, the framework can be viewed as a noise-aware, state-tailored extension of variational RDM theory,30–50 capable of producing physically valid density matrices under realistic quantum measurement conditions.
Through comprehensive numerical analysis, we explore how measurement requirements scale with system size, orbital basis dimension, and the choice of N-representability constraints, identifying practical regimes where constrained tomography yields superior performance over other approaches. Finally, we validate constrained shadow tomography through computations performed on IBM's superconducting quantum device, confirming that the advantages of constraint-enforced shadow tomography persist under realistic experimental conditions involving gate noise, readout errors, and limited sampling depth. Our results highlight the importance of integrating physical priors and optimization-based constraints into quantum state reconstruction frameworks, paving the way for accurate and scalable molecular simulation on quantum hardware.
![]() | (1) |
is the measurement channel. The estimator can then be used to predict the expectation values of many observables with provable guarantees,![]() | (2) |
which converges to Tr(Oρ) as the number of measurements M increases. For fermionic systems, the choice of measurement unitaries is key to an optimal protocol. For example, particle-number conserving transformations or fermionic Gaussian unitaries (FGUs) preserve antisymmetry and can be implemented efficiently on qubit hardware through linear optical or Majorana representations.23,25,27 These structured unitaries significantly reduce circuit depth and improve measurement fidelity compared to fully random Clifford unitaries.
The statistical efficiency of classical shadow tomography is further characterized by the shadow norm, which quantifies the variance of observable estimators under random measurements.18 For a given observable O, the estimation error scales as
![]() | (3) |
In the context of electronic structure and many-fermion simulations, the goal is often not full state reconstruction but accurate estimation of low-order observables such as the 2-RDM. Classical shadow tomography provides an efficient statistical framework for directly estimating these quantities from measurement data, enabling integration with post-processing schemes and physical consistency.
Since fermionic wavefunctions prohibitively scale exponentially with particle number, their state tomography is often simplified to an estimation of p-particle reduced density matrices (p-RDMs). If represented in the qubit basis, estimating all p-RDMs on n fermionic modes requires
measurements.54,55 Using FGUs, the sample complexity can be reduced to
.23 Considering particle-number symmetry, the required number of samples can further be reduced to
,24 with η particles. Here, the estimation error remains independent of the full Hilbert space dimension.
![]() | (4) |
creates a particle in orbital i and âi annihilates a particle in orbital i. The classical shadow representation of the 2-RDM can be expressed as![]() | (5) |
. Here, the indices denote spin orbitals, An is a one-body anti-Hermitian matrix, and n is the shadow index. Each shadow n corresponds to measuring all diagonal elements of the 2-RDM after applying the one-body unitary transformation, Ûn, to the wave function. The one-body unitaries are generated by random sampling according to the Haar measure,63 effectively rotating the orbitals into a new basis. From eqn (5), the shadow measurements can be expressed in terms of the 2-RDM elements as![]() | (6) |
By defining an objective functional of the 2-RDM, (J[2D]), such as the nuclear norm, Frobenius norm,64 or the energy expectation value,39,65,66 a convex optimization problem can be formulated with respect to the shadow constraints,
![]() | (7) |
for n ∈ [0, m], where N2
denotes the convex set of approximately N-representable 2-RDMs. For a quantum many-body system with at most pairwise interactions, minimizing the energy functional E[2D] is particularly appealing due to its direct relevance to molecular simulation. The 2-RDM can further be used to estimate the energy gradients67,68 and multipole moments.69 However, to enable practical implementation on current quantum devices, the formulation must be augmented to properly account for quantum measurement errors.
Motivated by the need to consider two complementary objectives: (1) to obtain an optimal solution to a general functional of the 2-RDM and (2) to minimize measurement errors (e.g., shot/gate noise, readout errors, limited sampling depth, etc.) affecting the 2-RDM, we cast eqn (7) into the following bi-objective semidefinite-programming optimization problem:70,71
![]() | (8) |
![]() | (9) |
2 = 2D + E1 − E2. | (10) |
The second objective function in eqn (9) minimizes the nuclear norm, or the sum of the singular values of the slack matrix, over the constraint set including the shadow constraints. The nuclear norm serves as a convex and computationally tractable surrogate for matrix rank, and efficient first-order algorithms based on singular value thresholding have been developed to solve such problems at large scale.66 Being convex and easy to optimize, the nuclear norm offers the best convex approximation of the rank function for matrices whose largest singular value is no greater than one.74 When the matrix variable is symmetric and positive semidefinite, the method relaxes to the trace heuristic, i.e., ‖X‖* = Tr(X), frequently used in systems and control studies.75,76
For practical implementation, considering the energy functional of the 2-RDM and the case where wi = w for all i, we obtain the following bi-objective semidefinite program (SDP), which includes the relaxation for rank via the nuclear norm,
![]() | (11) |
The two error matrices, E1 and E2, act as a “positive” and “negative” direction of error, respectively, increasing and decreasing the correction to the 2-RDM simultaneously. In the limit where w → 0, the SDP reduces to the variational 2-RDM algorithm,33,39 in which only the energy functional E[2D] is minimized subject to the N-representability conditions. In the limit w → ∞, the SDP gains the ability to treat arbitrary quantum states, including excited and non-stationary electronic states, within the same variational framework.77–79 In this work, we adopt equal weighting between the energy and measurement-consistency terms as a simple, minimally-biased default that avoids over-committing to either objective in the presence of noise. More generally, the w parameter may be guided by standard trade-off criteria (e.g., balancing energetic improvement against deviation from the measured shadow data). Additionally, a closely related study by the authors77 shows that reconstruction performance is often robust across a fairly broad intermediate range of w, with clear improvement over a wide range of choices.
The present algorithm enforces a necessary subset of the N-representability conditions, known as the 2-positivity (DQG) conditions.41,56,57,59,80,81 The 2-positivity conditions restrict the particle–particle RDM (2D), the hole–hole RDM (2Q), and the particle–hole RDM (2G) to be positive semidefinite where fQ and fG are linear maps connecting 2Q and 2G to 2D, respectively. These semidefinite constraints correspond to keeping the particle–particle, hole–hole, and particle-hole probability distributions nonnegative. The 2-positivity conditions incur a computational scaling of
in memory and
in floating-point operations,39 where r denotes the number of orbitals, and therefore, constrained shadow tomography maintains overall polynomial scaling. This scaling is consistent with variational 2-RDM SDP theory, where large-scale calculations have previously been demonstrated with millions of optimization variables (explicit timing data on standard hardware can be found in ref. 39). Scalability can further be modified by exploiting symmetry and solver structure, and partial constraint hierarchies (e.g., D or DQ). Performing full 2-RDM tomography on a quantum device would nominally require
distinct measurements, whereas our shadow-based technique reduces this requirement to approximately nsr2, where ns is the number of classical shadows and typically satisfies ns ≪ r2 for a desired accuracy ε.28,29
From a practical standpoint, the error-mitigating formulation presented is particularly well-suited for application on quantum devices, where finite sampling noise and gate errors can corrupt 2-RDM estimates derived from shadow tomography or other measurement schemes. By incorporating error matrices directly into variational optimization, the method provides a mechanism to regularize noisy experimental data while maintaining physical consistency through the N-representability constraints. Beyond molecular electronic structure, this framework is broadly applicable to quantum simulation tasks involving spin models, correlated materials, and quantum state tomography, where convex relaxation and data-driven corrections offer a scalable and physically grounded route to extracting reliable observables from imperfect measurements.
We begin by assessing the accuracy and consistency of the sv2RDM approach against the most-favorably scaling FCS tomography protocol.24Fig. 1 summarizes the dependence of absolute energy error, computed in the minimal basis set, and the Frobenius norm of the 2-RDM error (relative to the 2-RDM from full configuration interaction (FCI)) on system size under fixed total shot budgets for hydrogen chains of increasing length (H4–H10). Two sampling strategies were compared: the many-unitary/single-shot optimum for the FCS and the few-unitary/multiple-shot protocol for the sv2RDM. Each data point represents an average over 20 independent runs (10 for H10), with 95% confidence intervals indicated by the error bars. Both quantum tomography methods are compared with the classical v2RDM method. Across all systems, the overall trends reveal sv2RDM outperforming FCS in both total energy and 2-RDM reconstruction with respect to the FCI results, given the same total shot budget for the two protocols. Importantly, the circuits corresponding to the shot budgets used are up to two orders of magnitude lower in depth for sv2RDM, substantially improving the feasibility of constrained shadow tomography on quantum hardware. Moreover, the sv2RDM method provides greater accuracy than the classical v2RDM method with increasing system size.
Fig. 2 shows the lowest eigenvalue of the 2-RDM as a function of the total shot budget for H4 in the minimal basis, comparing the sv2RDM method with full 2-positivity conditions (DQG) to the FCS approach, which does not inherently produce N-representable matrices. As the total number of shots increases, both methods converge toward consistent eigenvalue estimates, indicating improved reconstruction of the underlying fermionic state. At low shot budgets, the FCS results exhibit larger fluctuations and unphysical negative eigenvalues, reflecting the stochastic nature of randomized measurement sampling. As expected from the scaling analysis in ref. 24, the statistical fluctuations in the FCS protocol diminish approximately as
, leading to a smooth convergence toward the physically valid spectrum at greater sampling depths. In contrast, the sv2RDM (DQG) results remain stable across all sampling regimes with the most “negative” eigenvalue being zero, demonstrating that explicit enforcement of N-representability constraints significantly mitigates noise sensitivity and maintains physically valid 2-RDM spectra.
To further address the physicality of the reconstructed RDMs, Fig. 3 compares the absolute error matrices of the 2-RDMs obtained from sv2RDM (DQG) (a) and FCS (b) for the H4 system in the minimal basis. Each matrix element represents the absolute deviation from the FCI reference value. The sv2RDM (DQG) reconstruction exhibits uniformly low and smoothly distributed errors, consistent with the enforcement of N-representability conditions. In contrast, the FCS results display more pronounced and localized deviations, across all elements, further highlighting the statistical noise associated with such stochastic measurement sampling. The visual contrast between the two panels emphasizes that the constrained sv2RDM approach yields systematically more accurate and physically consistent 2-RDMs than the unconstrained FCS method under comparable measurement resources.
Table 1 summarizes the Frobenius-norm differences between reconstructed and FCI ref. 2-RDMs across hydrogen chains of increasing length using the same total shot budgets. The SDP used in this case has w → ∞ in eqn (11). The results, including only the D condition and the full DQG constraint set, are compared against the FCS protocol. Across all system sizes, imposing N-representability conditions systematically reduces the deviation from the reference state, with the greatest improvements observed when all DQG constraints are enforced. The effect becomes more pronounced for larger systems, where unconstrained or partially constrained optimizations show growing discrepancies from the FCS method. These results demonstrate that the inclusion of full 2-positivity conditions provides superior reconstruction fidelity, particularly under limited measurement resources and increasing system complexity.
000 unitaries/1 shot vs. 16 unitaries/1000 shots for H4, 36
000 unitaries/1 shot vs. 36 unitaries/1000 shots for H6, 160
000 unitaries/1 shot vs. 160 unitaries/1000 shots for H8, and 300
000 unitaries/1 shot vs. 300 unitaries/1000 shots for H10
| H chain | Shot budget | D | DQG | FCS |
|---|---|---|---|---|
| 4 | 16 000 |
0.129 | 0.059 | 0.423 |
| 6 | 36 000 |
0.226 | 0.080 | 0.873 |
| 8 | 160 000 |
0.454 | 0.037 | 0.960 |
| 10 | 300 000 |
0.217 | 0.055 | 1.349 |
The inclusion of electronic energy minimization acts as a Hamiltonian-informed regularizer, which is particularly beneficial in molecular ground-state settings, but is not required for the reconstruction itself. In the limit w → ∞, the SDP reconstruction becomes fully independent of the Hamiltonian and may be applied to arbitrary quantum states. In this regime, we find that expectation values such as the electronic energy converge more slowly than norm-based 2-RDM metrics, consistent with the heightened sensitivity of energy observables. Representative equilibrium-point energy benchmarks are provided in the SI (Table S1). Related work by the authors77 further discusses the w → ∞ limit as a useful setting for correlated purification and error mitigation, including applications beyond ground states such as excited-state reconstruction.
Fig. 4 presents the simulated potential energy curve of N2 in the cc-pVDZ basis set with a [10,8] active space using complete active space configuration interaction (CASCI), sv2RDM, and FCS methods (SI Sec. III includes v2RDM results). The CASCI results serve as the reference for evaluating the accuracy of the reconstructed energy profiles. The FCS results exhibit noticeable deviations near the equilibrium and bond-stretching region, while the sv2RDM is able to closely reproduce reference results across the full dissociation range. These comparisons highlight that constrained semidefinite reconstructions provide quantitatively reliable potential energy surfaces even under limited measurement resources, whereas unconstrained fermionic shadow tomography approaches can struggle to maintain accuracy in strongly correlated regimes.
To evaluate the performance of constrained shadow tomography on real quantum hardware, the H4 rectangle–square–rectangle potential energy profile is computed in the minimal basis using both classical and hybrid quantum approaches (Fig. 5). Density Functional Theory (DFT) results using the B3LYP functional91,92 are shown as the classical baseline, while the sv2RDM calculations are performed using measurements collected on the ibm_fez quantum processor (details can be found in SI Section II A). The Local Unitary Cluster Jastrow (LUCJ) ansatz93 is optimized and computed classically, serving as the initially prepared quantum state and a reference for the sv2RDM calculations. The DFT results fail to reproduce the smooth energy flattening near the square geometry that signals strong static correlation. In contrast, the sv2RDM results recover the correct qualitative features, closely matching the expected multireference character. The sv2RDM energies obtained from the quantum device align well with the LUCJ benchmark within statistical uncertainty, demonstrating that error-mitigated semidefinite constraints can effectively suppress noise-induced artifacts and maintain physical consistency in near-term quantum simulations.
The scalability of the constrained reconstruction approach is further evaluated by comparing correlation energies for hydrogen chains of increasing length, computed with sv2RDM, FCI, and the LUCJ ansatz in the minimal basis (Table 2). sv2RDM energies are obtained from measurement data collected on the ibm_fez quantum processor under fixed total shot budgets scaled with system size, with up to 20 qubits (details can be found in SI Section II B). Other methods were computed classically. Across all chain lengths, sv2RDM reproduces the qualitative energy trends of the reference LUCJ results, maintaining size-extensive behavior and smooth energy variation. Though deviations from the classical references appear at all system sizes, attributable to the increased noise inherent to hardware sampling, the results highlight that, even with limited quantum resources, enforcing semidefinite constraints enables systematically scalable energy predictions for chemical systems.
000 shots for H4, 36 unitaries/10
000 shots for H6, 160 unitaries/10
000 shots for H8, and 300 unitaries/10
000 shots for H10. Hartree Fock, LUCJ, and FCI calculations are run classically. All calculations use a minimal basis set
| H chain | sv2RDM | LUCJ | FCI |
|---|---|---|---|
| 4 | 0.011 | 0.034 | 0.068 |
| 6 | 0.013 | 0.062 | 0.101 |
| 8 | 0.024 | 0.061 | 0.133 |
| 10 | 0.065 | 0.084 | 0.166 |
The results demonstrate that constrained shadow tomography provides a robust and systematically improvable framework for extracting physically meaningful quantities from both simulated and experimental quantum data. The numerical benchmarks employ Gaussian perturbations as a controlled model of finite-shot statistical noise, enabling a clear assessment of the proposed tomography's ability to suppress statistical fluctuations and reduce the number of required measurements. While this noise model does not capture realistic experimental effects, including coherent gate errors, readout bias, and correlated device effects, these are naturally present in the accompanying hardware demonstrations, which show qualitatively similar behavior. This agreement supports the robustness and measurement efficiency of the method beyond idealized shot-noise assumptions.
Across molecular benchmarks, enforcing N-representability through the 2-positivity conditions yields stable energy estimates, well-behaved eigenvalue spectra, and accurate two-particle correlations even under modest shot budgets. Comparisons with classical v2RDM, LUCJ, and FCI references confirm that the sv2RDM approach retains quantitative accuracy while remaining resilient to hardware noise and stochastic sampling errors. The constrained shadow tomography is complementary to established quantum error mitigation techniques such as symmetry verification94 and purification-based post-processing,95 which enforce physical structure at the level of the global state or density matrix. By contrast, the present approach operates directly at the level of fermionic reduced density matrices, enforcing N-representability constraints through a convex SDP, and thus provides a noise-model-independent route to suppressing unphysical errors in measured observables. This framework is naturally compatible with other mitigation layers, including correlated purification strategies,77 as well as probabilistic error cancellation methods96 that rely on explicit noise characterization. These findings establish a consistent foundation for assessing scalability, error mitigation, and resource efficiency in near-term quantum simulations, motivating a deeper discussion on methodological implications and potential pathways toward chemically accurate hybrid quantum-classical frameworks.
The presented algorithm establishes a practical foundation for advancing quantum state characterization in many-body quantum simulations. Future work may explore the incorporation of additional system symmetries or adaptive weighting strategies to further enhance RDM reconstruction fidelity. As quantum hardware continues to progress, methods that integrate physical insight, optimization principles, and statistical learning will be essential for developing reliable and scalable approaches to quantum state reconstruction. This study positions constrained shadow tomography as a powerful and generalizable framework for extracting accurate physical insight from limited quantum data, paving the way for robust, physically grounded characterization of complex quantum systems.
Footnote |
| † These authors contributed equally to this work. |
| This journal is © The Royal Society of Chemistry 2026 |