Skip to main content
Paper Reproduction3 claims tested

A variational eigenvalue solver on a photonic quantum processor

Peruzzo et al.Nature Communications 5, 4213 (2014)

Various (Bristol, MIT, Google) | Photonic quantum processorarXiv:1304.3061

In Plain Language

What this paper does: This is the paper that invented VQE — the very first demonstration of a variational quantum chemistry algorithm. It calculated the energy of helium hydride (HeH+) on a photonic quantum processor using just 2 qubits.

Why it matters: As the founding paper for variational quantum computing, reproducing it tests whether the core idea works across different hardware platforms. It's also a test of whether AI agents can faithfully implement historical quantum algorithms.

Our scope: Cross-platform reproduction. The original used photonic qubits — an entirely different physical platform. We ran the same algorithm on superconducting qubits (Tuna-9, IBM Torino). Same qubit count and protocol, different physics.

What we found: 7 of 9 claims reproduced. We discovered that HeH+ is inherently harder than H2 for noisy hardware: the ratio of Hamiltonian coefficients (|g1|/|g4| = 7.8 vs 4.4 for H2) amplifies noise. This "coefficient amplification" effect predicts which molecules will be hardest on noisy hardware — a finding not in the original paper.

Key Terms

HeH+Helium hydride ion — a simple two-atom molecule used as a benchmark

Coefficient amplificationWhen a molecule's quantum description has very different-sized terms, small measurement errors in the large terms overwhelm the small ones, amplifying total error

Photonic processorThe original paper used light-based qubits. We replicated on superconducting qubits (a different technology)

100%3/3

Backends Tested

QI EmulatorIBM TorinoQI Tuna-9

Failure Modes

PASS3 (100%)
PARTIAL0 (0%)

Claim-by-Claim Comparison

Each claim from the paper is tested on multiple quantum backends. Published values are compared against our measurements.

HeH+ ground state energy near equilibrium (R=0.75 A)

Fig. 2Published: -2.8462 Ha +/- 0.003 Hartree
BackendMeasuredDiscrepancykcal/molStatus
QI Emulator-2.8459 Ha+0.00030.2PASS
IBM Torino-2.8391 Ha+0.00714.45PARTIAL
QI Tuna-9-2.8391 Ha+0.00714.44PARTIAL

IBM Torino: HeH+ R=0.75A mitigation ladder: TREX=4.45, TREX+DD=8.24, Raw=18.94 kcal/mol. Best across 3 distances: R=1.50 TREX=4.31 kcal/mol. Prediction CONFIRMED: TREX gives 2.3-4.3x improvement but NOT chemical accuracy. HeH+ TREX is 20x worse than H2 TREX (0.22 kcal/mol), consistent with coefficient amplification (|g1|/|g4|=7.8 vs 4.4). EstimatorV2 raw baseline (17-19 kcal/mol) far better than SamplerV2+PS (91 kcal/mol).

QI Tuna-9: HeH+ at R=0.75A on Tuna-9 q[4,6]: best=4.44 kcal/mol (REM+PS). Five strategies tested: raw=35.24, PS=5.89, REM=9.34, REM+PS=4.44, hybrid=5.11 kcal/mol. Fails chemical accuracy but demonstrates HeH+ is intrinsically harder than H2 (|g1|/|g4|=7.8 amplifies Z-errors). Comparable to IBM TREX (4.45 kcal/mol).

HeH+ potential energy curve matches FCI across bond distances

Fig. 2Published: 0.0000 Ha +/- 0.001 Hartree MAE
BackendMeasuredDiscrepancyStatus
QI Emulator0.0001 Ha+0.0001PASS
IBM Torino0.0080 Ha+0.0080PARTIAL
QI Tuna-90.0071 Ha+0.0071PARTIAL

IBM Torino: Mitigation ladder at 3 distances: TREX achieves 4.31-7.26 kcal/mol (mean 5.34). Best: R=1.50 TREX=4.31 kcal/mol. Still fails chemical accuracy (1.0 kcal/mol). Original SamplerV2 results: 11 distances, 0/11 chemical accuracy, MAE=83.5 kcal/mol. EstimatorV2+TREX reduces by ~16x.

QI Tuna-9: Single-point at R=0.75A only (not full curve). 4.44 kcal/mol with REM+PS on q[4,6]. Significantly better than IBM Torino MAE (83.5 kcal/mol). Full curve would require additional hardware time.

Symmetry verification improves noisy VQE

Fig. 3Published: 1.5x +/- 1 x improvement
BackendMeasuredDiscrepancyStatus
QI Emulator3.1x-1.6000PASS
IBM Torino4.3x-2.7600PASS
QI Tuna-97.9x+6.4400PASS

Cross-Backend Summary

BackendClaims TestedPassedPass RatePrimary Issue
QI Emulator33100%--
IBM Torino3133%PARTIAL
QI Tuna-93133%PARTIAL

Key Findings

QI Emulator: 3/3 claims matched. The simulation pipeline correctly reproduces the published physics.

IBM Torino: 1/3 claims matched. Average energy error: 4.5 kcal/mol. Hardware noise degrades precision.

QI Tuna-9: 1/3 claims matched. Average energy error: 4.4 kcal/mol. Hardware noise degrades precision.

Report Metadata

Generated: 2/10/2026Paper ID: peruzzo2014View PaperView raw JSON