NVIDIA cuQuantum v25.11 Released: QEC Gets a GPU Boost
NVIDIA's cuQuantum SDK v25.11 Improves Error Correction and Large-Scale Quantum Simulation
The latest version of the cuQuantum SDK, v25.11, from NVIDIA accelerates GPU-based quantum computing simulations with high-performance libraries. This patch adds components for Pauli propagation and stabiliser simulations, which are needed to simulate huge quantum computers, quantum error correction, and AI training data. The study found that the cuQuantum cuPauliProp software can efficiently predict experimental quantities like expectation values that are too intricate for classical simulation.
The cuQuantum cuStabilizer library increases frame simulation throughput and sampling rates, which are essential for assessing quantum error correcting codes.
New high-performance libraries cuPauliProp and cuStabilizer accelerate large-scale quantum computer modelling. CuQuantum SDK 25.11 is now available from NVIDIA. Simulating and validating large-scale quantum systems may require these novel elements.
Challenge of Validation
As quantum processing units (QPUs) improve and devices scale beyond what is usually simulable, validating results is essential to ensuring output reliability. The new libraries in cuQuantum SDK v25.11 cover two key workload classes for algorithm engineering, verification, and validation for intermediate to large-scale quantum devices.
Simulating Intractable Observables with CuPauliProp Pauli propagation is a unique method for replicating large-scale quantum circuit observables, notably those with noise models of quantum processors.
Many quantum applications, such as the Variational Quantum Eigensolver (VQE) and quantum simulation of physical dynamics, require expectation value computation. Traditional simulation approaches may be too expensive to accurately mimic these expected values. Pauli propagation allows estimation by describing states and observables as weighted sums of Pauli tensor products and dynamically deleting components that significantly affect the expectation value.
Particular Uses: The approach is promising for simulating near-Clifford or noisy circuits. It has also performed well when modelling circuits that simulate the evolution of certain quantum spin systems, such as IBM's 127-qubit "utility circuits".
NVIDIA GPUs speed up Pauli propagation, allowing developers to improve classical circuit simulation with new tools. Compared to single-threaded Qiskit Pauli-Prop on current data centre CPUs, NVIDIA DGX B200 GPUs have shown multiple orders of magnitude speedups.
CuStabilizer: High-Throughput Error Correction Testing Stabiliser simulations are based on the Gottesman-Knill theorem, which asserts that Clifford gate circuits (CNOT, Hadamard, and Phase gates) may be simulated classically in polynomial time. This capacity is needed for large-scale QEC code testing and resource estimation.
Frame Simulation Acceleration: The cuStabilizer library boosts frame simulator throughput for sampling speeds. Since quantum devices are imperfect, frame simulation models quantum noise's effect on quantum state. The frame simulator monitors the change caused by adding arbitrary noisy gates if the noise-free outcome is known. Meeting Data Needs: Because the number of noisy gate possibilities increases rapidly with circuit size, error correction methods require many “shots” or samples to simulate. CuStabilizer is ideal for developing QEC codes, testing new decoders, and creating big datasets for AI decoder training.
Performance: GPU acceleration increases sampling throughput compared to cutting-edge CPU codes like Google Stim. Optimization for huge volumes of samples and qubits is achieved with NVIDIA DGX B200 GPU simulation speedups of up to 1,060x for large code distances (e.g., surface code distance 31). Pip installs cuPauliProp and cuStabilizer.









