Demonstrating quantum error mitigation on logical qubits
Abstract
A long-standing challenge in quantum computing is developing technologies to overcome the inevitable noise in qubits. To enable meaningful applications in the early stages of fault-tolerant quantum computing, devising methods to suppress post-correction logical failures is becoming increasingly crucial. In this work, we propose and experimentally demonstrate the application of zero-noise extrapolation, a practical quantum error mitigation technique, to error correction circuits on state-of-the-art superconducting processors. By amplifying the noise on physical qubits, the circuits yield outcomes that exhibit a predictable dependence on noise strength, following a polynomial function determined by the code distance. This property enables the effective application of polynomial extrapolation to mitigate logical errors. Our experiments demonstrate a universal reduction in logical errors across various quantum circuits, including fault-tolerant circuits of repetition and surface codes. We observe a favorable performance in multi-round error correction circuits, indicating that this method remains effective when the circuit depth increases. These results advance the frontier of quantum error suppression technologies, opening a practical way to achieve reliable quantum computing in the early fault-tolerant era.
I Introduction
Suppressing errors is a problem that lies at the center of quantum computing technologies. Quantum error correction and mitigation are the two generic methods of error suppression. Error correction promises to reach an arbitrarily high fidelity provided sufficient qubit resources. Recently, experiments have demonstrated a positive gain in the surface code error correction when scaling the code distance up to 7 [1]. However, it is still viewed as a long-term goal to achieve negligible infidelity through error correction, which could need millions or even more qubits and pose a challenge to the experimental technologies regarding scalability [2]. Error mitigation takes different resources, computing time, to suppress errors, being originally designed for the regime that error correction is lacking [3]. Considering the earliest applications of quantum computing, it is highly likely that they will be carried out under strict technology constraints, mainly on the qubit number. In this scenario, error correction can only attain a limited fidelity, and a practical approach taking advantage of the two error suppression methods is necessary [4, 5].
Zero-noise extrapolation (ZNE) is one of the most practical error mitigation techniques, universally applicable to quantum algorithms that evaluate expectation values [6, 7]. The central idea behind ZNE is to amplify the noise in a quantum circuit by a controllable factor , and then extrapolate the results back to , thereby inferring the behavior of a noiseless circuit. In a recent experiment, ZNE demonstrated substantial error suppression on a superconducting system with more than a hundred qubits [8], making it the only error mitigation technique successfully applied at this scale to date. Given these promising results, we consider ZNE the leading candidate for mitigating errors in quantum error correction circuits. Specifically, we apply ZNE to reduce post-correction logical errors by amplifying noise on physical qubits.
The implementation of ZNE on logical qubits faces two primary experimental challenges. First, it requires a quantum processor with high-fidelity gates and sufficient qubits, which is crucial for executing quantum error correction. Recent progress in qubit fabrication and control technologies have made superconducting qubits a promising platform for investigating quantum error correction and mitigation technologies [8, 9, 10, 11, 12, 13, 1]. In this work, we utilize two superconducting quantum processors to experimentally assess residual errors and costs in quantum error correction and mitigation, employing both the repetition code and surface code [14, 15, 16, 17]. Second, accurate error mitigation relies on measuring error rates per operation [3, 18]. While measuring logical error rates is feasible for small codes, it becomes increasingly time-consuming for larger codes due to the small logical error rates [19, 20]. Additionally, for high-encoding-rate quantum error correction codes, such as certain qLDPC codes, logical errors may involve multi-qubit correlations, making the measurement impractical [21, 22] (see Supplementary Section S2). To overcome this issue, we adopt a strategy of amplifying noise on physical qubits instead of logical qubits. By integrating error mitigation techniques with error correction, this study demonstrates a practical pathway to bridge the gap between the noisy intermediate-scale quantum (NISQ) era and the fault-tolerant quantum computing (FTQC) era, advancing the pursuit of practical quantum computing technologies.
II Results
We shall first justify the application of ZNE to quantum error correction circuits. The essential assumption made in ZNE is that the expected value of an observable is a function of the noise strength , and this function can be fitted well by simple functions such as polynomials. This assumption holds in NISQ circuits, which has been illustrated in many theoretical works and experiments [6, 7, 3, 23, 24, 25, 12]. In this paper, one of the primary goals is to demonstrate that the assumption is also valid in quantum error correction circuits.
Our first evidence is a theoretical result based on the stochastic error model: each operation in the circuit is either error-free or erroneous with a certain probability of , and we increase the probability to (with ) when amplifying the noise (Fig. 1). We apply this model to general quantum circuits with all the components necessary for quantum error correction, including mid-circuit state preparation and measurement, feedback and post-selection; these components are beyond NISQ circuits [26]. Even with these components, the expected value with errors is in the form , where is the number of operations, and is the deviation caused by errors in the circuit (see Supplementary Section S1). This polynomial expression holds for all quantum circuits and suggests the use of a polynomial fitting function.

We use different polynomial fitting functions in cases with and without error correction. Without error correction, the leading contribution of errors is due to only one error in the circuit. Therefore, we always include the linear term when choosing a polynomial fitting function, then we introduce terms sequentially for higher-order fitting; we use such polynomials in conventional ZNE. With error correction, the leading order is different. In a circuit, its capability of error correction is characterized by an effective code distance : when the number of errors is smaller than , we can always successfully detect and correct the errors. Because of this reason, the coefficients are all zero for , and the leading contribution becomes ; this coincides with numerical results on logical error rates of surface codes and bivariate bicycle codes [27, 22]. Therefore, with error correction, we choose a fitting function in the form , which has non-constant terms starting with the term.

We carry out experimental demonstrations on two superconducting quantum processors to certify that ZNE can be naturally integrated into the FTQC circuits. The processors used here, each of which contains a lattice of tens of frequency-tunable qubits featuring adjustable nearest-neighbor couplings, have performance similar to those dedicated to quantum error correction experiments in the literature [13, 12, 1]. The qubits selected for the experiments are highly coherent, with the median time being above s (see parameters in Supplementary Table S1), and the controlled -phase (CZ) gates between nearest-neighbor qubits have a median fidelity around 0.995. Since FTQC also relies heavily on measurement quality, we can differentiate between the and states of each qubit, achieving median readout fidelities of and for Processor I (with Purcell filters, -s measurement time) and Processor II (without Purcell filters, -s measurement time), respectively. This is accomplished by pumping the state of each qubit to its next-higher level. Furthermore, even without the pumping technique which is a prerequisite for feedback operations in FTQC circuits, we can still achieve a median readout fidelity of on Processor I within 0.5 s, as demonstrated in our repetition code experiment by allowing repetitive measurements on the syndrome qubits up to rounds (see next).
As the first experimental demonstration, we show that ZNE works on an example circuit with a feedback control to eliminate the bit-flip error on (Fig. 2a), where the nominal data qubits ( for , 2, and 4) are each initialized into a superposition state given by . The first 4 CNOT gates in the sequence diagram are used to encode the parity of the data qubits onto the syndrome qubits ( and ), followed by operations for algorithmic purpose, and the next 4 CNOT gates serve to decode and identify the bit-flip type of errors that may occur during the operational stage. To implement ZNE, one needs to be able to controllably amplify the errors, which can be achieved using schemes such as pulse stretching [28, 25] or subcircuit repetition [29]. Here to simulate depolarizing errors occurring at a probability of with being the scaling factor, we run types of circuit instances which correspond to all combinations of inserting one operation drawn from the list of at the operational stage for each of the 3 data qubit, based on which we construct a weighted list of circuit instances so that the insertion probabilities of , , and all equal to for each data qubit (see Supplementary Section S4 for more details). Finally all qubits are measured in the basis to yield a 5-bit binary string, , for each circuit instance, and the no correction data are directly calculated using s from all circuit instances. In the case of error correction, since a bit-flip error due to the insertion of or on the pair of data qubits and , , flips the state of , the 5-bit binary strings from all instances are selected only when to ensure no error on , following which is numerically reversed due to the feedback gate only if to eliminate the bit-flip error on . The newly processed strings are then used to calculate the data with correction.
For the case of no correction, Fig. 2b shows that the expectation values of for , for different input states parametrized by , all gradually approach zero as the noise strength increases. In comparison, the data using processed bit strings with error correction indicate that the impact of noise is reduced, resulting in a slower decline in . Experimental results closely match numerical simulations of circuits with gate and measurement errors (Fig. 2c). Furthermore, when we use ZNE to extrapolate the values at , the results show excellent agreement with numerical predictions, which demonstrates that ZNE effectively works on the circuit.
In the above-mentioned example circuit, small errors occurring at the CNOT gates and measurement cannot be corrected, so that the post-mitigation residual error at is comparable to the case without error correction. Next, we resort to fault-tolerant circuits of repetition and surface codes, where the logical error rates can be suppressed to an arbitrarily low level by increasing the code distance once physical error rates are below the threshold. On fault-tolerant circuits, we find that the post-mitigation residual error is much smaller than the case without error correction.
We implement repetition codes with distances using up to 13 qubits on Processor I, as shown in Fig. 3a. Repetition codes protect the encoded logical qubit from bit-flip errors and have been widely used as a prototype for demonstrating error correction across various quantum platforms, including ion traps [30, 31], nuclear magnetic resonance [32, 33], and superconducting qubits [34, 1]. Here, we adopt it for our first demonstration of error mitigation on fault-tolerant circuits.

The circuit of a repetition code, schematically depicted in Fig. 3b, consists of data qubits and syndrome qubits arranged alternately in a linear chain. The data qubits are initialized in the state , preparing the logical qubit in the logical state . Next, we apply error injections and perform parity-check measurements for successive rounds. The circuit terminates with an additional round of error injection and transversal measurement on data qubits. The transversal measurement facilitates a final round of parity checks and readout of the logical operator . The parity checks are designed to detect all possible bit-flip errors, whether injected deliberately or introduced by computational operations such as gates, state preparation, and measurements. Each experimental shot produces a bit string of length , recording the measurement outcomes. This bit string is then processed by a minimum-weight perfect matching (MWPM) decoder [35, 36], which outputs the correction gates. These corrections ensure faithful reconstruction of the logical state, provided that the number of bit-flip errors does not exceed ; therefore, the circuit is fault-tolerant with respect to bit-flip errors.
We evaluate the performance of error suppression methods using the expectation value of the logical operator . In an ideal noise-free circuit, the expectation value is ; the presence of noise reduces this value. For illustrative purpose, we experimentally inject Pauli errors during the operational stage with the unit probability set at , as done in the previous example. Figure 3c shows the experimental results for circuits with one round of parity-check measurements (): the expectation value decreases gradually with increasing noise strength, while error correction slows the rate of decline. As the code distance increases, the rate of decline approaches zero asymptotically, indicating a below-threshold error regime.
To further suppress errors beyond error correction, we apply ZNE to the results. Specifically, we select data points corresponding to noise strengths and perform polynomial extrapolation to obtain the error-mitigated expectation value of the logical operator, denoted as . To evaluate the performance of ZNE, we define two metrics: the bias and the sampling overhead . The bias quantifies the deviation of the error-mitigated value from the ideal expectation and is expressed as
(1) |
The sampling overhead measures the relative increase in sampling cost required to achieve the same variance for the error-mitigated value as for the uncorrected value, and it is defined as
(2) |
Unless otherwise specified, we take in what follows and iterate over all other data points for .
The - scatter plots in Fig. 3d visualize that ZNE consistently provides more accurate estimations compared to the uncorrected value at (indicated by dashed lines) across all cases. The plots also highlight a trade-off between the bias and sampling overhead : while ZNE reduces bias, it increases the sampling overhead. If the noise-boosted data points at are chosen closer to or if a higher-order extrapolation (larger ) is used, the bias generally decreases, but the sampling overhead tends to grow. Figures 3c-d further highlight the advantage of combining ZNE with error correction. When error correction is applied, the - scatter points progressively migrate toward the lower-left corner as the code distance increases, indicating simultaneous improvements in precision and efficiency. Notably, for , the residual error is reduced to approximately , with a modest sampling overhead of just .
It is important to note that error mitigation is scalable on error correction circuits as the code distance and circuit complexity increase [5]. A fundamental limitation of error mitigation methods is that they become inefficient when the product of the error rate per gate and the number of gates becomes large. This limitation is well-studied in probabilistic error cancellation, a bias-free method, where variance increases exponentially with the gate number [37, 38, 39]. In contrast, the polynomial-function ZNE has a bias that grows with the gate number [7]. Although these issues also arise in error mitigation applied to error correction circuits, the key factors become the logical error rate and the number of logical gates. Therefore, as long as the logical error rate remains sufficiently low — that is, with a sufficiently large code distance — error mitigation can be applied to circuits of arbitrary gate complexity. For instance, using the surface code with a physical error rate of per gate and a code distance of eleven, we can achieve a logical error rate of approximately [27]. This allows a circuit with logical operations to run at one logical error in one hundred circuit shots. In this case, our protocol can reduce the logical error by a factor of about with an overhead cost of due to our estimation (see Supplementary Section S7). Moreover, our experiments show that as the code distance and parity-check rounds increase, the performance of ZNE remains nearly unchanged; notice that multi-round parity checks are necessary in practical fault-tolerant quantum computing. This insight can be demonstrated in two complementary ways. First, when the error rate per parity-check round is fixed, increasing the number of parity-check rounds results in a similar relative bias and sampling overhead for ZNE, despite the growth in unmitigated bias (see Supplementary Fig. S6). Second, for a fixed total error rate in all rounds, partitioning the circuit and employing multi-round QEC significantly improves both the unmitigated and mitigated results (Fig. 3e–f). Remarkably, these enhancements are achieved while the sampling overhead remains nearly unchanged.

As a final experiment, we extend our investigation to the rotated surface code, a quantum error correction code capable of correcting both bit-flip and phase-flip errors. We implement the distance-3 rotated surface code on our Processor II, as illustrated in Fig. 4a. The encoded logical qubit is defined by two anti-commuting logical operators, and , and the system can be initialized by preparing the logical states using digital quantum circuits that consist of single- and two-qubit gates [40, 13].
We demonstrate the effectiveness of ZNE following the sequence diagram shown in Fig. 4a, with three representative initial states: , , and (Fig. 4b). In the absence of error injections, the initial states reconstructed from the corrected expectation values of the and observables locate significantly closer to the surface of the Bloch sphere compared to those reconstructed from the uncorrected values (Fig. 4b). This demonstrates reductions in both bit-flip and phase-flip errors that arise during the initial state preparation and parity measurement processes. We present numerical simulations based on the single-qubit depolarizing model, where the depolarizing rate is calibrated by matching the fidelity of the prepared logical state, . As shown in Figs. 4b-c, this noise model agrees well with the experimental data, regardless of the initial states. Detailed results for both uncorrected and corrected expectation values of the observable, with the initial state set to , are presented in Fig. 4c; see Supplementary Fig. S8 for more experimental results. To implement ZNE, we insert random Pauli gates on data qubits before and after the parity-check measurements to amplify the noise. In surface codes, quantum computing are driven by the parity-check measurements applied on a lattice deforming with time in protocols such as braiding transformation [15] and lattice surgery [41]. ZNE can be applied to such circuits through the same noise amplification strategy as taken in our experiment. By incorporating ZNE, the residual errors in and are further reduced beyond error correction (see Fig. 4d and Supplementary Fig. S8).
III Discussion
Our results demonstrate for the first time the application of quantum error mitigation on error correction circuits, effectively minimizing the impact of error correction failures. The specific method that we choose is ZNE: we amplify noise on physical qubits and utilize a selected polynomial extrapolation function, thereby adapting the method to be compatible with fault-tolerant quantum circuits. Amplifying physical error rates, a well-established and experimentally validated technique on NISQ circuits, ensures the feasibility of our approach. Moreover, the success of error mitigation methods such as ZNE and probabilistic error cancellation fundamentally depends on precise noise modeling. Our experimental results exhibit strong consistency with numerical simulations of noise models from calibration, offering a robust foundation for further performance optimization. As the main result, our approach achieves a reduction in error-induced bias, outperforming both standalone error correction and error mitigation. This bias reduction illustrates the potential to achieve reliable quantum computing on hardware that only permits limited error correction capabilities, marking a pivotal step toward large-scale quantum computing on noisy devices.
Acknowledgements
The device was fabricated at the Micro-Nano Fabrication Center of Zhejiang University. We acknowledge the support from the National Natural Science Foundation of China (Grant Nos. 92365301, 92065204, 12404574, 12274368, 12274367, 12174342, 12322414, 12404570, U20A2076, 12225507 and 12088101), the Innovation Program for Quantum Science and Technology (Grant No. 2021ZD0300200), the Zhejiang Provincial Natural Science Foundation of China (Grant Nos. LR24A040002 and LDQ23A040001), the National Key Research and Development Program of China (Grant No. 2023YFB4502600), and the NSAF (Grant No. U1930403).
Author contributions
H.X. and Y.L. proposed the ideas and conducted the theoretical analysis; A.Z., Y.G. and J.Y. carried out the experiments and analyzed the experimental data under the supervision of P.Z. and H.W.; H.L. and J.C. fabricated the device, supervised by H.W.; All authors contributed to the experimental setup, analysis of data, discussions of the results and writing of the manuscript.
References
- Acharya et al. [2024] R. Acharya, L. Aghababaie-Beni, I. Aleiner, T. I. Andersen, M. Ansmann, F. Arute, K. Arya, A. Asfaw, N. Astrakhantsev, J. Atalaya, R. Babbush, D. Bacon, B. Ballard, J. C. Bardin, J. Bausch, A. Bengtsson, A. Bilmes, S. Blackwell, S. Boixo, G. Bortoli, A. Bourassa, J. Bovaird, L. Brill, M. Broughton, D. A. Browne, B. Buchea, B. B. Buckley, D. A. Buell, T. Burger, B. Burkett, N. Bushnell, A. Cabrera, J. Campero, H.-S. Chang, Y. Chen, Z. Chen, B. Chiaro, D. Chik, C. Chou, J. Claes, A. Y. Cleland, J. Cogan, R. Collins, P. Conner, W. Courtney, A. L. Crook, B. Curtin, S. Das, A. Davies, L. D. Lorenzo, D. M. Debroy, S. Demura, M. Devoret, A. D. Paolo, P. Donohoe, I. Drozdov, A. Dunsworth, C. Earle, T. Edlich, A. Eickbusch, A. M. Elbag, M. Elzouka, C. Erickson, L. Faoro, E. Farhi, V. S. Ferreira, L. F. Burgos, E. Forati, A. G. Fowler, B. Foxen, S. Ganjam, G. Garcia, R. Gasca, Élie Genois, W. Giang, C. Gidney, D. Gilboa, R. Gosula, A. G. Dau, D. Graumann, A. Greene, J. A. Gross, S. Habegger, J. Hall, M. C. Hamilton, M. Hansen, M. P. Harrigan, S. D. Harrington, F. J. H. Heras, S. Heslin, P. Heu, O. Higgott, G. Hill, J. Hilton, G. Holland, S. Hong, H.-Y. Huang, A. Huff, W. J. Huggins, L. B. Ioffe, S. V. Isakov, J. Iveland, E. Jeffrey, Z. Jiang, C. Jones, S. Jordan, C. Joshi, P. Juhas, D. Kafri, H. Kang, A. H. Karamlou, K. Kechedzhi, J. Kelly, T. Khaire, T. Khattar, M. Khezri, S. Kim, P. V. Klimov, A. R. Klots, B. Kobrin, P. Kohli, A. N. Korotkov, F. Kostritsa, R. Kothari, B. Kozlovskii, J. M. Kreikebaum, V. D. Kurilovich, N. Lacroix, D. Landhuis, T. Lange-Dei, B. W. Langley, P. Laptev, K.-M. Lau, L. L. Guevel, J. Ledford, K. Lee, Y. D. Lensky, S. Leon, B. J. Lester, W. Y. Li, Y. Li, A. T. Lill, W. Liu, W. P. Livingston, A. Locharla, E. Lucero, D. Lundahl, A. Lunt, S. Madhuk, F. D. Malone, A. Maloney, S. Mandrá, L. S. Martin, S. Martin, O. Martin, C. Maxfield, J. R. McClean, M. McEwen, S. Meeks, A. Megrant, X. Mi, K. C. Miao, A. Mieszala, R. Molavi, S. Molina, S. Montazeri, A. Morvan, R. Movassagh, W. Mruczkiewicz, O. Naaman, M. Neeley, C. Neill, A. Nersisyan, H. Neven, M. Newman, J. H. Ng, A. Nguyen, M. Nguyen, C.-H. Ni, T. E. O’Brien, W. D. Oliver, A. Opremcak, K. Ottosson, A. Petukhov, A. Pizzuto, J. Platt, R. Potter, O. Pritchard, L. P. Pryadko, C. Quintana, G. Ramachandran, M. J. Reagor, D. M. Rhodes, G. Roberts, E. Rosenberg, E. Rosenfeld, P. Roushan, N. C. Rubin, N. Saei, D. Sank, K. Sankaragomathi, K. J. Satzinger, H. F. Schurkus, C. Schuster, A. W. Senior, M. J. Shearn, A. Shorter, N. Shutty, V. Shvarts, S. Singh, V. Sivak, J. Skruzny, S. Small, V. Smelyanskiy, W. C. Smith, R. D. Somma, S. Springer, G. Sterling, D. Strain, J. Suchard, A. Szasz, A. Sztein, D. Thor, A. Torres, M. M. Torunbalci, A. Vaishnav, J. Vargas, S. Vdovichev, G. Vidal, B. Villalonga, C. V. Heidweiller, S. Waltman, S. X. Wang, B. Ware, K. Weber, T. White, K. Wong, B. W. K. Woo, C. Xing, Z. J. Yao, P. Yeh, B. Ying, J. Yoo, N. Yosri, G. Young, A. Zalcman, Y. Zhang, N. Zhu, and N. Zobrist, Quantum error correction below the surface code threshold (2024), arXiv:2408.13687 [quant-ph] .
- Preskill [2018] J. Preskill, Quantum Computing in the NISQ era and beyond, Quantum 2, 79 (2018).
- Cai et al. [2023] Z. Cai, R. Babbush, S. C. Benjamin, S. Endo, W. J. Huggins, Y. Li, J. R. McClean, and T. E. O’Brien, Quantum error mitigation, Rev. Mod. Phys. 95, 045005 (2023).
- Piveteau et al. [2021] C. Piveteau, D. Sutter, S. Bravyi, J. M. Gambetta, and K. Temme, Error mitigation for universal gates on encoded qubits, Phys. Rev. Lett. 127, 200505 (2021).
- Suzuki et al. [2022] Y. Suzuki, S. Endo, K. Fujii, and Y. Tokunaga, Quantum error mitigation as a universal error reduction technique: Applications from the nisq to the fault-tolerant quantum computing eras, PRX Quantum 3, 010345 (2022).
- Li and Benjamin [2017] Y. Li and S. C. Benjamin, Efficient variational quantum simulator incorporating active error minimization, Phys. Rev. X 7, 021050 (2017).
- Temme et al. [2017a] K. Temme, S. Bravyi, and J. M. Gambetta, Error mitigation for short-depth quantum circuits, Phys. Rev. Lett. 119, 180509 (2017a).
- Kim et al. [2023] Y. Kim, A. Eddins, S. Anand, K. X. Wei, E. van den Berg, S. Rosenblatt, H. Nayfeh, Y. Wu, M. Zaletel, K. Temme, and A. Kandala, Evidence for the utility of quantum computing before fault tolerance, Nature 618, 500 (2023).
- O’Brien et al. [2023] T. E. O’Brien, G. Anselmetti, F. Gkritsis, V. E. Elfving, S. Polla, W. J. Huggins, O. Oumarou, K. Kechedzhi, D. Abanin, R. Acharya, I. Aleiner, R. Allen, T. I. Andersen, K. Anderson, M. Ansmann, F. Arute, K. Arya, A. Asfaw, J. Atalaya, J. C. Bardin, A. Bengtsson, G. Bortoli, A. Bourassa, J. Bovaird, L. Brill, M. Broughton, B. Buckley, D. A. Buell, T. Burger, B. Burkett, N. Bushnell, J. Campero, Z. Chen, B. Chiaro, D. Chik, J. Cogan, R. Collins, P. Conner, W. Courtney, A. L. Crook, B. Curtin, D. M. Debroy, S. Demura, I. Drozdov, A. Dunsworth, C. Erickson, L. Faoro, E. Farhi, R. Fatemi, V. S. Ferreira, L. Flores Burgos, E. Forati, A. G. Fowler, B. Foxen, W. Giang, C. Gidney, D. Gilboa, M. Giustina, R. Gosula, A. Grajales Dau, J. A. Gross, S. Habegger, M. C. Hamilton, M. Hansen, M. P. Harrigan, S. D. Harrington, P. Heu, M. R. Hoffmann, S. Hong, T. Huang, A. Huff, L. B. Ioffe, S. V. Isakov, J. Iveland, E. Jeffrey, Z. Jiang, C. Jones, P. Juhas, D. Kafri, T. Khattar, M. Khezri, M. Kieferová, S. Kim, P. V. Klimov, A. R. Klots, A. N. Korotkov, F. Kostritsa, J. M. Kreikebaum, D. Landhuis, P. Laptev, K.-M. Lau, L. Laws, J. Lee, K. Lee, B. J. Lester, A. T. Lill, W. Liu, W. P. Livingston, A. Locharla, F. D. Malone, S. Mandrà, O. Martin, S. Martin, J. R. McClean, T. McCourt, M. McEwen, X. Mi, A. Mieszala, K. C. Miao, M. Mohseni, S. Montazeri, A. Morvan, R. Movassagh, W. Mruczkiewicz, O. Naaman, M. Neeley, C. Neill, A. Nersisyan, M. Newman, J. H. Ng, A. Nguyen, M. Nguyen, M. Y. Niu, S. Omonije, A. Opremcak, A. Petukhov, R. Potter, L. P. Pryadko, C. Quintana, C. Rocque, P. Roushan, N. Saei, D. Sank, K. Sankaragomathi, K. J. Satzinger, H. F. Schurkus, C. Schuster, M. J. Shearn, A. Shorter, N. Shutty, V. Shvarts, J. Skruzny, W. C. Smith, R. D. Somma, G. Sterling, D. Strain, M. Szalay, D. Thor, A. Torres, G. Vidal, B. Villalonga, C. Vollgraff Heidweiller, T. White, B. W. K. Woo, C. Xing, Z. J. Yao, P. Yeh, J. Yoo, G. Young, A. Zalcman, Y. Zhang, N. Zhu, N. Zobrist, D. Bacon, S. Boixo, Y. Chen, J. Hilton, J. Kelly, E. Lucero, A. Megrant, H. Neven, V. Smelyanskiy, C. Gogolin, R. Babbush, and N. C. Rubin, Purification-based quantum error mitigation of pair-correlated electron simulations, Nat. Phys. 19, 1787 (2023).
- van den Berg et al. [2023a] E. van den Berg, Z. K. Minev, A. Kandala, and K. Temme, Probabilistic error cancellation with sparse Pauli–Lindblad models on noisy quantum processors, Nat. Phys. 19, 1116 (2023a).
- Xu et al. [2023] S. Xu, Z.-Z. Sun, K. Wang, L. Xiang, Z. Bao, Z. Zhu, F. Shen, Z. Song, P. Zhang, W. Ren, X. Zhang, H. Dong, J. Deng, J. Chen, Y. Wu, Z. Tan, Y. Gao, F. Jin, X. Zhu, C. Zhang, N. Wang, Y. Zou, J. Zhong, A. Zhang, W. Li, W. Jiang, L.-W. Yu, Y. Yao, Z. Wang, H. Li, Q. Guo, C. Song, H. Wang, and D.-L. Deng, Digital simulation of projective non-abelian anyons with 68 superconducting qubits, Chin. Phys. Lett. 40, 060301 (2023).
- Krinner et al. [2022] S. Krinner, N. Lacroix, A. Remm, A. Di Paolo, E. Genois, C. Leroux, C. Hellings, S. Lazar, F. Swiadek, J. Herrmann, G. J. Norris, C. K. Andersen, M. Müller, A. Blais, C. Eichler, and A. Wallraff, Realizing repeated quantum error correction in a distance-three surface code, Nature 605, 669 (2022).
- Zhao et al. [2022] Y. Zhao, Y. Ye, H.-L. Huang, Y. Zhang, D. Wu, H. Guan, Q. Zhu, Z. Wei, T. He, S. Cao, F. Chen, T.-H. Chung, H. Deng, D. Fan, M. Gong, C. Guo, S. Guo, L. Han, N. Li, S. Li, Y. Li, F. Liang, J. Lin, H. Qian, H. Rong, H. Su, L. Sun, S. Wang, Y. Wu, Y. Xu, C. Ying, J. Yu, C. Zha, K. Zhang, Y.-H. Huo, C.-Y. Lu, C.-Z. Peng, X. Zhu, and J.-W. Pan, Realization of an error-correcting surface code with superconducting qubits, Phys. Rev. Lett. 129, 030501 (2022).
- Dennis et al. [2002] E. Dennis, A. Kitaev, A. Landahl, and J. Preskill, Topological quantum memory, J. Math. Phys. 43, 4452 (2002).
- Fowler et al. [2012] A. G. Fowler, M. Mariantoni, J. M. Martinis, and A. N. Cleland, Surface codes: Towards practical large-scale quantum computation, Phys. Rev. A 86, 032324 (2012).
- Devitt et al. [2013] S. J. Devitt, W. J. Munro, and K. Nemoto, Quantum error correction for beginners, Rep. Prog. Phys. 76, 076001 (2013).
- Terhal [2015] B. M. Terhal, Quantum error correction for quantum memories, Rev. Mod. Phys. 87, 307 (2015).
- van den Berg et al. [2023b] E. van den Berg, Z. K. Minev, A. Kandala, and K. Temme, Probabilistic error cancellation with sparse Pauli–Lindblad models on noisy quantum processors, Nat. Phys. 19, 1116 (2023b).
- Eisert et al. [2020] J. Eisert, D. Hangleiter, N. Walk, I. Roth, D. Markham, R. Parekh, U. Chabaud, and E. Kashefi, Quantum certification and benchmarking, Nat. Rev. Phys. 2, 382 (2020).
- Chen et al. [2022] S. Chen, S. Zhou, A. Seif, and L. Jiang, Quantum advantages for pauli channel estimation, Phys. Rev. A 105, 032435 (2022).
- Breuckmann and Eberhardt [2021] N. P. Breuckmann and J. N. Eberhardt, Quantum low-density parity-check codes, PRX Quantum 2, 040101 (2021).
- Bravyi et al. [2024] S. Bravyi, A. W. Cross, J. M. Gambetta, D. Maslov, P. Rall, and T. J. Yoder, High-threshold and low-overhead fault-tolerant quantum memory, Nature 627, 778 (2024).
- Czarnik et al. [2021] P. Czarnik, A. Arrasmith, P. J. Coles, and L. Cincio, Error mitigation with Clifford quantum-circuit data, Quantum 5, 592 (2021).
- Strikis et al. [2021] A. Strikis, D. Qin, Y. Chen, S. C. Benjamin, and Y. Li, Learning-based quantum error mitigation, PRX Quantum 2, 040330 (2021).
- Kandala et al. [2019] A. Kandala, K. Temme, A. D. Córcoles, A. Mezzacapo, J. M. Chow, and J. M. Gambetta, Error mitigation extends the computational reach of a noisy quantum processor, Nature 567, 491 (2019).
- Chen et al. [2023] S. Chen, J. Cotler, H.-Y. Huang, and J. Li, The complexity of NISQ, Nat. Commun. 14, 6001 (2023).
- Bravyi and Vargo [2013] S. Bravyi and A. Vargo, Simulation of rare events in quantum error correction, Phys. Rev. A 88, 062308 (2013).
- Temme et al. [2017b] K. Temme, S. Bravyi, and J. M. Gambetta, Error mitigation for short-depth quantum circuits, Phys. Rev. Lett. 119, 180509 (2017b).
- He et al. [2020] A. He, B. Nachman, W. A. de Jong, and C. W. Bauer, Zero-noise extrapolation for quantum-gate error mitigation with identity insertions, Phys. Rev. A 102, 012426 (2020).
- Chiaverini et al. [2004] J. Chiaverini, D. Leibfried, T. Schaetz, M. D. Barrett, R. B. Blakestad, J. Britton, W. M. Itano, J. D. Jost, E. Knill, C. Langer, R. Ozeri, and D. J. Wineland, Realization of quantum error correction, Nature 432, 602 (2004).
- Moses et al. [2023] S. A. Moses, C. H. Baldwin, M. S. Allman, R. Ancona, L. Ascarrunz, C. Barnes, J. Bartolotta, B. Bjork, P. Blanchard, M. Bohn, J. G. Bohnet, N. C. Brown, N. Q. Burdick, W. C. Burton, S. L. Campbell, J. P. Campora, C. Carron, J. Chambers, J. W. Chan, Y. H. Chen, A. Chernoguzov, E. Chertkov, J. Colina, J. P. Curtis, R. Daniel, M. DeCross, D. Deen, C. Delaney, J. M. Dreiling, C. T. Ertsgaard, J. Esposito, B. Estey, M. Fabrikant, C. Figgatt, C. Foltz, M. Foss-Feig, D. Francois, J. P. Gaebler, T. M. Gatterman, C. N. Gilbreth, J. Giles, E. Glynn, A. Hall, A. M. Hankin, A. Hansen, D. Hayes, B. Higashi, I. M. Hoffman, B. Horning, J. J. Hout, R. Jacobs, J. Johansen, L. Jones, J. Karcz, T. Klein, P. Lauria, P. Lee, D. Liefer, S. T. Lu, D. Lucchetti, C. Lytle, A. Malm, M. Matheny, B. Mathewson, K. Mayer, D. B. Miller, M. Mills, B. Neyenhuis, L. Nugent, S. Olson, J. Parks, G. N. Price, Z. Price, M. Pugh, A. Ransford, A. P. Reed, C. Roman, M. Rowe, C. Ryan-Anderson, S. Sanders, J. Sedlacek, P. Shevchuk, P. Siegfried, T. Skripka, B. Spaun, R. T. Sprenkle, R. P. Stutz, M. Swallows, R. I. Tobey, A. Tran, T. Tran, E. Vogt, C. Volin, J. Walker, A. M. Zolot, and J. M. Pino, A race-track trapped-ion quantum processor, Phys. Rev. X 13, 041052 (2023).
- Cory et al. [1998] D. G. Cory, M. D. Price, W. Maas, E. Knill, R. Laflamme, W. H. Zurek, T. F. Havel, and S. S. Somaroo, Experimental quantum error correction, Phys. Rev. Lett. 81, 2152 (1998).
- Moussa et al. [2011] O. Moussa, J. Baugh, C. A. Ryan, and R. Laflamme, Demonstration of sufficient control for two rounds of quantum error correction in a solid state ensemble quantum information processor, Phys. Rev. Lett. 107, 160501 (2011).
- Reed et al. [2012] M. D. Reed, L. DiCarlo, S. E. Nigg, L. Sun, L. Frunzio, S. M. Girvin, and R. J. Schoelkopf, Realization of three-qubit quantum error correction with superconducting circuits, Nature 482, 382 (2012).
- Fowler [2015] A. G. Fowler, Minimum weight perfect matching of fault-tolerant topological quantum error correction in average o(1) parallel time, Quantum Info. Comput. 15, 145–158 (2015).
- Gidney [2021] C. Gidney, Stim: a fast stabilizer circuit simulator, Quantum 5, 497 (2021).
- Takagi et al. [2022] R. Takagi, S. Endo, S. Minagawa, and M. Gu, Fundamental limits of quantum error mitigation, npj Quantum Inf. 8, 114 (2022).
- Quek et al. [2024] Y. Quek, D. Stilck França, S. Khatri, J. J. Meyer, and J. Eisert, Exponentially tighter bounds on limitations of quantum error mitigation, Nat. Phys. 20, 1648 (2024).
- Tsubouchi et al. [2023] K. Tsubouchi, T. Sagawa, and N. Yoshioka, Universal cost bound of quantum error mitigation based on quantum estimation theory, Phys. Rev. Lett. 131, 210601 (2023).
- Satzinger et al. [2021] K. J. Satzinger, Y.-J. Liu, A. Smith, C. Knapp, M. Newman, C. Jones, Z. Chen, C. Quintana, X. Mi, A. Dunsworth, C. Gidney, I. Aleiner, F. Arute, K. Arya, J. Atalaya, R. Babbush, J. C. Bardin, R. Barends, J. Basso, A. Bengtsson, A. Bilmes, M. Broughton, B. B. Buckley, D. A. Buell, B. Burkett, N. Bushnell, B. Chiaro, R. Collins, W. Courtney, S. Demura, A. R. Derk, D. Eppens, C. Erickson, L. Faoro, E. Farhi, A. G. Fowler, B. Foxen, M. Giustina, A. Greene, J. A. Gross, M. P. Harrigan, S. D. Harrington, J. Hilton, S. Hong, T. Huang, W. J. Huggins, L. B. Ioffe, S. V. Isakov, E. Jeffrey, Z. Jiang, D. Kafri, K. Kechedzhi, T. Khattar, S. Kim, P. V. Klimov, A. N. Korotkov, F. Kostritsa, D. Landhuis, P. Laptev, A. Locharla, E. Lucero, O. Martin, J. R. McClean, M. McEwen, K. C. Miao, M. Mohseni, S. Montazeri, W. Mruczkiewicz, J. Mutus, O. Naaman, M. Neeley, C. Neill, M. Y. Niu, T. E. O’Brien, A. Opremcak, B. Pató, A. Petukhov, N. C. Rubin, D. Sank, V. Shvarts, D. Strain, M. Szalay, B. Villalonga, T. C. White, Z. Yao, P. Yeh, J. Yoo, A. Zalcman, H. Neven, S. Boixo, A. Megrant, Y. Chen, J. Kelly, V. Smelyanskiy, A. Kitaev, M. Knap, F. Pollmann, and P. Roushan, Realizing topologically ordered states on a quantum processor, Science 374, 1237 (2021).
- Horsman et al. [2012] D. Horsman, A. G. Fowler, S. Devitt, and R. V. Meter, Surface code quantum computing by lattice surgery, New J. Phys. 14, 123011 (2012).
- Javadi-Abhari et al. [2024] A. Javadi-Abhari, M. Treinish, K. Krsulich, C. J. Wood, J. Lishman, J. Gacon, S. Martiel, P. D. Nation, L. S. Bishop, A. W. Cross, B. R. Johnson, and J. M. Gambetta, Quantum computing with Qiskit (2024), arXiv:2405.08810 [quant-ph] .
- Fowler [2013] A. G. Fowler, Analytic asymptotic performance of topological codes, Phys. Rev. A 87, 040301 (2013).
- Watson and Barrett [2014] F. H. E. Watson and S. D. Barrett, Logical error rate scaling of the toric code, New J. Phys. 16, 093045 (2014).
- Bravyi and Kitaev [2005] S. Bravyi and A. Kitaev, Universal quantum computation with ideal clifford gates and noisy ancillas, Phys. Rev. A 71, 022316 (2005).
- Raussendorf et al. [2007] R. Raussendorf, J. Harrington, and K. Goyal, Topological fault-tolerance in cluster state quantum computation, New J. Phys. 9, 199 (2007).
- Endo et al. [2018] S. Endo, S. C. Benjamin, and Y. Li, Practical quantum error mitigation for near-future applications, Phys. Rev. X 8, 031027 (2018).
Supplementary Information for
“Demonstrating quantum error mitigation on logical qubits”
S1 Zero-noise extrapolation formulas
First, we introduce some notations and briefly review NISQ and FTQC circuits. Here, FTQC circuits refer to quantum circuits with mid-circuit state preparation and measurement, feedback and post-selection. Then, we show how to express an FTQC circuit with an example. Finally, we generalize the expression to arbitrary FTQC circuits. With the expression, we derive the formula for ZNE.
Notations. We use to denote Pauli operators on qubit-, and we use to denote the identity operator. Given an operator , we use to denote a completely positive map .
We use three types of primitive operations to implement quantum computing: gate, state preparation and measurement. We can denote them with completely positive maps. An ideal gate is denoted by , where is a unitary operator. An ideal state preparation is denoted by , which prepares qubit- in the eigenstate of with the eigenvalue . Here, is an arbitrary Pauli operator different from , and is the projection operator onto eigenstates of with the eigenvalue . An ideal measurement is denoted by , which is a measurement on qubit- in the basis. Here, is the measurement outcome. The maps and are trace-preserving; is not, but is trace-preserving.
When errors occur stochastically in an operation, the actual operation is in the form , where is the ideal operation, is the probability of errors, and is a completely positive map denoting the operation with errors.
Now, we take the Pauli error model as an example, which is a practical mode of errors in quantum computing. In the Pauli error model, the process generating Pauli errors is a map in the form , where is the error probability, and is a trace-preserving completely positive map denoting errors. If we neglect crosstalk, errors only happen on the qubit subset that an operation acts on, called the support. For a single-qubit operation on qubit-, the noise map is , where are probabilities of corresponding Pauli errors, and is the total error probability. Similarly, for a two-qubit gate on qubits and , the noise map is , where . To include crosstalk, the general noise map is in the form , where is a Pauli operator, is the probability of the Pauli error , and . Here, the Pauli operator is either a single-qubit Pauli operator or an arbitrary product of single-qubit operators, and it may act on qubits out of operation’s support. In this general Pauli-error noise map, the erroneous map is .
In the Pauli error model, the actual operation of a gate is in the form , where is the ideal gate. For a state preparation operation, the actual operation is also in the form , where is the ideal state preparation. For a measurement, the actual operation is in the form (the noise occur before the ideal operation this time), where is the ideal measurement. Substituting the expression of the noise map , we can find that actual operations are in the form , where for gates and state preparation, and for measurement.
NISQ circuits and FTQC circuits. In an NISQ circuit, qubits are prepared at the beginning and measured at the end of the circuit, and gates are applied between the state preparation and measurement. Such circuits are insufficient for the usual protocols of FTQC.
In error correction, we usually detect errors using measurements. Typically, we repeatedly measure a set of carefully chosen Pauli operators, stabilizer operators of the error correction code. Then, we send the measurement outcomes to a classical algorithm called a decoder. With the algorithm, we compute a set of Pauli gates. By applying the Pauli gates, we can correct errors on qubits. Therefore, the error correction is a typical feedback process.
We need post-selection when the magic state is used in quantum computing. Because of the error correction code, we cannot directly apply all necessary gates on logical qubits. The magic state represents a promising protocol to complete the universal gate set. In the magic-state protocol, we prepare some resource states on logical qubits and then improve their fidelities in certain distillation circuits. With the resource states distilled, we can use them to implement gates that cannot be implemented directly. The distillation circuits improve the fidelity through post-selection: the resource state is kept or discarded depending on measurements in the distillation circuit; if discarded, we need to re-prepare the state and try distillation for another round. We remark that the preparation, distillation and utilization of magic states also require feedback operations.

An example FTQC circuit. To illustrate the expression of FTQC circuits using completely positive maps, we take the circuit in Fig. S as an example, which includes feedback and post-selection operations. There are three state preparation operations and three measurements in the circuit. Depending on the measurement outcome , the number of gates is either two or three.
Using completely positive maps, we can express the ideal circuit in the following form
(S1) | |||||
Here, and are state preparation operations, are controlled-NOT gates, , and are measurements, is the measurement-dependent gate in the feedback operation, and denotes the post-selection operation.
With noise, the map of the circuit becomes
(S2) | |||||
where operations are in the from . Because the eighth operation is measurement-dependent, it has a measurement-dependent error probability and erroneous operation : and [ and ] are the error probability and erroneous operation, respectively, of the gate ().
General FTQC circuits. For a general FTQC circuit, we can express the ideal circuit consisting of operations in the form
(S3) | |||||
Here, is the completely positive map denoting the th operations. We use to represent the measurement outcome of the th operation: () if the th operation is (not) a measurement. If the th operation is a measurement, the map is a function of the measurement outcome ; even if the th operation is not a measurement, we still can express it as a function of because takes only one value anyway. We use to represent a tuple of measurement outcomes from the first to th operations, and we can define it formally with the recursion formula and the initial value . The th operation may depend on measurement outcomes of previous operations since the feedback. Therefore, the map is also a function of : If the operation is not a feedback operation, the dependence is trivial, i.e. , such as operations with in the example circuit; the eighth operation is the only one with a non-trivial dependence on previous measurement outcomes. The function describes the post-selection,
(S4) |
where denotes the set of measurement outcomes indicating success.
For the noisy circuit, its map is in the form
(S5) | |||||
where operations are in the from
(S6) | |||||
If the th operation is a feedback operation that depends on previous measurement outcomes, such as the eighth operation in the example circuit, error probability depends on ; otherwise, is a constant function; the erroneous operation is similar.
The expression in Eq. (S5) illustrates the temporal order of operations that an operation can only depend on measurement outcomes in previous operations. This temporal order is unimportant for error mitigation, and if neglect, we have a simplified expression of a noisy circuit: We use to denote all measurement outcomes in the circuit, then
(S7) |
where operations are in the from
(S8) |
Expansion formula of the observable with errors. With a circuit, we can evaluate observables that are functions of measurement outcomes, . For example, if we want to evaluate the observable in the example circuit, we take . In the case of error correction, this function takes into account the last-round error correction on the measurement outcomes.
The ideal expected value of the observable is
(S9) |
where is the initial state, and is the probability of the measurement outcome in the ideal circuit.
Because of the post-selection, the distribution may not be normalized in general. The expected value in the normalized distribution is
(S10) |
and we have the expression of by substituting into Eq. (S9). In the following, we focus on , and the result can be applied to .
With a noisy circuit, the expected value of the observable is
(S11) |
where is the probability of the measurement outcome in the noisy circuit. To explicitly express the observable as a function of error probabilities, we rewrite each actual operation in the circuit as
(S12) |
where is the ideal operation, and
(S13) |
is the deviation from the ideal operation. Substituting this expression of , we have
(S14) | |||||
where
(S15) | |||||
Eventually, we can rewrite the expression as
(S16) |
where
(S17) | |||||
and
(S18) | |||||
Here, we have used that .
In ZNE, we mitigate errors by boosting the error probability. If we boost errors in each operation with the same factor of (i.e. for all ), the computing result becomes a polynomial function of , . This formula holds for both NISQ and FTQC circuits.
S2 Correlations in logical errors of qLDPC codes
Unlike the surface code, many qLDPC codes encode a large number of logical qubits in a single block of physical qubits. Errors in these logical qubits could be significantly correlated, tending to occur simultaneously on multiple logical qubits, as illustrated in Fig. S. In the code [22], the probability that to logical qubits fail simultaneously is higher than other cases, indicating significant correlations. For such correlated errors, measuring their error rates is extremely challenging. Even assuming Pauli errors, obtaining a model of correlated errors requires measuring error rates, where is the number of logical qubits in a block.

S3 Device performance
Two quantum processors, named Processor I and Processor II, are used in this work for experiments of repetition code and surface code, respectively. Processor I consists of a transmon qubit array, from which 13 qubits are utilized in the experiment. Processor II has the same architecture as Processor I, but with a qubit array of . In the distance-3 surface code experiments, 17 qubits are used. The median and times for the qubits used in Processor I (Processor II) are s and s ( s and s), respectively. In this work, we adopt two different readout schemes: (1) conventional measurement for multi-round experiments; (2) using state to reduce measurement errors for one-round experiments. The gate and measurement errors of two processors are visualized in Fig. S.

Processor I | Processor II | |||||
---|---|---|---|---|---|---|
Parameter | Median | Mean | Stdev. | Median | Mean | Stdev. |
Qubit idle frequency, (GHz) | 4.000 | 3.962 | 0.128 | 4.123 | 4.120 | 0.131 |
Qubit anharmonicity, (MHz) | -197.3 | -197.1 | 1.6 | -214.6 | -214.7 | 2.3 |
Readout frequency, (GHz) | 6.316 | 6.299 | 0.095 | 6.416 | 6.416 | 0.086 |
Energy relaxation time, (s) | 124.36 | 111.82 | 33.22 | 127.89 | 132.44 | 24.32 |
Spin-echo dephasing time, (s) | 11.34 | 11.72 | 2.35 | 15.55 | 18.18 | 7.81 |
Readout error using Method I / Method II (%) | 0.47 / 0.87 | 0.52 / 0.82 | 0.16 / 0.18 | 0.87 / - | 0.94 / - | 0.30 / - |
1Q XEB Pauli error, (%) | 0.085 | 0.079 | 0.021 | 0.055 | 0.052 | 0.019 |
2Q XEB Pauli error, (%) | 0.56 | 0.56 | 0.14 | 0.37 | 0.40 | 0.17 |
S4 Generating circuit instances and calculating the expectations
To controllably amplify existing errors, we choose to insert an operation drawn from the list of during the operational stage for each qubit. The most intuitive way to generate adequate circuit instances for the estimation of (or ) — randomly selecting the operation with assigned probabilities for each circuit instance — can be inefficient. For example, in the scenario of using a distance-3 (7) repetition code, around () of circuit instances are devoid of any error injection when and . In addition, circuit instances with the presence of multiple injected errors, which can potentially lead to the failure of quantum error correction and contribute to the infidelity of estimation, are unlikely to be chosen. Alternatively, we can iterate over all possible circuit instances and numerically construct the desired circuit list by calculating the weighted average. For example, in the experiment related to Fig. 2 of the main text, all circuit instances were experimentally implemented for each value of , and the resulting data were processed to estimate the expectations of . However, in repetition and surface codes, the large number of circuit instances makes full implementation impractical. To address this challenge, we employ a method to select the most likely circuit instances as follows.
First, we determine the total number of circuit instances, which scales proportionally with the code distance to ensure more accurate estimations (see Table S2). Second, we consider the case of injecting errors into the circuit and calculate the required number of circuit instances based on the error-injection probability . Circuit instances are then randomly selected without replacement until the desired number is reached or all possible instances have been selected. The number of remaining circuit instances to be determined is updated accordingly. Finally, the second and third steps are iteratively repeated, starting with and continuing until the total number is reached.
The above procedure assumes perfect operations and measurements. In practice, however, when the number of possible circuit instances for injecting errors is small — such as in the case of — the statistical error due to finite measurement shots can proportionately impact the overall estimation accuracy, scaling with the assigned weight. To reduce this effect, we introduce a key modification to the procedure: the circuit number for each is required to exceed a predefined threshold, such as of the total circuit number. Equivalently, this modification increases the number of measurement shots, thereby reducing statistical errors and improving the accuracy of the estimation.
1-round | 2-round | 3-round | 4-round | |
distance-3 repetition code | ||||
distance-5 repetition code | ||||
distance-7 repetition code | ||||
distance-3 surface code | - | - | - |
We estimate the expectation values, such as and , by averaging measured results across all generated circuit instances with the respective weights. Specifically, we denote as the corresponding outcome with errors are injected into the circuit, with representing the circuit index and the index of measurement shot. The expectation value is then calculated as
(S19) |
where represents the number of measurement shots for each circuit instance, denotes the number of chosen circuits with errors injected, and is the respective weight. The value of for experiments of repetition code and surface code is .
S5 More experimental details
Standard error of the measured expectations. As mentioned in Section S4, measurement shots are taken for each circuit. It allows us to directly calculate the standard deviation of (see Eq. S19) without requiring additional experimental effort. First, we calculate the average expectation value for each shot index as
(S20) |
where the sum is taken over the error count and circuit instance . Next, the unbiased standard deviation of can be obtained by
(S21) |
We note that the standard errors are relatively small for both the repetition code and the surface code, making them difficult to visualize clearly in the main figures. See Fig. S for an enlarged view of the standard errors for the one-round repetition code.

Uncorrected expectation values for the repetition code. In Fig. S, we observe that the uncorrected expectation values for different repetition code distances nearly overlap. The slight differences observed for large noise scaling factors are mainly due to the randomness of the circuit instances, which also influence the simulation results.

Investigating the scalability of ZNE with a fixed error rate per parity-check round. Unlike in the main text, where the unit error probability for each is adjusted to maintain a fixed total error rate (see Table S3), in this study, the unit error probability in each parity-check round is fixed at and then the performance of ZNE is evaluated. As depicted in Fig. S, the relative bias, represented by , consistently remains below 1, confirming the effectiveness of ZNE. The ZNE results demonstrate consistent trends for different code distances and parity-check round numbers, with a larger code distance resulting in a smaller sampling overhead. Additionally, the sampling overhead remains nearly unchanged for different numbers of parity-check rounds. These findings demonstrate the scalability of ZNE for error correction circuits, even with increasing circuit complexity.
Round, | 1 | 2 | 3 | 4 |
---|---|---|---|---|
Unit error probability, () | 13.6 | 9.4 | 7.2 | 5.7 |

Preparation of an arbitrary logical state in the distance-3 surface code. In this work, an arbitrary logical state in the distance-3 surface code is prepared using the circuit illustrated in Fig. S. For specific cases, such as the initialization of states and , the circuit can be further simplified, as described in Ref. [13]. Additionally, the implementation of state initialization in a configuration where two-qubit gates can be performed between nearest-neighbor data qubits is detailed in Ref. [40].

More ZNE results in the distance-3 surface code. The expectation values of the and observables are measured as a function of the noise scaling factor for the states and , respectively (Fig. S). These results are consistent with those obtained for in the main text but specifically address either bit-flip error or phase-flip error. The measured values of the observable, with the initial state set to , are also shown in the figure. All - scatter plots demonstrate the effectiveness of ZNE in the surface code.

Numerical simulations as benchmark. We conduct numerical simulations using two popular open-source frameworks: Stim [36] and Qiskit [42]. For simulations of the repetition code, where the initial preparation of the logical state () is nearly perfect, errors introduced by the gate and measurement imperfections are then modeled using a depolarizing channel and a bit-flip channel, respectively. The error rates for these models are experimentally calibrated (see Fig. S and Table S1). In contrast, for simulations of the surface code, where initial states are imperfectly generated via multiple layers of single-qubit and two-qubit gates, the introduced imperfection is simplified by a single-qubit depolarizing channel uniformly applied to each qubit. The depolarizing rate is calibrated by matching the expectation value of an observable with error injection disabled in the circuit. In this work, is used as the initial state, and as the observable, resulting in a calibrated depolarizing rate of . As shown in Fig. 4 of the main text, this noise model closely aligns with the experimental data.
S6 Protocol
To implement the th-order ZNE, we use data points from experiments, . Here, denotes the expected value of the observable when error probabilities are amplified by a factor of , and corresponds to the case without noise amplification. Using these data points and the fitting formula , we can compute as
(S22) |
where , and is a Vandermonde-like matrix with elements
(S23) |
We characterize the performance of ZNE using two metrics: the bias characterizes the accuracy, and the sampling overhead characterizes the cost. Next, we focus on the overhead . Suppose is the total number of circuit shots used to acquire the data points. To reduce the variance in ZNE, we allocate the shots among the data points according to the importance sampling principle. Specifically, the number of shots used to measure is . The variance in the measurement of is then given by , and the variance of is thus . Here we have assumed that the observable is a Pauli operator. Without ZNE, if the shots are used to measure , the variance is . The sampling overhead reads , which is the ratio of the two variances.
S7 Application to large-scale logical circuits
We take the surface code as an example to illustrate that our method can be applied to logical circuits at large scale. Assume we have a logic target circuit consisting of logical gates. Similar to physical operations (see Sec. S1), the th logical gate reads
(S24) |
where represents the logical error rate of the gate. We remark that the logical error rate depends on the physical error rate and, therefore, is a function of the noise amplification factor .
The expected value of the observable computed with the circuit is
(S25) | |||||
where
(S26) |
is the ideal result,
(S27) |
is the result when the th logical gate has errors, and the remainder term has the upper bound
(S28) | |||||
where is the total error rate.
Now, we approximate each logical error rate with a polynomial, i.e.
(S29) |
and denotes the error in the approximation. With this approximation, we can re-express the expected value as
(S30) | |||||
where
(S31) |
and
(S32) |
Substituting the expression of into the th-order extrapolation formula Eq. (S22), we obtain the bias
(S33) |
where
(S34) |
and
(S35) |
are upper bounds of contributions by and terms, respectively. The first term is the error due to the second-order contribution of logical error rates, i.e.
(S36) |
If we choose a sufficiently large code distance such that is much smaller than one, the first term is much smaller than the raw bias before error mitigation, which has the upper bound
(S37) |
The second term is the error due to approximating logical error rates with polynomials. In the limit that the physical error rate approaches zero, the logical error rate behaves as [43, 44], suggesting that the extrapolation is sufficient in the low physical error rate regime. When is finite, the logical error rate deviates from , and we have to adapt the extrapolation function accordingly. Next, we use numerical calculations to estimate the bias in the polynomial extrapolation.

To estimate the bias and cost numerically, we consider a quantum memory circuit consisting of applying idle gates on a logical qubit and take the logical error rate per gate reported in Ref. [27]. We choose the observable as , such that the expectation value is affected by both logical and errors. Let be the sum of logical and error rates. The observable expectation value is taken as . Then, we apply ZNE to , and the results are shown in Fig. S.
Although the above numerical results are obtained from quantum memory circuits, we conjecture that general computing circuits yield similar results. Memory circuits are composed of repeatedly applying parity-check measurements on a fixed lattice. Practical protocols of quantum computing in surface codes include, for instance, braiding transformation [15] and lattice surgery [41], in which circuits are also composed of repeatedly applying parity-check measurements, however, on a lattice that deforms between certain parity-check cycles. Because the dominant operations are the same in memory circuits and such computing circuits, they should yield similar behavior of logical error rates (as functions of physical error rates and the code distance). In addition to braiding transformation and lattice surgery, universal quantum computing in surface codes also requires magic state injection and distillation [45, 46, 15]. Magic state errors decrease with the level of distillation. As long as the distillation level is adequately high, the post-distillation magic state errors are dominated by logical errors in lattice surgery operations, and the contribution of raw magic state errors (errors in injected magic states before distillation) is negligible; otherwise, we may need to modify the extrapolation function to include the impact of raw magic state errors: suppose the distilled magic state error rate is , where is an integer depending on the distillation level, we can add the term to the extrapolation function.
So far, we have only considered polynomial extrapolation functions. The use of other extrapolation functions could improve the performance. In ZNE on NISQ circuit, it has been shown experimentally that the exponential extrapolation outperforms polynomial extrapolation [47, 8]. For example, the logical error rate function reported in Ref. [27] is a candidate worth exploring. Given such candidate functions, we can benchmark and verify them through Clifford circuits [23, 24]: we can simulate the Clifford circuits on a classical computer to obtain the ideal result , such that we can evaluate the bias as well as the cost. In this way, we can compare the candidate the functions and identify the suitable ones.