The learnability of Pauli noise

Chen, Senrui; Liu, Yunchao; Otten, Matthew; Seif, Alireza; Fefferman, Bill; Jiang, Liang

doi:10.1038/s41467-022-35759-4

Download PDF

Article
Open access
Published: 04 January 2023

The learnability of Pauli noise

Nature Communications volume 14, Article number: 52 (2023) Cite this article

3672 Accesses
3 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Recently, several quantum benchmarking algorithms have been developed to characterize noisy quantum gates on today’s quantum devices. A fundamental issue in benchmarking is that not everything about quantum noise is learnable due to the existence of gauge freedom, leaving open the question what information is learnable and what is not, which is unclear even for a single CNOT gate. Here we give a precise characterization of the learnability of Pauli noise channels attached to Clifford gates using graph theoretical tools. Our results reveal the optimality of cycle benchmarking in the sense that it can extract all learnable information about Pauli noise. We experimentally demonstrate noise characterization of IBM’s CNOT gate up to 2 unlearnable degrees of freedom, for which we obtain bounds using physical constraints. In addition, we show that an attempt to extract unlearnable information by ignoring state preparation noise yields unphysical estimates, which is used to lower bound the state preparation noise.

Benchmarking universal quantum gates via channel spectrum

Article Open access 21 September 2023

A new class of efficient randomized benchmarking protocols

Article Open access 21 August 2019

6-qubit optimal Clifford circuits

Article Open access 05 July 2022

Introduction

Characterizing quantum noise is an essential step in the development of quantum hardware^1,2. Remarkably, despite recent progress in both gate-level and scalable noise characterization methods^{3,4,5,6,7,8,9,10,11,12,13,14,15,16}, the full characterization of the noise channel of a single CNOT/CZ gate remains infeasible. This is unlikely to be caused by limitations of existing benchmarking algorithms. Instead, it is believed to be related to the fundamental question of what information about a quantum system can be learned, in a setting where initial states, gates, and measurements are all subject to unknown quantum noise. It is well-known that some information about quantum noise can be learned (such as the gate fidelity learned by randomized benchmarking^3,4,5,6,7 or cycle benchmarking⁹), but not everything can be learned (due to the gauge freedom in gate set tomography^17,18,19). The boundary of learnability of quantum noise – a precise understanding of what information is learnable and what is not, still remains an open question.

Recently, there has been an interest in formulating noise characterization as learning unknown gate-dependent Pauli noise channels^9,11. This is motivated by randomized compiling, a technique that has been proposed to suppress coherent errors via inserting random Pauli gates^20,21. As an added benefit, randomized compiling twirls the gate-dependent CPTP noise channel into Pauli noise, thus reducing the number of parameters to be learned. Note that the twirled Pauli noise channel corresponds to the diagonal of the process matrix of the CPTP map, so Pauli noise learning is a necessary step for characterizing the CPTP map, regardless of whether randomized compiling is performed.

However, even under this simplified setting of Pauli noise learning, all prior experimental attempts can only partially characterize the noise channel of a single CNOT/CZ gate^21,22,23, which only has 15 degrees of freedom. A natural question is whether this limitation is caused by the fundamental unlearnability of the noise channel, and if so, which part of the noise channel and how many degrees of freedom among the 15 are unlearnable?

In this paper, we give a precise characterization of what information in the Pauli noise channel attached to Clifford gates is learnable, in a way that is robust against state preparation and measurement (SPAM) noise. We develop a systematic method for characterizing learnable degrees of freedom of a Clifford gate set using notions from algebraic graph theory and show that learnable information exactly corresponds to the cycle space of the Pauli pattern transfer graph, while unlearnable information exactly corresponds to the cut space. This characterization can be used to write down a list of linear functions of the noise model that corresponds to all independent learnable degrees of freedom. As an example, we show that the Pauli noise channel of an arbitrary 2-qubit Clifford gate has at most 2 unlearnable degrees of freedom. We perform an experimental characterization of a CNOT gate on IBM Quantum hardware²⁴ up to 2 unlearnable degrees of freedom. Although the unlearnable information cannot be estimated with high precision, we can determine a feasible region of those freedoms using the constraint that the noise model must be physical (i.e., all Pauli error rates are nonnegative).

A corollary of our result is that cycle benchmarking is optimal in the setting we consider, in the sense that it can learn all the information that is learnable. This reveals a fundamental fact about noise benchmarking, namely that cycle benchmarking – the idea of repeatedly applying the same gate sequence interleaved by single qubit gates, is the “right” algorithm for benchmarking Clifford gates, because of the fact that learnable information forms a cycle space. As an interesting side remark, the term “cycle” in cycle benchmarking originally refers to parallel gates applied in a clock cycle. Here we show that the term can also be understood in a graph-theoretical context.

In addition, we also explore ways to overcome the unlearnability barrier. It has been recognized that the unlearnability does not apply if the initial state ${\left|0\right\rangle }^{\otimes n}$ can be prepared perfectly^15,23, and it has been suggested that state preparation noise could be much smaller than gate and/or measurement noise in practice^25,26,27, which would make gate noise fully learnable up to small error. We develop an algorithm based on cycle benchmarking that fully learns gate-dependent Pauli noise channel assuming perfect initial state preparation, and experimentally demonstrate the method on IBM’s CNOT gate. Based on the experiment data, we conclude that this assumption is unlikely to be correct in our experiment as it gives unphysical estimates that are outside the feasible region we determined. Furthermore, we use the data to obtain a lower bound on the state preparation noise and conclude that it has the same order of magnitude as gate noise on the device we used. Therefore, the issue of unlearnability is a practically relevant concern, for which the noise on initial states is an important factor that cannot be neglected on current quantum hardware.

Results

Theory of learnability

We start by considering the learnability of the Pauli noise channel of a single n-qubit Clifford gate. A Pauli channel can be written as

$$\Lambda (\cdot )=\mathop{\sum}\limits_{a\in {{\mathsf{P}}}^{n}}{p}_{a}{P}_{a}(\cdot ){P}_{a},$$

(1)

where {p_a} is a probability distribution on Pⁿ = {I, X, Y, Z}ⁿ. The goal is to learn this distribution, which has 4ⁿ − 1 degrees of freedom. Considering Λ as a linear map, its eigenvectors exactly correspond to all n-qubit Pauli operators, as

$$\Lambda ({P}_{a})={\lambda }_{a}{P}_{a},\quad \forall a\in {{\mathsf{P}}}^{n}$$

(2)

where ${\lambda }_{a}={\sum }_{b\in {{\mathsf{P}}}^{n}}{p}_{b}{(-1)}^{\langle a,b\rangle }$ is the Pauli fidelity associated with the Pauli operator P_a. Therefore Λ is a linear map with known eigenvectors and unknown eigenvalues, so a natural way to learn Λ is to first learn all the Pauli fidelities λ_a, and then reconstruct the Pauli errors via ${p}_{a}=\frac{1}{{4}^{n}}{\sum }_{b\in {{\mathsf{P}}}^{n}}{\lambda }_{b}{(-1)}^{\langle a,b\rangle }$.

The convenience of working with Pauli fidelities is further demonstrated by the fact that some Pauli fidelities can be directly learned by cycle benchmarking, even with noisy state preparation and measurement. For example, consider the CNOT gate which maps the Pauli operator IX to itself. Figure 1(a) shows the cycle benchmarking circuit. Imagine that we put the Pauli operator IX after the left red box and evolve it with the circuit, then the evolved operator (before the right red box) equals ${\lambda }_{IX}^{3}\cdot IX$, up to a ± sign (which comes from the random Pauli gates and can always be accounted for during post-processing). Here we use the convention that the noise channel happens before each CNOT gate. In experiments, we prepare a + 1 eigenstate of IX (such as $\left |+\right\rangle \left |+\right\rangle$), measure the expectation value of IX at the end, and average over random Pauli twirling sequences. These SPAM operations are noisy and are represented as the red boxes. It is shown⁹ that the measured expectation value equals

$${\mathbb{E}}\langle IX\rangle={A}_{IX}\cdot {\lambda }_{IX}^{d}$$

(3)

where the expectation is over random Pauli twirling gates and randomness of quantum measurement, and A_IX depends on SPAM noise but is independent of circuit depth d. From this λ_IX can be learned by estimating the observable IX at several different depths and perform a curve fitting.

**Fig. 1: Cycle benchmarking for learning the Pauli noise channel of a CNOT gate.**

The Pauli operator IX is special as it is invariant under CNOT. Consider another example: CNOT maps XZ to YY and vice versa. Consider Fig. 1(b) where we insert additional layers of single-qubit Clifford gates $\sqrt{Z}\otimes \sqrt{X}$ that also maps XZ to YY and vice versa (up to a minus sign that can always be accounted for during post-processing). After XZ picks up a coefficient λ_XZ in front of the CNOT gate, it gets mapped to λ_XZ ⋅ YY by CNOT but then rotated back to λ_XZ ⋅ XZ by $\sqrt{Z}\otimes \sqrt{X}$. Following the same argument we conclude that both λ_XZ and λ_YY are learnable. For simplicity here we make an assumption that single qubit gates are noiseless, motivated by the fact that single qubit gates are 1-2 magnitudes less noisy than 2-qubit gates on today’s quantum hardware²⁴. In practice, it is a standard assumption to model noise on single-qubit gates as gate-independent (e.g.²³), and our noise characterization result can be interpreted as the noise channel induced by a dressed cycle which consists of a CNOT gate and two single-qubit gates²⁰.

The main challenge comes with the next example: CNOT maps IZ to ZZ and vice versa. By directly applying cycle benchmarking as in Fig. 1(a) (with even depth d) we obtain

$${\mathbb{E}}\langle IZ\rangle={A}_{IZ}\cdot {\lambda }_{IZ}{\lambda }_{ZZ}{\lambda }_{IZ}{\lambda }_{ZZ}\cdots={A}_{IZ}{\left({\lambda }_{IZ}{\lambda }_{ZZ}\right)}^{d/2},$$

(4)

and curve fitting gives $\sqrt{{\lambda }_{IZ}{\lambda }_{ZZ}}$ (similar results have been obtained in^9,21,22,23). To learn λ_IZ, we may consider applying the same technique in Fig. 1(b). However, the problem is that once IZ gets mapped to ZZ, it cannot be rotated back to IZ because I is invariant under single qubit unitary gates. The main difference between this example and previous examples is that here the Pauli weight pattern (an n-bit binary string with 0 indicating identity and 1 indicating non-identity) changes from 01 to 11, thus making the single qubit rotation tool inapplicable.

In fact we can go on to prove that λ_IZ (as well as λ_ZZ) is unlearnable. Here unlearnable means that there exists two noise models such that the parameter λ_IZ is different, but the two noise models are indistinguishable by any quantum experiment, meaning that any quantum experiment generates exactly the same output statistics with the two noise models. The result also generalizes to arbitrary n-qubit Clifford gates.

Theorem 1

Given an n-qubit Clifford gate ${{{{{{{\mathcal{G}}}}}}}}$ and an n-qubit Pauli operator P_a, the Pauli fidelity λ_a of the noise channel attached to ${{{{{{{\mathcal{G}}}}}}}}$ is learnable if and only if ${{{{{{{\rm{pt}}}}}}}}({{{{{{{\mathcal{G}}}}}}}}({P}_{a}))={{{{{{{\rm{pt}}}}}}}}({P}_{a})$. Here pt denotes the Pauli weight pattern.

The “if” part follows directly from cycle benchmarking as discussed above. For the “only if” part, when ${{{{{{{\rm{pt}}}}}}}}({{{{{{{\mathcal{G}}}}}}}}({P}_{a}))\ne {{{{{{{\rm{pt}}}}}}}}({P}_{a})$, we construct a gauge transformation to prove the unlearnability of λ_a, following ideas from gate set tomography^17,18,19. A gauge transformation is an invertible linear map ${{{{{{{\mathcal{M}}}}}}}}$ that converts a noise model (initial states ρ_i, POVM operators E_j, noisy gates G_k) to a new noise model as

$${\rho }_{i}\mapsto {{{{{{{\mathcal{M}}}}}}}}({\rho }_{i}),\quad {E}_{j}\mapsto {({{{{{{{{\mathcal{M}}}}}}}}}^{-1})}^{{{{\dagger}}} }({E}_{j}),\quad {G}_{k}\mapsto {{{{{{{\mathcal{M}}}}}}}}\circ {G}_{k}\circ {{{{{{{{\mathcal{M}}}}}}}}}^{-1},$$

(5)

with the constraint that the new noise model is physical. Note that the old and new noise models are indistinguishable by definition. To construct such a gauge transformation, as ${{{{{{{\rm{pt}}}}}}}}({{{{{{{\mathcal{G}}}}}}}}({P}_{a}))\, \ne \,{{{{{{{\rm{pt}}}}}}}}({P}_{a})$, there exists a bit on which the two Pauli weight patterns differ. We then define ${{{{{{{\mathcal{M}}}}}}}}$ as a single-qubit depolarizing noise channel on the corresponding qubit. In this way we can show that the old and new noise models assign different values to λ_a, which means λ_a is unlearnable. This proof naturally implies that using other noisy gates from the gate set (that are subject to different unknown noise channels) does not change the learnability of Pauli fidelities. More details of the proof are given in Supplementary Section II B. As a side remark, it is known that under the stronger assumption of gate-independent noise (where different multi-qubit gates are assumed to have the same noise channel), the noise channel is fully learnable^28,29,30.

Theorem 1 provides a simple condition for determining the learnability of individual Pauli fidelities, but it is not sufficient for characterizing the learnability of joint functions of different Pauli fidelities. In the CNOT example, we know that both λ_IZ and λ_ZZ are unlearnable, but we also know that their product λ_IZλ_ZZ is learnable. This means that there is only one unlearnable degree of freedom in the two parameters {λ_IZ, λ_ZZ}. In the following we show how to determine learnable and unlearnable degrees of freedom of Pauli noise, and also generalize the discussion from a single gate to a gate set.

We start by defining learnable information. Consider a Clifford gate set with m gates, where we model each gate as an n-qubit gate associated with an n-qubit Pauli noise channel. This model is applicable to both individual gates (e.g. a 2-qubit system where each 2-qubit gate is implemented by a different physical process and subject to a different noise channel) as well as parallel applications of gates (e.g. an n-qubit system where each “gate” in the gate set is implemented by a layer of 2-qubit gates; the n-qubit noise channel models the crosstalk among the 2-qubit gates). The goal is to characterize the learnable degrees of freedom among the m ⋅ 4ⁿ parameters.

Recall that the output of cycle benchmarking is a product of Pauli fidelities (including SPAM noise). We further show that without loss of generality this is the only type of information that we need to obtain from quantum experiments for the purpose of noise learning. This is because in general the output probability of any quantum experiment can be expressed as a sum of products of Pauli fidelities, and each individual product can be learned by cycle benchmarking (Supplementary Section IV). We therefore consider learning functions of the noise model that can be expressed as a product of Pauli fidelities (also see below Eq. (7) for a related discussion). This can be reduced to considering functions of the form $f={\sum }_{a,{{{{{{{\mathcal{G}}}}}}}}}{v}_{a}^{{{{{{{{\mathcal{G}}}}}}}}}\cdot {l}_{a}^{{{{{{{{\mathcal{G}}}}}}}}}$, where ${l}_{a}^{{{{{{{{\mathcal{G}}}}}}}}}:\!\!=\log {\lambda }_{a}^{{{{{{{{\mathcal{G}}}}}}}}}$ is the log Pauli fidelity, ${v}_{a}^{{{{{{{{\mathcal{G}}}}}}}}}\in {\mathbb{R}}$, and the superscript ${{{{{{{\mathcal{G}}}}}}}}$ denotes the corresponding Clifford gate. In the CNOT example l_IZ + l_ZZ is a learnable function. The idea of learning log Pauli fidelities in benchmarking has also been considered in^15,31. The advantage of considering log Pauli fidelities here is that the set of all learnable functions f forms a vector space. Therefore to characterize all independent learnable degrees of freedom, we only need to determine a basis of the vector space.

Recall that the reason that l_IZ + l_ZZ is learnable in the CNOT example is because the path of Pauli operator in the cycle benchmarking circuit forms a cycle IZ → ZZ → IZ → ⋯ , and the product of Pauli fidelities along the cycle (λ_IZλ_ZZ) can be learned via curve fitting. In general, as we can also insert single qubit Clifford gates in between, we do not need to differentiate between X, Y, Z. We therefore consider the pattern transfer graph associated with a Clifford gate set where vertices corresponds to binary Pauli weight patterns and each edge is labeled by the Pauli fidelity of the incoming Pauli operator. The graph has 2ⁿ vertices and m ⋅ 4ⁿ directed edges. They can also be merged to form the pattern transfer graph of the gate set {CNOT, SWAP}. Figure 2 shows the pattern transfer graph of CNOT, SWAP, and the gate set of {CNOT, SWAP}. Consider an arbitrary cycle in the pattern transfer graph C = (e₁, …, e_k) where each edge e_i is associated with some Pauli fidelity λ_i. Following Fig. 1(b), a cycle benchmarking circuit can be constructed which learns the product of the Pauli fidelites along the cycle, or equivalently the function ${f}_{C}:\!\!={\sum }_{{e}_{i}\in C}\log {\lambda }_{i}$ can be learned. This implies that the set of functions defined by linear combination of cycles $\{{\sum }_{C\in {{{{{{\mathrm{cycles}}}}}}}}{\alpha }_{C}{f}_{C}:{\alpha }_{C}\in {\mathbb{R}}\}$ are learnable. In the following we show that this in fact corresponds to all learnable information about Pauli noise.

**Fig. 2: Pattern transfer graph of CNOT, SWAP, and a gate set consisting of CNOT and SWAP.**

We label the edges of the pattern transfer graph as e₁, …, e_M where M = m ⋅ 4ⁿ and each edge e_i is a variable that represents some log Pauli fidelity. The goal is to characterize the learnability of linear functions of the edge variables $f=\mathop{\sum }\nolimits_{i=1}^{M}{v}_{i}{e}_{i}$, ${v}_{i}\in {\mathbb{R}}$. The set of linear functions can be equivalently understood as a vector space of dimension M, called the edge space of the graph, where f corresponds to a vector (v₁, …, v_M) and we think of e₁, …, e_M as the standard basis. Following the above discussion, the cycle space of the graph is defined as span{∑_e∈Ce: C is a cycle}, which is a subspace of edge space. We also define another subspace, the cut space, as ${{{{{{{\rm{span}}}}}}}}\{{\sum }_{e\in C}{(-1)}^{e\,{{{{{{\mathrm{from}}}}}}}\,{V}_{1}\,{{{{{{\mathrm{to}}}}}}}\,{V}_{2}}e:C\,{{{{{{\mathrm{is}}}}}}\; {{{{{\mathrm{a}}}}}}\; {{{{{\mathrm{cut}}}}}} \; {{{{{\mathrm{between}}}}}} \; {{{{{\mathrm{a}}}}}} \; {{{{{\mathrm{partition}}}}}} \; {{{{{\mathrm{of}}}}}} \; {{{{{\mathrm{vertices}}}}}}}\,{V}_{1},\, {V}_{2}\}$. It is known that the edge space is the orthogonal direct sum of cycle space and cut space for any graph³². Interestingly, we show that the complementarity between cycle and cut space happens to be the dividing line that determines the learnability of Pauli noise.

Theorem 2

The vector space of learnable functions of the Pauli noise channels associated with an n-qubit Clifford gate set is equivalent to the cycle space of the pattern transfer graph. In other words,

$${{{{{{\mathrm{All}}}}}}\; {{{{{\mathrm{information}}}}}}}\, \equiv \,{{{{{{\mathrm{Edge}}}}}} \; {{{{{\mathrm{space}}}}}}}\,,\\ \,{{{{{{\mathrm{Learnable}}}}}} \; {{{{{\mathrm{information}}}}}}} \equiv \,{{{{{{\mathrm{Cycle}}}}}} \; {{{{{\mathrm{space}}}}}}}\,,\\ \,{{{{{{\mathrm{Unlearnable}}}}}} \; {{{{{\mathrm{information}}}}}}} \equiv \,{{{{{{\mathrm{Cut}}}}}} \; {{{{{\mathrm{space}}}}}}}.$$

(6)

This implies that the number of unlearnable degrees of freedom equals 2ⁿ − c, where c is the number of connected components of the pattern transfer graph.

The learnability of cycle space follows from cycle benchmarking as discussed above. To prove the unlearnability of cut space, we use a similar argument as in Theorem 1 and show that a gauge transformation can be constructed for each cut in the pattern transfer graph. By linearity, this implies that any vector in the cut space corresponds to a gauge transformation. By definition, a learnable function must be orthogonal to all such vectors and thus orthogonal to the entire cut space. More details of the proof are given in Supplementary Section II C.

It is a well-known fact in graph theory that the cycle space of a directed graph G = (V, E) has dimension ∣E∣ − ∣V∣ + c while the cut space has dimension ∣V∣ − c, where c ≥ 1 is the number of connected components in G³² (a (weakly) connected component is a maximal subgraph in which every vertex is reachable from every other vertex via an undirected path). Theorem 2 implies that among the m ⋅ 4ⁿ degrees of freedom of the Pauli noise associated with a Clifford gate set, there are 2ⁿ − c unlearnable degrees of freedom. This shows that while the number of unlearnable degrees of freedom can be exponentially large, they only occupy an exponentially small fraction of the entire space. In addition, a cycle and cut basis can be efficiently determined for a given graph, though in our case this takes exponential time because the pattern transfer graph itself is exponentially large. However, computing the cycle/cut basis is not the bottleneck as the information to be learned also grows exponentially with the number of qubits. For small system sizes such as 2-qubit Clifford gates, we can write down a cycle basis as shown in Table 1(a) for the CNOT and SWAP gates, which represents all learnable information about these gates. The CNOT gate has 2 unlearnable degrees of freedom while the SWAP gate has 1 unlearnable degree of freedom. As the pattern transfer graph has at least 2 connected components, we conclude that the Pauli noise channel of a 2-qubit Clifford gate has at most 2 unlearnable degrees of freedom. Note that when treating {CNOT, SWAP} together as a gate set, there are only 2 unlearnable degrees of freedom according to Theorem 2 instead of 2 + 1 = 3, because there is one additional learnable degree of freedom (such as ${l}_{IZ}^{{{{{{{{\rm{CNOT}}}}}}}}}+{l}_{XX}^{{{{{{{{\rm{CNOT}}}}}}}}}+{l}_{XI}^{{{{{{{{\rm{SWAP}}}}}}}}}$) that is a joint function of the two gates.

Table 1 A complete basis for the learnable linear functions of log Pauli fidelities and Pauli error rates for a single CNOT/SWAP gate

Full size table

Finally, the learnability of Pauli errors can be determined by the learnability of Pauli fidelities according to the Walsh-Hadamard transform ${p}_{a}=\frac{1}{{4}^{n}}{\sum }_{b\in {{\mathsf{P}}}^{n}}{\lambda }_{b}{(-1)}^{\langle a,b\rangle }$. An issue here is that Pauli errors are linear functions of {λ_b} instead of $\{\log {\lambda }_{b}\}$. Here we make a standard assumption in the literature^9,10 that the total Pauli error is sufficiently small. In this case all individual Pauli errors are close to 0 while all individual Pauli fidelities are close to 1. Therefore the Pauli errors can be estimated via

$${p}_{a}=\frac{1}{{4}^{n}}\mathop{\sum}\limits_{b\in {{\mathsf{P}}}^{n}}{\lambda }_{b}{(-1)}^{\langle a,b\rangle }\approx \frac{1}{{4}^{n}}\mathop{\sum}\limits_{b\in {{\mathsf{P}}}^{n}}{(-1)}^{\langle a,b\rangle }\left(1+\log {\lambda }_{b}\right),$$

(7)

which means that their learnability can be determined by Theorem 2. In fact it has been suggested³¹ that any function of Pauli fidelities can be estimated in this way (as a linear function of log Pauli fidelities) up to a first-order approximation, which means that the learnability of any function of Pauli fidelities can be determined by Theorem 2. In Table 1 (c) we show the learnable Pauli errors for CNOT and SWAP, where “learnable” is in an approximate sense up to Eq. (7). Interestingly, for these two gates, the learnable functions of Pauli errors have the same form as the cycle basis, i.e. the cycle space is invariant under Walsh-Hadamard transform. We calculate the learnable Pauli errors for up to 4-qubit random Clifford gates and this seems to be true in general. We leave a rigorous investigation into this phenomenon for future work.

Experiments on IBM Quantum hardware

We demonstrate our theory on IBM quantum hardware²⁴ using a minimal example – characterizing the noise channel of a CNOT gate. In our experiments both the gate noise and SPAM noise are twirled into Pauli noise using randomized compiling. In the following we show how to extract all learnable information of Pauli noise SPAM-robustly, and also attempt to estimate the unlearnable degrees of freedom by making additional assumptions.

First, we conduct two types of cycle benchmarking (CB) experiments, the standard CB and CB with interleaving single-qubit gates (called interleaved CB), as shown in Fig. 1. The results are shown in Fig. 3. Here a set of two Pauli labels in the x-axis (e.g., {IZ, ZZ}) corresponds to the geometric mean of the Pauli fidelity (e.g., $\sqrt{{\lambda }_{IZ}{\lambda }_{ZZ}}$). Comparing to Table 1, we see that all learnable information of Pauli fidelities (including learnable individual and 2-product) are successfully extracted. Also note from Fig. 3 that the two types of CB experiments give consistent estimates, in terms of both the process fidelity and individual Pauli fidelities (e.g., $\sqrt{{\lambda }_{XZ}{\lambda }_{YY}}$ estimated from standard CB is consistent with λ_XZ and λ_YY from interleaved CB).

**Fig. 3: Estimates of Pauli fidelities of IBM’s CNOT gate via standard CB (left) and CB with interleaved gates (right), using circuits shown in Fig. 1.**

We have shown that all 13 learnable degrees of freedom (excluding the trivial λ_II = 1) are extracted in Fig. 3 by comparing with Table 1, and there remain 2 unlearnable degrees of freedom. We can bound the feasible region of the 2 unlearnable degrees of freedom using physical constraints, i.e., the reconstructed Pauli noise channel must be completely positive. This is equivalent to requiring p_a ≥ 0 for all Pauli error rates p_a. We choose λ_XX and λ_ZZ as a representation of the unlearnable degrees of freedom, and plot the calculated feasible region in Fig. 4(a), which happens to be a rectangular area. We also calculate the feasible region for each unlearnable Pauli fidelity and Pauli error rate, which are presented in Fig. 4(b), (c). In particular, we choose two extreme points (blue and green dots in Fig. 4(a)) in the feasible region and plot the corresponding noise model in Fig. 4(b), (c). Note that the (approximately) learnable Pauli error rates (on the left of the red vertical dashed line) are nearly invariant under change of gauge degrees of freedom, but they can be estimated to be negative due to statistical fluctuation. Thus, when we calculate the physical constraints, we only require those unlearnable Pauli error rates (on the right of the red vertical dashed line) to be non-negative.

**Fig. 4: Feasible region of the learned Pauli noise model, using data from Fig. 3.**

Next, we explore an approach to estimate the unlearnable information with additional assumptions. Suppose that one can prepare ${\left|0\right\rangle }^{\otimes n}$ perfectly. Since we assume noiseless single-qubit gates, this means we can prepare a set of perfect tomographically complete states $\{\left|0/1\right\rangle,\left|\pm \right\rangle,\left|\pm i\right\rangle \}$. In this case, all the unlearnable degrees of freedom become learnable, as one can first perform a measurement device tomography, and then directly estimate the process matrix of a noisy gate with measurement error mitigated²⁵. Following this general idea, we propose a variant of cycle benchmarking for Pauli noise characterization, which we call intercept CB as it uses the information of intercept in a standard cycle benchmarking protocol. Given an n-qubit Clifford gate ${{{{{{{\mathcal{G}}}}}}}}$, let m₀ be the smallest positive integer such that ${{{{{{{{\mathcal{G}}}}}}}}}^{{m}_{0}}={{{{{{{\mathcal{I}}}}}}}}$. For any Pauli fidelity λ_a (regardless of whether learnable or not according to Theorem 1), consider the following two CB experiments using the standard circuit as in Fig. 1(a). First, prepare an eigenstate of P_a, run CB with depth lm₀ + 1 for some non-negative integer l, and estimate the expectation value of ${P}_{b}:\!\!={{{{{{{\mathcal{G}}}}}}}}({P}_{a})$. The result equals

$${\mathbb{E}}{\langle {P}_{b}\rangle }_{l{m}_{0}+1}={\lambda }_{{P}_{a}}^{S}{\lambda }_{{P}_{b}}^{M}{\lambda }_{a}{\left(\mathop{\prod }\limits_{k=1}^{{m}_{0}}{\lambda }_{{{{{{{{{\mathcal{G}}}}}}}}}^{k}({P}_{a})}\right)}^{l},$$

(8)

where ${\lambda }_{{P}_{a/b}}^{S/M}$ is the Pauli fidelity of the state preparation and measurement noise channel, respectively (earlier we have absorbed these two coefficients into a single coefficient A for simplicity). Second, prepare an eigenstate of P_b, run CB with depth lm₀, and estimate the expectation value of P_b. The result equals

$${\mathbb{E}}{\langle {P}_{b}\rangle }_{l{m}_{0}}={\lambda }_{{P}_{b}}^{S}{\lambda }_{{P}_{b}}^{M}{\left(\mathop{\prod }\limits_{k=1}^{{m}_{0}}{\lambda }_{{{{{{{{{\mathcal{G}}}}}}}}}^{k}({P}_{a})}\right)}^{l}.$$

(9)

By fitting both ${\mathbb{E}}{\langle {P}_{b}\rangle }_{l{m}_{0}+1}$ and ${\mathbb{E}}{\langle {P}_{b}\rangle }_{l{m}_{0}}$ as exponential decays in l, extracting the intercepts (function values at l = 0), and taking the ratio, we obtain an estimator ${\widehat{\lambda }}_{a}^{\,{{{{{{\mathrm{ICB}}}}}}}}$ that is asymptotically unbiased to ${\lambda }_{a}\cdot {\lambda }_{{P}_{a}}^{S}/{\lambda }_{{P}_{b}}^{S}$. This estimator is robust against measurement noise. Note that ${\lambda }_{{P}_{a}}^{S}={\lambda }_{{P}_{b}}^{S}=1$ if we assume perfect initial state preparation, and in this case the above shows that λ_a is learnable, and thus the entire Pauli noise channel is learnable. We note that, instead of fitting an exponential decay in l, one could in principle just take l = 0 and estimate the ratio of ${\mathbb{E}}{\langle {P}_{b}\rangle }_{0}$ and ${\mathbb{E}}{\langle {P}_{b}\rangle }_{1}$, which also yields a consistent estimate for ${\lambda }_{a}\cdot {\lambda }_{{P}_{a}}^{S}/{\lambda }_{{P}_{b}}^{S}$. If one has already obtained all the learnable information from previous experiments, this could be a more efficient approach. However, if one has not done those experiments, the intercept CB with multiple depths can estimate the intercept (unlearnable information) and slope (learnable information) simultaneously, which is more sample efficient.

We numerically simulate intercept CB for characterizing the CNOT gate under different state preparation (SP) and measurement (M) noise. As shown in Fig. 5, this method yields relatively precise estimate when there is only measurement noise even if the noise is orders of magnitude stronger than the gate noise, but will have large deviation from the true noise model even under small state preparation noise. We refer the reader to Supplementary Section III for more details about the numerical simulation.

**Fig. 5: Simulation of intercept CB on CNOT under different SPAM noise rate.**

Finally, we experimentally implement intercept CB to estimate λ_XX and λ_ZZ, which are the two unlearnable degrees of freedom of CNOT, allowing us to determine all the Pauli fidelities and Pauli error rates. One challenge in interpreting the results is that we do not know in general whether the low SP noise assumption holds, therefore it is unclear if the learned results should be trusted. However, for the estimate to be correct, it should at least lie in the physically feasible region we obtained earlier in Fig. 4. In Fig. 6, we present our experimental results of intercept CB. It turns out that certain Pauli fidelities are far away from the physical region by several standard deviations. This gives strong evidence that the low SP noise assumption was not true on the platform we used.

**Fig. 6: The learned Pauli noise model using intercept CB.**

The data collected here can further be used to give a lower bound for the SP noise. Suppose we obtain the physical region of λ_a to be $[{\widehat{\lambda }}_{a,\min },\, {\widehat{\lambda }}_{a,\max }]$. Combining with the expression of intercept CB, we have

$${\widehat{\lambda }}_{a}^{{{\,{{{{{\rm{ICB}}}}}}}}}/{\widehat{\lambda }}_{a,\max }\le {\lambda }_{{P}_{a}}^{S}/{\lambda }_{{P}_{b}}^{S}\le {\widehat{\lambda }}_{a}^{{{\,{{{{{\rm{ICB}}}}}}}}}/{\widehat{\lambda }}_{a,\min }.$$

(10)

Applying this to the data of IZ and ZZ in Fig. 6(a), we have ${\lambda }_{IZ}^{S}/{\lambda }_{ZZ}^{S}\le 0.9879(23)$. If we make a physical assumption that the state preparation noise is a random bit-flip during the qubit initialization, one can conclude the bit-flip rate on the first qubit is lower bounded by 0.61(12)%. One can in principle bound the bit-flip rate on the second qubit by looking at ${\lambda }_{XX}^{S}/{\lambda }_{XI}^{S}$. Unfortunately, our estimate of ${\lambda }_{XX}^{S}$ from intercept CB falls in the physical region within one standard deviation, so there is no nontrivial lower bound. One could expect obtaining a useful lower bound by looking at a CNOT gate with reversed control and target. The lower bound of SP noise obtained here is completely independent of the measurement noise and does not suffer from the issue of gauge freedom¹⁹, as long as all of our noise assumptions are valid, i.e., there is no significant contribution from time non-stationary, non-Markovian, or single-qubit gate-dependent noise.

Discussion

We have shown how to characterize the learnability of Pauli noise of Clifford gates and discussed a method to extract unlearnable information by assuming perfect initial state preparation. It is also interesting to consider other physically motivated assumptions on the noise model to avoid unlearnability. For example, we can write down a parameterization of the noise model based on the underlying physical mechanism which may have fewer than 4ⁿ parameters. The main issue here is that these assumptions are highly platform-dependent and should be decided case-by-case. Moreover, it is unclear to what extent should the learned results be trusted when additional assumptions are made, since in general we cannot test whether the assumptions hold due to unlearnability.

Another direction to overcome the unlearnability is to change the model of quantum experiments. Here we have been working with the standard model as in gate set tomography, where a quantum measurement decoheres the system and only outputs classical information. However, some platforms might support quantum non-demolition (QND) measurements, and in this case measurements can be applied repeatedly, which could potentially allow more information to be learned³³.

Recently, ref. 30 considered similar issues of noise learnability. They studied a different Pauli noise model with perfect initial state $\left|0\right\rangle$, perfect computational basis measurement, and noisy single qubit gates, and showed the existence of unlearnable information. In contrast, here we focus on the learnability of Pauli noise of multi-qubit Clifford gates assuming perfect single-qubit gates (with noisy SPAM), and in practice we make the standard assumption that noise on single-qubit gates is gate-independent (e.g.²³), in which case our noise learning results are interpreted as characterizing a dressed cycle.

This work leaves open the question of noise learnability for non-Clifford gates. An issue here is that randomized compiling is not known to work with non-Clifford gates in general, so it is unclear if the general CPTP noise learnability problem can be reduced to Pauli noise. Recent work¹⁴ shows that random quantum circuits can effectively twirl the CPTP noise channel into Pauli noise and can be used to learn the total Pauli error. The question of whether more information can be learned still remains open.

Another issue to address is the scalability in noise learning. It is impossible to estimate all learnable degrees of freedom efficiently as there are exponentially many of them (an exponential lower bound on the sample complexity is shown in¹⁶). One way to avoid the exponential scaling issue is to assume the noise model has certain special structure (such as sparsity or low-weight) such that the noise model only has polynomially many parameters^10,11,22,34. It is an interesting open direction to study the characterization of learnability under these assumptions, and we give some related discussions in Supplementary Section II D.

Data availability

The data generated in this study is available at https://github.com/csenrui/Pauli_Learnability.

Code availability

The code that supports the findings of this study is available at https://github.com/csenrui/Pauli_Learnability.

References

Eisert, J. et al. Quantum certification and benchmarking. Nat. Rev. Phys. 2, 382–390 (2020).
Article Google Scholar
Preskill, J. Quantum computing in the NISQ era and beyond. Quantum 2, 79 (2018).
Article Google Scholar
Emerson, J., Alicki, R. & Życzkowski, K. Scalable noise estimation with random unitary operators. J. Optics B: Quant. Semiclassical Opt. 7, S347–S352 (2005).
Article ADS MathSciNet Google Scholar
Knill, E. et al. Randomized benchmarking of quantum gates. Phys. Rev. A 77, 012307 (2008).
Article ADS Google Scholar
Dankert, C., Cleve, R., Emerson, J. & Livine, E. Exact and approximate unitary 2-designs and their application to fidelity estimation. Phys. Rev. A 80, 012304 (2009).
Article ADS Google Scholar
Magesan, E., Gambetta, J. M. & Emerson, J. Scalable and robust randomized benchmarking of quantum processes. Phys. Rev. Lett. 106, 180504 (2011).
Article ADS Google Scholar
Magesan, E., Gambetta, J. M. & Emerson, J. Characterizing quantum gates via randomized benchmarking. Phys. Rev. A 85, 042311 (2012).
Article ADS Google Scholar
Helsen, J., Roth, I., Onorati, E., Werner, A. H. & Eisert, J. A general framework for randomized benchmarking. arXiv preprint arXiv:2010.07974 (2020).
Erhard, A. et al. Characterizing large-scale quantum computers via cycle benchmarking. Nat. Commun. 10, 5347 (2019).
Article ADS Google Scholar
Flammia, S. T. & Wallman, J. J. Efficient estimation of pauli channels. ACM Transactions on Quantum Computing 1 (2020). https://doi.org/10.1145/3408039.
Harper, R., Flammia, S. T. & Wallman, J. J. Efficient learning of quantum noise. Nat. Phys. 16, 1184–1188 (2020).
Article CAS Google Scholar
Harper, R., Yu, W. & Flammia, S. T. Fast estimation of sparse quantum noise. PRX Quantum 2, 010322 (2021).
Article Google Scholar
Flammia, S. T. & O’Donnell, R. Pauli error estimation via population recovery. Quantum 5, 549 (2021).
Article Google Scholar
Liu, Y., Otten, M., Bassirianjahromi, R., Jiang, L. & Fefferman, B. Benchmarking near-term quantum computers via random circuit sampling (2021). 2105.05232.
Flammia, S. T. Averaged circuit eigenvalue sampling (2021). 2108.05803.
Chen, S., Zhou, S., Seif, A. & Jiang, L. Quantum advantages for pauli channel estimation. Phys. Rev. A 105, 032435 (2022).
Article ADS MathSciNet CAS Google Scholar
Merkel, S. T. et al. Self-consistent quantum process tomography. Phys. Rev. A 87, 062119 (2013).
Article ADS Google Scholar
Blume-Kohout, R. et al. Robust, self-consistent, closed-form tomography of quantum logic gates on a trapped ion qubit (2013). 1310.4492.
Nielsen, E. et al. Gate Set Tomography. Quantum 5, 557 (2021).
Article Google Scholar
Wallman, J. J. & Emerson, J. Noise tailoring for scalable quantum computation via randomized compiling. Phys. Rev. A 94, 052325 (2016).
Article ADS Google Scholar
Hashim, A. et al. Randomized compiling for scalable quantum computing on a noisy superconducting quantum processor. Phys. Rev. X 11, 041039 (2021).
CAS Google Scholar
Berg, E. v. d., Minev, Z. K., Kandala, A. & Temme, K. Probabilistic error cancellation with sparse pauli-lindblad models on noisy quantum processors (2022). 2201.09866.
Ferracin, S. et al. Efficiently improving the performance of noisy quantum computers (2022). 2201.10672.
IBM Quantum. https://quantum-computing.ibm.com/services?services=systems (2022).
Maciejewski, F. B., Zimborás, Z. & Oszmaniec, M. Mitigation of readout noise in near-term quantum devices by classical post-processing based on detector tomography. Quantum 4, 257 (2020).
Article Google Scholar
Bravyi, S., Sheldon, S., Kandala, A., Mckay, D. C. & Gambetta, J. M. Mitigating measurement errors in multiqubit experiments. Phys. Rev. A 103, 042605 (2021).
Article ADS CAS Google Scholar
Ferracin, S., Merkel, S. T., McKay, D. & Datta, A. Experimental accreditation of outputs of noisy quantum computers. Phys. Rev. A 104, 042603 (2021).
Article ADS CAS Google Scholar
Kimmel, S., da Silva, M. P., Ryan, C. A., Johnson, B. R. & Ohki, T. Robust extraction of tomographic information via randomized benchmarking. Phys. Rev. X 4, 011050 (2014).
Google Scholar
Helsen, J. et al. Estimating gate-set properties from random sequences (2021). 2110.13178.
Huang, H.-Y., Flammia, S. T. & Preskill, J. Foundations for learning from noisy quantum experiments (2022). 2204.13691.
Nielsen, E., Young, K. & Blume-Kohout, R. First-order gauge-invariant error rates in quantum processors. Bulletin of the American Physical Society (2022).
Bollobás, B.Modern graph theory, vol. 184 (Springer Science & Business Media, 1998).
Laflamme, R., Lin, J. & Mor, T. Algorithmic cooling for resolving state preparation and measurement errors in quantum computing. arXiv preprint arXiv:2203.08114 (2022).
Harper, R., Yu, W. & Flammia, S. T. Fast estimation of sparse quantum noise. PRX Quantum 2, 010322 (2021).
Article Google Scholar

Download references

Acknowledgements

We thank Ewout van den Berg, Arnaud Carignan-Dugas, Robert Huang, Kristan Temme and Pei Zeng for helpful discussions. S.C. and L.J. acknowledge support from the ARO (W911NF-18-1-0020, W911NF-18-1-0212), ARO MURI (W911NF-16-1-0349, W911NF-21-1-0325), AFOSR MURI (FA9550-19-1-0399, FA9550-21-1-0209), AFRL (FA8649-21-P-0781), DoE Q-NEXT, NSF (OMA-1936118, EEC-1941583, OMA-2137642), NTT Research, and the Packard Foundation (2020-71479). Y.L. was supported by DOE NQISRC QSA grant #FP00010905, Vannevar Bush faculty fellowship N00014-17-1-3025, MURI Grant FA9550-18-1-0161 and NSF award DMR-1747426. A.S. is supported by a Chicago Prize Postdoctoral Fellowship in Theoretical Quantum Science. B.F. acknowledges support from AFOSR (YIP number FA9550-18-1-0148 and FA9550-21-1-0008). This material is based upon work partially supported by the National Science Foundation under Grant CCF-2044923 (CAREER) and by the U.S. Department of Energy, Office of Science, National Quantum Information Science Research Centers. This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.

Author information

These authors contributed equally: Senrui Chen, Yunchao Liu.

Authors and Affiliations

Pritzker School of Molecular Engineering, University of Chicago, Chicago, IL, 60637, USA
Senrui Chen, Alireza Seif & Liang Jiang
Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA, 94720, USA
Yunchao Liu
HRL Laboratories, LLC, 3011 Malibu Canyon Rd., Malibu, CA, 90265, USA
Matthew Otten
Department of Computer Science, University of Chicago, Chicago, IL, 60637, USA
Bill Fefferman

Authors

Senrui Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yunchao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Otten
View author publications
You can also search for this author in PubMed Google Scholar
Alireza Seif
View author publications
You can also search for this author in PubMed Google Scholar
Bill Fefferman
View author publications
You can also search for this author in PubMed Google Scholar
Liang Jiang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.C. and Y.L. developed the theory and performed the experiments. B.F. and L.J. supervised the project. S.C., Y.L., M.O., A.S., B.F., and L.J. contributed important ideas during initial discussions and contributed to writing the manuscript.

Corresponding authors

Correspondence to Senrui Chen, Yunchao Liu or Liang Jiang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, S., Liu, Y., Otten, M. et al. The learnability of Pauli noise. Nat Commun 14, 52 (2023). https://doi.org/10.1038/s41467-022-35759-4

Download citation

Received: 04 July 2022
Accepted: 23 December 2022
Published: 04 January 2023
DOI: https://doi.org/10.1038/s41467-022-35759-4

This article is cited by

Approaches to Constrained Quantum Approximate Optimization
- Zain H. Saleem
- Teague Tomesh
- Martin Suchara
SN Computer Science (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.