## Abstract

Self-testing represents the strongest form of certification of a quantum system. Here, we theoretically and experimentally investigate self-testing of nonprojective quantum measurements. That is, how can one certify, from observed data only, that an uncharacterized measurement device implements a desired nonprojective positive-operator valued measure (POVM). We consider a prepare-and-measure scenario with a bound on the Hilbert space dimension and develop methods for (i) robustly self-testing extremal qubit POVMs and (ii) certifying that an uncharacterized qubit measurement is nonprojective. Our methods are robust to noise and thus applicable in practice, as we demonstrate in a photonic experiment. Specifically, we show that our experimental data imply that the implemented measurements are very close to certain ideal three- and four-outcome qubit POVMs and hence non-projective. In the latter case, the data certify a genuine four-outcome qubit POVM. Our results open interesting perspective for semi–device-independent certification of quantum devices.

## INTRODUCTION

Measurements in quantum theory were initially represented by complete sets of orthogonal projectors on a Hilbert space. Such measurements are standard in a multitude of applications. Nevertheless, in a modern understanding of quantum theory, measurements are described by positive-operator valued measures (POVMs), i.e., a set of positive semi-definite operators summing to identity. POVMs are the most general notion of a quantum measurement; all projective measurements are POVMs, but not all POVMs need be projective.

Nonprojective measurements are widely useful in both conceptual and applied aspects of quantum theory, as well as in quantum information processing. In several practically motivated tasks, they present concrete advantages over projective measurements. Nonprojective measurements enhance estimation and tomography of quantum states (*1*, *2*), as well as entanglement detection (*3*) and unambiguous state discrimination of nonorthogonal states (*4*, *5*). They have also found applications in quantum cryptography (*6*, *7*) and randomness generation (*8*). In addition, nonprojective measurements can be used to maximally violate particular Bell inequalities (*9*) (assuming a bound on the Hilbert space dimension), a fact that has been applied to improve randomness extraction beyond what is achievable with projective measurements (*10*, *11*).

In view of their diverse and growing applicability, it is important to develop tools for certifying and characterizing nonprojective measurements under minimal assumptions. The strongest possible form of certification involves a “black-box” scenario, where the quantum devices are a priori uncharacterized. Astonishingly, it is possible in certain cases to completely characterize both the quantum state and the measurements based only on observed data, which is referred to as “self-testing” (*12*). A well-known example is that the maximal violation of the Clauser-Horne-Shimony-Holt Bell inequality (*13*) implies (self-tests) a maximally entangled two-qubit state and pairs of anticommuting local projective measurements (*14*–*16*). Self-testing can also be made robust to noise (*17*, *18*).

However, for the purpose of characterizing nonprojective measurements in the black-box scenario, methods based on Bell inequalities encounter a challenge. Because of Neumark’s theorem, every nonprojective measurement can be recast as a projective measurement in a larger Hilbert space. That is, any nonprojective measurement on a given system is equivalent to projective measurement applied to the joint state of the system and an ancilla of a suitable dimension [see, e.g., (*19*)]. Since one usually considers no restriction on Hilbert space dimension in the Bell scenario, it is nontrivial to characterize a nonprojective measurement based on a Bell inequality. While this is possible in theory (in the absence of noise) (*10*), it appears challenging in the more realistic scenario where the experiment features imperfections. To the best of our knowledge, robust self-testing methods for nonprojective measurements in Bell scenarios have not yet been developed. A possible way to circumvent the problem is to consider a Bell scenario with quantum systems of bounded Hilbert space dimension. In particular, Gómez *et al.* (*11*) and Gómez *et al.* (*20*) recently reported the experimental certification of a nonprojective measurement in a Bell experiment assuming qubits. However, these experiments do not represent self-tests, as they certify the nonprojective character of a measurement, but not how it relates to a specific target POVM.

Here, we investigate the problem of self-testing nonprojective measurements under the assumption of bounded Hilbert space dimension. We follow a different approach, by considering a prepare-and-measure scenario instead of a Bell scenario. First, this scenario offers a natural framework for certifying and characterizing nonprojective measurements. The reason is that, as argued above, the notion of nonprojectiveness almost inherently involves a notion of Hilbert spaces of fixed dimension. Then, the prepare-and-measure scenario is arguably the simplest scenario in which the problem can be studied without further assumptions. Second, the prepare-and-measure scenario offers a very significant practical advantage as compared to Bell experiments. The reason is that there is no need to involve distant observers and entangled states. This makes prepare-and-measure scenarios simpler to implement (*21*–*26*). Moreover, prepare-and-measure scenarios are easier to analyze theoretically, which allows us to develop self-testing methods that are versatile and highly robust to noise. Third, the assumption of a dimension bound is reasonable for characterization schemes. This is due to the fact that characterization schemes are not adversarial; i.e., they do not involve malicious devices. The experimenter typically knows which degrees of freedom are relevant; for example, the polarization of photons. However, every experiment is subject to unavoidable errors due, e.g., to technical noise and alignment errors. Characterization of quantum devices in this realistic setting is well captured by our assumption of a dimension bound.

In the first part of the paper, we present methods for characterizing nonprojective measurements. First, we present a method for self-testing a targeted nonprojective measurement in noiseless scenarios. Second, since noiseless statistics never occur in practice, we present methods for inferring a lower bound on the closeness of the uncharacterized measurement and a given target POVM, based on the observed noisy statistics; specifically, we lower-bound the worst-case fidelity between the real measurement and the ideal target one. Third, we introduce a method for determining whether the observed statistics could have arisen from some (unknown) projective measurements. If not, the measurement is certified as nonprojective. These methods have twofold relevance. On the one hand, they enable foundational insights into physical inference of nonprojective measurements in a semi–device-independent setting. On the other hand, they provide tools for assessing and certifying the quality of an experimental setup. We demonstrate the practicality of these self-testing methods in two experiments. In the first, we target a symmetric informationally complete (SIC) qubit POVM and demonstrate an estimated 98% worst-case fidelity. In addition, our data certify a genuine four-outcome qubit POVM. In the second experiment, we target a symmetric three-outcome qubit POVM and certify a worst-case fidelity of at least 96%. Last, we discuss some open questions.

## THE SELF-TESTING PROBLEM, THE SCENARIO, AND OVERVIEW OF RESULTS

Self-testing is the task of characterizing a quantum system based only on observed data. In other words, it is about gaining knowledge of the physical properties of initially unknown states and/or measurements present in an experiment by studying the correlations observed in the laboratory.

In this work, we focus on prepare-and-measure scenarios. They differentiate themselves from Bell scenarios in two important ways. First, prepare-and-measure scenarios involve communicating observers and thus no space-like separation. Second, they do not involve entanglement, whereas Bell scenarios do. Prepare-and-measure scenarios can generally be modeled by two separated parties, Alice and Bob, who receive random inputs *x* and *y*, respectively. Alice prepares and sends a quantum state ρ* _{x}* to Bob who performs a measurement

*y*with outcome

*b*, represented by a POVM

This generates a probability distribution

To make the problem nontrivial, an assumption on Alice’s preparations is required; otherwise, Alice could simply send *x* to Bob and any probability distribution *P*(*b*∣*x*, *y*) would be achievable. The assumption we consider in this work is that Alice’s preparations, i.e., the set of states ρ* _{x}*, can be represented in Hilbert space of given dimension

*d*. By choosing

*d*< ∣

*x*∣, we prevent Alice from communicating all information about her input

*x*to Bob. There exist distributions obtained from quantum systems of a dimension

*d*that cannot be simulated classically [see, e.g., (

*27*)]. That is, no strategy in which Alice communicates a classical

*d*-valued message to Bob can possibly reproduce the observed data. Such distributions that cannot be classically simulated are candidates for self-testing considerations.

The problem of self-testing consists in characterizing the set of states {ρ* _{x}*} and/or the set of measurements

*P*(

*b*∣

*x*,

*y*). This characterization can usually be done only up to a unitary transformation and possibly a relabeling. In a recent work (

*28*), methods were presented for self-testing sets of pure quantum states and sets of projective measurements in the qubit case. These were subsequently extended to higher dimensional systems in (

*29*,

*30*).

Formally, a self-test can be made via a witness, which is a linear function of the probability distribution *P*(*b* ∣ *x*, *y*)* _{xyb}* are real coefficients. Moreover, given a witness, one can determine its maximal witness value

*P*(

*b*∣

*x*,

*y*) leads to

*d*, quantum systems outperform classical ones; if not, several strategies would generally be compatible with the data [see (

*21*,

*21*,

*27*) for examples of such witnesses]. In the “Self-testing nonprojective measurements: Noiseless case” section, we present a method for constructing witnesses whose maximal value can self-test a targeted nonprojective qubit measurement M

^{target}.

Next, we turn to robust self-tests, i.e., self-tests that can be applied even when the statistics is not ideal, causing the witness value to be less than

Whereas robust self-testing represents a quantitative physical inference, it is also relevant to consider a more qualitative inference. On the basis of the witnesses we develop for self-testing, we show how to certify that the uncharacterized measurement is nonprojective. In the “Certification methods for nonprojective measurements” section, we determine the largest value of our witness that is compatible with qubit projective measurements. When observing a larger value, the nonprojective character of the measurement is certified. In a similar spirit, we determine a bound on our witness above which a genuine four-outcome (nonprojective) qubit measurement is certified.

An overview of all the self-testing methods developed in this work is illustrated in Fig. 1. The methods will be applied in the “Qubit SIC-POVM” section to self-test particularly relevant nonprojective qubit measurements. For these examples, we will demonstrate the usefulness of our methods by implementing them in a photonic experiment. Specifically, our experimental data imply that the implemented measurements are very close to certain ideal three- and four-outcome qubit POVMs and hence are nonprojective. In the latter case, the data certify a genuine four-outcome qubit POVM.

## RESULTS

This section presents how to certify and characterize nonprojective measurements in prepare-and-measure scenarios with both noiseless and noisy statistics. The focus will be on qubit systems. Therefore, we begin by summarizing the properties of qubit POVMs.

A POVM with *O* outcomes is a set of operators *E _{i}* ≥ 0 and that

*E*can be represented on the Bloch sphere as

_{i}*≥ 0, and*

_{i}The set of POVMs is convex, and a POVM is called extremal if it cannot be decomposed as a convex mixture of other POVMs. For qubits, extremal POVMs have either *O* = 2,3,4 outcomes (*31*). In the case *O* = 2, extremal POVMs are simply projective, whereas for *O* = 3 and *O* = 4, they are nonprojective; an extremal three-outcome qubit POVM has three unit Bloch vectors in a plane, and an extremal four-outcome qubit POVM has four unit Bloch vectors of which no choice of three are in the same plane (*31*). An extremal qubit POVM is therefore characterized by its Bloch vectors. As the statistics of nonextremal POVMs can always be simulated by stochastically implementing extremal POVMs, it is clear that only extremal POVMs can be self-tested.

### Self-testing nonprojective measurements: Noiseless case

Consider a target extremal nonprojective qubit POVM M^{target}, with *O* = 3 or *O* = 4 outcomes, for which we associate the outcome *b* to the unit Bloch vector ^{target}. The method consists of two steps summarized in Fig. 2.

Step 1. First, we construct a simpler witness *O* preparations; i.e., Alice has *O* inputs. Bob receives an input *y* = 1, …, *Y* and provides a binary outcome. The goal of this simpler witness is to self-test a particular relation among the prepared states ∣ψ* _{x}*〉. Specifically, we would like to certify that their unit Bloch vectors

^{target}; i.e.,

*x*= 1, …,

*O*. Let us define

*c*chosen such that the maximal value

_{xyb}*〉} (up to a global unitary and relabelings). In general, we believe that it is always possible to find such a self-test by considering enough inputs for Bob, corresponding to well-chosen projective measurements, and suitable coefficients*

_{x}*c*[see (

_{xyb}*28*) for examples]. Furthermore, note that one could also, in principle, have more than

*O*preparations for Alice and then self-test that

*O*of them have the desired relation to M

^{target}. In addition, we remark that the construction of an adequate witness

Step 2. We construct our final witness A from **povm**. This setting corresponds to a measurement with *O* outcomes. Since the intention is to self-test the measurement corresponding to this setting as M^{target}, we associate the setting **povm** to *O* outcomes. We define*k*. A maximal witness value **povm** corresponds to M^{target} (up to a unitary and relabelings). This is because a maximal witness value implies that (i) the set of prepared states {∣ψ* _{x}*〉} have Bloch vectors anti-aligned with those of M

^{target}and (ii)

*P*(

*b*=

*x*∣

*x*,

**povm**) = 0 for all

*x*; hence, the Bloch vectors of the setting

**povm**are of unit length and aligned with those of M

^{target}. Moreover, as a qubit POVM is characterized by its Bloch vectors, we see that M

^{target}is the only POVM that can attain the maximal witness value

^{target}.

In the “Qubit SIC-POVM” section, we will apply this method to self-test symmetric qubit POVMs with three and four outcomes.

### Robust self-testing of nonprojective measurements

No experiment can achieve the noiseless conditions needed to obtain exactly a maximal value of A. Therefore, it is paramount to discuss the case when a nonmaximal value of A is observed. We will show that, in this case, one can nevertheless make a statement about how close the uncharacterized measurement E performed in the laboratory (corresponding to the setting **povm**) is to the target POVM M^{target}.

To address this question, we must first define a measure of closeness between two measurements. A natural and frequently used distance measure in quantum information is the fidelity, *F*, between two operators. We consider a measure of closeness amounting to the best possible weighted average fidelity between the extremal qubit target POVM elements *M*^{target} = {*M _{i}*} and the actual POVM elements E = {

*E*}. That is, we allow for a quantum extraction channel Λ to be applied to the actual POVM. The set of allowed extraction channels is the set of unital channels in the relevant Hilbert space dimension. This is understood from the fact that the extraction channel must map

_{i}*O*-outcome POVMs to

*O*-outcome POVMs in the given Hilbert space dimension. Because of linearity, this implies that the channel is unital. Conversely, since every channel preserves positivity, every unital channel in the relevant Hilbert space dimension maps POVMs to POVMs. We look for the best possible extraction channel. We thus define the quantity

Since the target measurement is extremal, the POVM elements are proportional to rank-one projectors; *M _{i}* ∝

*P*. Because of Eq. 4, we can write

_{i}*P*. Hence,

_{i}*M*and Λ[

_{i}*E*] are both proportional to the same rank-one projector. Since a POVM with Bloch vectors of unit length is fully characterized, i.e., all coefficients λ

_{i}*are fixed by the conditions (Eq. 5), this implies that*

_{i}*M*= Λ[

_{i}*E*]. Thus, a maximal fidelity of

_{i}*F*= 1 is uniquely achieved when the actual POVM is equal to the target measurement.

In general, a nonmaximal value of the witness A can arise from many different possible choices of states and measurements. We denote by *O*-outcome POVMs that are compatible with a given observed value A. Our goal is now to find a lower-bound on the average fidelity *F* that holds for every measurement

Calculating this quantity, or even lower-bounding it, is typically a nontrivial problem even in the simplest case. We proceed with presenting two methods for this task.

We remark that the definition (Eq. 8), given for qubits, could potentially be extended to higher-dimensional systems (replacing the factor 1/2 by 1/*d*). This could work for POVMs where all elements are proportional to rank-one projectors. However, the latter are only a strict subset of general extremal POVMs. Finding a more general figure of merit is thus an interesting open question.

* Robust self-testing with the swap method*. A lower-bound on the worst-case average fidelity can be obtained via semidefinite programming (

*32*). The method combines the so-called swap method (

*33*,

*34*), introduced for self-testing in the Bell scenario, and the hierarchy of dimensionally bounded quantum correlations (

*35*). Such adaptations of the swap method to prepare-and-measure scenarios were introduced in (

*28*) to self-test pure state and projective measurements. In section S1, we outline the details of how the swap method is adapted to robustly self-test nonprojective measurements. This method benefits from being applicable in a variety of scenarios and for returning rigorous lower bounds on F. Nevertheless, it suffers from two drawbacks. First, the method only overcomes the fact that self-tests are valid up to a global unitary, but not that they may be valid up to relabelings. Thus, it is only useful for target measurements that are self-tested up to a unitary. Second, while rarely producing tight bounds on F, the computational requirements scale rapidly with the number of inputs, the number of outputs, and the chosen level of the hierarchy. In the “Qubit SIC-POVM” section, we will show that the method can be efficiently applied for robustly self-testing a three-outcome qubit POVM.

* Numerically approximating robust self-testing*. To also address cases in which self-tests are valid up to both a unitary transformation and relabelings, we can estimate F based on random sampling. The approximation method benefits from being straightforward and broadly useful, while it suffers from the fact that it merely estimates the value of F instead of providing a strict lower bound. The key feature is that the minimization appearing in Eq. 9 is replaced by a minimization taken over data obtained from many random samples of the setting

**povm**. We detail this method in section S2 and apply it to an example in the “Qubit SIC-POVM” section.

### Certification methods for nonprojective measurements

Whereas robust self-testing considers quantitative aspects of physical inference from noisy data, it is important to also consider the qualitative inference. An important qualitative statement is to prove that the uncharacterized measurement is nonprojective or, more generally, that it cannot be simulated by projective measurements. It is known that when POVMs are sufficiently noisy, they become perfectly simulable via projective measurements (*19*, *36*, *37*). The witnesses we construct can address this question. We will see that whenever the observed value of the witness A is sufficiently large, one can certify that the setting **povm** necessarily corresponds to some nonprojective measurement and could not have been simulated via projective measurements. Specifically, we derive an upper bound on A for projective measurements (or convex combination of them). The violation of such a bound thus certifies a nonprojective measurement or, more precisely, a genuine three-outcome (or four-outcome) POVM. At the end of this subsection, we also show how to certify a genuine four-outcome POVM.

A projective qubit measurement has binary outcomes and can therefore be represented by an observable *M* ≡ *M*_{0} − *M*_{1}, where *M _{i}* is the measurement operator corresponding to outcome

*i*= 0,1. Let us consider the case where the

*O*-outcome measurement

**povm**is projective. One may assign two outcomes to rank-one projectors and the rest to trivial zero operators. Note that it is enough here to consider these cases, as the witness A is linear in terms of the measurement operators. Projectors can thus be assigned in three (

*O*= 3) or six (

*O*= 4) different ways, of which the optimal instance must be chosen. Let the outcomes in the optimal instance be

*o*

_{0∣povm}and

*o*

_{1∣povm}and associate the observable

*C*(

*k*) is a constant and

*M*

_{1}, …,

*M*

_{Y + 1}}. Note that

*y*but on the collection of observables. Using the Cauchy-Schwarz inequality for operators, we obtain

Because of projectivity, we have *t*, which is a weighted sum of scalar products of the Bloch vectors of the observables. Consequently, to bound A under all projective measurements, we have

Thus, B(*k*) bounds the value of A for projective measurements. The evaluation of this bound only depends on Bob’s Bloch vectors and is further simplified by their parameterization in terms of two angles. The effort needed to evaluate the bound depends on the chosen prepare-and-measure scenario. Typically, considering scenarios with some symmetry properties is beneficial.

Moreover, when targeting a four-outcome qubit POVM, we consider also a finer form of qualitative characterization by considering whether A can be simulated by the setting **povm** being some three-outcome POVM. If not, the measurement is certified as a genuine four-outcome measurement. This amounts to bounding the value of A achievable under any two- or three-outcome qubit POVM and then observing a violation of that bound. For this purpose, one may use the hierarchy of dimensionally bounded quantum correlations (*35*), which can be used to upper-bound A under three-outcome POVMs. Since the hierarchy is built on projective measurements, one must embed Alice’s preparations in a larger Hilbert space with the dimension chosen such that three-outcome POVMs can be recast as projective measurement following Neumark’s theorem. To obtain tight bounds, one may need a reasonably high hierarchy level, which can be efficiently implemented using the methods of (*30*).

Next, in the “Qubit SIC-POVM” section, we will apply the outlined methods to specific nonprojective measurements and experimentally demonstrate the certification of both nonprojective and genuine four-outcome measurements.

### Relevant examples and their experimental realization

In the above, we have discussed methods for self-testing a target nonprojective measurement. Here, we put these methods in practice in a photonic experiment. We implement three- and four-outcome symmetric qubit POVMs, with Bloch vectors forming a star (trine-POVM) and a tetrahedron (SIC-POVM), respectively. In the first case, we certify a nonprojective measurement and apply our methods for robust self-testing, demonstrating worst-case average fidelity of at least 96% compared to an ideal trine-POVM. In the second case, we certify a genuine four-outcome qubit POVM and demonstrate worst-case average fidelity of approximately 98% with respect to an ideal SIC-POVM. We consider each example separately by first applying the methods of Results to obtain adequate witnesses and then present the corresponding experimental realization. The setup common to both experiments is presented in Materials and Methods.

* Qubit SIC-POVM*. We begin by illustrating the self-testing methods for a frequently used nonprojective measurement, namely, the qubit SIC-POVM, which we denote

*= 1/4. Such a regular tetrahedron construction can be achieved via two different labelings of the four outcomes that are not equivalent under unitary transformations. Up to a unitary transformation, each such SIC-POVM can be written with Bloch vectors*

_{b}* Noiseless self-test*. We find a prepare-and-measure scenario for self-testing

*x*∈ {1,2,3,4}, and Bob has three binary-outcome measurements,

*y*∈ {1,2,3}. The witness is chosen as

*S*

_{1, y}= [0,0,0],

*S*

_{2, y}= [0,1,1],

*S*

_{3, y}= [1,0,1], and

*S*

_{4, y}= [1,1,0]. The maximal value,

*, σ*

_{x}*, and σ*

_{y}*. In section S3, we prove the maximal witness value and show that it self-tests that Alice’s preparations indeed must form a regular tetrahedron on the Bloch sphere. By step 2 in the “Self-testing nonprojective measurements: Noiseless case” section, we supply Bob with an additional four-outcome measurement*

_{z}**povm**and consider the modified witness

We note that there also exist other prepare-and-measure scenarios fulfilling the requirements of step 1. For example, one may achieve the desired self-test using the so-called 3 → 1 random access code whose self-testing properties were considered in (*28*). However, this prepare-and-measure scenario requires more preparations than the one presented here.

* Robust self-test*. Next, we consider the worst-case fidelity (given in Eq. 9) of the measurement corresponding to the setting

**povm**with

^{5}optimal pairs

**povm**. The evaluation was done for

*k*= 1/5 (which, as will soon be shown, turns out to be the most noise-resilient choice of

*k*). We see that the minimal sampled fidelity as a function of

* Certifying nonprojective and genuine four-outcome POVMs*. Last, we derive a tight bound valid for all qubit projective measurements on the value of

**povm**be the outcomes

*b*= 1,2. Hence, we define the observable

*C*(

*k*) = (1 − 2

*k*)/2 and

*r*,

*s*≥ 0, with equality if and only if

*r*=

*s*. Apply this inequality twice to the expression (Eq. 12), first to the two terms associated to

*x*= 0,1, and then to the two terms associated to

*x*= 2,3. After a simple optimization over

*r*= 3 + 144

_{k}*k*

^{2}. This bound is valid for a particular value of

*x*. To hold for all projective measurements, we simply maximize

*f*(

_{k}*x*) over

*x*. This requires only an optimization in a single real variable

*x*∈ [ − 1,1], which is straightforward. The optimal choice is denoted

*x**. Setting B(

*k*) =

*f*(

_{k}*x**), we have

*k*∈ {1/100,2/100, …,1} by numerically optimizing

*k*) up to machine precision with a projective measurement.

Furthermore, we have also considered bounding *35*), we have embedded the qubit preparations into a three-dimensional Hilbert space and optimized *k* ∈ {1/100,2/100…,1}, and each upper bound was saturated up to numerical precision using lower bounds numerically obtained via semidefinite programs.

To study the robustness of both the nonprojective and the genuine four-outcome certification, we have considered the critical visibility of the system needed when exposed to noise. This is modeled by the preparations taking the form ρ* _{x}*(

*v*) =

*v*ρ

*+ (1 −*

_{x}*v*)ρ

_{noise}, where

*v*∈ [0,1] is the visibility and ρ

_{noise}is some arbitrary qubit state. A straightforward calculation shows that the critical visibility for violating some given bound B is

_{noise}. We have applied this to

*k*), corresponding to the bounds on projective and three-outcome measurements, respectively. The corresponding critical visibilities are plotted in section S5. In both cases, we find that the largest amount of noise is tolerated for

*k*= 1/5, corresponding to

*v*

_{crit}= 0.970 and

*v*

_{crit}= 0.990, respectively.

* Experimental result*. Wave-plate settings for Alice’s prepared states in Eq. 13 and Bob’s measurements σ

*, σ*

_{x}*, σ*

_{y}*, and the four-outcome SIC-POVM anti-aligned to the vectors in Eq. 13, are reported in section S5. In section S5, we also report a state tomography of Alice’s preparations.*

_{z}Optimally choosing *k* = 1/5, the measured value of the witness as compared to the relevant bounds is

The statistical error originates from Poissonian statistics, and the systematic error originates from the precision of the wave-plate settings. More details about the errors are discussed in section S5.

We observe a substantial violation of both the projective measurement and the three-outcome measurement bounds. Thus, we can certify that Bob’s measurement **povm** is a genuine four-outcome qubit POVM. Furthermore, as illustrated by the results in Fig. 3, we certify approximately a 98% worst-case fidelity with the qubit SIC-POVM.

* Qubit trine-POVM*. We consider a second example in which the target POVM is the so-called trine-POVM. This measurement has three outcomes, and its Bloch vectors form an equilateral triangle on a disk of the Bloch sphere, with λ

*= 1/3. The Bloch vectors are hence defined by*

_{l}* Noiseless self-test*. We introduce a prepare-and-measure scenario in which Alice has three inputs

*x*∈ {1,2,3}, and Bob has two binary-outcome measurements labeled by

*y*∈ {1,2}, and consider the witness

*T*

_{x,1}= [1,1, − 1] and

**povm**for Bob and consider the witness

*k*> 0. Then,

**povm**as the trine-POVM up to a unitary.

* Robust self-test*. We now turn to considering its robust self-testing properties, i.e., lower-bounding the worst-case fidelity of the unknown measurement (setting

**povm**) with the target measurement for a given value of

Last, we have also self-tested the trine-POVM in a different prepare-and-measure scenario (see section S3). In section S4, we use this prepare-and-measure scenario to derive a tight bound on projective measurements by evaluating the right-hand side of Eq. 12.

* Experimental realization*. The witness in Eq. 21 is maximized if Alice’s three Bloch vectors point to the vertices of an equilateral triangle on a disk of the Bloch sphere. We take that disk to be the

*xz*plane, taking

*, σ*

_{z}*, and the three-outcome POVM with vectors anti-aligned to Alice’s states. See section S5 for state tomography of Alice’s preparations. In contrast to the previous experiment, output 2 of Bob’s measurement station only consists of one detector (D3) and no wave plate or polarizing beamsplitter (PBS) (see Fig. 5). The wave-plate settings corresponding to the above states and measurements are reported in section S5.*

_{x}With the said settings, we have obtained the experimentally measured value of *k*. Since we aim to demonstrate a large worst-case fidelity with the trine-POVM, we have computed the lower bound on *k* and found that choosing *k* = 1 leads to the optimal result. The corresponding experimentally measured witness is

This data point and its relation to the worst-case fidelity of the laboratory measurement with the targeted POVM are depicted in Fig. 4. From

Furthermore, we have considered the possibility of the experimental data certifying a nonprojective qubit measurement. However, to this end, we found that another choice of *k* is optimal with respect to the witness value that is achievable under projective measurements. We found that the optimal choice is *k* ≈ 4.5. The corresponding experimentally measured value becomes

## DISCUSSION

We investigated the problem of self-testing nonprojective measurements. We argued that a prepare-and-measure scenario with an upper bound on the Hilbert space dimension represents a natural framework for investigating this problem. We considered both the qualitative certification of a measurement being nonprojective and/or genuine four-outcome, as well as a quantitative characterization in terms of worst-case fidelity to a given target POVM. We demonstrate the practical relevance of these methods in two experiments in which we both certify a genuine four-outcome POVM and infer a high worst-case fidelity with respect to target symmetric qubit POVMs.

It would be interesting to overcome the limitation of the swap method and develop a rigorous robust self-testing method for general four-outcome qubit POVMs. Also extending these methods to high-dimensional POVMs would be relevant since there exist extremal nonprojective measurements that feature the same number of outcomes as projective measurements (contrary to the qubit case). Moreover, it would be interesting to investigate self-testing of nonprojective measurements using different assumptions as in our work. One could consider for instance prepare-and-measure scenarios with a bound on the entropy (*38*), the overlap between the prepared states (*8*), or their mean energy (*39*). Last, one may ask whether it would be possible to robustly self-test a nonprojective measurement in the fully device-independent case, i.e., returning to the Bell scenario without any assumption on the dimension.

## MATERIALS AND METHODS

In the experiment, the qubit states are encoded in the polarization degree of freedom of a single photon, with the convention of ∣*H*〉 ≡ ∣0〉 and ∣*V*〉 ≡ ∣1〉. The setup is depicted in Fig. 5.

Alice’s station includes a heralded single-photon source where femtosecond laser pulses at 390 nm are converted into pairs of photons at 780 nm, through type I spontaneous parametric down-conversion in two orthogonally oriented beta-barium borate crystals. Photon pairs go through 3-nm spectral filters and are then coupled into two single-mode fibers for spatial mode filtering. The idler photon is sent to the trigger avalanche photodiode (APD) detector (T) and heralds the presence of a signal photon. The latter is then emitted again into free space and undergoes Alice’s state preparation, consisting of a fixed linear polarizer, a λ/4 wave plate [or quarter–wave plate (QWP)], and a λ/2 wave plate [or half–wave plate (HWP)].

Upon preparing the required qubit state, Alice forwards the signal photon to Bob’s measurement station, where it goes through a double-path Sagnac interferometer, each path of which contains an HWP. The interferometer mixes the polarization degree of freedom with path, effectively enabling Bob to perform either projective or nonprojective measurements in the original polarization Hilbert space where the qubit was prepared, thanks to the two polarization analyzers at the outputs. Each of these consists of a phase plate, an HWP, and (in output 1) a QWP, a polarizing beam splitter and two single-photon detectors. Outputs from all detectors (T and D1 to D4) are sent to a coincidence unit connected to a computer.

All measurements were performed with heralded photon rates of approximately 1 × 10^{4} counts per second, while each setting was measured for 500 s. We have made an assumption of fair sampling, i.e., that the detection events are representative of the total number of signal photons. This assumption is reasonable for tasks that do not include a notion of an adversary. The quality of state preparation and measurement can be estimated by preparing states ∣*H*〉, * _{z}*, σ

*, and σ*

_{x}*, respectively. The three visibilities obtained in our setup with this characterization measurement were*

_{y}While the almost optimal

*Note added.* During the completion of this manuscript, we became aware of an independent work (*40*) discussing the certification of qubit POVMs.

## SUPPLEMENTARY MATERIALS

Supplementary material for this article is available at http://advances.sciencemag.org/cgi/content/full/6/16/eaaw6664/DC1

This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license, which permits use, distribution, and reproduction in any medium, so long as the resultant use is **not** for commercial advantage and provided the original work is properly cited.

## REFERENCES AND NOTES

**Acknowledgments:**We thank J. Kaniewski for insightful comments.

**Funding:**This work was supported by the Swiss National Science Foundation (starting grant DIAQ, NCCR-QSIT), the Swedish Research Council, and Knut and Alice Wallenberg Foundation. T.V. was supported by the National Research, Development and Innovation Office NKFIH (grant nos. K111734 and KH125096).

**Author contributions:**A.T. and T.V. proposed the basic concept. A.T., T.V., and N.B. developed the theory. M.S. performed the experiments and the data analysis supported by M.B. All authors discussed the results and participated in the writing of the manuscript.

**Competing interests:**The authors declare that they have no competing interests.

**Data and materials availability:**All data needed to evaluate the conclusions in the paper are present in the paper and/or the Supplementary Materials. Additional data related to this paper may be requested from the authors.

- Copyright © 2020 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC).