Quantum Tomography under Prior Information
Teiko Heinosaari, Luca Mazzarella, Michael M. Wolf
Introduction
Quantum tomography aims at identifying states of a quantum system. In order to achieve this aim, measurement data is often supplemented by prior information. In this work we consider cases where prior information effectively reduces the dimensionality, i.e., the number of parameters which are necessary to characterize the state of a system. Physically, one may think of scenarios of interferometry, process tomography or parameter estimation, where one prepares the initial state which then evolves depending on a certain number of unknown parameters before one measures the final system. Effective reductions of the number of parameters can also be due to a constraining symmetry or fixed energy or particle number.
We are interested in determining the minimal number of measurement outcomes or measurement settings which are required for identifying a state taken from such a reduced set. Clearly, if states in the considered set are parameterized by a certain number of independent real parameters, then we need at least this number of measurement outcomes or binary measurements in order to pinpoint the state. As an example take the manifold of pure states in a -dimensional Hilbert space. Their description requires real parameters, as opposed to real parameters needed to describe an arbitrary density matrix. So if we want to determine a pure state by a single measurement with outcomes, seems necessary and appears to be achievable. The question about the smallest possible in this particular example has been addressed in a number of publications , but the answer has remained somewhat elusive, so far.
A related problem has been addressed based on compressed sensing ideas, where it has been shown that for density matrices of rank , binary measurements are sufficient in order to identify the state with high probability. In this light we emphasize that our focus lies on schemes which identify the unknown state unambiguously and deterministically. In this work we disregard questions regarding the robustness of the schemes, the complexity of the necessary post-processing of the measurement data or the ability to verify the assumed prior information.
Outline. Typically, when we want to know the minimal number of measurement outcomes related to some subset of states, we need to independently consider upper and lower bounds. In Section 3 we concentrate on upper bounds, while in Section 4 we provide some methods to obtain lower bounds. The upper bounds are of geometric or algebraic nature and often come with concrete constructions. The lower bounds are mainly topological in nature. They are based on the observation that any measurement which is informationally complete when supplemented by prior information is a mapping into the space of measurement outcomes which preserves topological invariants. In the case of pure states the upper and lower bounds essentially match which enables us for instance to show that for a minimal informational complete measurement up to an additive logarithmic correction. This exemplifies our general finding that, loosely speaking, on the one hand topological obstruction can force to be twice the dimension of a considered manifold, whereas on the other hand geometric reasoning allows us to show that such an overhead is essentially always sufficient.
Notation. In this paper is a fixed finite dimensional complex Hilbert space. We denote by the set of linear operators on and we will write for the operator norm which coincides with the largest singular value of . A positive operator having trace one is a density operator, also referred to as state, and we denote by the set of all states on .
Quantum tomography prerequisites
In this section we will introduce and analyze the linear algebra framework for quantum tomography. Later, in Section 4, we will then add a topological perspective.
The goal of quantum tomography is to identify an unknown quantum state from the statistics of measurement outcomes. We will consider tomographic schemes where a fixed (as opposed to an adaptive) measurement setting is chosen, and we will refrain from considering errors for instance caused by finite statistics. In particular, in the following “ measurement ”always refers to a full statistical experiment rather than to a single-shot experiment.
Our analysis deals with several different albeit closely related approaches: 1) single measurements with many outcomes; 2) several measurements with possibly fewer outcomes; 3) several measurements where only expectation values are considered.
For a POVM and a state (density matrix) , we denote by the corresponding probability distribution of measurement outcomes. It is given by the formula
Let be a subset of density matrices. A POVM is called informationally complete w.r.t. if is injective on , i.e.,
In a more general scheme one may perform several measurements and use all of their measurement outcome statistics. Let be a finite collection of POVMs. We denote by the linear span of the set
where is the number of measurement outcomes of , is the number of measurement outcomes of , and so forth. In particular,
Every operator can be written as a linear combination of two selfadjoint operators,
which belong to . If is selfadjoint, then can be written as a linear combination of two positive operators,
For any POVM , the generated operator system cannot have more linearly independent elements than has outcomes. This implies the last claim. ∎
We denote by the orthogonal complement of an operator system with respect to the Hilbert-Schmidt inner product. In particular, for a POVM we have
The complement of is related to the informational completeness of in the following simple way.
Let be an operator system and let be a set of states. Then a POVM satisfying is informationally complete w.r.t. iff
In other words, for any pair of states the following are equivalent:
,
.
For all states , we have:
Let be a subset of states. A collection of selfadjoint operators is called informationally complete w.r.t. if
for all .
We are interested in questions of the type: what is the minimal number of
needed to have an informationally complete measurement scheme w.r.t. to a given premise ?
Let . The following are equivalent.
There exists a POVM with outcomes and is informationally complete w.r.t. .
There exists a set of selfadjoint operators and is informationally complete w.r.t. .
As a consequence of Proposition 3 we only need to calculate the minimal number of either selfadjoint operators or POVM elements.
(Minimal informationally complete measurements) Let be any subset of states. We denote by
- the minimal number of selfadjoint operators which are informationally complete w.r.t. .
- the minimal number of positive operators determining a POVM which is informationally complete w.r.t. (i.e., the number of POVM elements subtracted by one).
The remaining part of this work is devoted to deriving upper and lower bounds on either for various concrete subsets or depending on specific properties of – in particular its dimensionality and topology.
Upper bounds
As the first class of examples, we consider the subset of all states whose rank is bounded by a number where . How many measurement outcomes suffice in order to identify a state taken from ?
If , then there exists a POVM which is informationally complete w.r.t. and has outcomes. Therefore,
We will construct a subspace of matrices with the properties that
The main part of our construction follows . The following fact will be needed. Let be a totally nonsingular -matrix with positive entries. (Recall that a matrix is called totally nonsingular if all of its minors are nonzero.) For instance, a Vandermonde matrix of the form
We will now define a set of matrices that span the sought subspace . By the th diagonal of a matrix we mean the elements with . (In other words, we label the diagonals from the lower left corner upwards. The main diagonal is then the th diagonal.)
For each satisfying , we build up matrices as follows. We choose columns from a totally nonsingular -matrix and we put them to the th diagonal and ’s elsewhere, hence obtaining linearly independent matrices. Any matrix which is a linear combination of these matrices has at least nonzero elements on the th diagonal. Since all the matrix elements not in the th diagonal are , we see that the has rank at least .
In total, we have built up linearly independent matrices. The previously mentioned subspace is the linear span of these matrices. The properties (a)-(c) are immediate consequences of the construction. To verify (d), suppose that . Let be the largest such that the th diagonal of contains nonzero elements. Then the th diagonal contains actually nonzero elements. The submatrix having those nonzero elements in its main diagonal is lower triangular, therefore has nonzero determinant. It follows that the rank of is at least . ∎
Let us remark that a recently introduced method based on compressed sensing uses outcomes to identify a rank- state with high probability . The number of outcomes given in Theorem 1 therefore beats the compressed sensing approach. The latter, however, might be advantageous regarding the simplicity of the classical post-processing, robustness and verifiability of the assumption.
2. Pure states
We will now have a closer look at the set of pure states. For this case, Theorem 1 implies that . We will first provide some simple arguments showing that indeed for . In general, we will see later, based on topological reasoning, that the leading order is the best possible.
We remark that Flammia et al. proved that a POVM which is informationally complete w.r.t. the set of pure states has at least outcomes . They also constructed a POVM with elements capable of distinguishing almost all (but not all) pairs of pure states. A similar construction was given by Finkelstein .
For a POVM , the following conditions are equivalent:
is not informationally complete w.r.t. the set of pure states.
From Proposition 4 we conclude the following simple characterization.
It is an immediate consequence of Corollary 1 that in the qubit case (i.e. ) a POVM is informationally complete w.r.t. pure states if and only if . Therefore, in the qubit case informational completeness for pure states implies informational completeness for all states. One can also easily see this by a direct inspection of the Bloch sphere.
contains rank-2 matrices but no rank-2 selfadjoint matrix.
As an application of Corollary 1, we can easily characterize all POVMs in dimension which are informationally complete w.r.t. the set of pure states.
(i.e. is informationally complete w.r.t. all states).
In particular, a minimal POVM which is informationally complete w.r.t. pure states has outcomes.
We conclude that if is informationally complete for the set of all pure states, then either or . In the latter case is informationally complete for all states. If , then and is thus generated by a single selfadjoint operator . From Cor. 1 follows that is informationally complete w.r.t. all pure states exactly when is invertible. ∎
The characterization given in Proposition 5 helps to illustrate one possible drawback of POVMs which are merely informationally complete w.r.t. to a subset of states: even if a POVM can distinguish any pair of distinct pure states, it may not be capable of distinguishing pure states from mixed states. In other words, we may not be able to verify the premise from the measurement outcome statistics.
We already noted that Theorem 1 leads to the general upper bound albeit without yielding an explicit set of operators. Now, we give another proof of this result via a simple direct construction. The construction is inspired by a topological embedding given by James in . Consider two types of matrices and which we label by integers and respectively. The ’s are taken to be such that \big{(}X_{\alpha}\big{)}_{kl}=\delta_{k+l,\alpha+1}, i.e., there are ’s along the ’th anti-diagonal and zeroes elsewhere. The ’s are similarly defined with nonzero entries solely along the anti-diagonals, in this case \big{(}Y_{\beta}\big{)}_{kl}=0 unless . The entries are chosen such that the matrices are anti-symmetric with entries below the diagonal. The constructed matrices thus have the following structure:
where we replaced for all and exploited that is upper triangular and that for all . Together with the hypothesis in Eq.(3) and the assumption that \big{(}C_{\gamma}\big{)}_{n,m+1}\neq 0 this implies indeed that .
Note that for and . Hence, if for every , then for every . The remaining condition holds due to . Therefore it follows from Eq.(3) that the set is informationally complete w.r.t. the set of all pure states. ∎
It is possible to slightly improve the upper bound at the cost of having to deal with a more complicated set of operators introduced in the work of Milgram :
For odd this upper bound can be worse than the previously derived , but for most dimensions it is below . Notice that satisfies . We will see later that this improvement is, in fact, nearly optimal. With this refined upper bound we can calculate the exact value of for small dimensions in Sec. 4.2.
The construction of the operators is based on the work of Milgram . We will only argue why this corresponds to a proper measurement scheme rather than reproducing the rather cumbersome construction.
The set of selfadjoint matrices therefore leads to a measurement scheme which is informationally complete w.r.t. the set of pure states. ∎
3. Generic bounds depending on fractal dimension
For instance, if is a smooth manifold of real dimension , then .
Let be a closed subset of the set of density matrices with Minkowski dimension . Then almost any (in the Lebesgue measure sense) collection of selfadjoint operators is informationally complete w.r.t. . In particular, .
In principle, this bound can be refined to , where is the Hausdorff dimension of the set . This bound is generally better since , but it may be more difficult to handle. We also mention that for the inverse mappings Hölder continuity can be proven and the respective constants can be bounded .
Let us apply Theorem 4 to the case of pure states. This is a smooth manifold of real dimension , hence . Thus, almost any collection of selfadjoint operators is informationally complete w.r.t. pure states.
Lower bounds
Let be a POVM with outcomes. This induces a map
If is a closed surface without boundary, then . Moreover, if is non-orientable in addition, then .
The first three matrices in (8) give rise to measurement results which form the Roman surface displayed in Fig.2. The failure of informational completeness (due to disregarding the necessary fourth measurement) is reflected by self-intersections of the surface.
2. Measurements as diffeomorphisms
Manifolds of interest in quantum tomography often have a differentiable structure – they are smooth manifolds. In such a case we may resort to differential topology, which imposes more restrictive conditions on the existence of smooth embeddings. Before we apply these conditions to the concrete cases of pure states and states with general rank constraints we provide some general background.
will play an important role in the following.
To show that is a smooth embedding we need to prove (i) that it is smooth, which follows from linearity, (ii) that it is a topological embedding, which follows from the assumed injectivity and Prop. 6 and (iii) that it has an injective derivative everywhere.
The inclusion holds for all if is the complex Grassmannian manifold understood as the submanifold in the space of selfadjoint matrices which consists of all orthogonal projections of rank .
is an element of and in fact, such derivatives span the entire tangent space
In order to see this we have to show that they span a vector space which has the same dimension as the manifold (for which ). To this end, note that there is a one-to-one relation between commutators and block off-diagonal matrices in the sense that we can always write
In a suitable basis any element is such that
since Eq.(11) allows us to work with the singular values of by transforming with appropriate unitaries and . Setting equal to the operator norm of we complete the proof if we show that every matrix of the form with is a difference of two projections. This can seen to be true by taking the difference of two pure qubit states whose Bloch vectors are parameterized by . ∎
where denotes the number of ’s in the binary expansion of and is the real dimension of the manifold.
If we now combine this non-embedding result with the upper bounds discussed in Subsection 3.2, we obtain a fairly comprehensive picture on the minimal number of measurements required to identify an element of the set of pure states. The bounds for the minimal number for the dimensions are summarized in the table below.
For all dimensions we can say that the difference between the upper and lower bounds is at most , and that the minimal number differs from the best affine upper bound by at most . The lower and upper bounds up to are presented in Fig.4.
Summary
How many measurement outcomes (i.e., POVM elements) are minimally needed in order to identify all quantum states from a given set? We have shown on the one hand that if the set is a manifold, then topological obstructions can increase the number of required measurements by a factor of two over the dimensionality of the manifold. On the other hand we have seen that this factor of two is sufficient even in a more general context where the considered set is not necessarily a manifold and its dimensionality is understood as its Minkowski dimension.
We have considered two different types of examples: 2-manifolds, where (non-)orientability plays an important role, and Grassmannian manifolds, which contain the set of all pure states as a particular instance.
For the latter case we have shown that upper and lower bounds—both obtained via topological embeddings—essentially match. In fact, they never differ by more than . To be more precise, their difference is strictly less than the number of ones appearing in the binary expansion of .
Points which are beyond the present work, albeit obviously of practical relevance, are the inversion algorithm and issues of robustness and certifiability. For instance, we do not know whether in the pure state case can be achieved in a way such that (i) the inversion is algorithmically efficient and (ii) the validity of the assumption behind the prior information is certifiable.
Acknowledgements
We thank Claudio Carmeli for pointing out an error in the earlier version of Theorem 3.
MMW was supported by the EU-STREP projects COQUIT and QUEVADIS and the Alfried Krupp von Bohlen und Halbach-Stiftung. TH was supported by the Emil Aaltonen Foundation, Alfred Kordelin Foundation and Academy of Finland (grant no. 1381359). TH and MMW are both grateful for the hospitality and the inspiring working environment at the Institut Mittag-Leffler in Stockholm, where this work was partly done.