The problem of mutually unbiased bases in dimension 6

Philippe Jaming, Mate Matolcsi, Peter Mora

Introduction

This paper is based on the talk given by the second author at the International Conference on Design Theory and Applications, NUI, Galway, July 1-3, 2009.

The notion of mutually unbiased bases (MUBs) constitutes a basic concept of Quantum Information Theory and plays an essential role in quantum-tomography , quantum cryptography , the mean king problem as well as in constructions of teleportation and dense coding schemes .

One reason for the slow progress is that mutually unbiased bases are naturally related to complex Hadamard matrices. Indeed, if the bases B0,,Bm\mathcal{B}_{0},\ldots,\mathcal{B}_{m} are mutually unbiased we may identify each Bl={e1(l),,ed(l)}\mathcal{B}_{l}=\{\mathbf{e}_{1}^{(l)},\ldots,\mathbf{e}_{d}^{(l)}\} with the unitary matrix

In particular, the existence of 4 MUBs B0,B1,B2,B3\mathcal{B}_{0},\mathcal{B}_{1},\mathcal{B}_{2},\mathcal{B}_{3} in dimension 6 is equivalent to the existence of 3 mutually unbiased Hadamard matrices H1,H2,H3.H_{1},H_{2},H_{3}. Therefore, in an attempt to prove that no collection of 4 MUBs exist in dimension 6 it is enough to prove that no triplet of mutually unbiased Hadamard matrices exist. This will be the core of our argument in this note.

A complete classification of complex Hadamard matrices, however, is only available up to dimension 5 (see ) which allows for a complete classification of MUBs (see ). The classification in dimension 6 is still out of reach despite recent efforts . This is one of the reasons for Problem 1.1 to be difficult.

In this paper we outline a discretization approach that is likely to lead to the proof of Conjecture 1.2 in the near future. Once all the ideas are properly implemented in a computer code, an exhaustive search will be carried out to prove Conjecture 1.2. We will include all the basic definitions and ideas as well as some preliminary results. We remark here that a partial result of this approach has already been completed: in we assumed that the first Hadamard matrix H1H_{1} comes from the two-parameter Fourier family of complex Hadamard matrices, and we proved by discretization and an exhaustive computer search that in such a case a MUB-quartet I,H1,H2,H3I,H_{1},H_{2},H_{3} cannot exist. In this paper, however, we tackle the general case, so that we must consider H1H_{1} as any complex Hadamard matrix of dimension 6. This complicates matters quite considerably as the number of cases to check after the discretization increases by orders of magnitude. As an optimistic note let us recall here that the non-existence of a projective plane of order 10 was also proved by an exhaustive computer search .

Discretization

After multiplying rows and columns by appropriate scalars if necessary, we can assume that all coordinates of the first row and column of AA are 1’s, and all coordinates of the first column of all other matrices are 1’s (i.e. we assume that all vectors in the bases A,B,CA,B,C have first coordinate 1, and the first vector in basis AA is an all 1’s vector). All the other coordinates in the matrices are complex numbers of modulus 1, i.e. they are of the form e2πiρe^{2\pi i\rho} with ρ[0,1)\rho\in[0,1). We will use a discretization approach. Let NN be a positive integer, called the discretization parameter. We partition the interval [0,1)[0,1) into NN sub-intervals I0(N),I1(N),,IN1(N)I_{0}^{(N)},I_{1}^{(N)},\ldots,I_{N-1}^{(N)} of equal length, i.e. Ij(N)=[j/N,(j+1)/N)I_{j}^{(N)}=[j/N,(j+1)/N). (Other partitions are also possible, but this seems most convenient for programming.) Now, any entry e2πiρe^{2\pi i\rho} in any of the matrices A,B,CA,B,C will be represented by the integer jj if ρIj(N)\rho\in I_{j}^{(N)} (note that 0jN10\leq j\leq N-1). This means: whenever we see an entry jj somewhere in a matrix then we conclude that the original phase ρ\rho must lie somewhere in the interval Ij(N)I_{j}^{(N)}. We have no more and no less information than this. We also agree that the first coordinate of each row will be represented by , keeping in mind that it represents exactly 1, without error (and not the interval I0(N)I_{0}^{(N)}).

In short: we will exclusively be dealing with row vectors of the form

There are altogether N5N^{5} vectors of the form (1).

Also, there is a natural ordering among these vectors: uvu\leq v if and only if it is so in lexicographical order. We will use this ordering throughout this note.

We will say that a vector uu of the form (1) belongs to ORTNORT_{N} if there exist ϕkIjk\phi_{k}\in I_{j_{k}} such that 1+k=15e2iπϕk=01+\sum_{k=1}^{5}e^{2i\pi\phi_{k}}=0.

This is too crude, but we can iterate it to the “children” of uu. Namely, assume that the numbers ϕk\phi_{k} exist as in Definition 3.2. For each interval IjkI_{j_{k}} the value ϕk\phi_{k} must lie in either the left or the right half of IjkI_{j_{k}}. There are 32 choices, according to whether we consider the left or the right half of each interval IjkI_{j_{k}}. These choices are called the 32 “children” of uu. Clearly, at least one of these children needs to satisfy (2) with 5π2N\frac{5\pi}{2N} on the right hand side (and its own midpoints substituted to the left hand side, of course, instead of rjkr_{j_{k}}). If none of the children satisfy this, then uu can be discarded. Of course we iterate this to grandchildren, and so on, down to 7-8 generations. The vector uu survives this test if it has at least one surviving descendant in each generation.

The set ORTNORT_{N} is clearly invariant under permutations of the last 5 coordinates j1,j2,j3,j4,j5j_{1},j_{2},j_{3},j_{4},j_{5}. Therefore it makes sense to introduce the set ORTN,monORT_{N,mon} of vectors in ORTNORT_{N} with monotonically increasing coordinates. To save time, in the actual computer code we first find the vectors of ORTN,monORT_{N,mon} by the method above, and then we permute the last 5 coordinates to arrive at the set ORTNORT_{N}.

There exists also an improved error bound (see Lemma 3.2 in ). It is somewhat slower to check by computer and it is reasonable to believe that we arrive at the same set ORTNORT_{N} by applying either error bounds.

We have implemented a computer code for selecting the set ORTNORT_{N}. For example, for N=17N=17 we have ORTN=58450|ORT_{N}|=58450, for N=19N=19, ORTN=82630|ORT_{N}|=82630, and for N=53N=53, ORTN=1875110|ORT_{N}|=1875110. Experience shows that the set ORTNORT_{N} is unexpectedly large if NN is divisible by 2 or 3. Therefore, we have mainly restricted our attention to NN being a prime.

The optimal choice of NN seems to be crucial for the success of the project. Clearly, if NN is too small then the error bounds are not good enough and we will not reach a contradiction in the forthcoming steps (see Section 4 below). However, if NN is too large then the size of the sets ORTNORT_{N} and correspondingly HADNHAD_{N} will be far too large to be manageable. At present we believe that the optimal choice of NN is around N50N\approx 50.

We will say that the vectors u=(0,j1,j2,j3,j4,j5)u=(0,j_{1},j_{2},j_{3},j_{4},j_{5}) and v=(0,m1,m2,m3,m4,m5)v=(0,m_{1},m_{2},m_{3},m_{4},m_{5}) are NN-orthogonal if there exist numbers ϕk\phi_{k} and ψk\psi_{k} in the intervals IjkI_{j_{k}} and ImkI_{m_{k}}, such that 1+k=15e2iπ(ϕkψk)=01+\sum_{k=1}^{5}e^{2i\pi(\phi_{k}-\psi_{k})}=0.

This property is clearly shift-invariant in the sense that it only depends on the values (j1m1,j5m5)(j_{1}-m_{1},\dots j_{5}-m_{5}) modulo NN. We can therefore take m1==m5=0m_{1}=\dots=m_{5}=0 and correspondingly v0=(0,0,0)v_{0}=(0,0\ldots,0), (where the last 5 coordinates represent the interval I0I_{0}, of course) and define the set ORTeps,NORT_{eps,N} as the set of vectors of the form (1) which are NN-orthogonal to v0v_{0}. (The notation ORTeps,NORT_{eps,N} indicates that the vector v0v_{0} contains an “epsilon” of error, because the last 5 coordinates represent the interval I0I_{0} and not the exact number 1.) With this notation the shift-invariance means that uu and vv will be NN-orthogonal if and only if the vector (j1m1,j5m5)(mod N)(j_{1}-m_{1},\dots j_{5}-m_{5})(mod\ N) is in ORTeps,NORT_{eps,N}.

Having constructed the set ORTNORT_{N} previously, it is now easy to obtain ORTeps,NORT_{eps,N}. Indeed, by definition a vector u=(0,j1,j5)u=(0,j_{1},\dots j_{5}) can only be NN-orthogonal to v0v_{0} if there exist numbers ϕk\phi_{k} in the intervals IjkI_{j_{k}} and ψk\psi_{k} in [0,1N)[0,\frac{1}{N}), such that 1+k=15e2iπ(ϕkψk)=01+\sum_{k=1}^{5}e^{2i\pi(\phi_{k}-\psi_{k})}=0. But then the numbers ϕkψk\phi_{k}-\psi_{k} must fall in the intervals IjkϵkI_{j_{k}-\epsilon_{k}} where ϵk\epsilon_{k} is either 0 or 1, and hence the vector uϵ=(0,j1ϵ1,,j5ϵ5)u_{\epsilon}=(0,j_{1}-\epsilon_{1},\ldots,j_{5}-\epsilon_{5}) is in ORTNORT_{N}.

Therefore, ORTeps,NORT_{eps,N} will consist of all the vectors of the form uϵ=(0,j1+ϵ1,,j5+ϵ5)u^{\epsilon}=(0,j_{1}+\epsilon_{1},\ldots,j_{5}+\epsilon_{5}), where ϵk\epsilon_{k} is 0 or 1, and the vector (0,j1,,j5)(0,j_{1},\ldots,j_{5}) is in ORTN.ORT_{N}.

Each uORTNu\in ORT_{N} gives rise to 32 different uϵu^{\epsilon} above. One could therefore expect that the size of ORTeps,NORT_{eps,N} will be nearly 32 times the size of ORTNORT_{N}. This is not so, however, because there will be many coincidences. Experience shows that the size of ORTeps,NORT_{eps,N} is approximately 4 times the size of ORTNORT_{N}, regardless of the value of NN.

– each row and column must come from ORTNORT_{N}.

– each row (resp. column) must be lexicographically larger than any previous rows (resp. columns). In particular, the entries of the second row and column are monotonically increasing, i.e. they belong to ORTN,monORT_{N,mon}.

– the second column must be lexicographically larger than or equal to the second row.

– each row (resp. column) must be NN-orthogonal to any previous rows (resp. columns). This is equivalent to the fact that the pairwise differences of the rows (resp. columns) modulo NN must be contained in ORTeps,NORT_{eps,N}.

– each row (resp. column) must be compatible with the already existing entries of the matrix (e.g. when we fit in the fourth row, then its first 3 coordinates are already fixed because the first three columns of the matrix have already been filled out previously).

We will say that a vector u=(0,j1,j2,j3,j4,j5)u=(0,j_{1},j_{2},j_{3},j_{4},j_{5}) belongs to the set UBNUB_{N} if there exist ϕkIjk\phi_{k}\in I_{j_{k}} such that 1+k=15e2iπϕk=6|1+\sum_{k=1}^{5}e^{2i\pi\phi_{k}}|=\sqrt{6}. We will say that uu belongs to UBN,monUB_{N,mon} if the coordinates of uu are monotonically increasing.

The set UBNUB_{N} can be constructed in a similar way as ORTNORT_{N}. With rjkr_{j_{k}} denoting the midpoint of the interval IjkI_{j_{k}} the trivial estimate gives

This is too crude, of course, and the descendants of uu need to be checked for some 7-8 generations.

Once again, the set UBNUB_{N} is invariant under the permutation of the last 5 coordinates j1,j2,j3,j4,j5j_{1},j_{2},j_{3},j_{4},j_{5}. Therefore, in practice, we first check monotonically increasing vectors only, and obtain UBN,monUB_{N,mon}. Then we permute the coordinates to obtain UBNUB_{N}.

The set UBNUB_{N} is much larger than ORTNORT_{N}. This can be expected because orthogonality of complex vectors induces two conditions (the real part and imaginary part both being zero) while unbiasedness only induces one condition.

We have implemented a code for listing the set of vectors UBNUB_{N}. For example, for N=17N=17 we have UBN=479340|UB_{N}|=479340, while for N=19N=19, UBN=764060|UB_{N}|=764060.

We will also need a set UBeps,NUB_{eps,N} which is analogous to ORTeps,NORT_{eps,N}.

We will say that the vectors u=(0,j1,j2,j3,j4,j5)u=(0,j_{1},j_{2},j_{3},j_{4},j_{5}) and v=(0,m1,m2,m3,m4,m5)v=(0,m_{1},m_{2},m_{3},m_{4},m_{5}) are NN-unbiased if there exist numbers ϕk\phi_{k} and ψk\psi_{k} in the intervals IjkI_{j_{k}} and ImkI_{m_{k}}, such that 1+k=15e2iπ(ϕkψk)=6|1+\sum_{k=1}^{5}e^{2i\pi(\phi_{k}-\psi_{k})}|=\sqrt{6}.

This property is again shift-invariant in the sense that it only depends on the values (j1m1,j5m5)(j_{1}-m_{1},\dots j_{5}-m_{5}) modulo NN. We can therefore take m1==m5=0m_{1}=\dots=m_{5}=0 and correspondingly v0=(0,0,0)v_{0}=(0,0\ldots,0), (where the last 5 coordinates represent the interval I0I_{0}, of course) and define the set UBeps,NUB_{eps,N} as the set of vectors of the form (1) which are NN-unbiased to v0v_{0}. With this notation the shift-invariance means that uu and vv will be NN-unbiased if and only if the vector (j1m1,j5m5)(mod N)(j_{1}-m_{1},\dots j_{5}-m_{5})(mod\ N) is in UBeps,NUB_{eps,N}.

Finally, we remark that the entire discretization procedure described above has already been completed in in the restricted setting when AA is assumed to belong to the Fourier family F(a,b)F(a,b) of complex Hadamard matrices.

[Theorem 1.4 in ] None of the pairs \bigl{(}Id,F(a,b)\bigr{)} of mutually unbiased orthonormal bases can be extended to a quartet \bigl{(}Id,F(a,b),B,C\bigr{)} of mutually unbiased orthonormal bases.

References