Sinkhorn normal form for unitary matrices

Martin Idel, Michael M. Wolf

Introduction

For every n×nn\times n matrix AA with positive entries there exist two diagonal matrices L, RL,~{}R such that LARLAR is doubly stochastic, i.e. the entries of each column and row sum up to one. This result was first obtained by Sinkhorn [Sin64], who also gave an algorithm how to compute LL and RR by iterated left and right multiplication of diagonal matrices.

Recently, De Vos and De Baerdemacker studied the same problem for unitary matrices [DVB14a]. They conjectured that for every n×nn\times n unitary UU there exist two unitary diagonal matrices L,RL,R such that LURLUR has all row and column sums equal to one. To support their conjecture, they construct an algorithm similar to the iteration procedure for matrices with positive entries from [Sin64, SK67]. They also provide numerical evidence that the algorithm always converges to a unitary matrix with row and column sums equal to one.

The goal of this paper is to prove the conjecture of De Vos and De Baerdemacker that such a normal form always exists by reformulating the problem in terms of symplectic topology. It turns out that the reformulated problem is a special case of the Arnold (sometimes Arnold-Givental) conjecture on the intersection of Lagrangian submanifolds [MS98], which was solved for this case in [BEP04, Cho04]. More precisely, in section 2 we show:

For every unitary matrix UU(n)U\in U(n) there exist two diagonal unitary matrices L,RU(n)L,R\in U(n) such that A:=LURA:=LUR satisfies jAji=jAij=1\sum_{j}A_{ji}=\sum_{j}A_{ij}=1 for all i=1,ni=1,\ldots n.

For a given unitary UU(n)U\in U(n) the triple (L,R,A)(L,R,A) is certainly not unique, since multiplying LL by a global phase and RR by its inverse does not change LARLAR. Hence, it makes sense to consider the decomposition U=eiφLARU=e^{i\varphi}L^{\prime}AR^{\prime}, where L,RL^{\prime},R^{\prime} are unitary diagonal such that L11=R11=1L^{\prime}_{11}=R^{\prime}_{11}=1 and φ[0,2π)\varphi\in[0,2\pi). In particular, for U(2)U(2), a simple complete solution was given in [DVB14a] from which one can see that for every non-diagonal matrix, there are only two different AA such that eiφLAR=Ue^{i\varphi}LAR=U. For n>2n>2 the picture is less clear and the reformulation in terms of symplectic topology appears to give further insight into the freedom of the decomposition.

In addition to the Sinkhorn-type normal form above, in section 3 we give several reformulations that might be interesting for applications, for instance regarding the decomposition of general 2n2n-port linear optics devices into canonical multiports and phase shifters.

Sinkhorn-type normal form

Likewise, since Ae=Ae\overline{A}e=Ae and AA is unitary, we obtain

so that columns and rows of AA sum up to one.

The definition is slightly different from the one in [BEP04], where the authors only consider nonempty open sets such that the restriction of ω\omega to these sets is exact. However, they prove that the torus TnT^{n} is displaceable in the above definition, if and only if there exists an open neighborhood VTn\mathcal{V}\supset T^{n} such that ωV\omega|_{\mathcal{V}} is exact and V\mathcal{V} is displaceable. With this we can state the final and crucial ingredient in the proof of the normal form:

Because every unitary matrix defines a Hamiltonian isotopy (see proposition 5 in the appendix), the theorem tells us in particular TnUTnT^{n}\cap UT^{n}\neq\emptyset for all unitaries UU(n)U\in U(n) so that together with lemma 1 this proves the sought normal form:

For every unitary matrix UU(n)U\in U(n) there exist two diagonal unitary matrices L,RU(n)L,R\in U(n) such that A:=LURA:=LUR fulfills jAji=jAij=1\sum_{j}A_{ji}=\sum_{j}A_{ij}=1 for all i=1,ni=1,\ldots n.

Equivalent normal forms for unitary matrices

Now let AU(n)A\in U(n) be such that Ae=ATe=eAe=A^{T}e=e. Then FnAFne0=e0F_{n}^{\dagger}AF_{n}e_{0}=e_{0} and similarly, (FnAFn)Te0=FnATFne0=e0(F_{n}^{\dagger}AF_{n})^{T}e_{0}=F_{n}A^{T}F_{n}^{\dagger}e_{0}=e_{0}, which shows that

This decomposition has an immediate application in quantum optics, where any n×nn\times n unitary corresponds to a passive transformation on nn modes or a 2n2n-multiport. In this scenario a diagonal unitary corresponds to a set of phase shifters, which are applied to the modes individually and the discrete Fourier transformation is known as canonical 2n2n-multiport [MMW+95], which may be implemented by a symmetric fibre coupler. The structure of the corresponding decomposition is graphically depicted in Figure 1.

Let us finally discuss the question of uniqueness of these decompositions and to this end come back to the original normal form

where D1,D2D_{1},D_{2} are unitary diagonal with (Di)11=1(D_{i})_{11}=1 and AA has row and column sums equal to 1. Counting parameters, using that the matrices AA are isomorphic to U(n1)U(n-1) as proven above, we have:

parameters (c.f. [DVB14a]). Hence, the number of parameters matches exactly the dimension of U(n)U(n). Given a unitary U=eiφD1AD2U=e^{i\varphi}D_{1}AD_{2} as above, this means that it might be reasonable to expect only a discrete set of different decompositions or at least a discrete set of AA that UU can be scaled to. The exact number of different AA can easily be seen to be two for the case n=2n=2 (c.f. [DVB14a]), but already for n=3n=3 and n=4n=4, there is only a conjectured bound (6 and 20, c.f. [Shc13]).

In [Cho04] it is proven that if TnT^{n} and UTnUT^{n} intersect transversally, their number of distinct intersection points must be at least 2n2^{n}, which follows from general results in Floer-homology theory when applied to Lagrangian intersection theory. Since transversality is a generic property for intersections, one might therefore conjecture that for a generic unitary UU(n)U\in U(n) [Cho04] implies a lower bound 2n12^{n-1} on the number of different normal forms. However, it is not true that we always have a discrete number of decompositions or (in contrast to the 2×22\times 2 case) at least a discrete number of AA such that AA has row and column-sums equal to one and eiφLAR=Ue^{i\varphi}LAR=U. A counterexample is given by the Fourier transform in 4×44\times 4 dimensions, where we have for any φ[0,2π)\varphi\in[0,2\pi):We thank the anonymous referee for providing this counterexample.

After completion of this document, we learned that part of this section, in particular corollary 1 were independently found in [DVB14b].

Conclusion

We have studied a variant of a Sinkhorn type normal form for unitary matrices. Its existence was conjectured in [DVB14a] and we give a nonconstructive proof. This means in particular that the question, whether the algorithm presented in [DVB14a] always converges for any set of starting conditions, remains open. Also, it would be nice to have an elementary proof of the fact that for any unitary matrix UU we have TnUTnT^{n}\cap UT^{n}\neq\emptyset. The decomposition is in not unique: We provided an example where, contrary to the 2×22\times 2-case, there is a one-parameter set of AA as well as LL and RR, such that LAR=ULAR=U. We suggested an argument that the number of different decompositions, if it is discrete, might grow exponentially. However this lower bound relies on a lower bound on Lagrangian intersections which holds only for transversal intersections.

We thank Michael Keyl for many helpful comments on the parts involving symplectic topology. M. Idel is supported by the Studienstiftung des deutschen Volkes. M. Wolf acknowledges support from the CHIST-ERA/BMBF project CQC.

References

Appendix A Symplectic Preliminaries

This section introduces the definitions and results from symplectic topology beyond the first chapters of [MS98] needed to understand the basic reductions of the proof of theorem 1 in [BEP04].

Let (M,ω)(\mathcal{M},\omega) be a closed symplectic manifold. If the manifold is simply connected (i.e. every loop is contractible)

In principle, the result also holds for arbitrary symplectic manifolds. One has to be more careful with non-compactly supported functions, but we can safely ignore these subtleties, since our manifold of interest will be closed.

Furthermore, let us recall that a Lagrangian submanifold L\mathcal{L} of a 2n2n-dimensional symplectic manifold (M,ω)(\mathcal{M},\omega) is a smooth nn-dimensional submanifold of M\mathcal{M} such that

A.2 The Clifford-torus as a Lagrangian submanifold

We now study the Clifford torus as a special case of the Lagrangian submanifold of interest for our result.

The next step is to show non-degeneracy. For this, note that Φω(X,Y)=0 Y\Phi^{*}\omega(X,Y)=0~{}\forall Y if and only if dΦX=0d\Phi X=0 pointwise, since ω\omega is non-degenerate. But dΦX=0d\Phi X=0 implies in particular dπX=0d\pi X=0 and hence, ωFS\omega_{FS} as defined above is a non-degenerate 2-form.

since dd commutes with pullbacks and ω\omega is closed. Since this holds on any patch UiU_{i}, dωFS=0d\omega_{FS}=0 globally.

Then Tπ(p)TnT_{\pi(p)}T^{n} will be spanned by dπXπ(p)id\pi X^{i}_{\pi(p)}.