On biunimodular vectors for unitary matrices

Hartmut Führ, Ziemowit Rzeszotnik

Note

As we have learned from the referees, the existence of biunimodular vectors for an arbitrary unitary matrix has been recently proved in this journal by Idel and Wolf (see ) basing on the results of in symplectic geometry due to Biran, Entov and Polterovich. In this paper we provide an independent account of this existence phenomenon and push a bit further the theory of biunimodular vectors.

Outline

Background

The paper explores the notion of a biunimodular vector that traces back to Gauss. The classic example of such vectors are Gauss sequences given for k=0,1,,n1k=0,1,\dots,n-1 as

Following Haagerup () and Saffari () we attribute the starting point of the general theory of biunimodular vectors to Per Enflo. In 1983 he has asked whether for prime nn the only unimodular vectors of length nn whose Fourier transform is also unimodular are Gauss sequences. While the answer is affirmative when n5n\leq 5, a computer search done by Björck for n=7n=7 provided a surprising counterexample

with θ=arccos(34)\theta=\arccos(-\frac{3}{4}) yielding eiθ=34+i74e^{i\theta}=-\frac{3}{4}+i\frac{\sqrt{7}}{4}. These findings, published in 1985 (), have been generalized in and later the term “bi-unimodular sequence” has been coined by Björck and Saffari to describe a unimodular finite vector, whose Fourier transform is unimodular as well (see ).

The Björck sequences provided in are biunimodular vectors of a prime length pp. For p1(mod4)p\equiv-1\pmod{4} their coefficients are either 11 or eiθe^{i\theta} with θ=arccos1p1+p\theta=\arccos\frac{1-p}{1+p}. For p1(mod4)p\equiv 1\pmod{4} their coefficients are either 11, eiηe^{i\eta} or eiηe^{-i\eta} with η=arccos1p+1\eta=\arccos\frac{1}{\sqrt{p}+1}. The importance of Björck sequences has been underlined in , where it was shown that for these sequences (contrary to Gauss sequences) the discrete narrow band ambiguity function has an optimal bound. However, Gauss and Björck sequences provide only a glimpse at the structure of biunimodular vectors for FnF_{n}. Even in the case p=7p=7 there is another such vector found by Björck and Fröberg that is related neither to a Gauss nor to a Björck sequence (see or ).

Moreover, in Björck and Saffari proved that if nn is divisible by a square, then there are infinitely many (continuum) biunimodular vectors with the leading entry 1, and they conjectured that for all square-free nn the number of such biunimodular vectors is finite.

An easy observation is that a unimodular vector u=(u0,u1,,un1)u=(u_{0},u_{1},\dots,u_{n-1}) is biunimodular for FnF_{n} if and only if its cyclic translations are orthogonal. This orthogonality relation is called zero autocorrelation and means that

where the index k+ik+i is taken (modn)(\bmod n).

The idea of Björck to search for biunimodular vectors uu for FnF_{n} relies on setting xk=uk+1ukx_{k}=\frac{u_{k+1}}{u_{k}}, where k=0,1,,n1k=0,1,\dots,n-1 with un=u0u_{n}=u_{0} and transforming (1) to the following set of equations:

The unimodular solutions of the above set of equations are in one-to-one correspondence to biunimodular vectors for FnF_{n} (with the leading entry 1). The general complex solutions of (2) are called cyclic nn-roots and became a research area of independent interest. In the mid-1980s Arnborg has asked Davenport to establish whether the set of cyclic 6-roots is finite, leading to the popularisation of the cyclic nn-roots problem in . Backelin proved in that for nn divisible by a square the number of cyclic nn-roots is infinite. For 2n82\leq n\leq 8 all cyclic nn-roots were found by Björck and Fröberg (see ) by using Gröbner basis techniques to solve the system (2). Faugère has found cyclic 9 and 10 roots by improving the search for the Gröbner basis (see and ). These results have been confirmed and extended by polyhedral homotopy continuation methods applied for solving (2) as a benchmark problem. The numbers of cyclic nn-roots for a square-free nn between 2 and 14 is listed below.For 2n112\leq n\leq 11 the amount of roots has been verified by various methods, for n=13n=13 and 1414 the number of cyclic nn-roots is claimed only by Li and Tsai (see ). Moreover, they count only isolated roots, so the number of cyclic 14-roots still has to be verified as finite or not.

The table also contains information about the number of unimodular cyclic nn-roots that yields the amount of biunimodular vectors for FnF_{n}. The numbers up to n=7n=7 were given by Björck, Fröberg and Haagerup, who inspected the obtained cyclic nn-roots to check for unimodular solutions, establishing the real benchmark for solving the cyclic nn-root problem (see and ). The number of biunimodular vectors for n=13n=13 has been obtained by Gabidulin and Shorin in .

Basing on the number of cyclic pp-roots for prime pp up to 7 Fröberg concluded that the amount of cyclic pp-roots should be (2p2p1){2p-2\choose p-1}. This conjecture has been confirmed by Haagerup in under the condition that the roots are counted with multiplicities.

(Haagerup) For prime pp the number of cyclic pp-roots counted with multiplicities is (2p2p1){2p-2\choose p-1}.

The paper of Haagerup contains an elegant direct argument showing that the number of cyclic pp-roots is finite for prime pp, confirming Conjecture 1.1 in the prime case. His more advanced reasoning allows to count the cyclic pp-roots with multiplicities, imposing a question whether for all primes cyclic pp-roots have multiplicity 1 and providing an additional motivation to verify the number of distinct cyclic 13-roots.

A slightly different approach towards finding biunimodular vectors for FnF_{n} has been taken by Gabidulin and Shorin. In they multiplied the system (1) by the product k=0n1uk\prod_{k=0}^{n-1}u_{k} to obtain a system of equations that has been treated with Gröbner basis techniques as well, providing the mentioned number of biunimodular vectors for F13F_{13}.

The theory of biunimodular vectors may be hard to follow due to their popularity stemming from applications in several fields of signal processing. This popularity can be measured by the amount of various names given to the same objects by many researchers working within the area. Gauss sequences are often called Zadoff, Chu or Wiener sequences. Unimodular vectors are described as constant amplitude, phase-shift keyed or polyphase sequences. Vectors whose Fourier transform is unimodular are named as perfect, having optimum correlation or zero autocorrelation. Therefore, biunimodular vectors are studied as CAZAC (constant amplitude zero autocorrelation) sequences, PSK (phase-shift keyed) perfect sequences, polyphase sequences with optimum correlation and so on (see by Benedetto et al. for a better account of this nomenclature and a list of relevant papers that can be extended by adding an early reference ).

Since full understanding of biunimodular vectors would allow to resolve the circulant Hadamard matrix conjecture, and is just a tip of the iceberg being a classification of all Hadamard matrices, the complexity of the topic is apparent.

In it has been proved that biunimodular vectors exist for the Fourier transform on any finite abelian group. This raises the question of existence of such vectors for an arbitrary unitary matrix and motivates the following

In the following sections we explain that the existence of biunimodular vectors for an arbitrary unitary matrix allows to understand the structure of unitary matrices in fairly simple terms.

Synthesis of U​(n)𝑈𝑛U(n)

Unitary n×nn\times n matrices form the unitary group U(n)U(n) and are a basic notion in mathematics, physics, chemistry and engeneering. These matrices can be obtained in various standard ways: Gram-Schmidt process, Householder reflections, Hurwitz parametrization, Cayley transform and via Hermitian matrices using the matrix exponential. In 1952 Murnaghan described a convenient parametrization of the unitary group (see ).Another such parametrization has been provided in 1982 by Diţă (see ). This has been followed by the work of Reck et al. who in 1994 proved that any unitary matrix can be obtained as a sequence of consecutive beam splitter transformations, that can be conducted experimentally in the laboratory using optical devices (). In the current century the research on finding the ways to generate unitary matrices has intensified. Nemoto described the special unitary group SU(n)SU(n) in basing on an earlier result of Rowe et al. Tilma and Sudarshan provided an Euler angle parametrization for SU(n)SU(n) and U(n)U(n) (see ). Meanwhile Diţă in has shown another description of U(n)U(n). A recursive parametrization of unitary matrices has been given by Jarlskog in and a composite parameterization was done in (see and for more details on these developments).

and run the following recursive procedure using the Fourier transform (Fl)j,k=1le2πijkl(F_{l})_{j,k}=\frac{1}{\sqrt{l}}e^{-2\pi i\frac{jk}{l}} and its conjugate transpose (Fl)j,k=1le2πijkl(F_{l}^{*})_{j,k}=\frac{1}{\sqrt{l}}e^{2\pi i\frac{jk}{l}} .

The initial unitary matrix A1A_{1} is given as [a11][a_{11}]. And, in general, for 2ln2\leq l\leq n

The above procedure is not one-to-one, meaning that different sets of parameters may give the same matrix. As to the question if it is onto, the simplicity of the above scheme can raise some doubts whether it can reveal the structure of an arbitrary unitary matrix. Nevertheless, in the next section we shall prove that all unitary matrices can be obtained in this way, provided that all of them possess biunimodular vectors.

Analysis of U​(n)𝑈𝑛U(n)

A unitary matrix AU(n)A\in U(n) has a biunimodular vector vv if and only if

The description of the group Fix(1){\rm Fix}(\mathbf{1}) is related to the group Fix(e){\rm Fix}(\mathbf{e}). Indeed, since Fne=1n1F_{n}\mathbf{e}=\frac{1}{\sqrt{n}}\mathbf{1}, it is easy to see that MFix(1)M\in{\rm Fix}(\mathbf{1}) iff FnMFnFix(e)F_{n}^{*}MF_{n}\in{\rm Fix}(\mathbf{e}).

Therefore, we see that AA has a biunimodular vector vv iff FnDwADvFnFix(e)F_{n}^{*}D_{w}^{*}AD_{v}F_{n}\in{\rm Fix}(\mathbf{e}). Due to the obvious description of Fix(e){\rm Fix}(\mathbf{e}) the proposition follows. \Box

The Fourier transform FnF_{n} enters the above scheme as a generic unitary matrix with the property that Fne=1n1F_{n}\mathbf{e}=\frac{1}{\sqrt{n}}\mathbf{1}. We need to clarify that, in the above proposition, FnF_{n} can be replaced by any unitary matrix FF such that Fe=1n1F\mathbf{e}=\frac{1}{\sqrt{n}}\mathbf{1}. However, clearly the easiest description of such FF is given by F=FnMF=F_{n}M with MFix(e)M\in{\rm Fix}(\mathbf{e}).

On the other hand, as pointed out by the referees, one has the following crucial

(Idel, Wolf + Biran, Entov, Polterovich) Every unitary matrix has a biunimodular vector.

In the following section we shall examine the existence of biunimodular vectors from a few different angles.

Biunimodular problem

The biunimodular problem is to find (or prove the existence of) a biunimodular vector for a given unitary matrix. With the notation explained below we can restate this task in equivalent terms.

Let AU(n)A\in U(n). The matrix AA has a biunimodular vector if and only if either of the following holds

ADFix(1)D1A\in\mathcal{D}\cdot{\rm Fix}(\mathbf{1})\cdot\mathcal{D}_{1}.

ADE(1)DA^{\prime}\in\mathcal{D^{\prime}}\cdot{\rm E}(\mathbf{1})\cdot\mathcal{D^{\prime}}.

(a’) Here, A=λAA^{\prime}=\lambda A with λn=det(A)\lambda^{n}=\overline{\det(A)}, so ASU(n)A^{\prime}\in SU(n). D=DSU(n)\mathcal{D^{\prime}}=\mathcal{D}\cap SU(n) and E(1){\rm E}(\mathbf{1}) denotes the subgroup of SU(n)SU(n) that has 1\mathbf{1} as an eigenvector. Clearly, (a’) is a mere translation of (a) into the special unitary setting.

proving that A1n\|A\|_{\infty\to 1}\leq n. Thus, if AA has a biunimodular vector vv, then Av1=nv\|Av\|_{1}=n\|v\|_{\infty} showing that A1=n\|A\|_{\infty\to 1}=n.

Part (a) of Theorem 4.1 is equivalent to saying that ADFix(1)DA\in\mathcal{D}\cdot{\rm Fix}(\mathbf{1})\cdot\mathcal{D}. Thus, Theorem 3.4 is equivalent to the decomposition U(n)=DFix(1)DU(n)=\mathcal{D}\cdot{\rm Fix}(\mathbf{1})\cdot\mathcal{D}. Unfortunately, there is no general theory explaining when two subgroups HH, HH^{\prime} of a Lie group GG yield the decomposition G=HHHG=H\cdot H^{\prime}\cdot H. Moreover, the Cartan decomposition and Kostant decomposition of any semisimple Lie group G, that both are given in this form, indicate potential difficulties in developing such a general explanation.

For part (a’) included in the above characterization, we would like to notice that D\mathcal{D^{\prime}} is a maximal torus subgroup of a compact simple Lie group SU(n)SU(n). Thus, Theorem 3.4 is equivalent to the following maximal torus decomposition:

with dim(SU(n))=dim(E(1))+2dim(D)\dim(SU(n))=\dim({\rm E}(\mathbf{1}))+2\dim(\mathcal{D^{\prime}}). This allows to conjecture, or ask, whether every compact simple Lie group GG admits a maximal torus decomposition G=THTG=T\cdot H\cdot T, with a maximal torus TT of GG and a subgroup HGH\subseteq G such that dimG=dimH+2dimT\dim G=\dim H+2\dim T.

Regarding part (b), as in , an interpolation argument can be used to conclude that A1=n\|A\|_{\infty\to 1}=n is in turn equivalent to saying that Apq=n1q1p\|A\|_{p\to q}=n^{\frac{1}{q}-\frac{1}{p}} in the region 1q2p1\leq q\leq 2\leq p\leq\infty. This further amplifies the interest in biunimodular vectors, that are precisely those vectors, where the norm Apq=n1q1p\|A\|_{p\to q}=n^{\frac{1}{q}-\frac{1}{p}} is attained, in the indicated range of pp and qq.

Part (d) of the characterization provides a deceptively simple geometric interpretation of the problem: Describe the intersection of two tori. This geometric angle has been exploited to prove Theorem 3.4.

With the above characterization we close the motivational part of the paper and present our findings on the biunimodular problem.

U​(2)𝑈2U(2)

Obtaining biunimodular vectors of a given unitary 2×22\times 2 matrix is an easy task.

Every matrix in U(2)U(2) has a biunimodular vector.

Proof. Every matrix AU(2)A\in U(2) can be written as

where α=argx\alpha=\arg x and β=argy\beta=\arg y. \Box

As we indicated in Section 2, the existence of biunimodular vectors for every matrix in U(2)U(2) allows for writing an alternative formula describing U(2)U(2).

Proof. Let AU(2)A\in U(2). Since AA has a biunimodular vector, by Theorem 4.1 we have that ADFix(1)D1A\in\mathcal{D}\cdot{\rm Fix}(\mathbf{1})\cdot\mathcal{D}_{1}. Clearly, in this case

U​(3)𝑈3U(3)

Some elementary calculations allow to conclude that all orthogonal matrices in O(3)O(3) have a biunimodular vector. Another easy case is given in the following.

For AU(3)A\in U(3) with at least one zero entry we shall explain how to exhibit its biunimodular vectors. By interchanging rows and columns of AA we can assume that there is a zero entry in the top right corner of AA. Moreover, we can multiply rows and columns of AA by unimodular constants in such a way that the outcome shall have only real entries. Therefore, we can assume that AA is given as

Moreover, if AA has only one zero entry, then there are no other biunimodular vectors for AA. It is also clear, that when AA has at least two zero entries, then at least one of its entries has modulus 1 leading to a trivialization of AA and a continuum of biunimodular vectors.

Whenever we talk about the number of biunimodular vectors for a given unitary matrix, we concentrate only on vectors with the leading entry equal to 1, as it was indicated in Remark 3.2. The above example explains the structure and the number of biunimodular vectors for AU(3)A\in U(3) in the case when AA has at least one zero entry. In the case when all entries of AU(3)A\in U(3) are nonzero, we have only found examples of matrices with the number of biunimodular vectors equal to 4,5 or 6.

In order to exhibit the method allowing to count biunimodular vectors of a given matrix AU(3)A\in U(3), let us consider a vector u=uxy=(1,eix,eiy)u=u_{xy}=(1,e^{ix},e^{iy}) and the square Q=[π,π]2Q=[-\pi,\pi]^{2}. The idea is to look at three regions Rj={(x,y)Q:Au(j)1}R_{j}=\{(x,y)\in Q:|Au(j)|\geq 1\}, j=1,2,3j=1,2,3 and check the intersection of their boundaries.

For the Fourier case A=F3A=F_{3} the corresponding three regions are given in Fig.1. The points where the boundaries of these regions intersect correspond to biunimodular vectors of AA. From the unitarity of AA it follows that every intersection point of two such boundaries belongs to the boundary of the third region as well, allowing for a visual count of biunimodular vectors.

The next figure shows these regions for three exemplary unitary matrices providing the mentioned count of biunimodular vectors (conducted and plotted with Mathematica).

In general, since AA is unitary these closed regions cover QQ, and therefore, any of these regions must intersect with at least one of the two other regions. Unfortunately, it is hard to argue that at least one pair of the regions intersects in such a way that their boundaries also intersect, closing a natural way to prove Theorem 6.3. This obstacle boils down to the lack of proof for a classification of the regions RjR_{j}, that essentially can be simplified to

A working proof that all AU(3)A\in U(3) have biunimodular vectors, that we give below, relies on findings of Section 8, that are included in Appendix A.

Every matrix in U(3)U(3) has a biunimodular vector.

Proof. In Appendix A (Observation A.3) we prove that

Thus, in order to prove that every matrix in U(3)U(3) has a biunimodular vector, it is enough to show it for the matrix T(α,β,γ,z)T_{(\alpha,\beta,\gamma,z)}.

Then, the second entry of vv is given as v2=uzcosβeimwsinβv_{2}=uz\cos\beta-e^{im}w\sin\beta, where u=eixcosγ+eiysinγu=e^{ix}\cos\gamma+e^{iy}\sin\gamma. And v3=uzsinβ+eimwcosβv_{3}=uz\sin\beta+e^{im}w\cos\beta. In short,

with j=1,2j=1,2 and concentrate on the final equation sinαwj(m)cosα=eimwj(m)\sin\alpha-w_{j}(m)\cos\alpha=e^{im}w_{j}(m).

so sinαeix0wj0(m0)cosα=eimeix0wj0(m0)\sin\alpha-e^{ix_{0}}w_{j_{0}}(m_{0})\cos\alpha=e^{im}e^{ix_{0}}w_{j_{0}}(m_{0}), leading to the conclusion that m0,j0m_{0},j_{0} and x0x_{0} solve (8) by setting (eix,eiy)=eix0(1,(1)j0ieiφm0)(e^{ix},e^{iy})=e^{ix_{0}}(1,(-1)^{j_{0}}ie^{i\varphi_{m_{0}}}).

Thus, we are left with solving (9). Here, it is important to notice that the biunimodular vectors (1,ieiφm)(1,ie^{i\varphi_{m}}) and (1,ieiφm)(1,-ie^{i\varphi_{m}}) are orthogonal. Thus, if we define fj(m)=wj(m)2f_{j}(m)=|w_{j}(m)|^{2} for j=1,2j=1,2 and m[0,2π]m\in[0,2\pi], then f1(m)+f2(m)=2(sinγ,cosγ)22=2f_{1}(m)+f_{2}(m)=2\|(-\sin\gamma,\cos\gamma)\|_{2}^{2}=2. Moreover, there is an m1[0,2π]m_{1}\in[0,2\pi] such that eim1=ze^{im_{1}}=z, and then φm1=0\varphi_{m_{1}}=0 by (6), so f1(m1)=f2(m1)=1f_{1}(m_{1})=f_{2}(m_{1})=1. Finally, let us notice that for g(m)=sinαeim+cosα2g(m)=\left|\frac{\sin\alpha}{e^{im}+\cos\alpha}\right|^{2} one has infg=(sinα1+cosα)21\inf g=\left(\frac{|\sin\alpha|}{1+|\cos\alpha|}\right)^{2}\leq 1 and supg=(sinα1cosα)21\sup g=\left(\frac{|\sin\alpha|}{1-|\cos\alpha|}\right)^{2}\geq 1.

This allows to finish the proof. Indeed, if f1>gf_{1}>g and f2>gf_{2}>g, then 1>g1>g contradicting supg1\sup g\geq 1. Similarly, if f1<gf_{1}<g and f2<gf_{2}<g, then 1<g1<g contradicting infg1\inf g\leq 1. Thus, there is an m2[0,2π]m_{2}\in[0,2\pi] such that g(m2)g(m_{2}) is between f1(m2)f_{1}(m_{2}) and f2(m2)f_{2}(m_{2}). However, gg can not be strictly between f1f_{1} and f2f_{2} on [0,2π][0,2\pi], because f1(m1)=f2(m1)f_{1}(m_{1})=f_{2}(m_{1}). Therefore, there is an m0[0,2π]m_{0}\in[0,2\pi] such that either g(m0)=f1(m0)g(m_{0})=f_{1}(m_{0}) or g(m0)=f2(m0)g(m_{0})=f_{2}(m_{0}). \Box

As in the U(2)U(2) case, the existence of biunimodular vectors for all members of U(3)U(3) allows to write a formula for an arbitrary matrix in U(3)U(3).

Proof. Let AU(3)A\in U(3). By Theorems 6.3 and 4.1 we have that ADFix(1)D1A\in\mathcal{D}\cdot{\rm Fix}(\mathbf{1})\cdot\mathcal{D}_{1}. Moreover, in this case Fix(1)=F3Fix((1,0,0))F3{\rm Fix}(\mathbf{1})=F_{3}\cdot{\rm Fix}((1,0,0))\cdot F_{3}^{*}, where F3F_{3} is the Fourier transform and

due to the description of U(2)U(2) provided in Corollary 5.2. Since F3=13[1111ωω1ωω]F_{3}=\frac{1}{\sqrt{3}}\begin{bmatrix}1&1&1\\ 1&\overline{\omega}&\omega\\ 1&\omega&\overline{\omega}\end{bmatrix}we can finish the proof. \Box

Moreover, as a byproduct from the proof of Theorem 6.3 we obtain the following description of U(3)U(3).

Proof. This follows from Observation A.3 used in the proof of Theorem 6.3 and the matrix factorization:

that leads to the provided description of U(3)U(3) via easy matrix manipulations. \Box

The above description of U(3)U(3) can be compared with the standard Euler-Tait-Bryan decomposition of O(3)O(3):

U​(4)𝑈4U(4) and beyond

The above observation allows to construct matrices in U(n)U(n) with all nonzero entries having a continuum of biunimodular vectors, whenever n4n\geq 4. It also allows for a further appreciation of Haagerup’s result stated in Theorem 1.2. It is not clear, however, whether for all unitary matrices with a continuum of biunimodular vectors one can find vectors uu and ww as in Observation 7.1. Even if this would be true, pointing to a conclusion that for any unitary matrix the set of its biunimodular vectors is a (finite) union of tori, it still would not tell, whether the set can be empty or not, underscoring the importance of Theorem 3.4.

In the next section, we provide an alternative way of building U(2n)U(2^{n}), that is based solely on the biunimodular structure of U(2)U(2). After that, we conduct the study of biunimodular vectors in the general U(n)U(n) case from the Lie groups perspective. Finally, we present a numerical treatment of the biunimodular problem with certain applications to the classical Fourier case.

In this section we show, that a further generalization of the biunimodular problem allows to build all matrices in U(2n)U(2n) from matrices in U(n)U(n).

In order to achieve this goal we employ the formula for U(2)U(2) given in Section 5:

where II stands for the identity matrix in U(n)U(n).

Before we prove the theorem, we need a slim primary on block matrices. Let M(n)M(n) denote the set of all n×nn\times n matrices with complex entries. An arbitrary matrix in M(2n)M(2n) can be written in a block form as [ABCD]\begin{bmatrix}A&B\\ C&D\end{bmatrix} with A,B,C,DM(n)A,B,C,D\in M(n).

Clearly, we have that [ABCD][ABCD]=[AA+BCAB+BDCA+DCCB+DD]\begin{bmatrix}A&B\\ C&D\end{bmatrix}\begin{bmatrix}A^{\prime}&B^{\prime}\\ C^{\prime}&D^{\prime}\end{bmatrix}=\begin{bmatrix}AA^{\prime}+BC^{\prime}&AB^{\prime}+BD^{\prime}\\ CA^{\prime}+DC^{\prime}&CB^{\prime}+DD^{\prime}\end{bmatrix} and [ABCD]=[ACBD]\begin{bmatrix}A&B\\ C&D\end{bmatrix}^{*}=\begin{bmatrix}A^{*}&C^{*}\\ B^{*}&D^{*}\end{bmatrix}. Moreover, every 2n×n2n\times n matrix with complex entries can be written as [XY]\begin{bmatrix}X\\ Y\end{bmatrix} with X,YM(n)X,Y\in M(n) and [ABCD][XY]=[AX+BYCX+DY]\begin{bmatrix}A&B\\ C&D\end{bmatrix}\begin{bmatrix}X\\ Y\end{bmatrix}=\begin{bmatrix}AX+BY\\ CX+DY\end{bmatrix}.

Let 12[ABCD]U(2n)\frac{1}{\sqrt{2}}\begin{bmatrix}A&B\\ C&D\end{bmatrix}\in U(2n) with AU(n)A\in U(n) and B,C,DM(n)B,C,D\in M(n). Then B,C,DU(n)B,C,D\in U(n).

Let U=12[ABCD]U=\frac{1}{\sqrt{2}}\begin{bmatrix}A&B\\ C&D\end{bmatrix} and IU(n)I\in U(n) be the identity matrix. Since UU=[I00I]UU^{*}=\begin{bmatrix}I&0\\ 0&I\end{bmatrix} we get that BB=IBB^{*}=I. Considering UU=[I00I]U^{*}U=\begin{bmatrix}I&0\\ 0&I\end{bmatrix} yields that CC=IC^{*}C=I and DD=ID^{*}D=I. ∎

We can recognize the task as a special sort of a biunimodular problem. For an arbitrary unitary matrix UU(2n)U\in U(2n) we need to find a unitary matrix CU(n)C\in U(n) such that U[IC]U\begin{bmatrix}I\\ C^{*}\end{bmatrix} is “unimodular”, in the sense that

In order to solve the biunimodular problem (12) we write U=[XYXY]U=\begin{bmatrix}X&Y\\ X^{\prime}&Y^{\prime}\end{bmatrix} with X,Y,X,YM(n)X,Y,X^{\prime},Y^{\prime}\in M(n) and apply the singular value decomposition (SVD) to XX and YY.

Recall that for an arbitrary matrix MM(n)M\in M(n) there are unitary matrices W,WU(n)W,W^{\prime}\in U(n) and a diagonal matrix with non-negative entries ΣM(n)\Sigma\in M(n) such that M=WΣWM=W^{\prime}\Sigma W^{*}. This leads to the polar decomposition M=SMUMM=S_{M}U_{M} with SM=WΣ(W)S_{M}=W^{\prime}\Sigma(W^{\prime})^{*} - a positive semi-definite matrix and a unitary matrix UM=WWU(n)U_{M}=W^{\prime}W^{*}\in U(n).

In short, we use polar decomposition X=SXUXX=S_{X}U_{X}, Y=SYUYY=S_{Y}U_{Y} to write

with A,BU(n)A,B\in U(n) as desired, so C=iUYUXC^{*}=iU_{Y}^{*}U_{X} solves problem (12). To see this, we check that A=(SX+iSY)UXA=(S_{X}+iS_{Y})U_{X} and AA=SX2+SY2+i(SYSXSXSY)AA^{*}=S_{X}^{2}+S_{Y}^{2}+i(S_{Y}S_{X}-S_{X}S_{Y}). Since UU=[I00I]UU^{*}=\begin{bmatrix}I&0\\ 0&I\end{bmatrix} we get that SX2+SY2=IS_{X}^{2}+S_{Y}^{2}=I, so SX2S_{X}^{2} and SY2S_{Y}^{2} commute. For positive semi-definite matrices SXS_{X} and SYS_{Y} this means that they commute as well and, therefore, AU(n)A\in U(n).

To finish the proof we make the final observation that AU(n)A\in U(n) already implies that BU(n)B\in U(n). Indeed, we can extend the matrix 12[IC]\frac{1}{\sqrt{2}}\begin{bmatrix}I\\ C^{*}\end{bmatrix} to a unitary matrix 12[ICCI]U(2n)\frac{1}{\sqrt{2}}\begin{bmatrix}I&C\\ C^{*}&-I\end{bmatrix}\in U(2n) and see that 12U[ICCI]=12[AXBY]U(2n)\frac{1}{\sqrt{2}}U\begin{bmatrix}I&C\\ C^{*}&-I\end{bmatrix}=\frac{1}{\sqrt{2}}\begin{bmatrix}A&X^{\prime\prime}\\ B&Y^{\prime\prime}\end{bmatrix}\in U(2n). Since AA is unitary, Lemma 8.2 assures that BU(n)B\in U(n).

U​(n)𝑈𝑛U(n)

Moreover, let us make the following formal definition:

Clearly, Bn\mathcal{B}_{n} is the set of n×nn\times n unitary matrices that possess biunimodular vectors and Theorem 3.4 asserts that Bn=U(n)\mathcal{B}_{n}=U(n). While it would be hard to reproduce this result, one can check what information about Bn\mathcal{B}_{n} and Bi(A){\hbox{Bi}}(A) can be drawn by using basic tools of Lie groups theory.

containing the entire information concerning the existence of biunimodular vectors.

Indeed, the matrices possessing biunimodular vectors are given as the image of Φ\Phi, what already provides some information on Bn\mathcal{B}_{n}.

Bn=DFix(1)D1=Φ(D×Fix(1)×D1)\mathcal{B}_{n}=\mathcal{D}\cdot{\rm Fix}(\mathbf{1})\cdot\mathcal{D}_{1}=\Phi(\mathcal{D}\times{\rm Fix}(\mathbf{1})\times\mathcal{D}_{1}). In particular, Bn\mathcal{B}_{n} is a closed, pathwise connected subset of U(n)U(n).

Proof. As we mentioned before, the equality Bn=DFix(1)D1\mathcal{B}_{n}=\mathcal{D}\cdot{\rm Fix}(\mathbf{1})\cdot\mathcal{D}_{1} follows immediately from Theorem 4.1(a). Thus, by definition, Bn=Φ(D×Fix(1)×D1)\mathcal{B}_{n}=\Phi(\mathcal{D}\times{\rm Fix}(\mathbf{1})\times\mathcal{D}_{1}). Since the subgroups D,Fix(1),D1\mathcal{D},\,{\rm Fix}(\mathbf{1}),\,\mathcal{D}_{1} are compact and pathwise connected, Bn\mathcal{B}_{n}, which is the continuous image of their cartesian product, has the same properties. \Box

Moreover, the preimage Φ1(A)\Phi^{-1}(A) bijectively corresponds to Bi(A){\hbox{Bi}}(A).

There is a continuous bijection between Bi(A){\hbox{Bi}}(A) and Φ1(A)\Phi^{-1}(A).

U(n)U(n) has (real) dimension n2n^{2}, whereas Fix(1)U(n1){\rm Fix}(\mathbf{1})\cong U(n-1) is of dimension (n1)2(n-1)^{2}. D\mathcal{D} has dimension nn and D1\mathcal{D}_{1} has dimension n1n-1, hence Φ\Phi is a mapping between two manifolds of (real) dimension n2n^{2}. Clearly, Φ\Phi is smooth.

In order to proceed with the calculations we provide some basic facts from the theory of Lie groups and establish the necessary notation.

Since multiplication (on the left or on the right) with a group element gg is a diffeomorphism on HH, that is a matrix group, one obtains that

We let u,d,d1,f\mathfrak{u},\mathfrak{d},\mathfrak{d}_{1},\mathfrak{f} denote the Lie algebras of U(n),D,D1,Fix(1)U(n),\mathcal{D},\mathcal{D}_{1},{\rm Fix}(\mathbf{1}), respectively. Then

d\mathfrak{d} consists of all diagonal matrices with purely imaginary entries, and d1d\mathfrak{d}_{1}\subset\mathfrak{d} is the subspace of matrices with vanishing upper left corner. Differentiating the equality exp(tX)1=1\exp(tX)\mathbf{1}=\mathbf{1} at t=0t=0 yields the following description of f\mathfrak{f}:

i.e. the elements of f\mathfrak{f} are characterized by the fact that the sum over rows are zero. We also note the simple fact that df={0}\mathfrak{d}\cap\mathfrak{f}=\{0\}.

To calculate the Jacobian of Φ:D×Fix(1)×D1U(n)\Phi:\mathcal{D}\times{\rm Fix}(\mathbf{1})\times\mathcal{D}_{1}\to U(n) we shall use the following

Let H1,H2H_{1},\,H_{2} be closed subgroups of U(n)U(n) and m:H1×H2U(n)m:H_{1}\times H_{2}\to U(n), m(g,h)=ghm(g,h)=g\cdot h. Then, for all g,hU(n)g,h\in U(n), the Jacobian of mm at (g,h)(g,h) is given by

Proof. Let γi:(ϵ,ϵ)Hi\gamma_{i}:(-\epsilon,\epsilon)\to H_{i} denote smooth curves with

By a standard reasoning, the product rule for scalar-valued functions easily extends to matrix-valued functions implying that

Using this lemma and the chain rule, we obtain a rather transparent description of the Jacobian dΦ(D1,S,D2)d\Phi(D_{1},S,D_{2}):

Let (D1,S,D2)D×Fix(1)×D1(D_{1},S,D_{2})\in\mathcal{D}\times{\rm Fix}(\mathbf{1})\times\mathcal{D}_{1}. Then

applying the previous lemma twice and using the chain rule yields

Recall that we are particularly interested in the rank of dΦd\Phi, that is, the real dimension of the image of dΦd\Phi. The following lemma translates this to a more tractable problem in linear algebra:

For all (any) (D1,D2)D×D1(D_{1},D_{2})\in\mathcal{D}\times\mathcal{D}_{1} the Jacobian dΦ(D1,S,D2)d\Phi(D_{1},S,D_{2}) has rank mm.

Proof. For the equivalence (a) \Leftrightarrow (b) we observe that Φ(CD1,S,D2C)=CΦ(D1,S,D2)C\Phi(CD_{1},S,D_{2}C^{\prime})=C\Phi(D_{1},S,D_{2})C^{\prime} for all CDC\in\mathcal{D}, and CD1C^{\prime}\in\mathcal{D}_{1}. Thus, by the chain rule, dΦ(CD1,S,D2C)=CdΦ(D1,S,D2)Cd\Phi(CD_{1},S,D_{2}C^{\prime})=Cd\Phi(D_{1},S,D_{2})C^{\prime}, what yields the equivalence.

To see that (b) \Leftrightarrow (c) we note, that by the previous lemma, dΦ(I,S,I)(X,Y,Z)=XS+Y+SZd\Phi(I,S,I)(X,Y^{\prime},Z)=XS+Y^{\prime}+SZ, for all Xd,YTS(Fix(1)),Zd1X\in\mathfrak{d},Y^{\prime}\in T_{S}({\rm Fix}(\mathbf{1})),Z\in\mathfrak{d}_{1}. Since SS is invertible, the rank of dΦ(I,S,I)d\Phi(I,S,I) equals the rank of RSdΦ(I,S,I)R_{S^{*}}\circ d\Phi(I,S,I), where RSR_{S^{*}} denotes right multiplication with SS^{*}. Clearly, RSdΦ(I,S,I)(X,Y,Z)=X+YS+SZSR_{S^{*}}\circ d\Phi(I,S,I)(X,Y^{\prime},Z)=X+Y^{\prime}S^{*}+SZS^{*} and this map has the same rank as the map CSC_{S} from (c), since Y=YSfY=Y^{\prime}S^{*}\in\mathfrak{f} iff YTS(Fix(1))Y^{\prime}\in T_{S}({\rm Fix}(\mathbf{1})). \Box

Proof. Let CS:d×f×d1uC_{S}:\mathfrak{d}\times\mathfrak{f}\times\mathfrak{d}_{1}\to\mathfrak{u} denote the linear map from Lemma 9.6. Let n=d+fu\mathfrak{n}=\mathfrak{d}+\mathfrak{f}\subset\mathfrak{u}. Then df={0}\mathfrak{d}\cap\mathfrak{f}=\{0\} implies that the restriction of CSC_{S} to d×f\mathfrak{d}\times\mathfrak{f} is an isomorphism onto n\mathfrak{n}. This allows for the first step towards (13) by considering the quotient map

By Lemma 9.6 and a rank formula for the quotient map we obtain that

Thus, we have established that rank(CS)=rank(Im(S)){\rm rank}(\overline{C_{S}})={\rm rank}({\rm Im}(S)), so equation (13) follows from (14).

Thus, Lemma 9.7 implies that rank(dΦ(I,S,I))=n2{\rm rank}(d\Phi(I,S,I))=n^{2}. \Box

With the above lemma we can draw the mentioned conclusion regarding the structure of Bn\mathcal{B}_{n}.

Proof. By Proposition 9.1 Bn=Φ(D×Fix(1)×D1)\mathcal{B}_{n}=\Phi(\mathcal{D}\times{\rm Fix}(\mathbf{1})\times\mathcal{D}_{1}) and Φ\Phi is a map between groups of dimension n2n^{2}. Lemma 9.8 guarantees an existence of a point in D×Fix(1)×D1\mathcal{D}\times{\rm Fix}(\mathbf{1})\times\mathcal{D}_{1}, where the Jacobian of Φ\Phi has the full rank. Therefore, by the inverse function theorem, Φ\Phi is a diffeomorphism around this point. \Box

Bn=U(n)\mathcal{B}_{n}=U(n) if and only if BnBnBn\mathcal{B}_{n}\cdot\mathcal{B}_{n}\subset\mathcal{B}_{n}.

Proof. Clearly, ABnA\in\mathcal{B}_{n} iff A1BnA^{-1}\in\mathcal{B}_{n}. Therefore, Bn\mathcal{B}_{n} is a symmetric subset of U(n)U(n) that, by the above theorem, has a nonempty open interior. If, in addition, it is closed under multiplication, then Bn\mathcal{B}_{n} must be an open subgroup. Since U(n)U(n) is connected, it follows that Bn=U(n)\mathcal{B}_{n}=U(n). The converse is clear. \Box

The results of this subsection can be compared with a theorem of De Vos and De Baerdemacker based on Hurwitz parametrization and stating that (Bn)n1=U(n)(\mathcal{B}_{n})^{n-1}=U(n).

3 The structure of Bi​(A)Bi𝐴{\hbox{Bi}}(A)

Regarding the structure of Bi(A){\hbox{Bi}}(A) it is crucial to recall Conjecture 1.1. In light of the hypothesis imposed by Björck and Saffari, the main issue is to establish the cardinality of Bi(A){\hbox{Bi}}(A). Therefore, there is a merit in proving the following.

For almost all AU(n)A\in U(n) the set Bi(A){\hbox{Bi}}(A) is finite.

Proof. This follows from Proposition 9.2 and Sard’s theorem. By Proposition 9.2 we can replace Bi(A){\hbox{Bi}}(A) by Φ1(A)\Phi^{-1}(A). Clearly, we can concentrate on the case Φ1(A)\Phi^{-1}(A)\not=\emptyset. Recall that the regular points of a smooth map are those for which the rank of the Jacobian is full; any point that is not regular is called critical. A regular value is a value, such that all the points in the preimage are regular. If AA is a regular value of Φ\Phi, then Φ1(A)\Phi^{-1}(A) must be a discrete subset of the compact space D×Fix(1)×D1\mathcal{D}\times{\rm Fix}(\mathbf{1})\times\mathcal{D}_{1}, hence finite. Thus, if Φ1(A)\Phi^{-1}(A) is infinite, then AA must be the image of a critical point of Φ\Phi. However, Sard’s theorem states that the images of the critical points constitute a set of the Haar measure zero, what concludes the proof.

In order to achieve a deeper understanding of the set Bi(A){\hbox{Bi}}(A) for a given AU(n)A\in U(n), we consider the phasing manifold of AA

introduced in , and the stabilizer group DA={(E1,E2)D×D1:E1AE2=A}\mathcal{D}_{A}=\{(E_{1},E_{2})\in\mathcal{D}\times\mathcal{D}_{1}:E_{1}AE_{2}=A\}.

By Proposition 9.2 the next result establishes a bijection

that can be explained in the following way. Intuitively, finding all biunimodular vectors for AA amounts to finding all SFix(1)S\in{\rm Fix}(\mathbf{1}) for which there exists a pair (D1,D2)D×D1(D_{1},D_{2})\in\mathcal{D}\times\mathcal{D}_{1} such that A=D1SD2A=D_{1}SD_{2} (i.e. S=D11AD21MAS=D_{1}^{-1}AD_{2}^{-1}\in\mathcal{M}_{A}) and, after that, finding other such pairs for SS by considering the set {(D1E1,D2E2):(E1,E2)DA}\{(D_{1}E_{1},D_{2}E_{2}):(E_{1},E_{2})\in\mathcal{D}_{A}\}. We shall prove that this simple scheme yields all members of Bi(A){\hbox{Bi}}(A) in a one-to one fashion.

There is a bijection Ψ:(MAFix(1))×DAΦ1(A)\Psi:\left(\mathcal{M}_{A}\cap{\rm Fix}(\mathbf{1})\right)\times\mathcal{D}_{A}\to\Phi^{-1}(A).

Proof. We can pick a map ψ:MAFix(1)D×D1\psi:\mathcal{M}_{A}\cap{\rm Fix}(\mathbf{1})\to\mathcal{D}\times\mathcal{D}_{1}, ψ(S)=(ψ1(S),ψ2(S))\psi(S)=(\psi_{1}(S),\psi_{2}(S)), such that ψ1(S)Sψ2(S)=A\psi_{1}(S)\,S\,\psi_{2}(S)=A. Indeed, by definition, for every SMAFix(1)S\in\mathcal{M}_{A}\cap{\rm Fix}(\mathbf{1}) there is (D1,D2)D×D1(D_{1},D_{2})\in\mathcal{D}\times\mathcal{D}_{1}, such that S=D1AD2S=D_{1}AD_{2}, thus we can pick one of such pairs and set (ψ1(S),ψ2(S))=(D11,D21)(\psi_{1}(S),\psi_{2}(S))=(D_{1}^{-1},D_{2}^{-1}).

Then, the map Ψ:(MAFix(1))×DAΦ1(A)\Psi:\left(\mathcal{M}_{A}\cap{\rm Fix}(\mathbf{1})\right)\times\mathcal{D}_{A}\to\Phi^{-1}(A) is given by

To see that the image of Ψ\Psi is contained in Φ1(A)\Phi^{-1}(A), it is enough to note that

The map is one-to-one, since Ψ(S,(E1,E2))=Ψ(S,(E1,E2))\Psi\left(S,(E_{1},E_{2})\right)=\Psi\left(S^{\prime},(E^{\prime}_{1},E^{\prime}_{2})\right) implies that S=SS=S^{\prime} and this, in turn, implies that Ei=EiE_{i}=E_{i}^{\prime}, for i=1,2i=1,2. To check that the map is onto, we take an arbitrary element (G1,S,G2)Φ1(A)(G_{1},S,G_{2})\in\Phi^{-1}(A) and easily verify that Ψ(S,(E1,E2))=(G1,S,G2)\Psi\left(S,(E_{1},E_{2})\right)=(G_{1},S,G_{2}) with Ei=Gi(ψi(S))1E_{i}=G_{i}\,(\psi_{i}(S))^{-1}, for i=1,2i=1,2. Since G1(ψ1(S))1A(ψ2(S))1G2=G1SG2=AG_{1}(\psi_{1}(S))^{-1}A\,(\psi_{2}(S))^{-1}G_{2}=G_{1}SG_{2}=A, we get that (E1,E2)DA(E_{1},E_{2})\in\mathcal{D}_{A} and end the proof. \Box

Since in the above theorem it is hard to guarantee the continuity of ψ\psi, we can not conclude that the bijection Ψ\Psi is continuous. We mention this, because otherwise, by Proposition 9.2 the set Bi(A){\hbox{Bi}}(A) would be homeomorphic to (MAFix(1))×DA(\mathcal{M}_{A}\cap{\rm Fix}(\mathbf{1}))\times\mathcal{D}_{A}.

With Theorem 9.12, we can concentrate on the set (MAFix(1))×DA\left(\mathcal{M}_{A}\cap{\rm Fix}(\mathbf{1})\right)\times\mathcal{D}_{A}, in order to investigate the cardinality of Bi(A){\hbox{Bi}}(A). For that, we need a better understanding of the phasing manifold MA\mathcal{M}_{A}, for a given AU(n)A\in U(n). This can be obtained by defining the mapping

and noting that MA=ΦA(D×D1)\mathcal{M}_{A}=\Phi_{A}(\mathcal{D}\times\mathcal{D}_{1}). This setup allows for making some useful observations.

For every AU(n)A\in U(n) the set MA\mathcal{M}_{A} is the orbit of AA under the left action of D×D1\mathcal{D}\times\mathcal{D}_{1} on U(n)U(n) given by ((D1,D2),A)D1AD2\left((D_{1},D_{2}),A\right)\mapsto D_{1}AD_{2}. In particular, MA\mathcal{M}_{A} is a closed submanifold of U(n)U(n) with MA(D×D1)/DA\mathcal{M}_{A}\simeq\left(\mathcal{D}\times\mathcal{D}_{1}\right)/\mathcal{D}_{A}, so dim(MA)+dim(DA)=2n1{\rm dim}(\mathcal{M}_{A})+{\rm dim}(\mathcal{D}_{A})=2n-1.

Proof. Note that the map ((D1,D2),A)D1AD2\left((D_{1},D_{2}),A\right)\mapsto D_{1}AD_{2} indeed defines an action of the product group, since the diagonal group is commutative; and clearly MA\mathcal{M}_{A} is the orbit of AA. The remaining statements follow from this: ΦA\Phi_{A} is just the quotient map with respect to this action, what implies that the differential of ΦA\Phi_{A} has constant rank; and this in turn allows to define a manifold structure on MA\mathcal{M}_{A}. In addition, MA\mathcal{M}_{A} is compact, hence closed, and ΦA\Phi_{A} induces a continuous bijection (D×D1)/DAMA\left(\mathcal{D}\times\mathcal{D}_{1}\right)/\mathcal{D}_{A}\to\mathcal{M}_{A} between compact Hausdorff spaces, which then has to be a homeomorphism. Therefore, dim(MA)+dim(DA)=dim(D×D1)=2n1{\rm dim}(\mathcal{M}_{A})+{\rm dim}(\mathcal{D}_{A})={\rm dim}(\mathcal{D}\times\mathcal{D}_{1})=2n-1. \Box

dim(MA)=2n1    dim(DA)=0    DAisfinite.{\rm dim}(\mathcal{M}_{A})=2n-1\iff{\rm dim}(\mathcal{D}_{A})=0\iff\mathcal{D}_{A}{\rm\,\,\,is\,\,finite.}

The first equivalence follows immediately from the dimension formula given in the above lemma. In the second equivalence one implication is obvious. For the other implication we note that, if dim(DA)=0{\rm dim}(\mathcal{D}_{A})=0 then DA\mathcal{D}_{A} must be a discrete subgroup of the compact group D×D1\mathcal{D}\times\mathcal{D}_{1}, so it must be finite. ∎

At this stage we can provide the following explanation on the cardinality Bi(A)|{\hbox{Bi}}(A)| of Bi(A){\hbox{Bi}}(A).

Obviously, for n=1n=1 we have Bi(A)=1|{\hbox{Bi}}(A)|=1. When n=2n=2, then Bi(A)|{\hbox{Bi}}(A)| equals to 2 or to the continuum c\mathfrak{c}. For n=3n=3 we have only found examples where Bi(A)|{\hbox{Bi}}(A)| equals to 4, 5, 6 or c\mathfrak{c}. By Theorem 9.12, Bi(A)|{\hbox{Bi}}(A)| is equal to the cardinality of the set (MAFix(1))×DA\left(\mathcal{M}_{A}\cap{\rm Fix}(\mathbf{1})\right)\times\mathcal{D}_{A}. Therefore, to see the general picture, we employ the above corollary together with Theorem 3.4 and get the following:

If dim(MA)<2n1{\rm dim}(\mathcal{M}_{A})<2n-1 then Bi(A)=c|{\hbox{Bi}}(A)|=\mathfrak{c}.

If dim(MA)=2n1{\rm dim}(\mathcal{M}_{A})=2n-1 then

Bi(A){\hbox{Bi}}(A) is infinite but countable (no examples) or

Regarding the case dim(MA)<2n1{\rm dim}(\mathcal{M}_{A})<2n-1, it is clear that for the identity matrix II (and any diagonal matrix) we have that MI=D\mathcal{M}_{I}=\mathcal{D}, so dim(MI)=n{\rm dim}(\mathcal{M}_{I})=n. Moreover, there are examples of matrices AU(n)A\in U(n) with dim(MA){\rm dim}(\mathcal{M}_{A}) in the range from nn to 2n12n-1.

We would like to end this section by getting closer to understanding the classical Fourier case. As it turns out, for the Fourier matrix FnF_{n} the stabilizer DFn\mathcal{D}_{F_{n}} is trivial.

The above lemma together with Theorem 9.12 and Lemma 9.14 entail the following information on the set of all biunimodular vectors in the classical Fourier case.

The above theorem indicates the resilience of Conjecture 1.1. Another approach towards this conjecture has been designed by Gabidulin. Unfortunately, even in the prime case, where some details are provided, he does not explain how to exclude that Bi(Fn){\hbox{Bi}}(F_{n}) is infinite but countable. Therefore, the only accessible confirmation of the conjecture in the prime case is .

Numerical Results

so that v=sign(v)vv={\hbox{sign}}(v)|v|. Since (15) holds for all vv with v1|v|\leq 1, we can safely define sign(v){\hbox{sign}}(v) as zero on the complement of the support of vv.

In this way we rediscover a phase retrieval method that has a pretty long history. To make it short, following Jaming we recall that as early as 1932, Pauli was asking whether the information on the moduli of a wave function and its Fourier transform allows to recover the phase of the wave function (i.e. Having ψ|\psi| and Fψ|{\cal F}\psi| can we recover ψ\psi?). In practice, the phase retrieval problem is a serious obstacle for analysing complex valued signals, since only partial information is given on ψ|\psi| (see by Candès et al. or and for recent developments on this notorious problem). Nevertheless, under original Pauli constrains, a method for recovering the phase of a signal has been provided in 1972 by Gerchberg and Saxton (see ). Gerchberg-Saxton algorithm has several variants, and the one that is relevant to us takes the following form. Let ψ=g|\psi|=g and Fψ=h|{\cal F}\psi|=h. In order to recover ψ\psi define

and starting with an f0f_{0} such that f0=g|f_{0}|=g, run fj=Pghj(f0)f_{j}=P_{gh}^{j}(f_{0}) until it converges to a solution ψ\psi^{\prime}.Unfortunately, ψ\psi^{\prime} is not unique (it depends on the choice of f0f_{0}), so it is only a candidate for ψ\psi (see Subsection 10.3). This algorithm can be viewed as an alternating projection method for finding an intersection of two subsets of a Hilbert space, that was discovered by von Neumann in 1933 (), in the case when the subsets are closed subspaces. This view is justified by writing PghP_{gh} as PGPHP_{G}\circ P_{H}, where PGf=gsign1(f)P_{G}f=g\,{\hbox{sign}}_{1}(f) and PHf=F(hsign1(Ff)))P_{H}f={\cal F}^{*}(h\,{\hbox{sign}}_{1}({\cal F}f))) are projections on certain subsets of a Hilbert space.

instead of PAP^{\prime}_{A}, we shall quickly confine our search to the range where both these operators agree. Thus, we shall refer to this method as “alternating projection algorithm” anyway.

so Av12|Av|\geq\frac{1}{2} for δ18\delta\leq\frac{1}{8}. Moreover, for such δ\delta we get

so A(sign(Av))12|A^{*}({\hbox{sign}}(Av))|\geq\frac{1}{2} as well. This allows to draw the following conclusions.

If the stopping condition (\refdelta0)(\ref{delta0}) is satisfied, then 1AVJ2<2δ\|\mathbf{1}-|AV_{J}|\|_{2}<\sqrt{2\delta}. If the initial condition

Vj:=PAjV0=(PA)jV0V_{j}:=P_{A}^{j}V_{0}=(P^{\prime}_{A})^{j}V_{0},

due to an easy induction argument. Moreover, assuming (19), it can be proved that

AVj+11=AVj1\|AV_{j+1}\|_{1}=\|AV_{j}\|_{1} iff Vj+1=VjV_{j+1}=V_{j},

limjVj+1Vj=0\lim_{j\to\infty}\|V_{j+1}-V_{j}\|_{\infty}=0.

Where the former follows from (15) and (16), while the proof of the latter is a bit lenghtly, so we include it in Appendix B containing a detailed study of the algorithm.

This brings us to the main point of the analysis of the algorithm. If the stopping condition (18) holds with δ18\delta\leq\frac{1}{8}, then the initial condition (19) holds with V0V_{0} replaced by VJV_{J}. Thus, the stopping condition (18), with δ\delta numerically near to zero, gives VJV_{J} that is close to a fix point of PAP_{A}. If L:=limjAVj1=nL:=\lim_{j\to\infty}\|AV_{j}\|_{1}=n, then VJV_{J} is close to a biunimodular vector of AA, however, it is not clear how close. If L<nL<n, then VJV_{J} is close (in a similar fashion) to a fix point of PAP_{A}, that may be far away from biunimodular vectors. Moreover, it is impossible to establish numerically that L=nL=n. Hence, we need to settle for δ\delta-near biunimodular vectors given as

and stress again, that some of these vectors can be far away from biunimodular vectors, even if δ\delta is nearly zero.

In short, if the algorithm returns vectors that are “biunimodular” in practice, there is no guarantee that they are close to the theoretical biunimodular vectors.

Despite this uncertainty, it turns out, that finding near biunimodular vectors for every unitary matrix is sufficient to conclude, that all such matrices have biunimodular vectors. In the following proposition we prove even more.

then every unitary matrix has a biunimodular vector.

belonging to U(n)U(n) with n=Nmn=Nm, has the norm AN1=N(mϵ)\|A_{N}\|_{\infty\to 1}=N(m-\epsilon). Thus, if (20) holds for ANA_{N}, then we get that

leading to a contradiction ϵmnα1\frac{\epsilon}{m}\leq n^{\alpha-1}, since nα10n^{\alpha-1}\to 0 as NN\to\infty. \Box

In the next subsection we check the performance of the alternating projection algorithm for unitary matrices up to dimension 100. The final subsection contains results of the application of this algorithm to the Fourier case.

2 Effectiveness of the algorithm

In order to measure the performance of the alternating projection algorithm we recall, that for δ>0\delta>0 and AU(n)A\in U(n) we have defined the set

of δ\delta-near biunimodular vectors of AA. For a given unitary matrix AA and a prescribed parameter δ\delta the algorithm is supposed to return a member of Biδ(A){\hbox{Bi}}_{\delta}(A). The effectiveness of the algorithm can be measured by considering the set Bn,δ\mathcal{B}_{n,\delta}, that is, the set of these matrices in U(n)U(n), for which the algorithm returns the desired outcome.

The aim of this subsection is to provide numerical evidence that the Haar measure of Bn,δ\mathcal{B}_{n,\delta} (that we denote by μ(Bn,δ)\mu(\mathcal{B}_{n,\delta})) is very close to one. For this purpose we proceeded as follows: For each dimension under consideration, we drew a fixed number KK of random unitary matrices with Haar measure used as underlying probability measure. To achieve this, we implemented the method described in . We then ran the alternating projection algorithm on the matrix with randomly chosen starting vectors, in order to produce near biunimodular vectors. The search was terminated either if one such vector was found, or until a maximal number of starting points was exceeded. The parameters used in this procedure were as follows:

Maximal number of starting points: M=103M=10^{3};

Maximal number of iterations: L=104L=10^{4}.

We implemented the described algorithms in MATLAB, and ran them on a stand-alone personal computerThe CPU of the machine was a 6 core AMD FX 6100 processor with 8 MB level 1 cache, running at 3.3. GHz, together with 8 GB RAM. The operating system was SUSE linux 12.2 for 64-bit PCs..

The results of our numerical experiment are documented in Table 1. Two facts seem particularly striking to us: First of all, the algorithm found a δ\delta-near biunimodular vector for each of the 10410^{4} random matrices in each dimension considered. This translates to an estimate of μ(Bn,δ)\mu(\mathcal{B}_{n,\delta}), which holds at least with very high likelihood: Our numerical experiment amounts to performing a Bernoulli experiment based on repeated independent samples of a Haar-distributed random matrix AA in U(n)U(n), and a positive outcome of the experiment (a near biunimodular vector being found) implies ABn,δA\in\mathcal{B}_{n,\delta}. Thus the probability of a positive outcome for a single event is given by pμ(Bn,δ)p\leq\mu(\mathcal{B}_{n,\delta}). Assuming that μ(Bn,δ)1103=0.999\mu(\mathcal{B}_{n,\delta})\leq 1-10^{-3}=0.999, the probability of obtaining a positive result 10410^{4} times in a row will thus be 0.9991044.5173105\leq 0.999^{10^{4}}\approx 4.5173\cdot 10^{-5}. In other words, our experiments provide very convincing evidence that μ(Bn,δ)>0.999\mu(\mathcal{B}_{n,\delta})>0.999.

Another striking phenomenon is illustrated by the 33rd and 44th column of Table 1, highlighting the effectiveness of the search algorithm. For small to medium dimensions and in the average case, the algorithm needs only very few starting points to find a near biunimodular vector, but even the maximal number of such points stays well away from the maximal number of 10310^{3} that was allowed. As the dimension increases, several effects conspire to increase the algorithm complexity: The rate of convergence in the alternating projection algorithm slows down, as witnessed by the increasing percentage of cases where the maximal number of iterations is reached before the (nδ)(n-\delta)-threshold is passed (not shown in the table). Also, one may expect that the chances of randomly picking starting points for which the alternating projection algorithm converges sufficiently quickly also diminish with increasing dimension. Both effects increase the number of starting points needed in the search. In addition, the costs of applying the matrix-by-vector multiplication also can be expected to contribute to a nonlinear increase of the computational load. It seems that the overall result is an exponential increase of runtime: A log-linear fit for the runtimes for n10n\geq 10 revealed a behaviour like O(e0.05n)O(e^{0.05n}).

3 Biunimodular vectors for Fourier matrices

When looking for elements of Bi(Fn){\hbox{Bi}}(F_{n}) for a Fourier matrix FnF_{n}, it is advantageous to take into account various operations that preserve this set:

Circular shifts by k=0,1,,n1k=0,1,\ldots,n-1. I.e., if (u0,,un1)Bi(Fn)(u_{0},\ldots,u_{n-1})\in{\hbox{Bi}}(F_{n}), then the vector (uk,uk+1,,un1,u0,,uk1)(u_{k},u_{k+1},\ldots,u_{n-1},u_{0},\ldots,u_{k-1}) is biunimodular.

Modulation by k=0,1,,n1k=0,1,\ldots,n-1: If (u0,,un1)Bi(Fn)(u_{0},\ldots,u_{n-1})\in{\hbox{Bi}}(F_{n}), then the vector (u0,u1exp(2πik/n),,un1exp(2πik(n1)/n))(u_{0},u_{1}\exp(2\pi ik/n),\ldots,u_{n-1}\exp(2\pi ik(n-1)/n)) is biunimodular.

Dilation by k=0,1,,n1k=0,1,\ldots,n-1 coprime with nn: If (u0,,un1)Bi(Fn)(u_{0},\ldots,u_{n-1})\in{\hbox{Bi}}(F_{n}), then (y0,,yn1)(y_{0},\ldots,y_{n-1}) with yj=ujkmodny_{j}=u_{jk\mod n} is biunimodular.

Conjugation: If (u0,,un1)Bi(Fn)(u_{0},\ldots,u_{n-1})\in{\hbox{Bi}}(F_{n}), then its conjugate (u0,,un1)Bi(Fn)(\overline{u}_{0},\ldots,\overline{u}_{n-1})\in{\hbox{Bi}}(F_{n}).

Fourier transform: If uBi(Fn)u\in{\hbox{Bi}}(F_{n}), then FnuF_{n}u is biunimodular as well.

These operations induce the action of a finite group GnG_{n} of size 4n2φ(n)4n^{2}\varphi(n) on Bi(Fn){\hbox{Bi}}(F_{n}), where φ(n)\varphi(n) is Euler’s totient function.

The procedure to compute near biunimodular vectors was applied to all square-free numbers between 2 and 15. In the following presentation of numerical results, we say that an orbit found by our algorithm is close to one from the literature, if the minimal distance (measured in 2\|\cdot\|_{2}) between them is <105<10^{-5}. We observed that each time an orbit found by alternating projections was close to an orbit from the literature, the orbit cardinalities matched as well.

n=2n=2: The algorithm precisely found the one orbit consisting of the two vectors (1,±i)(1,\pm i).

n=3n=3: The algorithm found one orbit of length 6, which was close to the orbit of the gaussian vector (1,ω2,1)(1,\omega^{2},1), where ω=exp(2πi/3)\omega=\exp(2\pi i/3).

n=5n=5: The algorithm found two orbits, each of length 10. Each orbit was close to one of the orbits represented by the gaussian vectors (1,ω,ω4,ω4,ω)(1,\omega,\omega^{4},\omega^{4},\omega) and (1,ω2,ω3,ω3,ω2)(1,\omega^{2},\omega^{3},\omega^{3},\omega^{2}), where ω=exp(2πi/5)\omega=\exp(2\pi i/5).

n=6n=6: The algorithm found two orbits, one of length 12 and one of length 36. The orbit of length 12 was close to the gaussian vector, the orbit of length 36 was close to the vector described in [34, formula (3.20)].

n=7n=7: The algorithm found three orbits, with lengths 42,19642,196 and 294294, yielding 532 solutions in total. The orbit of length 4242 is close to the gaussian solution, whereas the orbit of length 196 is close to the Björck sequence (1,1,1,eiθ,1,eiθ,eiθ)(1,1,1,e^{i\theta},1,e^{i\theta},e^{i\theta}) mentioned in the introduction. The orbit of length 294 was close to the solution given in [34, formula (3.24)].

n=10n=10: Here we obtained 10 orbits, with orbit lengths 20 (2 orbits), 200 (3 orbits), 400 (4 orbits), and 800 (1 orbit), resulting in a total of 3040 solutions. For n=10n=10, the literature does not provide a complete list of solutions.

n=11n=11: Here 12 orbits were found, with lengths 110 (1), 484 (1), 1210 (7), 2420 (2) and 4840 (1), resulting in a total number of 18744 vectors. Among these were the gaussians, all contained in the orbit of length 110. For n=11n=11, the literature does not provide a complete list of solutions.

n=13n=13: The algorithm found 20 orbits, with lengths 78 (2), 338 (2), 1014 (2), 1352 (2), 2028 (7), and 4056 (5), yielding 40040 solutions in total. Gabidulin and Shorin exhibited 25 orbits for n=13n=13, with 53222 solutions overall. Each of the orbits found via alternating projections was close to precisely one orbit representative from the list presented by Gabidulin and Shorin. The following orbits given in were not found by the algorithm: An orbit with 1014 elements, corresponding to the case e=6e=6 in [30, Subsection 7.4], two orbits with 2028 solutions, corresponding to lines one and seven in the table in [30, Subsection 7.4], and the first two orbits listed for the case e=12e=12, each having 4056 elements.

n=14n=14: The algorithm found 39 orbits, with lengths 84 (1), 196 (2), 392 (1), 588 (3), 784 (2), 1176 (12), 2352 (13), 4704 (5), resulting in a total of 72408 vectors. Among these were the gaussians, all contained in the orbit of length 84. For n=14n=14, the literature does not provide a complete list of solutions.

n=15n=15: The algorithm produced 46 orbits, with lengths 900 (2), 1800 (21), 3600 (20), 7200 (3), resulting in a total of 133200 vectors. The gaussian vectors are contained in two orbits of length 60, which were not found by the algorithm. For n=15n=15, the literature does not provide a complete list of solutions.

Appendix A

As we have already explained, Theorem 8.1 allows for an easy construction of all unitary matrices in the dyadic case U(2n)U(2^{n}). Recall, that every such matrix can be obtained from four matrices in U(2n1)U(2^{n-1}). For n=1n=1 the procedure yields U(2)U(2) via (10). Here, we show how the procedure works in the case n=2n=2, to get a closed formula for U(4)U(4).

Theorem 8.1 asserts that every unitary matrix UU(4)U\in U(4) with entries (U)k,l=uk,l(U)_{k,l}=u_{k,l}, where k,l=1,2,3,4k,l=1,2,3,4, can be obtained as

This gives the following description of U(4)U(4):

By setting u11=1u_{11}=1 in the above formula for U(4)U(4) we will find a description of U(3)U(3), that is used in Theorem 6.3 to prove the existence of a biunimodular vector for every matrix in U(3)U(3). This shall be executed as a series of observations.

First, we set u11=1u_{11}=1 in the above formula for U(4)U(4).

u11:=12(14a1a3(1a0)z2(1z0)+12a1(1+a0)(1+12z1(1+z0)))=1u_{11}:={\scriptstyle\frac{1}{2}}({\scriptstyle\frac{1}{4}}a_{1}a_{3}(1-a_{0})z_{2}(1-z_{0})+{\scriptstyle\frac{1}{2}}a_{1}(1+a_{0})(1+{\scriptstyle\frac{1}{2}}z_{1}(1+z_{0})))=1 iff a0=a1=z0=z1=1a_{0}=a_{1}=z_{0}=z_{1}=1.

Proof. Clearly, u11=12R,e+Ku_{11}=\frac{1}{2}\langle R,e+\overline{K}\rangle, where e=(1,0)e=(1,0), R=12(a1(1+a0),a1a3(1a0))R=\frac{1}{2}(a_{1}(1+a_{0}),a_{1}a_{3}(1-a_{0})) denotes the the first row of AA from (21) and K=12(z1(1+z0),z2(1z0))K=\frac{1}{2}(z_{1}(1+z_{0}),z_{2}(1-z_{0})) denotes the first collumn of ZZ from therein. Thus, R2=K2=1\|R\|_{2}=\|K\|_{2}=1.

Since u11u_{11} is an entry of a unitary matrix, we know that u111|u_{11}|\leq 1. This inequality can be proved directly in the following way

Then, we set a0=a1=z0=z1=1a_{0}=a_{1}=z_{0}=z_{1}=1 in the formula for U(4)U(4), to get a corresponding formula for U(3)U(3) given by the simplified entries uk,lu_{k,l} with k,l=2,3,4k,l=2,3,4. By pulling out some factors from these entries we obtain the following matrix: D1U(a,b,c,z)D2D_{1}U_{(a,b,c,z)}D_{2}, where D1D_{1} is the diagonal matrix D1=diag(a2a3,b1b3,b2b3)D_{1}={\hbox{diag}}(a_{2}a_{3},b_{1}b_{3},b_{2}b_{3}), D2=diag(1,c2,c3)D_{2}={\hbox{diag}}(1,c_{2},c_{3}) and

with a=z2z3a=z_{2}z_{3}, b=b4b=b_{4}, c=c4c=c_{4} and z=c1c2b3z=c_{1}\overline{c_{2}b_{3}}. This proves the following description of U(3)U(3)

By writing a=ei2αa=e^{i2\alpha}, b=ei2βb=e^{i2\beta}, c=ei2γc=e^{i2\gamma} we get that U(a,b,c,z)=D1T(α,β,γ,z)D2U(a,b,c,z)=D_{1}^{\prime}T_{(\alpha,\beta,\gamma,z)}D_{2}^{\prime}, where D1=diag(eiα,ei(α+β),iei(α+β))D_{1}^{\prime}={\hbox{diag}}(e^{i\alpha},e^{i(\alpha+\beta)},-ie^{i(\alpha+\beta)}), D2=diag(1,eiγ,ieiγ)D_{2}^{\prime}={\hbox{diag}}(1,e^{i\gamma},-ie^{i\gamma}) and

providing a parallel description of U(3)U(3):

Appendix B

In order to conduct a more detailed study of the alternating projection algorithm we begin by proving the following

1Av2<2δ\|\mathbf{1}-|Av|\|_{2}<\sqrt{2\delta},

Av12|Av|\geq\frac{1}{2} and A(sign(Av))12|A^{*}({\hbox{sign}}(Av))|\geq\frac{1}{2},

APAv1=Av1\|AP_{A}v\|_{1}=\|Av\|_{1} iff PAv=vP_{A}v=v,

PAvv2n(APAv1Av1)12\|P_{A}v-v\|_{\infty}\leq 2\sqrt{n}(\|AP_{A}v\|_{1}-\|Av\|_{1})^{\frac{1}{2}}.

To see that Av|Av| is close to 1=(1,1,,1)\mathbf{1}=(1,1,\dots,1) we use v2=n\|v\|_{2}=\sqrt{n} and observe that

proving (a). This already assures that Av12|Av|\geq\frac{1}{2} for δ18\delta\leq\frac{1}{8} (otherwise 141Av22<2δ\frac{1}{4}\leq\|\mathbf{1}-|Av|\|_{2}^{2}<2\delta, forcing δ>18\delta>\frac{1}{8}).

Since Av>0|Av|>0, we easily see that sign(Av)Av2=1Av2<2δ\|{\hbox{sign}}(Av)-Av\|_{2}=\|\mathbf{1}-|Av|\|_{2}<\sqrt{2\delta}. Thus,

Therefore, by the same token as before, δ18\delta\leq\frac{1}{8} implies that A(sign(Av))12|A^{*}({\hbox{sign}}(Av))|\geq\frac{1}{2}.

To prove (c) we note that, by (15) and (16),

To show (d) we remark, that |A^{*}({\hbox{sign}}(Av))|\geq\mbox{\small\frac{1}{2}} together with A(sign(Av))1n\|A^{*}({\hbox{sign}}(Av))\|_{1}\leq n and (22) allows to apply Lemma B.4 below, proving (d). \Box

To show Lemma B.4 we start with an elementary observation.

Proof. Since 1ϵa=1a|1-\epsilon-a|=1-|a|, we get that (1ϵ)2Re(a)=(1ϵ)21+2a(1-\epsilon)2{\hbox{Re}}(a)=(1-\epsilon)^{2}-1+2|a|, so

what is clear for a12|a|\geq\frac{1}{2}, and follows from ϵ2a\epsilon\leq 2|a| in the case when a12|a|\leq\frac{1}{2}. \Box

The next lemma can be treated as a solution to a certain random walk problem on the complex plane.

Since for u:=vwu:=v\overline{w} we have u,1=v,w>0\langle u,\mathbf{1}\rangle=\langle v,w\rangle>0 and u1=w1\|u\|_{1}=\|w\|_{1}, the above estimate and Lemma B.3 yield (24) immediately. \Box

Thus, to facilitate the further discussion of the algorithm, we will concentrate on the starting vectors V0V_{0} satisfying the initial condition

Under this assumption we can gather our findings in the following

AVj1AVj+11n\|AV_{j}\|_{1}\leq\|AV_{j+1}\|_{1}\leq n,

AVj+11=AVj1\|AV_{j+1}\|_{1}=\|AV_{j}\|_{1} iff Vj+1=VjV_{j+1}=V_{j},

Vj+1Vj2n(AVj+11AVj1)12\|V_{j+1}-V_{j}\|_{\infty}\leq 2\sqrt{n}(\|AV_{j+1}\|_{1}-\|AV_{j}\|_{1})^{\frac{1}{2}} ,

limjVj+1Vj=0\lim_{j\to\infty}\|V_{j+1}-V_{j}\|_{\infty}=0.

Since Bi(A)SA{\hbox{Bi}}(A)\subset S_{A}, the finiteness of SAS_{A} is very hard to establish and, in the Fourier case, brings us back to Conjecture 1.1.

Acknowledgements

Part of this work was carried during several visits of HF to Wrocław, and two visits of ZR to Aachen. We would like to thank our respective hosting organizations. We also thank the referees for providing essential information that made the current understanding of biunimodular vectors more complete.

References