A Parallel Approximation Algorithm for Positive Semidefinite Programming
Rahul Jain, Penghui Yao
Introduction
In this work we consider the class of positive semidefinite programs. A positive semidefinite program can be expressed in the following standard form (we use symbols to also represent Löwner order).
Our algorithm is inspired by the algorithm used by Luby and Nisan to solve positive linear programs. Positive linear programs can be considered as a special case of positive semidefinite programs in which the matrices used in the description of the program are all pairwise commuting. Our algorithm (and the algorithm in ) is based on the ’multiplicative weights update’ (MWU) method. This is a powerful technique for ’experts learning’ and finds its origins in various fields including learning theory, game theory, and optimization. The algorithms used in are based on its matrix variant the ’matrix multiplicative weights update’ method. The algorithm of Luby and Nisan proceeds in phases, where in each phase the large eigenvalues of (s represent the candidate dual variables at time ) are sought to be brought below a threshold determined for that phase. The primal variable at time step is chosen to be the projection onto the large eigenvalues (above the threshold) eigenspace of . Using the sum of the primal variables generated so far, the dual variables are updated using the MWU method. A suitable scaling parameter is chosen during this update, which is small enough so that the good properties needed in the analysis of MWU are preserved and at the same time is large enough so that there is reasonable progress in bringing down the large eigenvalues.
Due to the non-commutative nature of the matrices involved in our case, our algorithm primarily deviates from that of in how the threshold is determined inside each phase. The problem that is faced is roughly as follows. Since ’s could be non-commuting, when s are scaled down, the sum of the large eigenvalues of may not come down and this scaling may just move the large eigenvalues eigenspace. Therefore a suitable extra condition needs to be ensured while choosing the threshold. Due to this, our analysis also primarily deviates from in bounding the number of time steps required in any phase and is significantly more involved. The analysis requires us to study the relationship between the large eigenvalues eigenspaces before and after scaling (say and ). For this purpose we consider the decomposition of the underlying space into one and two-dimensional subspaces which are invariant under the actions of both and (projections onto and respectively) and this helps the analysis significantly. Such decomposition has been quite useful in earlier works as well for example in quantum walk and quantum complexity theory .
We present the algorithm in the next section and its analysis, both optimality and the running time, in the subsequent section. Due to space constraints we move some proofs to the Appendix.
Algorithm
Given the positive semidefinite program as above, we first show in Appendix A that without loss of generality can be in the following special form.
Analysis
For all of this section, let . In the following we assume that is sufficiently large and is sufficiently small.
In this section we present the analysis assuming that all the operations performed by the algorithm are perfect. We claim, without going into further details, that similar analysis can be performed while taking into account the accuracy loss due to the actual operations of the algorithm in the limited running time.
For all , satisfies the conditions and in Step (3d) in the Algorithm.
Follows since ∎
Following lemma shows that for any time , is not much larger than .
For all ,
Fix any . As , the loop at Step 3(c) runs at most times. Hence
Following lemma shows that as increases, there is a reduction in the trace of the dual variable in terms of the trace of the primal variable.
For all we have,
Following lemma relates the trace of with the trace of and .
We can now finally bound the trace of in terms of the trace of .
and are feasible for the and respectively and
It is easily verified that and are feasible for and respectively. From Lemma 5 we have,
Since we have
2 Time complexity
Let us first introduce some notation. Let be a Hermitian matrix and be a real number. Let
denote the projector onto the space spanned by the eigenvectors of with eigenvalues at least . Let be shorthand for .
denote the sum of eigenvalues of at least . Thus . Let be shorthand for .
denote the k-th largest eigenvalue of .
.
for any two vectors we say majorizes , denoted , iff and for any we have, .
For Hermitian matrices and , implies for all . Thus for any real number .
Let be an Hermitian matrix and be a family of mutually orthogonal projections. Then
For any two projectors and , there exits an orthogonal decomposition of the underlying vector space into one dimensional and two dimensional subspaces that are invariant under both and . Moreover, inside each two-dimensional subspace, and are rank-one projectors.
Let be the final value of . Then .
Hence . Therefore . ∎
For any fixed , the number of iterations of the algorithm is at most . Hence combined with Lemma 10, the total number of iterations of the algorithm is at most
Fix . Assume that the Algorithm has reached step for this fixed , times. As argued in the proof of Lemma 4, whenever Algorithm reaches step , . Thus there exists a value between and such that at least times.
From Lemma 4 we get that the sum of the eigenvalues above , is at most at the beginning of this phase. Whenever in this phase, using Fact 7, we conclude that the eigenvalues of above do not increase. Whenever in this phase, using Lemma 12, we conclude that the eigenvalues of above reduce by a factor of . This can be seen by letting in Lemma 12 to be and to be . Now condition of the Algorithm gives condition of Lemma 12. Condition of Lemma 12 can also be seen to be satisfied (using Lemma 3) and condition of Lemma 12 is false due to condition of the Algorithm. This implies condition of Lemma 12 must also be false which gives us the desired conclusion.
Therefore the eigenvalues of above (in particular above ) will vanish before , times. Hence must decrease before the Algorithm has reached step , times. ∎
Following is a key lemma. It states that for two positive semidefinite matrices , if has good weight in the large (above ) eigenvalues space of and if the sum of large (above ) eigenvalues of is pretty much the same as for , then the sum of eigenvalues of , slightly below should be a constant fraction larger than the sum above .
Let . Let be two positive semidefinite matrices satisfying
In order to prove this Lemma we will need to first show a few other Lemmas. By Fact 9, and decompose the underlying space as follows,
Above for each , is either one-dimensional or two-dimensional subspace, invariant for both and and inside at least one of and survives. is the subspace where both and vanish. We identify the subspace and the projector onto itself. For any matrix , define to be . We can see that both the projectors and are decomposed into the direct sum of one-dimensional projectors as follows.
For any , and . That is, the eigenspace of with eigenvalues at least , is exactly the restriction of to and similarly for .
Let and , then
is the spectral decomposition of . As and , we have and , and hence ∎
We prove (6) and (7) and (8) follow similarly.
From (1), for all , . Combined with (8), we have
From (10) (since for all ),
Note that for any , . Otherwise, either or and neither of these can happen in (from definitions of and ).
The following lemma states that for each , the second eigenvalue of is close to . Its proof involves some direct calculations and due to space constraint we move it to Appendix B.
Let and be positive semidefinite matrices satisfying
Then
We can finally prove Lemma 12. By Fact 7, . Let , and . Then
According to the decomposition in Fact 9, Lemma 14 and the remarks below it, and
The RHS of both the equations are equal by Lemma 14. Therefore,
Let , then
Therefore from previous three inequalities,
Note that , therefore from Remark 2.,
Penghui Yao would like to thank Attila Pereszlnyi and Huangjun Zhu for helpful discussions.
References
Appendix A Transforming to special form
Let us consider an instance of a positive semidefinite program as follows.
We show how to transform the primal problem to the special form and a similar transformation can be applied to dual problem. First observe that if for some , , the corresponding constraint in primal problem is trivial and can be removed. Similarly if for some , the support of is not contained in the support of , then must be and can be removed. Therefore we can assume w.l.o.g. that for all and the support of is contained in the support of . Hence w.l.o.g we can take the support of as the whole space, in other words, is invertible. For all define . Consider the normalized Primal problem.
The next step to transforming the problem is to limit the range of eigenvalues of s. Let .
Let be the spectral decomposition of . Define for all and ,
Define Consider the transformed Primal problem .
Transformed Primal problem
Any feasible solution to is also a feasible solution to .
Follows immediately from the fact that .
Fix . Assume that there exists such that . Then, from Claim 18
Note that for all , the ratio between the largest eigenvalue and the smallest nonzero eigenvalue of is at most .
Finally, we get the special form Primal problem as follows. Let and for all define . Consider,
Appendix B Deferred Proofs
is the eigenvector of with eigenvalue . Hence . Note that , because Consider