On largest volume simplices and sub-determinants

Marco Di Summa, Friedrich Eisenbrand, Yuri Faenza, Carsten Moldenhauer

Introduction

Many techniques in convex geometry begin with approximating a geometric shape by a simpler one. The maximum volume ellipsoid, or John ellipsoid (see, e.g., ), for example, is a prominent such simplification with many applications in discrete and continuous optimization.

Simplices are, next to ellipsoids, among the most primitive convex sets. We are interested here in the problem of approximating a given VV-polytope by a contained simplex of largest volume. More precisely, we investigate the approximability and hardness of the following problem.

We assume here, without loss of generality, that the convex hull of the points a1,,ana_{1},\dots,a_{n} is full-dimensional. As is the case for ellipsoids, the largest volume simplex in a convex body has attracted a lot of attention in the computer science and optimization literature, see, e.g., .

where AA is the matrix with columns v1v0,v2v0,,vdv0v_{1}-v_{0},v_{2}-v_{0},\dots,v_{d}-v_{0}. Thus MVS can be reduced to nn instances of the problem of finding the largest absolute value of a d×dd\times d subdeterminant of a d×(n1)d\times(n-1) matrix. This motivates the second problem that is central to our study.

Here a basis of a d×nd\times n matrix AA is a maximal subset of the column indices B{1,,n}B\subseteq\{1,\dots,n\} such that the corresponding columns are linearly independent, and ABA_{B} is the matrix consisting of the columns indexed by BB. Khachiyan has shown that there exists a ((1+ε)d)(d1)/2((1+\varepsilon)\cdot d)^{(d-1)/2} approximation algorithm for MSD, and thus also for MVS, with running time polynomial in n,dn,d and 1/ε1/\varepsilon.

We show that there exists an algorithm for MVS, with running time polynomial in n,dn,d and 1/ε1/\varepsilon, that computes a simplex Σ\Sigma with

We show that there exists a constant c>1c>1 such that MVS cannot be approximated within a factor of cdc^{d}, unless P=NPP=NP. This improves the previous best 1.091.09-inapproximability of Packer .

Our hardness result (ii) holds for instances with n=Θ(d)n=\Theta(d). A recent sampling technique immediately yields a cˉd\bar{c}\,^{d}-approximation for such instances (for another constant cˉ\bar{c}), showing that the hardness is essentially tight in this case.

These results (i), (ii) and (iii) also hold for MSD.

The literature on topics related to MVS and MSD is extensive. In order to put our results in perspective, we provide an overview of a selection of related papers.

Variants where the dimension of the solution can be restricted, e.g. finding a maximum volume jj-dimensional simplex, have been considered for MVS and MSD .

Hardness of approximation

Packer has shown that MVS is inapproximable within a constant factor smaller than 1.091.09. This implies the same hardness for MSD, which was shown earlier to be NP-hard by Papadimitriou .

In both cases, the authors show that there exist a constant c>1c>1 and a function j(d)j(d) such that it is NP-hard to approximate the respective problem with factor less than cj(d)c^{j(d)}. Here, j(d)j(d) is linear in dd, but with constant dependence strictly less than one, thus these results do not cover the case j=dj=d.

Subdeterminants in optimization and combinatorics

The subdeterminant lower bound for hereditary discrepancy is maxkΔkk\max_{k}\sqrt[k]{\Delta_{k}}. Recently Matousek has shown that the subdeterminant bound is tight up to a polynomial factor in logd\log d and logn\log n. In a recent series of papers by Nikolov et al. it was shown how to approximate the hereditary discrepancy, and thus the subdeterminant bound maxkΔkk\max_{k}\sqrt[k]{\Delta_{k}}, within a polynomial factor in logd\log d and logn\log n. Our result provides a O(logd)O(\log d)-approximation to Δdd\sqrt[d]{\Delta_{d}}. It is an interesting problem whether a polynomial time approximation algorithm for the subdeterminant bound with a guarantee that is polynomial in logd\log d exists.

A tight analysis of Khachiyan’s algorithm

We now come to the main algorithmic result of our paper and show the following theorem. Recall that ABA_{B} is the matrix corresponding to the columns of AA indexed by BB. We denote the set of bases of AA by B\mathscr{B}.

The algorithm runs in time polynomial in n,dn,d, and 1/ε1/\varepsilon. Thus MSD and MVS can be approximated within the factor above.

The largest d×dd\times d subdeterminant of AA, in absolute value, corresponds to the largest volume simplex in A\mathscr{A} with one vertex being the origin. The algorithm begins by rounding the polytope A\mathscr{A}.

Khachiyan showed that, if K\mathcal{K} is a symmetric VV-polytope that is explicitly given by its vertices, then it is possible to compute an ellipsoid E\mathscr{E} such that EK(1+ε)d E\mathscr{E}\subseteq\mathcal{K}\subseteq\sqrt{(1+\varepsilon)d}\ \mathscr{E} in time polynomial in the number of vertices of K\mathcal{K}, dd and 1/ε1/\varepsilon. This applies to the polytope A\mathscr{A}.

holds. From there, the algorithm proceeds in a greedy fashion, see Figure 1.

Note that v1,,vdv_{1},\dots,v_{d} correspond to the Gram-Schmidt orthogonalization of the vectors returned by the greedy procedure. Thus, after rounding, the absolute value of the determinant induced by the chosen vectors is v1vd\|v_{1}\|\dots\|v_{d}\|.

We now review Khachiyan’s analysis showing that his algorithm is a factor ((1+ε)d)(d1)/2((1+\varepsilon)d)^{(d-1)/2} approximation algorithm for MSD. Let us denote the Euclidean lengths of the picked vectors viv_{i} by ρi=vi\rho_{i}=\lVert v_{i}\rVert for i=1,,di=1,\dots,d, and define Δmax=maxBBdet(AB)\Delta_{\max}=\max_{B\in\mathscr{B}}|\det(A_{B})|. The claimed bound follows from the following two facts:

There is a polynomial-time ((1+ε)d)(d1)/2((1+\varepsilon)d)^{(d-1)/2}-approximation algorithm for problems MSD and MVS.

2 Improving the analysis

We will now show that the approximation factor can in fact be bounded by (eln((1+ε)d))d/2(e\ln((1+\varepsilon)d))^{d/2}. To do so, we significantly tighten the upper bound (i) presented in the analysis above. Key to this improvement is the following observation.

Each viv_{i} is on a principal axis of E\mathscr{E}.

None of the vectors viv_{i} is contained in the interior of E\mathscr{E}.

There exists α>0\alpha>0 such that each aia_{i} is contained in αE\alpha\cdot\mathscr{E}.

The theorem follows from the existence of an ellipsoid E\mathscr{E} satisfying the conditions of Lemma 3 with α=eln((1+ε)d)\alpha=\sqrt{e\ln((1+\varepsilon)d)}.

The vectors v1,,vdv_{1},\dots,v_{d} are pairwise orthogonal due to the projection in Step 2. Recall that determinants are invariant under rotation and note that the algorithm’s execution is also invariant under rotation. Therefore, by suitably rotating the input, we can assume without loss of generality that v1,,vdv_{1},\dots,v_{d} are the vectors ρ1e1,,ρded\rho_{1}\cdot e_{1},\dots,\rho_{d}\cdot e_{d}, where eie_{i} is the ii-th unit vector.

Next, let us group the elements of the decreasing sequence ρ1,,ρn\rho_{1},\dots,\rho_{n} as follows. Let GjG_{j} be the set of indices ii such that ρ1exp((j1)/2)ρi>ρ1exp(j/2)\frac{\rho_{1}}{\exp((j-1)/2)}\geq\rho_{i}>\frac{\rho_{1}}{\exp(j/2)}. Let tt denote the number of such groups. Since ρ1/ρd(1+ε)d\rho_{1}/\rho_{d}\leq\sqrt{(1+\varepsilon)d}, we have tlogexp(1/2)(1+ε)d=ln((1+ε)d)t\leq\log_{\exp(1/2)}\sqrt{(1+\varepsilon)d}=\ln((1+\varepsilon)d). Assume that all groups GjG_{j} are non-empty (discarding empty groups will decrease the number of groups and subsequently yield an improved analysis). Let rjr_{j} be the largest element of GjG_{j} and note that rj/ρier_{j}/\rho_{i}\leq\sqrt{e} for all ρiGj\rho_{i}\in G_{j}.

We are done once we have shown that the ellipsoid E\mathscr{E}, together with α=eln((1+ε)d)\alpha=\sqrt{e\ln((1+\varepsilon)d)}, satisfies the conditions (a)–(c) from Lemma 3.

3 Tightness of the analysis

We now provide a family of instances of increasing dimension where Khachiyan’s algorithm achieves a ratio of (αln(d))d/2(\alpha\ln(d))^{d/2}, where α0.748\alpha\geq 0.748 tends to 11 as dd grows. Thus, we basically match the upper bound on the approximation ratio given in the previous section.

Let d4d\geqslant 4 be a power of two. The instances are matrices with dd rows of the form

Here, DD is a d×dd\times d diagonal matrix of the form

Thus the polytope that is generated by the columns of the matrix (1) and their negatives is “round”, in the sense that it contains the unit ball and it is contained in the unit ball scaled by d\sqrt{d}. We will show below that Khachiyan’s algorithm will output a solution of value at most cdetDc\cdot|\det D|. This implies the claim, as

Then using x1ln(x)x-1\geq\ln(x), we deduce that the approximation ratio is

Let ww be any column vector of c21cDH\frac{\sqrt{c^{2}-1}}{c}DH. The squared norm of the projection of ww into the orthogonal complement of the first ii column vectors of DD (where i{0,,d1}i\in\{0,\dots,d-1\}) is

The last term is the squared norm of the (i+1)(i+1)-th column vector of DD. Thus Khachiyan’s algorithm outputs the first d1d-1 columns of DD, and in the last step a column of norm at most cc from EE. The determinant of the column vector selected by Khachiyan’s algorithm is then at most c×detDc\times|\det D|, as claimed above.

Hardness

We now consider the hardness of approximating MVS and MSD. As mentioned in Section 1.1, the best inapproximability result was due to Packer , who proved that MVS cannot be approximated with a factor better than 1.091.09, unless P=NPP=NP. In this section we provide a drastic improvement showing that it is NP-hard to approximate MVS and MSD with a factor cdc^{d}, where c>1c>1 is an explicit constant. In particular, we show that the result holds for instances where n=Θ(d)n=\Theta(d). We will also conclude that the hardness result is best possible for such instances.

Our argument is based on the connection between MSD and the following problem.

Given a simple undirected graph, find a maximum family of vertex-disjoint odd cycles.

The overall strategy for proving hardness is the following. We first build on a hardness result by Berman and Karpinski on stable sets in 33-regular graphs and show that OCP is NP-Hard to approximate with a factor cˉ{\bar{c}}, where cˉ>1\bar{c}>1 is an explicit constant. Our second step is to reduce OCP to MSD, using the construction seen above. Hence the constant inapproximability for OCP leads to a cdc^{d}-inapproximability for MSD. Last, we reduce MSD to MVS.

Let us remark that OCP is NP-hard even when restricted to planar graphs and, in that case, allows for constant factor approximations . Following the hardness for packing the maximum number of disjoint cycles by Friggstad and Salavatipour , we can deduce for OCP a constant hardness under P\neqNP and a hardness of O(log12εn)O\left(\log^{\frac{1}{2}-\varepsilon}n\right) unless NPZPTIME(npolylog(n))\text{NP}\subseteq\text{ZPTIME}\left(n^{\text{polylog}(n)}\right), where nn is the number of nodes of the graph. This result relies on the PCP-theorem. Our construction below yields a weaker hardness for OCP, but it does not rely on the PCP-theorem, it leads to an improved hardness result for the vertex-disjoint triangle packing problem, and it is much simpler to use for constructing a hardness for MSD subsequently. In particular, it enables us to easily calculate the explicit constants in the hardness for MSD.

On the positive side, Kawarabayashi and Reed showed that for general graphs OCP can be approximated within a factor of n\sqrt{n}.

We now describe our inapproximability result for OCP. We require a result of Berman and Karpinski for maximum stable set on 3-regular graphs. Given a system of 2n2n linear equations modulo 2 with 3 variables per equation, Håstad showed that it is NP-hard to distinguish between instances where there exists a solution satisfying (1ε)2n(1-\varepsilon)2n equations and instances where no solution satisfies more than (1+ε)n(1+\varepsilon)n equations, for any arbitrarily small ε>0\varepsilon>0. Building on this result, Berman and Karpinski gave a polynomial time construction of a 3-regular graph on 176n176n vertices that translates nn satisfied equations to a maximum stable set of size at most 97n97n and 2n2n satisfied equations to a maximum stable set of size at least 98n98n. Thus, it is NP-Hard to detect if a 33-regular graph on 176n176n vertices has a stable set of size at least (1ε)98n(1-\varepsilon)98n, or at most (1+ε)97n(1+\varepsilon)97n, for each ε>0\varepsilon>0.

Let G=(V,E)G=(V,E) be the graph constructed in . Intuitively, we would like to construct a new graph HH such that every vertex in GG corresponds to a triangle in HH and that a stable set in GG also corresponds to a packing of triangles in HH. A first candidate for such a graph HH would be the line graph of GG (recall that GG is 33-regular), but in the line graph we might also create triangles that do not correspond to vertices in GG.

We solve this issue by slightly changing GG. Subdivide every edge in GG twice, i.e., substitute an edge {u,v}\{u,v\} by a path Puv=u,p1,p2,vP_{uv}=u,p_{1},p_{2},v. Let GG^{\prime} be the obtained graph. Since GG has 176n176n vertices, hence 32176n=264n\frac{3}{2}176n=264n edges, GG^{\prime} has 176n+2264n=704n176n+2\cdot 264n=704n vertices and 3264n=792n3\cdot 264n=792n edges. Note that GG^{\prime} is triangle-free.

We now prove that every subdivision of an edge in GG augments the stable sets by exactly one vertex. Consider a stable set SS in GG. By choosing p2p_{2} when uSu\in S or p1p_{1} otherwise we obtain an induced stable set in GG^{\prime} of size S+264n|S|+264n. Conversely, let SS^{\prime} be a stable set in GG^{\prime}. Modify SS^{\prime} such that for each PuvP_{uv} not both uu and vv are in SS^{\prime}: if both are in SS^{\prime}, then the stable set S{v}{p2}S^{\prime}\setminus\{v\}\cup\{p_{2}\} has the same cardinality and only includes one of {u,v}\{u,v\}. Then we can obtain a stable set in GG of size at most S264n|S^{\prime}|-264n. In particular, a stable set of size 97n97n (resp., 98n98n) in GG translates into a stable set of size 97n+264n=361n97n+264n=361n (resp., 362n362n) in GG^{\prime}.

Now let HH be constructed as follows: starting from the line graph of GG^{\prime}, for each PuvP_{uv} add two vertices and connect them as to obtain the graph HuvH_{uv} depicted in Figure 2. The number of vertices in HH is 792n+2264n=1320n792n+2\cdot 264n=1320n and the number of edges is 12(4792n+22264n)=2112n\frac{1}{2}(4\cdot 792n+2\cdot 2\cdot 264n)=2112n, as every vertex belonging to the line graph of GG^{\prime} has degree four and the two additional vertices in each HuvH_{uv} have degree two.

As GG^{\prime} is triangle-free, there is a one-to-one correspondence between triangles in HH and vertices in GG^{\prime}. Moreover, two vertices in GG^{\prime} are adjacent if and only if their corresponding triangles in HH have a common vertex. Thus a maximum stable set of size 361n361n (resp., 362n362n) in GG^{\prime} translates into a maximum number of 361n361n (resp., 362n362n) vertex-disjoint triangles in HH. It is known that finding the maximum number of vertex-disjoint triangles in a graph is APX-hard . However, no explicit lower bound was known.

It is NP-hard to approximate the maximum number of vertex-disjoint triangles in a graph with a factor of (362361ε)\left(\frac{362}{361}-\varepsilon\right) for arbitrarily small constant ε>0\varepsilon>0. The result holds even for graphs with maximum degree four.

Now, consider OCP in HH, i.e., finding the maximum number of vertex-disjoint odd cycles. We prove that there is always an optimal solution consisting of triangles only. Assume the contrary, i.e., the optimal solution includes a cycle CC of length at least 55. We distinguish three cases. First, CC does not contain any vertex other than those of type xx and yy (see Figure 2). Then, CC must be a triangle. Second, CC is fully contained in some HuvH_{uv}. One easily checks that also in this case CC must be a triangle. Third, CC is not contained in any HuvH_{uv}. Then CC has to pass through the node zz of some HuvH_{uv} and hence the solution does not include any triangle in such HuvH_{uv}. Substitute CC by any of the two triangles in HuvH_{uv} to obtain another optimal solution to OCP. Hence there is an optimal solution to OCP in HH only containing triangles. Therefore we have the following hardness result.

It is NP-hard to approximate OCP with a factor of (362361ε)\left(\frac{362}{361}-\varepsilon\right) for arbitrarily small constant ε>0\varepsilon>0. The result holds even for graphs with maximum degree four.

2 From OCP to MSD and MVS

Consider the node-edge incidence matrix AA of HH: this is a 1320n×2112n1320n\times 2112n matrix. From what was argued above, the maximum number of vertex-disjoint odd cycles of 361n361n (resp., 362n362n) translates into subdeterminants of size 2361n2^{361n} (resp., 2362n2^{362n}). This allows us to prove a hardness of 2n2^{n} when the dimension is d=1320nd=1320n.

It is NP-hard to approximate MSD with a factor of (21/1320ε)d\left(2^{1/1320}-\varepsilon\right)^{d} for arbitrarily small constant ε>0\varepsilon>0. The hardness even holds when restricted to node-edge incidence matrices of graphs with maximum degree four.

where ε=723ε\varepsilon^{\prime}=723\varepsilon and ε=21/1320(12ε1320)>0\varepsilon^{\prime\prime}=2^{1/1320}\left(1-2^{-\frac{\varepsilon^{\prime}}{1320}}\right)>0. ∎

We now derive a similar inapproximability result for MVS.

It is NP-hard to approximate MVS with a factor of (21/1320ε)d\left(2^{1/1320}-\varepsilon\right)^{d} for arbitrarily small constant ε>0\varepsilon>0.

where SS^{\prime} is the simplex among S1,,Sd+1S_{1},\dots,S_{d+1} of maximum volume. Note moreover that the submatrix AA^{\prime} associated to the non-zero vertices of SS^{\prime} is a feasible solution to the original MSD problem. We deduce

Hence, we can output AA^{\prime} and obtain the required approximation. ∎

For every ε>0\varepsilon>0, it is NP-Hard to approximate

the maximum number of vertex-disjoint triangles in a graph and OCP within a factor of (285284ε)\left(\frac{285}{284}-\varepsilon\right);

MSD and MVS with a factor of (21/1012ε)d\left(2^{1/1012}-\varepsilon\right)^{d}.

3 Tightness for instances with n=O​(d)𝑛𝑂𝑑n=O(d)

The claimed result then follows by repeated sampling and picking the largest basis.

If n=O(d)n=O(d), then there exists a randomized algorithm for MSD with approximation ratio cˉd\bar{c}^{d} for some constant cˉ\bar{c} that depends on the constant in the OO-notation.

References