Polynomial integrality gaps for strong SDP relaxations of Densest k-subgraph

Aditya Bhaskara, Moses Charikar, Venkatesan Guruswami, Aravindan Vijayaraghavan, Yuan Zhou

Introduction

The densest kk-subgraph problem takes as input a graph G(V,E)G(V,E) on nn vertices and a parameter kk, and asks for a subgraph of GG on at most kk vertices having the maximum number of edges. While it is a fundamental graph optimization problem and arises in several applications (community detection in social networks, identifying protein families and molecular complexes in protein-protein interaction networks, etc), there is a huge gap between the best approximation algorithm and the known inapproximability results. The current best approximation algorithm due to [BCC+10] gives O(n1/4+ε)O(n^{1/4+\varepsilon})-factor approximation algorithm which runs in time nO(1/ε)n^{O(1/\varepsilon)} for any constant ε>0\varepsilon>0. On the inapproximability side, [Fei02] initially showed a small constant factor inapproximability for Densest kk-subgraph using the random 3-SAT assumption. [Kho04] used quasi-random PCPs to rule out a PTAS. More recently, [RS10, AAM+10] used more non-standard assumptions to rule out any constant factor approximation algorithms.

While only constant factor approximations have been ruled out, it is commonly believed that Densest kk-subgraph is much harder to approximate even on average (for a natural distribution on hard instances). Recently, average-case hardness assumptions based on the hardness of “planted” versions of Densest kk-subgraph were used for public key cryptography [ABW08] and in showing that financial derivates can be fraudulently priced without detection [ABBG10]. Given the interest in Densest kk-subgraph from both the algorithms and the complexity point of view, developing a better understanding of the problem is an important challenge for the field.

In this work, we study lift-and-project relaxations for Densest kk-subgraph. Lift-and-project methods are systematic iterative procedures to obtain sequences of increasingly stronger mathematical programming relaxations for an integer optimization problem (e.g. Lovász-Schrijver [LS91], Sherali-Adams [SA90] and Lasserre [Las01]. See the survey by Laurent [Lau03] for a comparison). Typically, the relaxation obtained after rr levels of these strengthenings can be solved in nO(r)n^{O(r)} time. A number of recent papers have studied the strength and limitations of such relaxations as a basis for designing approximation algorithms for various problems [ABLT06, CMM09, Chl06, CS06, dVM07, GMPT07, GMT09, KS09, RS09, Sch08, STT07a, STT07b, Tul09] (see the recent survey by Chlamtac and Tulsiani [CT11]). In most cases of approximation algorithms that use strengthened LP and SDP relaxations, such relaxations can be obtained from a few levels of such lift-and-project procedures. In fact, the O(n1/4+ε)O(n^{1/4+\varepsilon}) approximation algorithm of [BCC+10] for Densest kk-subgraph uses a linear programming relaxation which is weaker than that obtained from O(1/ε)O(1/\varepsilon) levels of the Sherali-Adams hierarchy.[BCC+10] also gives a purely combinatorial algorithm that does not use a linear program. [BCC+10] also show that the integrality gap becomes O(n1/4ε)O(n^{1/4-\varepsilon}) after nO(ε)n^{O(\varepsilon)} levels of the Sherali Adams LP hierarchy.

Our gap instances are actually (Erdös - Renyi) random graph instances G(n,p)\mathcal{G}(n,p) and random bipartite graphs under a special distribution – hence, we show that natural distributions of instances are integrality gap instances with high probability.

We note that prior results exhibiting gap instances for lift-and-project relaxations do so for problems that are already known to be hard to approximate under some suitable assumption; based on this hardness result, one would expect lift-and-project relaxations to have an integrality gap that matches the inapproximability factor.

Our gap constructions for Densest kk-subgraph in this paper are a rare exception to this trend, as the integrality gaps we show are substantially stronger than the (very weak) hardness bounds known for the problem. In fact, we are only aware of the following examples where a polynomial-round Lasserre integrality gap stronger than the corresponding NP-hardness result is known : Max KK-CSP, KK-coloring [Tul09], Balanced Separator and Uniform Sparsest Cut [GSZ11]. In the first two cases, NP-hardness results that are not that far from the gaps are known [ST00, Kho02] and for Max KK-CSP a matching Unique-Games hardness is also known [ST09]. For the other two problems, constant factor integrality gaps were shown for linear number of rounds of Lasserre hierarchy [GSZ11]. Again, while these problems are not known to be APX-hard, under the conjectured intractability of Small Set Expansion, they are known to be hard to approximate within any constant factor.

In the absence of inapproximability results for Densest kk-subgraph, our results show that beating a factor of nΩ(1)n^{\Omega(1)} is a barrier for even the most powerful SDPs, and in fact even beating the best known n1/4n^{1/4} factor is a barrier for current techniques. These results are perhaps indicative of the hardness of approximating Densest kk-subgraph within nΩ(1)n^{\Omega(1)} factors.

A problem related to Densest kk-subgraph is the Small Set Expansion (SSE) problem, which has received a lot of recent attention due to strong connections to the Unique Games conjecture [RS10]. One way to state the SSE conjecture [RS10] (which is known to imply the Unique Games conjecture) is as follows: for all ϵ>0\epsilon>0, there exists δ,D\delta,D (think of DD as a constant), such that the following problem is not polynomial-time solvable:

Given a DD-regular instance G(V,E)G(V,E) with k=δnk=\delta n, the Gap-SSE problem is to distinguish between the following two cases.

Yes case. There exists a subgraph of kk vertices with average degree at least (1ε)D(1-\varepsilon)D.

No case. All subgraphs of kk vertices have average degree at most εD\varepsilon D.

Clearly, Densest kk-subgraph is hard to approximate within any constant factor, assuming the Small Set Expansion conjecture. On the other hand, our results indicate that approximating Densest kk-subgraph even within a polynomial factor may be a harder problem than Unique Games or Small Set Expansion, because these problems were recently shown to be solvable using nϵΩ(1)n^{\epsilon^{\Omega(1)}} rounds of the Lasserre hierarchy, where ϵ\epsilon is the completeness parameter in Unique Games and Small Set Expansion [BRS11, GS11].

Preliminaries

We introduce some notation which will be used throughout the paper. G=(V,E)G=(V,E) refers to a graph which is an instance of the Densest kk-subgraph problem on nn vertices, and kk refers to the size of the subgraph we are required to output. For an induced subgraph HGH\subseteq G, we denote by d(H)d(H) the average degree (or density of HH). For a vertex vv in subgraph HH, we will denote by ΓH(v)\Gamma_{H}(v) the set of neighbors of vv in HH (the suffix will be dropped when H=GH=G).

The phrase “with high probability” will mean: with probability 11p(n)1-\frac{1}{p(n)}, for any polynomial p(n)p(n).

2 The relaxation hierarchies for Densest k𝑘k-subgraph.

We will be concerned with the SDP relaxations derived from the Sherali-Adams and Lasserre hierarchies for Densest kk-subgraph. As in other lift-and-project schemes, a feasible solution to rr levels of these hierarchies satisfies the condition that for any set of rr vertices, it defines a valid distribution over integral solutions for these vertices – in particular, the integrality gap becomes 11 after nn levels. Further, the relaxations given by rr levels of the Sherali-Adams and Lasserre hierarchies can be solved in nO(r)n^{O(r)} time. We are interested in the integrality gap of rr levels of these relaxations for Densest kk-subgraph. Refer to [CT11] for a more comprehensive comparison of these relaxation hierarchies.

The Sherali-Adams hierarchy starts with a simple LP relaxation of a {0,1}\{0,1\} integer program, and obtains a sequence of successively tighter relaxations with more levels. The natural LP relaxation for Densest kk-subgraph (LP1 in Figure 1) [SW98, FS97] has variables {xi}\{x_{i}\} to denote if vertex ii belongs to the solution, and edge variables {xij}(i,j)E(G)\{x_{ij}\}_{\begin{subarray}{c}(i,j)\in E(G)\end{subarray}} to denote if both i,ji,j are in the subgraph. This LP has an integrality gap of Ω(nk)\Omega(\frac{n}{k}) ([FKP01, FS97]).

For our integrality gaps, we will in fact start with a stronger basic (first-level) linear program (LP2 in Figure 1) which is equivalent upto a factor of 22 (see [BCC+10]). Intuitively, it tries to find a kk-subgraph HH where the minimum degree dHd_{H} is maximized. An LP hierarchy obtained from this min. degree LP (LP2) was in fact used by [BCC+10] to obtain their approximation algorithm. While the program as stated is not linear, we guess the degree dd and consider the feasibility linear program that is obtained.

Let us consider strengthening this LP by considering rr levels of the Sherali-Adams hierarchy (SArSA_{r}, shown in Figure 2). In the lifted LP, the variable xSx_{S} is supposed to capture whether every vertex in SS belongs to the chosen kk-subgraph (i.e., xS=iSxix_{S}=\prod_{i\in S}x_{i}). Further if we take two sets S,SS,S^{\prime} of r\leq r vertices, the local distributions induced by a feasible solution (using the inclusion-exclusion constraints), agree on the variables in the intersection SSS\cap S^{\prime}. We follow the notation established in [CT11] while defining the hierarchy.

2.2 The mixed hierarchy (Sherali-Adams + SDP).

The mixed hierarchy (also refered to as SA+) imposes an additional SDP constraint on top of the Sherali-Adams LP relaxation. In particular, it asks for the values xijx_{ij} to come from vector inner products i.e. the matrix X=(xij)X=(x_{ij}) is p.s.d. Most known algorithms which proceed by rounding a relaxation obtained from an SDP hierarchy [Chl06, CS06, BRS11] work with this mixed hierarchy [GS11] is an exception and seems to need a relaxation given by the Lasserre hierarchy..[RS09, KS09] and [GMT09] considered this hierarchy and obtained integrality gaps for Unique Games and approximation-resistant CSPs.

One level of the mixed hierarchy for Densest kk-subgraph gives the SDP relaxation introduced in [FS97, SW98]. [BCC+10] show that the mixed hierarchy performs better than log-density based arguments (which are captured by just the LP hierarchy) in a planted model.In particular, the problem of detecting if dense kk-subgraph is planted in a random graph or not, in the parameter range D<n1/2D<n^{1/2}. It is interesting in this light to obtain integrality gaps for mixed hierarchy.

2.3 The Lasserre hierarchy.

The Lasserre hierarchy produces a sequence of SDP relaxations which are stronger than the Sherali-Adams and the mixed hierarchies. As in [CT11], the rr-level Lasserre SDP for Densest kk-subgraph introduces a vector US{\bm{U}}_{S} for each subset SVS\subseteq V with Sr|S|\leq r (Figure 3).

The intended solution sets US=U{\bm{U}}_{S}={\bm{U}}_{\emptyset} if every vertex in SS belongs to the densest kk-subgraph, and US=0{\bm{U}}_{S}=\bf{0} otherwise. The vector lengths US2\left\lVert{\bm{U}}_{S}\right\rVert^{2} correspond to valid LP values xSx_{S} for the Sherali-Adams relaxation presented above.

As in Section 2.2.1, we can write an SDP which tries to find the kk-subgraph of largest induced minimum degree dd. This can be captured by the SDP constraint (analogous to (2))

However, we show in Section 4.3 that our integrality gaps also hold for the Lasserre hierarchy defined by this SDP. We refer to the SDP with constraint (4) as the Min degree Lasserre SDP .

Integrality Gap for the Sherali-Adams hierarchy

In what follows LL will denote the number of levels of the hierarchy we will consider.

Let Llogn10loglognL\leq\frac{\log n}{10\log\log n}. The integrality gap of SAL\text{SA}_{L} is at least \Omega\big{(}\frac{n^{1/4}}{L\log^{2}n}\big{)}.

To prove Theorem 3.1, we present instances GG where the relaxation has a solution with value d=Ω(n1/4/L)d=\Omega(n^{1/4}/L), while the integer optimum, i.e., the largest density of a kk-subgraph in GG is only O(log2n)O(\log^{2}n). It will be notationally convenient to construct gaps for L/2L/2 levels.

We in fact give a distribution over instances, and prove that the desired gap holds with high probability. The instances we consider are G(n,p)\mathcal{G}(n,p) random graphs with p=n1/2lognp=n^{-1/2}\log n (thus the expected degree of each vertex is D=n1/2lognD=n^{1/2}\log n). The parameter kk is chosen to be n1/2n^{1/2}. An easy calculation shows that in any kk subgraph, the density (and hence the min-degree) is at most O(log2n)O(\log^{2}n) (see full version or [BCC+10, FKP01]). The meat of the argument is thus to show that there exists an LP solution to SAL/2\text{SA}_{L/2} (Equations (1)-(3)) of value d=Ω(n1/4/L)d=\Omega(n^{1/4}/L) even for LL of the order logn/loglogn\log n/\log\log n.

The following are the properties of the distribution G(n,p)\mathcal{G}(n,p) (with above parameters) we will truly be using [see Section A for proofs]. Any graph with these properties admits the solution to SAL/2\text{SA}_{L/2} which we describe.

Every vertex has degree between (n1/2logn)/2(n^{1/2}\log n)/2 and 2n1/2logn2n^{1/2}\log n.

Any two vertices i,ji,j have at least one common neighbor and has at most O(log2n)O(\log^{2}n) common neighbours.

2 Feasible solution.

Before formally giving the xSx_{S} values, we give intuition as to what they ought to be. First, we start out setting xi=n1/2x_{i}=n^{-1/2} (equal for all vertices, since ixik=n1/2\sum_{i}x_{i}\leq k=n^{1/2} and no vertex is special). Next, suppose SVS\subset V with iSi\in S and think of dn1/4d\approx n^{1/4}. Now (2) implies that jΓ(i)xSjn1/4xS\sum_{j\in\Gamma(i)}x_{S\cup j}\geq n^{1/4}x_{S}. Further from (1), we obtain jVxSjn1/2xS\sum_{j\in V}x_{S\cup j}\leq n^{1/2}x_{S}. Thus we conclude that xSjx_{S\cup j} must be roughly n1/4xSn^{-1/4}x_{S} for jΓ(S)j\in\Gamma(S), while for j∉Γ(S)j\not\in\Gamma(S), it should be only n1/2xSn^{-1/2}x_{S}. Now consider TVT\subset V which span a tree: we could imagine starting with one vertex and adding vertices one by one (each added vertex is a neighbour of the previous ones), and thus conclude that xTx_{T} is roughly n(T+1)/4n^{-(|T|+1)/4} (since xi=n1/2x_{i}=n^{-1/2} to begin with). Now let SS be an arbitrary set of vertices and consider a tree TST\supseteq S: by monotonicity (a corollary of (3)), xSxTx_{S}\geq x_{T}, and since this is true for every such TT, we need to set xSx_{S} to be at least n(st(S)+1)/4n^{-(\mathsf{st}(S)+1)/4}, where st(S)\mathsf{st}(S) is the number of vertices (size) in the minimum Steiner tree of SS.

These, with additional ‘dampening’ factors (LL-terms), are precisely the values we will set. More precisely we consider the solution

where st(S)\mathsf{st}(S), as above, is the size of the minimum Steiner tree of SS. Thus for instance xi=n1/2/Lx_{i}=n^{-1/2}/L, while x{i,j}=1/(n3/4L2)x_{\{i,j\}}=1/(n^{3/4}L^{2}) when (i,j)E(i,j)\in E and 1/(nL2)1/(nL^{2}) otherwise (the latter is because there is a path of length-2 between any i,jGi,j\in G with high probability).

Let us fix Llogn/(10loglogn)L\leq\log n/(10\log\log n). We now show that the LP solution presented above is feasible for SAL/2\text{SA}_{L/2} with high probability. The following lemma is useful in simplifying the analysis: it implies that we need to only consider T=T=\emptyset while showing that the LP solution satisfies constraints (1) and (2). This is where the ’dampening’ factors come into play.

Let S,TS,T be disjoint subsets of VV of size at most tt and xSx_{S} be the solution described above. Then

One property of the assignment (5) is that xSixS/Lx_{S\cup i}\leq x_{S}/L for i∉Si\not\in S. Further all the xSx_{S} are 0\geq 0, and thus in the sum above, the term corresponding to JTJ\subseteq T contibutes positively when J|J| is even and negatively otherwise. Hence,

A similar proof shows the upper bound, since the xS{i}x_{S\cup\{i\}} terms for iTi\in T dominate the contributions of xSJx_{S\cup J} for J>1|J|>1. ∎

In checking feasibility, it suffices to check (1) and (2) with T=T=\emptyset.

Lemma 3.2 allows us to ‘remove’ the JT\sum_{J\subseteq T} on both sides of the equations (and set T=T=\emptyset) by losing a factor of 2. Since we allow constant slack, the claim follows. ∎

We refer to the constraints (1) and (2) as the size and the density constraints respectively, because the former says that we should pick only a kk-subgraph, and the latter says the minimum degree (density) is at least dd. The assignment we described allows us to prove the density constraint easily.

(Density Constraint) The xSx_{S} described above satisfy constraints (2).

Let SVS\subset V and iSi\in S. We need to check that jΓ(i)xSjn1/4LxS\sum_{j\in\Gamma(i)}x_{S\cup j}\geq\frac{n^{1/4}}{L}\cdot x_{S}. It is easy to see that for every jΓ(i)j\in\Gamma(i), st(Sj)st(S)+1\mathsf{st}(S\cup j)\leq\mathsf{st}(S)+1, and thus xSjn1/4LxSx_{S\cup j}\geq\frac{n^{-1/4}}{L}\cdot x_{S} (the LL term is due to the dependence on S|S| in (5)). Since there are at least n1/2logn/2n^{1/2}\log n/2 terms in the LHS, the inequality follows. ∎

3 The Size Constraint and Minimum Steiner trees in 𝒢​(n,p)𝒢𝑛𝑝\mathcal{G}(n,p).

By the above corollary, it suffices to check (noting k=n1/2k=n^{1/2}) that

We show this by proving that st(Si)st(S)+2\mathsf{st}(S\cup i)\geq\mathsf{st}(S)+2 for most iVi\in V, in particular we bound the number of exceptions (lemmas below state the precise bounds). This then implies that (6) holds.

We start with some basic facts (and notation) about Minimum Steiner trees (minST) of S(V)S(\subset V) in G(n,p)\mathcal{G}(n,p), with our parameters. We will refer to the vertices in SS as the terminals, and the rest of the vertices in a minST as the non-terminals. First, the minST must have all its leaves to be terminals. Further, since every two vertices in GG have a path of length two, we must have st(S)2S1\mathsf{st}(S)\leq 2|S|-1 for all SS. This helps us bound the number of tree structures the minST of SS can have. We define this formally.

Given SVS\subset V, a tree structure for SS is a tree TT along with a mapping g:V(S)V(T)g:V(S)\rightarrow V(T) which is one-one (not necessarily onto). The vertices in TT without an inverse image in SS are called internal vertices and the rest are also called fixed vertices. A tree structure for SS is valid if it is possible to ‘fill in’ the internal vertices with distinct vertices from VV such that all the edges in the tree are also present in GG. [The relation to Steiner trees is apparent – the internal vertices are the Steiner vertices]. Given an internal vertex in TT, the vertices of GG which take that position in some valid ‘filling in’ are called the set of candidates for that position.

Before we get to the lemmas, we note that the number of tree structures for SS of size 2S\leq 2|S| is at most (2S)2S(2|S|)^{2|S|} (this is just by a naïve bound using the number of trees). Let us now bound the number of iVi\in V for which st(Si)st(S)+1\mathsf{st}(S\cup i)\leq\mathsf{st}(S)+1.

Let SVS\subset V and TT be a tree structure for a min Steiner tree of SS (so the leaves of TT are elements of SS). Then the number of candidates for each of the positions in TT is at most (logn)2S(\log n)^{2|S|}.

The proof is by induction on the size of SS. The base case S=1|S|=1 is trivial. Assume the result for all tree structures of sets of size S1\leq|S|-1. Now consider SS. We may assume that TT has at least one non-terminal, as otherwise there is nothing to prove.

First, note that there exists a vertex uTu\in T which is adjacent to at most one non-leaf vertex in TT. This is because deleting all the leaves in TT gives a tree (which is not empty as there is at least one non-terminal in TT), and a leaf in this tree our required uu. If uu is a terminal, we could remove the leaves attached to uu (thus obtaining a subset SS^{\prime} of the terminals), and the remaining tree structure would be a valid min Steiner tree for SS^{\prime}. Further, the set of non-terminals is precisely the same, and thus the inductive hypothesis implies the claim for SS. Thus suppose uu is a non-terminal.

If degree(u)>2(u)>2, then there are at least two leaves attached to uu, thus the number of candidates for uu is only log2n\log^{2}n. Consider one candidate xx for uu. Let TT^{\prime} be the tree obtained by removing all the leaves attached to uu (thus uu is now a leaf), and SS^{\prime} be SxS\cup x minus the set of leaves attached to uu. Now TT^{\prime} is a min Steiner tree structure for SS^{\prime} (otherwise we can obtain a smaller tree for SS). Thus by the inductive hypothesis, the number of candidates for any internal vertex in TT^{\prime} is at most (logn)2S2(\log n)^{2|S|-2}. Since there are only log2n\log^{2}n of the xx’s, it follows that the total #(candidates) for an internal vertex is at most (logn)2S(\log n)^{2|S|}.

This completes the proof, by induction. ∎

Let SVS\subset V. There are at most (2Slogn)2S(2|S|\log n)^{2|S|} vertices ii such that st(Si)=st(S)\mathsf{st}(S\cup i)=\mathsf{st}(S).

Each such ii must be the internal vertex of some min Steiner tree for SS, and there are at most (2S)2S(2|S|)^{2|S|} tree structures. Lemma 3.5 now implies the claim. ∎

Let SVS\subset V. There are at most (4Slogn)4S×n1/2(4|S|\log n)^{4|S|}\times n^{1/2} vertices ii such that st(Si)=st(S)+1\mathsf{st}(S\cup i)=\mathsf{st}(S)+1.

Let ii be such a vertex. First, note that if there exists a min Steiner tree for SiS\cup i with ii as a leaf, we are done. This is because removing ii gives a min Steiner tree for SS, and thus ii is a neighbour of an internal vertex in a min Steiner tree for SS. Thus by Corollary 3.6 there are only (2Slogn)2S×(2n1/2logn)(2|S|\log n)^{2|S|}\times(2n^{1/2}\log n) such ii.

Thus suppose that the min Steiner tree for SiS\cup i has ii as an internal vertex. We will prove the bound as follows: we consider a tree structure TT of size st(S)+1\mathsf{st}(S)+1 with leaves being terminals from SS; then we show that the number of candidates for any fixed position in TT is at most (2Slogn)2Sn1/2(2|S|\log n)^{2|S|}n^{1/2}. This suffices, because the number of choices of tree structures adds an additional factor of (2S)2S(2|S|)^{2|S|}.

Let us consider a structure TT as above, and a position uu. Since uu is not a leaf, it has degree at least 22. Let the degree be dd, and let T1,,TdT_{1},\dots,T_{d} be the subtrees of TT formed by removing uu (see figure …). Now if for some ii, TiT_{i} is the min Steiner tree for the terminals in TiT_{i}, we are done, because then, each candidate for uu must be neighbour of an internal vertex in the tree, and by Corollary 3.6 there are only n×(2Slogn)2S\sqrt{n}\times(2|S|\log n)^{2|S|} candidates. Thus for each ii, TiT_{i} must have a strictly smaller tree TiT_{i}^{\prime}. Let the vertex in T1T_{1} connected to uu be called b1b_{1}. Now construct a new tree as follows: leave T1T_{1} intact, and replace T2,,TdT_{2},\dots,T_{d} by T2,,TdT_{2}^{\prime},\dots,T_{d}^{\prime}; connect b1b_{1} to T2,,TdT_{2}^{\prime},\dots,T_{d}^{\prime} using paths of length 22. The number of edges in the new tree is now at most Td(d1)+2(d1)|T|-d-(d-1)+2(d-1). The first term is the original cost, followed by removal of uu, followed by the decrease by using TiT_{i}^{\prime} as opposed to TiT_{i}, followed by the cost of adding length-2 paths.

Thus the new tree has cost at most T1|T|-1, and thus it is optimal for SS! Further, uu is adjacent to b1b_{1} which is an internal vertex, and thus the number of candidates is bounded by the desired quantity. ∎

Consider the sum iVxSi\sum_{i\in V}x_{S\cup i}. Corollary 3.6 implies that there are at most (Llogn)L(L\log n)^{L} terms which contribute a value xS/Lx_{S}/L. Lemma 3.7 implies that there are at most n1/2(2Llogn)2Ln^{1/2}\cdot(2L\log n)^{2L} terms which contribute a value xS/(n1/4L)x_{S}/(n^{1/4}L). Thus if we pick (2Llogn)2L<n1/4(2L\log n)^{2L}<n^{1/4}, we have the bound that the sum is at most n1/2xSn^{1/2}x_{S}, as desired.

Thus we have verified each of the constraints (1)-(3). This completes the proof of Theorem 3.1.

4 Gaps for the mixed hierarchy (SA+).

Consider the relaxation SAtSA_{t} described in (1)-(2), along with the constraint: Z=(xij)1i,jn0Z=(x_{ij})_{1\leq i,j\leq n}\succeq 0. The solution considered earlier (Equation (5)) turns out to also satisfy this PSD condition with high probability. The entries of ZZ are

where AA is the adjacency matrix of GG. Now AA is a G(n,p)\mathcal{G}(n,p) matrix with p=n1/2lognp=n^{-1/2}\log n. Thus the least eigenvalue is at least 2np(1p)-2\sqrt{np(1-p)} with high probability (by the Semicircle law). This is at least 4n1/4(logn)1/2-4n^{1/4}(\log n)^{1/2}. Thus we have A+4n1/4lognI0A+4n^{1/4}\sqrt{\log n}I\succeq 0. Using the fact that J0J\succeq 0, we obtain that Z0Z\succeq 0.

We conjecture that even LnεL\approx n^{\varepsilon} levels does not reduce the integrality gap substantially. We need a different approach (involving a better argument for bounding the number of trees) to extend the arguments above to this range of LL.

Integrality Gap for the Lasserre hierarchy

In this section, we show a gap instance with arbitrary large constant ratio for linear-round Lasserre relaxation, and a gap instance with nεn^{\varepsilon} ratio for n1O(ε)n^{1-O(\varepsilon)}-round Lasserre relaxation (Theorem 4.7). We also aim at maximizing the ratio of a polynomial-round Lasserre gap instance, getting a ratio of Ω(n2/53ε)\Omega(n^{2/53-\varepsilon}) (Theorem 4.8).

Our construction is based on a variant of Tulsiani’s gap instance for Max KK-CSP [Tul09] – we extend the parameter range of Tulsiani’s instance. Then we convert the Max KK-CSP instance to a constraint-variable graph and duplicate the variable vertices, which is our gap instance for Densest kk-subgraph. Note that the gap for Max KK-CSP problem is indeed a set of random instances. The vector solution from Lasserre gap for Max KK-CSP will help us exhibit a good Lasserre vector solution for Densest kk-subgraph. We finally use the structure of random instances of Max KK-CSP to show the soundness holds with high probability.

Now, let us proceed to the first step, the gap instance for Max KK-CSP.

We start by defining the Max KK-CSP problem.

The following theorem is an extension of the main theorem in [Tul09], showing that polynomial-round Lasserre relaxation cannot refute random Max KK-CSP with high probability.

V(S1,α1),V(S2,α2)0\langle{\bm{V}}_{(S_{1},\alpha_{1})},{\bm{V}}_{(S_{2},\alpha_{2})}\rangle\geq 0 for all S1,S2,α1,α2S_{1},S_{2},\alpha_{1},\alpha_{2};

V(S1,α1),V(S2,α2)=0\langle{\bm{V}}_{(S_{1},\alpha_{1})},{\bm{V}}_{(S_{2},\alpha_{2})}\rangle=0 if α1(S1S2)α2(S1S2)\alpha_{1}(S_{1}\cap S_{2})\neq\alpha_{2}(S_{1}\cap S_{2});

V(S1,α1),V(S2,α2)=V(S3,α3),V(S4,α4)\langle{\bm{V}}_{(S_{1},\alpha_{1})},{\bm{V}}_{(S_{2},\alpha_{2})}\rangle=\langle{\bm{V}}_{(S_{3},\alpha_{3})},{\bm{V}}_{(S_{4},\alpha_{4})}\rangle for all S1S2=S3S4S_{1}\cup S_{2}=S_{3}\cup S_{4} and α1α2=α3α4\alpha_{1}\circ\alpha_{2}=\alpha_{3}\circ\alpha_{4};

Recall that Tulsiani showed that, if the constraint-variable graph of a Max KK-CSP(C)(C) instance has very high left-expansion, then the Lasserre SDP admits a perfect solution for it. Formally, the following lemma is (implicitly) shown in [Tul09].

Given a Max KK-CSP(C)(C) instance, if every set of constraints of cardinality srs\leq r involves more than (Kδ)s(K-\delta)s variables (where 2δ2\delta is the distance of the dual code of CC), and if 4δK4\delta\leq K, then there is a perfect solution for the SDP relaxation obtained by r/16r/16 rounds of the Lasserre hierarchy.

Hence, we only need to prove the following lemma which shows that the constraint-variable graph still has very high left-expansion, even when a constraint might involve superconstant many variables (i.e. the left degree might be superconstant).

Given β,η,K\beta,\eta,K as in Theorem 4.2, with probability 1o(1)1-o(1), for all 2sηn2\leq s\leq\eta n, every set of ss constraints involves more than (Kδ)s(K-\delta)s variables.

A similar lemma can be found in [Tul09] (Lemma A.1), which only deals with constant KK. We need a more refined argument for superconstant KK, which is in Section 4.4.

2 The Lasserre gap for Densest k𝑘k-subgraph.

The gap instance is reduced from the gap instance for Max KK-CSP in Theorem 4.2. Let CC be the dual code of a [K,Kt,2δ]q[K,K-t,2\delta]_{q} code as used in Theorem 4.2, where KK is the block length, (Kt)(K-t) is the dimension, and 2δ32\delta\geq 3 is the distance of the code. Such a code has size C=qt|C|=q^{t}, and is very sparse for small enough tt. For 1000<q1000<q and K>q2K>q^{2}, we let β=(40qt+2lnq)/K\beta=(40q^{t+2}\ln q)/K, and do the following reduction.

Given a Max KK-CSP(C)(C) instance Φ\Phi with m=βnm=\beta n constraints and nn variables. Let GΦ=(LΦ,RΦ,EΦ)G_{\Phi}=(L_{\Phi},R_{\Phi},E_{\Phi}) be the bipartite graph with mCm|C| left vertices and nqnq right vertices. For every constraint CiC_{i} and every partial assignment to variables in the corresponding tuple TiT_{i} which satisfies the constraint CiC_{i}, we introduce a left vertex. For every variable xix_{i} and its corresponding assignment, we introduce a right vertex. Formally,

We connect a left vertex (Ci,α)(C_{i},\alpha) and right vertex (xj,α)(x_{j},\alpha^{\prime}) when xjTix_{j}\in T_{i} and α\alpha^{\prime} is consistent with α\alpha, i.e.

Now we define the final graph GΦ=(LΦ,RΦ,EΦ)G_{\Phi}^{\prime}=(L_{\Phi},R_{\Phi}^{\prime},E_{\Phi}^{\prime}) in which we want to find a dense kk-subgraph where k=2mk=2m. We take β\beta copies of the right vertices in RΦR_{\Phi} to get RΦR_{\Phi}^{\prime}. To get EΦE_{\Phi}^{\prime}, we connect a left vertex uLΦu\in L_{\Phi} and a right vertex vRΦv\in R_{\Phi}^{\prime} if uu is connected to vv’s corresponding vertex in RΦR_{\Phi} in EΦE_{\Phi}. The graph GΦG_{\Phi}^{\prime} has N=mC+βnq=O(nq2t+2lnq/K)N=m|C|+\beta nq=O(nq^{2t+2}\ln q/K) vertices.

In our analysis of the reduction, we need a qq-ary linear code C\mathcal{C} that has a small constant distance (but no less than 33), small block length (but more than qq), and very high dimension. Thus, we instantiate the code C\mathcal{C} with Generalized BCH codes given by the following.

For every prime tower qq, and integer 2δ32\delta\geq 3, there are qq-ary linear codes of block length K=q21K=q^{2}-1, dimension (K4δ+3)(K-4\delta+3), and distance at least 2δ2\delta.

We include a simple proof of Lemma 4.6 as follows.

We show the contrapositive statement : the only codeword of weight at most D1D-1 is 0\bm{0}. For every codeword of weight at most D1D-1, suppose the non-zero entries are in the set {ci1,ci2,ci3,,ciD1}\{c_{i_{1}},c_{i_{2}},c_{i_{3}},\cdots,c_{i_{D-1}}\}, we have

Note that the coefficients form a Vandermonde matrix (which has full rank). Therefore we have ci1=ci2=ci3==ciD1=0c_{i_{1}}=c_{i_{2}}=c_{i_{3}}=\cdots=c_{i_{D-1}}=0, i.e. the codeword is 0\bm{0}.

3 Analysis.

We get a family of gap instances GΦG_{\Phi}^{\prime} parameterized by q>1000q>1000 and 2δ32\delta\geq 3 (using Lemma 4.6). We obtain our two main results of this section by picking appropriate parameters for code CC as follows. To get lasserre integrality gaps for N1O(ϵ)N^{1-O(\epsilon)} levels , we show the following by setting the distance 2δ=32\delta=3.

For every 1000<q<Nϵ1000<q<N^{\epsilon} (where ϵ\epsilon is an absolute small constant), there is a gap instance of ratio Ω(q)\Omega(q) for N/qO(1)N/q^{O(1)}-level Lasserre SDP. The same construction also works for the Min degree Lasserre SDP, when q=Ω(logn)q=\Omega(\log n) and q<Nϵq<N^{\epsilon}.

We now aim at getting a gap instance of ratio NϵN^{\epsilon} for polynomial-round Lasserre SDP, where ϵ\epsilon is maximized. By setting q=nγq=n^{\gamma} for some small constant γ>0\gamma>0, the distance 2δ=42\delta=4, and optimizing the other parameters, we obtain the following (refer to section 4.3 for details)

For small enough κ>0\kappa>0, there is a gap instance of ratio N2/53O(κ)N^{2/53-O(\kappa)} for the NκN^{\kappa}-round Min degree Lasserre SDP.

The two theorems follow because of Theorem 4.2, Lemma 4.9, Lemma 4.11 (completeness) and Lemma 4.12 (soundness). In the completeness case, we will use our rr-level Lasserre solution for Max KK-CSP to show that the Lasserre SDP after R=r/KR=r/K levels of the hierarchy has value at least βmK\beta mK. In the soundness case, we show that with probability 1o(1)1-o(1), the graph GΦG_{\Phi}^{\prime} does not have any 2m2m-subgraph of value more than 17/q17/q times the SDP value (Lemma 4.12). Therefore, the graph GΦG_{\Phi}^{\prime} is a gap instance of ratio Ω(q)\Omega(q) for RR-round Lasserre SDP. We proceed by first proving these lemmas.

If the Max KK-CSP(C)(C) instance Φ\Phi admits a perfect solution for rr-round Lasserre SDP relaxation, then the r/Kr/K-round Lasserre SDP relaxation for the Densest kk-subgraph instance GΦG_{\Phi}^{\prime} has a solution of value βmK\beta mK.

For any set SLΦRΦS\subseteq L_{\Phi}\cup R_{\Phi}^{\prime}, suppose the left vertices included in SS are

and the right vertices included in SS are

We have SKr1+r2r|S^{\prime}|\leq Kr_{1}+r_{2}\leq r. If all the partial assignments αi\alpha_{i}’s and αi\alpha^{\prime}_{i}’s are consistent to each other (i.e. there are not two of them assigning the same variable to different values), we can define

and let US=V(S,α){\bm{U}}_{S}={\bm{V}}_{(S^{\prime},\alpha)}, or we let US=0{\bm{U}}_{S}=\bm{0}.

For every SLΦRΦS\subseteq L_{\Phi}\cup R_{\Phi}^{\prime}, we have V(,),US=US2\langle{\bm{V}}_{(\emptyset,\emptyset)},{\bm{U}}_{S}\rangle=\left\lVert{\bm{U}}_{S}\right\rVert^{2}.

If US=V(S,α){\bm{U}}_{S}={\bm{V}}_{(S^{\prime},\alpha)} for some S,αS^{\prime},\alpha, we have V(,),US=V(,),V(S,α)=V(S,α)2=US2\langle{\bm{V}}_{(\emptyset,\emptyset)},{\bm{U}}_{S}\rangle=\langle{\bm{V}}_{(\emptyset,\emptyset)},{\bm{V}}_{(S^{\prime},\alpha)}\rangle=\left\lVert{\bm{V}}_{(S^{\prime},\alpha)}\right\rVert^{2}=\left\lVert{\bm{U}}_{S}\right\rVert^{2}. If US=0{\bm{U}}_{S}=\bm{0}, we have V(,),US=US2=0\langle{\bm{V}}_{(\emptyset,\emptyset)},{\bm{U}}_{S}\rangle=\left\lVert{\bm{U}}_{S}\right\rVert^{2}=0. ∎

We can check that all the Lasserre constraints are satisfied.

For two sets S1,S2S_{1},S_{2}, either at least one of the vectors US1,US2{\bm{U}}_{S_{1}},{\bm{U}}_{S_{2}} is 0\bm{0} (therefore their inner-product is ), or US1=VS1,α1,US2=VS2,α2{\bm{U}}_{S_{1}}={\bm{V}}_{S_{1}^{\prime},\alpha_{1}},{\bm{U}}_{S_{2}}={\bm{V}}_{S_{2}^{\prime},\alpha_{2}} for some S1,S2,α1,α2S_{1}^{\prime},S_{2}^{\prime},\alpha_{1},\alpha_{2} and US1,US2=VS1,α1,VS2,α20\langle{\bm{U}}_{S_{1}},{\bm{U}}_{S_{2}}\rangle=\langle{\bm{V}}_{S_{1}^{\prime},\alpha_{1}},{\bm{V}}_{S_{2}^{\prime},\alpha_{2}}\rangle\geq 0.

For any S1,S2,S3,S4S_{1},S_{2},S_{3},S_{4} such that S1S2=S3S4S_{1}\cup S_{2}=S_{3}\cup S_{4}, either the set of partial assignments in S1S2=S3S4S_{1}\cup S_{2}=S_{3}\cup S_{4} are consistent to each other, in which case we have US1S2=US3S4=VS,α{\bm{U}}_{S_{1}\cup S_{2}}={\bm{U}}_{S_{3}\cup S_{4}}={\bm{V}}_{S,\alpha} where SS is the union of all the variables included in S1S2S_{1}\cup S_{2} and α\alpha is the concatenation of the partial assignments in S1S2S_{1}\cup S_{2}; or we have US1S2=US3S4=0{\bm{U}}_{S_{1}\cup S_{2}}={\bm{U}}_{S_{3}\cup S_{4}}=\bm{0}.

where the third last equality is because of Observation 4.3, and the second last equality is because of Observation 4.10.

Finally, we have U2=V(,)2=1\left\lVert{\bm{U}}_{\emptyset}\right\rVert^{2}=\left\lVert{\bm{V}}_{(\emptyset,\emptyset)}\right\rVert^{2}=1.

Now, we calculate the value of the solution

If we add the constraint (4), we can still get a good SDP solution for the Min degree Lasserre SDP with high probability, as long as qq is superconstant.

For q=Ω(logn)q=\Omega(\log n), with probability 1o(1)1-o(1), this vector solution also satisfies the added constraint (4) with d=βK/2d=\beta K/2, i.e., for every set SS, for each vertex uu, we have

For each left vertex (Ci,α)(C_{i},\alpha), we have

For each right vertex (xj,α)(x_{j},\alpha^{\prime}), we have

where the last equality is because we know that U{(Ci,α)}=0{\bm{U}}_{\{(C_{i},\alpha)\}}=\bm{0} when Ci(α)1C_{i}(\alpha)\neq 1. By the property of Lasserre vectors, we know that for each i[m]i\in[m],

For q=Ω(logn)q=\Omega(\log n), the expected number of constraints containing xjx_{j} is βK=Ω((logn)t+2)=Ω(logn)\beta K=\Omega((\log n)^{t+2})=\Omega(\log n), by our choice of β\beta. Therefore, by Chernoff bound and union bound, with probability 1o(1)1-o(1), for all xjx_{j}, there are at least βK/2\beta K/2 constraints containing xjx_{j}, and for every SS, xjx_{j} and α\alpha^{\prime}, we have

Soundness.

Now, we show that random instances of Max KK-CSP give rise to graphs GφG^{\prime}_{\varphi} whose 2m2m-sized subgraphs have density O(βK/q)O(\beta K/q). Note that the large alphabet size qq allows us to get a much larger gap than we would starting from random AND instances [Fei02]. This allows us some slack in the size of the subgraphs we need to argue about.

For CC the dual of a [K,Kt,2δ]q[K,K-t,2\delta]_{q} code, we prove the following soundness lemma.

When β(40qt+2lnq)/K\beta\geq(40q^{t+2}\ln q)/K, for a random Max KK-CSP(C)(C) instance Φ\Phi, with probability 1o(1)1-o(1), any subgraph of GΦG_{\Phi}^{\prime} obtained by choosing 2m2m left vertices and 2m2m right vertices contains at most 17βmK/q17\beta mK/q edges, and therefore any 2m2m-subgraph of GΦG_{\Phi}^{\prime} contains at most 17βmK/q17\beta mK/q edges.

Note that GφG^{\prime}_{\varphi} was constructed by taking β\beta copies of the right bipartition and replicating the edges. To prove Lemma 4.12, we only need to prove the following lemma.

Suppose that q>1000,K>q2/2,t10q>1000,K>q^{2}/2,t\leq 10. When β(40qt+2lnq)/K\beta\geq(40q^{t+2}\ln q)/K, for a random Max KK-CSP(C)(C) instance Φ\Phi, with probability 1o(1)1-o(1), any subgraph of GΦG_{\Phi} obtained by choosing 2m2m left vertices and 2n2n right vertices contains at most 17mK/q17mK/q edges.

We only need to prove once there is a 2m×2m2m\times 2m subgraph of GΦG_{\Phi}^{\prime} with tt edges, there is a 2m×2n2m\times 2n subgraph of GΦG_{\Phi} with at least t/βt/\beta edges. Fix 2m2m left vertices in GΦG_{\Phi}^{\prime}, to maximize the number of edges in the subgraph, we need to select the 2m2m right vertices with most edges connected to the chosen 2m2m left vertices. Since any two right vertices GΦG_{\Phi}^{\prime} corresponding to the same right vertex in GΦG_{\Phi} have the same set of neighbors, there is an densest 2m×2m2m\times 2m subgraph HH^{\prime} of GΦG_{\Phi}^{\prime} that, for any two such vertices, chooses either both or neither of them. Now we define an subgraph HH of GΦG_{\Phi} that contains the same 2m2m left vertices. It contains a right vertex if any copy of the vertex is contained in HH^{\prime}. HH contains 2m/β=2n2m/\beta=2n vertices, and it is easy to see that there are (at least) t/βt/\beta edges in HH. ∎

We proceed by fixing a set of 2n2n vertices RR on the right. Lemma 4.13 follows from the following claim by a standard union bound over all possible choices of RR.

Recall that GΦ=(LΦ,RΦ,EΦ)G_{\Phi}=(L_{\Phi},R_{\Phi},E_{\Phi}). Suppose that q>1000,K>q2/2,t10q>1000,K>q^{2}/2,t\leq 10. Fix a subset RRΦR\subseteq R_{\Phi} (note that RΦR_{\Phi} is the same for all the instances Φ\Phi of nn variables), R=2n|R|=2n, the probability (over choice of Φ\Phi) that there does not exist a subset LLΦL\subseteq L_{\Phi} of size 2m2m such that the number of edges in the induced subgraph by LRL\cup R is more than 17mK/q17mK/q, is at least 1exp(mK/(10qt+2))1-\exp(-mK/(10q^{t+2})).

Since there are only (qn2n)exp(2n(lnq+1)){qn\choose 2n}\leq\exp(2n(\ln q+1)) choices of RR, by a union bound, with probability at least

there is no 2m×2n2m\times 2n subgraph of GΦG_{\Phi} containing more than 17mK/q17mK/q edges. The probability becomes 1o(1)1-o(1) when β=(40qt+2lnq)/K\beta=(40q^{t+2}\ln q)/K. ∎

(Proof of Claim 4.14) First, we show that with high probability, a constraint CiC_{i} is “poorly satisfied”. That is, none of the left vertices corresponding to a constraint CiC_{i} has more than Ω(K/q)\Omega(K/q) neighbors in RR – this number is roughly 1/q1/q times the corresponding value in completeness case. We prove this in the following two steps.

For a random TT with T=K|T|=K, note that the expected degree E\/[deg(T)]=2K\mathop{\bf E\/}[\deg(T)]=2K. Therefore, by Hoeffding’s inequalities for sampling without replacement (Theorem 1 and Theorem 4 in [Hoe62]), we have

Since there are C=qtq10|C|=q^{t}\leq q^{10} codewords, by a union bound, for K>q2/2K>q^{2}/2 and q>1000q>1000, we have

Now, again, by standard Chernoff bound, we have

By the calculation above we know that with probability at least 1exp(mK/(10qt+2))1-\exp(-mK/(10q^{t+2})), there are at most m/(qC)m/(q\cdot|C|) constraints that are not poorly satisfied.

For each left vertex (Ci,α)LΦ(C_{i},\alpha)\in L_{\Phi}, if CiC_{i} is poorly satisfied, we know there are at most 8K/q8K/q edges from (Ci,α)(C_{i},\alpha) to RR. If CiC_{i} is not poorly satisfied, there are at most KK edges to RΦR_{\Phi} – this upperbound also applies to RR.

Therefore, with probability at least 1exp(mK/(10qt+2))1-\exp(-mK/(10q^{t+2})), any set of 2m2m left vertices has at most 2m8K/q+m/(qC)CK17mK/q2m\cdot 8K/q+m/(q\cdot|C|)\cdot|C|\cdot K\leq 17mK/q edges connected to RR. ∎

We now complete the proofs of the main theorems in this section.

Proof of Theorem 4.7.

By combining Theorem 4.2, Lemma 4.9, Lemma 4.11 (completeness), and Lemma 4.12 (soundness) we see that with probability 1o(1)1-o(1), the graph GΦG^{\prime}_{\Phi} provides a Ω(q)\Omega(q) integrality for the number of levels RR given by

Recall that K=q21K=q^{2}-1. By setting K=q21K=q^{2}-1 and 2δ=32\delta=3, we verify that the theorem holds. ∎

Proof of Theorem 4.8.

Let q=nγq=n^{\gamma}, since N=O(nq2t+2lnq/K)=O(nq2tlnq)N=O(nq^{2t+2}\ln q/K)=O(nq^{2t}\ln q), ratio of the gap due to Lemma 4.9 and Lemma 4.12 is

Note that when 2δ32\delta\geq 3 is fixed, ϵ\epsilon is maximized when γ\gamma is maximized.

The number of rounds (due to Theorem 4.2 and Lemma 4.9) is

For very small κ>0\kappa>0, to get a gap instance for NΩ(κ)N^{\Omega(\kappa)}-round Lasserre, we need

Let γ=1O(κ)10+6.5/(δ1)\gamma=\frac{1-O(\kappa)}{10+6.5/(\delta-1)}, we have

When 2δ=42\delta=4, we get the maximized value ϵ=2/53O(κ)\epsilon=2/53-O(\kappa). ∎

4 Expansion for random Max K𝐾K-CSP instances.

In this section, we prove Lemma 4.5, restated as follows.

Lemma 4.5 (restated). Given β,η,K\beta,\eta,K as in Theorem 4.2, with probability 1o(1)1-o(1), for all 2sηn2\leq s\leq\eta n, every set of ss constraints involves more than (Kδ)s(K-\delta)s variables.

Fix 2sηn2\leq s\leq\eta n, let us upperbound the probability that there is a set of ss constraints containing at most (Kδ)s(K-\delta)s variables. Since there are (βns){\beta n\choose s} such sets, the probability is at most

Fix a set TT of ii variables, let p(s,i)p(s,i) be the number of ss-tuples (T1,T2,,Ts)(T_{1},T_{2},\cdots,T_{s}) where for each 1js1\leq j\leq s, TjT_{j} is a set of KK variables, such that 1jsTj=T\cup_{1\leq j\leq s}T_{j}=T. We have

To upperbound p(i,s)p(i,s), we view the way to enumerating valid (T1,T2,,Ts)(T_{1},T_{2},\cdots,T_{s}) as, to choose a multiset of KsKs variables (each one from TT) so that each element in TT appears at least once in the multiset, then view each element in the multiset as a distinct element, and distribute these KsKs elements to ss sets, in a balanced way. Note that in this way, we are able to enumerate all the valid ss-tuples (although some of them might be enumerated more than once). Since there are at most (Ks1i1)<(Ksi){Ks-1\choose i-1}<{Ks\choose i} valid multisets, we have

Note that when K2s<δnK^{2}s<\delta n and i(Kδ)si\leq(K-\delta)s, we have i<nKs/(n+Ks)i<nKs/(n+Ks) (since iKs(1δ/K)Ks/(1+δ/K)=nKs/(n+δn/K)<nKs/(n+Ks)i\leq Ks(1-\delta/K)\leq Ks/(1+\delta/K)=nKs/(n+\delta n/K)<nKs/(n+Ks)), and therefore

therefore the function (ni)(Ksi){n\choose i}{Ks\choose i} is increasing when i(Kδ)si\leq(K-\delta)s, therefore

for Kn1/2K\leq n^{1/2}, we use the fact that (nK)(nK)K/K!nK/3/((K/e)K(5K))=(en/K)K/(15K){n\choose K}\geq(n-K)^{K}/K!\geq n^{K}/3/((K/e)^{K}\cdot(5\sqrt{K}))=(en/K)^{K}/(15\sqrt{K}) (since by Stirling’s formula, we have K!5K(K/e)KK!\leq 5\sqrt{K}(K/e)^{K}), and again use the fact that 2πK(K/e)KK!5K(K/e)K\sqrt{2\pi K}(K/e)^{K}\leq K!\leq 5\sqrt{K}(K/e)^{K}, we bound the expression above by

For 2sln2n2\leq s\leq\ln^{2}n, since nκ11/(108(βK2δ+0.75)1/(δ1))n^{\kappa-1}\leq 1/(10^{8}\cdot(\beta K^{2\delta+0.75})^{1/(\delta-1)}), we have β2K4δ+1.5/n2(δ1)n(2δ1)κ\beta^{2}K^{4\delta+1.5}/n^{2(\delta-1)}\leq n^{-(2\delta-1)\kappa}, we have

For ln2n<sηn\ln^{2}n<s\leq\eta n, since η1/(108(βK2δ+0.75)1/(δ1))\eta\leq 1/(10^{8}\cdot(\beta K^{2\delta+0.75})^{1/(\delta-1)}), we get η1/(108(βK2δ)1/(δ1))\eta\leq 1/(10^{8}\cdot(\beta K^{2\delta})^{1/(\delta-1)}), and further we have βK2δηδ1δδ/(10015e1+δ/2π)\beta K^{2\delta}\eta^{\delta-1}\leq\delta^{\delta}/(100\cdot 15e^{1+\delta}/\sqrt{2\pi}) for all δ>5/4\delta>5/4. Therefore,

Now, we upperbound probability that there exists a set of constraints of size sηns\leq\eta n involving at most (Kδ)s(K-\delta)s variables by

Conclusion

In this paper, we show integrality gap lower bounds of Ω(n1/4/log3n)\Omega(n^{1/4}/\log^{3}n) for Ω(logn/loglogn)\Omega(\log n/\log\log n) levels of the Sherali-Adams+ SDP relaxation, and Ω(n2/53ϵ)\Omega(n^{2/53-\epsilon}) for nΩ(ϵ)n^{\Omega(\epsilon)} levels of the Lasserre SDP relaxation for the Densest kk-subgraph problem.

The gap instances for SA+ SDP are actually (Erdös-Renyi) random graph instances G(n,p)\mathcal{G}(n,p). We believe these instances should give Ω(n1/4ϵ)\Omega(n^{1/4-\epsilon}) gaps for even stronger relaxations – in particular higher levels of the Sherali-Adams hierarchy, with stronger SDP constraints. The sub-exponential time algorithms for Densest kk-subgraph in [BCC+10] imply that the integrality gap becomes O(n1/4ε)O(n^{1/4-\varepsilon}) after nO(ε)n^{O(\varepsilon)} levels of an LP hierarchy which is weaker than the Sherali-Adams hierarchy. In fact, these sub-exponential time algorithms were inspired by attempts to construct integrality gap lower bounds for many levels (polynomial in nn). It would be interesting to close this gap by obtaining matching integrality gap lower bounds for nΩ(ε)n^{\Omega(\varepsilon)} levels. As a further goal, one might also hope to combine the techniques used in both parts of this paper, to get Ω(n1/4ϵ)\Omega(n^{1/4-\epsilon}) gaps for polynomial levels of the Lasserre hierarchy.

References

Appendix A Random graph properties

We prove that the properties used in our gap construction hold for G(n,p)\mathcal{G}(n,p), with p=n1/2(logn)1/2p=n^{-1/2}(\log n)^{1/2}. These properties are listed in Section 3.1. In what follows fix pp to be the value above. As mentioned in Section 2.1, the phrase “with high probability” (w.h.p.) refers to ‘with probability at least 11q(n)1-\frac{1}{q(n)}’, where q(n)q(n) is an arbitrary polynomial in nn (sometimes there will be a constant depending on the polynomial).

Every vertex of GG has degree between (n1/2logn)/2(n^{1/2}\log n)/2 and 2n1/2logn2n^{1/2}\log n w.h.p.

Let uVu\in V. The degree d(u)d(u) (as a random variable) is the sum of nn i.i.d. Bernoulli random variables each having parameter p=n1/2lognp=n^{-1/2}\log n. The expected value is thus n1/2lognn^{1/2}\log n. This is logn\gg\log n, and thus by Chernoff bounds, the probability that Pr[d(u)n1/2logn>t]et2/4np(1p)<1nq(n)\Pr[|d(u)-n^{1/2}\log n|>t]\leq e^{-t^{2}/4np(1-p)}<\frac{1}{nq(n)}, for any polynomial q(n)q(n). Taking union bound gives the claim. ∎

Every pair of vertices in GG have at most 2log2n2\log^{2}n common neighbours w.h.p.

Let u,vVu,v\in V. Let XiX_{i} be a random variable which is an indicator for iΓ(u)Γ(v)i\in\Gamma(u)\cap\Gamma(v). In G(n,p)\mathcal{G}(n,p), we have E\/Xi=p2=log2n/n\mathop{\bf E\/}{X_{i}}=p^{2}=\log^{2}n/n. Thus E\/Γ(u)Γ(v)=np2=log2n\mathop{\bf E\/}{|\Gamma(u)\cap\Gamma(v)|}=np^{2}=\log^{2}n. Thus the probability that it is >2log2n>2\log^{2}n is at most elog2n/4e^{-\log^{2}n/4}. Taking union bound over all u,vu,v, we obtain that this is smaller than any polynomial. ∎

Every pair of vertices have at least one common neighbour w.h.p.

As above, consider some u,vu,v; we have E\/Γ(u)Γ(v)=np2=log2n\mathop{\bf E\/}{|\Gamma(u)\cap\Gamma(v)|}=np^{2}=\log^{2}n. Thus Pr[ΓuΓ(v)<logn]elog2n/4\Pr[|\Gamma_{u}\cap\Gamma(v)|<\log n]\leq e^{-\log^{2}n/4} (since we can use Chernoff bounds as long as the expectation logn\gg\log n). Taking union bound again implies the result. ∎

No induced subgraph on n1/2n^{1/2} vertices has density >5logn>5\log n w.h.p.

Let SVS\subseteq V of size n1/2n^{1/2}. Then E\/E(S,S)=(n1/22)p=n1/2logn/2\mathop{\bf E\/}{E(S,S)}=\binom{n^{1/2}}{2}\cdot p=n^{1/2}\log n/2. Further the variance of this quantity is (n1/22)p(1p)<n1/2logn\binom{n^{1/2}}{2}p(1-p)<n^{1/2}\log n. Thus by Chernoff bound,

Picking t=4n1/2lognt=4n^{1/2}\log n, the probability upper bound is e4n1/2logne^{-4n^{1/2}\log n}. Thus we can take a union bound over all the (nn1/2)\binom{n}{n^{1/2}} subsets SS. This proves the claim. ∎