Integrality Gaps of Linear and Semi-definite Programming Relaxations for Knapsack

Anna R. Karlin, Claire Mathieu, C. Thach Nguyen

Introduction

Many approximation algorithms work in two phases: first, solve a linear programming (LP) or semi-definite programming (SDP) relaxation; then, round the fractional solution to obtain a feasible integer solution to the original problem. This paradigm is amazingly powerful; in particular, under the unique game conjecture, it yields the best possible ratio for MaxCut{\mathsf{MaxCut}} and a wide variety of other problems, see e.g. .

However, these algorithms have a limitation. Since they are usually analyzed by comparing the value of the output to that of the fractional solution, we cannot generally hope to get a better approximation ratio than the integrality gap of the relaxation. Furthermore, for any given combinatorial optimization problem, there are many possible LP/SDP relaxations, and it is difficult to determine which relaxations have the best integrality gaps.

This has lead to efforts to provide systematic procedures for constructing a sequence of increasingly tight mathematical programming relaxations for 0-1 optimization problems. A number of different procedures of this type have been proposed: by Lovász and Schrijver , Sherali and Adams , Balas, Ceria and Cornuejols , Lasserre and others. While they differ in the details, they all operate in a series of rounds starting from an LP or SDP relaxation, eventually ending with an exact integer formulation. The strengthened relaxation after tt rounds can typically be solved in nO(t)n^{O(t)} time and, roughly, satisfies the property that the values of any tt variables in the original relaxation can be expressed as the projection of a convex combination of integer solutions.

A major line of research in this area has focused on understanding the strengths and limitations of these procedures. Of particular interest to our community is the question of how the integrality gaps for interesting combinatorial optimization problems evolve through a series of rounds of one of these procedures. On the one hand, if the integrality gaps of successive relaxations drop sufficiently fast, there is the potential for an improved approximation algorithm (see for example). On the other hand, a large integrality gap persisting for a large, say logarithmic, number of rounds rules out (unconditionally) a very wide class of efficient approximation algorithms, namely those whose output is analyzed by comparing it to the value of a class of LP/SDP relaxations. This implicitly contains most known sophisticated approximation algorithms for many problems including SparsestCut{\mathsf{SparsestCut}} and MaximumSatisifiability{\mathsf{MaximumSatisifiability}}. Indeed, serveral very strong negative results of this type have been obtained (see and others). These are also viewed as lower bounds of approximability in certain restricted models of computation.

How strong are these restricted models of computation? In other words, how much do lower bounds in these models tell us about the intrinsic hardness of the problems studied? To explore this question, we focus on one problem that is well-known to be “easy” from the viewpoint of approximability: Knapsack{\mathsf{Knapsack}}. We obtain the following results:

We show that an integrality gap close to 2 persists up to a linear number of rounds of Sherali-Adams. (The integrality gap of the natural LP is 2.)

This is interesting since Knapsack{\mathsf{Knapsack}} has a fully polynomial time approximation scheme . This confirms and amplifies what has already been observed in other contexts (e.g ): the Sherali-Adams restricted model of computation has serious weaknesses: a lower bound in this model does not necessarily imply that it is difficult to get a good approximation algorithm.

We show that Lasserre’s hierarchy closes the gap quickly. Specifically, after tt rounds of Laserre, the integrality gap decreases to t/(t1)t/(t-1).

It is known that a few rounds of Lasserre can yield better relaxations. For example, two rounds of Lasserre applied to the MaxCut{\mathsf{MaxCut}} LP yields an SDP that is at least as strong as that used by Goemans and Williamson to get the best known approximation algorithm, and the SDP in which leads to the best known approximation algorithm for SparsestCut{\mathsf{SparsestCut}} can be obtained by three rounds of Lasserre. However, to the best of our knowledge, this is the first positive result that utilizes more than a small constant number of rounds in the Lasserre hierarchy.

Many known approximation algorithms can be recognized in hindsight as starting from a natural relaxation and strengthening it using a couple of levels of lift-and-project. The original hope had been to use lift and project systems as a systematic approach to designing novel algorithms with better approximation ratios. Instead, the last few years have mostly seen the emergence of a multitude of lower bounds. Indeed, lift and project systems have been studied mostly for well known difficult problems: MaxCut{\mathsf{MaxCut}} , SparsestCut{\mathsf{SparsestCut}}, VertexCover{\mathsf{VertexCover}} , HypergraphVertexCover{\mathsf{HypergraphVertexCover}}, TSP{\mathsf{TSP}} , MaximumAcyclicSubgraph{\mathsf{MaximumAcyclicSubgraph}} , CSP{\mathsf{CSP}} , and more.

The Knapsack{\mathsf{Knapsack}} problem has a fully polynomial time approximation scheme . The natural LP relaxation (to be stated in full detail in the next section) has an integrality gap of 2ϵ2-\epsilon . Although we are not aware of previous work on using the lift and project systems for Knapsack{\mathsf{Knapsack}}, the problem of strengthening the LP relaxation via addition of well-chosen inequalities has been much the object of much interest in the past in the mathematical programming community, as stronger LP relaxations are extremely useful to speed up branch-and-bound heuristics. The knapsack polytope was studied in detail by Weismantel . Valid inequalities were studied in . In particular, whenever SS is a minimal set (w.r.to inclusion) that does not fit in the knapsack, then S{j:iS,wjwi}xjS1\sum_{S\cup\{j:\forall i\in S,w_{j}\geq w_{i}\}}x_{j}\leq|S|-1 is a valid inequality. Generalizations and variations were also studied in . Thus, in spite of the existence of a dynamic program to solve the problem, Knapsack{\mathsf{Knapsack}} is fundamental enough that understanding the polytope (and its lifted tightenings) is of intrinsic interest. In , Bienstock formulated LP with arbitrary small integrality gaps for Knapsack{\mathsf{Knapsack}} using “structural disjunctions”, and asked if the popular hierarchies reduce the gap of the Knapsack{\mathsf{Knapsack}} linear program. Our results give a negative answer for Sherali-Adams and a strong affirmative one for Lasserre.

Our results confirm the indication from for example that the Sherali-Adams lift and project is not powerful enough to be an indicator of the hardness of problems. However, it should be noted that if the problem was phrased as a decision problem and the objective function was replaced by an additional constraint of the constraint polytope, then Sherali-Adams would succeed in reducing the integrality gap; thus the choice of the initial LP formulation is critical. On the other hand, little is know about the Lasserre hierarchy, as the first negative results were about kk-CSP . Our positive result leaves open the possibility that the Lasserre hierarchy may have promise as a tool to capture the intrinsic difficulty of problems.

Preliminaries

Our focus in this paper is on the Knapsack{{\mathsf{Knapsack}}} problem. In the Knapsack{{\mathsf{Knapsack}}} problem, we are given a set of nn objects V=[n]V=[n] with sizes c1,c2,cnc_{1},c_{2},\ldots c_{n}, values v1,v2,vnv_{1},v_{2},\ldots v_{n}, and a capacity CC. We assume that for every ii, ciCc_{i}\leq C. The objective is to select a subset of objects of maximum total value such that the total size of the objects selected does not exceed CC.

The standard linear programming (LP) relaxation for Knapsack{{\mathsf{Knapsack}}} is given by:

The intended intepretation of an integral solution of this LP is obvious: xi=1x_{i}=1 means the object ii is selected, and xi=0x_{i}=0 means it is not. The constraint can be written as g(x)=Cicixi0g(x)=C-\sum_{i}c_{i}x_{i}\geq 0.

Let Greedy denote the algorithm that puts objects in the knapsack by order of decreasing ratio vi/civ_{i}/c_{i}, stopping as soon as the next object would exceed the capacity. The following lemma is folklore.

Consider an instance (C,V)(C,V) of Knapsack{{\mathsf{Knapsack}}} and its LP relaxation KK given by (1). Then

2 The Sherali-Adams and Lasserre hierarchies

We next review the lift-and-project hierarchies that we will use in this paper. The descriptions we give here assume that the base program is linear and mostly use the notation given in the survey paper by Laurent . To see that these hierarchies apply at a much greater level of generality we refer the reader to Laurent’s paper .

Let KK be a polytope defined by a set of linear constraints g1,g2,gmg_{1},g_{2},\ldots g_{m}:

We are interested in optimizing a linear objective function ff over the convex hull P=conv(K{0,1}n)P={{\mathsf{conv}}\left(K{\cap}{\left\{0,1\right\}}^{n}\right)} of integral points in KK. Here, PP is the set of convex combinations of all integral solutions of the given combinatorial problem and KK is the set of solutions to its linear relaxation. For example, if KK is defined by (1), then PP is the set of convex combinations of valid integer solutions to Knapsack{{\mathsf{Knapsack}}}.

If all vertices of KK are integral then P=KP=K and we are done. Otherwise, we would like to strengthen the relaxation KK by adding additional valid constraints. The Sherali-Adams (SA) and Lasserre hierarchies are two different systematic ways to construct these additional constraints. In the SA hierarchy, all the constraints added are linear, whereas Lasserre’s hierarchy is stronger and introduces a set of positive semi-definite constraints. However, for consistency, we will describe both hierarchies as requiring certain submatrices to be positive semi-definite (readers who are not familiar with the following formulation of SA are referred to Appendix B for a linear formulation of the hierarchy.)

To this end, we first state some notation. Throughout this paper we will use P(V){{\mathcal{P}\left(V\right)}} to denote the power set of VV, and Pt(V){{\mathcal{P}_{t}\left(V\right)}} to denote the collection of all subsets of VV whose sizes are at most tt. Also, given two sets of coordinates TT and SS, TST\subseteq S and yRSy\in R^{S}, by yT{y|_{T}} we denote the projection of yy onto TT.

Next, we review the definition of the shift operator between two vectors x,yRP(V)x,y\in R^{{{\mathcal{P}\left(V\right)}}}: xyx*y is a vector in RP(V)R^{{{\mathcal{P}\left(V\right)}}} such that

The shift operator is commutative: for any vectors x,y,zRP(V)x,y,z\in R^{{{\mathcal{P}\left(V\right)}}}, we have x(yz)=y(xz)x*(y*z)=y*(x*z).

A polynomial P(x)=IVaIiIxiP(x)=\sum_{I\subseteq V}a_{I}\prod_{i\in I}x_{i} can also be viewed as a vector indexed by subsets of VV. We define the vector PyP*y accordingly: (Py)I=JVaJyIJ.(P*y)_{I}=\sum_{J\subseteq V}a_{J}y_{I{\cup}J}.

Finally, let T\cal T be a collection of subsets of VV and yy be a vector in RTR^{\cal T}. We denote by MT(y)M_{\cal T}(y) the matrix whose rows and colums are indexed by elements of T\cal T such that

We say that a point xnx\in^{n} belongs to the tt-th Sherali-Adams polytope sat(K){{{\mathsf{sa}}^{t}\left(K\right)}} iff there exists a ySAt(K)y\in{{{\mathsf{SA}}^{t}\left(K\right)}} such that y{i}=xiy_{\left\{i\right\}}=x_{i} for all i[n]i\in[n].

We say that a point xnx\in^{n} belongs to the tt-th Lasserre polytope lat(K){{{\mathsf{la}}^{t}\left(K\right)}} if there exists a yLat(K)y\in{{{\mathsf{La}}^{t}\left(K\right)}} such that y{i}=xiy_{\left\{i\right\}}=x_{i} for all iVi\in V.

Note that MP(U)(y)M_{{{\mathcal{P}\left(U\right)}}}(y) has at most 2t2^{t} rows and columns, which is constant for tt constant, whereas MPt(V)(y)M_{{{\mathcal{P}_{t}\left(V\right)}}}(y) has (n+1t+1){n+1\choose t+1} rows and columns.

It is immediate from the definitions that sat+1(K)sat(K){{{\mathsf{sa}}^{t+1}\left(K\right)}}\subseteq{{{\mathsf{sa}}^{t}\left(K\right)}}, and lat+1(K)lat(K){{{\mathsf{la}}^{t+1}\left(K\right)}}\subseteq{{{\mathsf{la}}^{t}\left(K\right)}} for all 1tn11\leq t\leq n-1. Sherali and Adams show that san(K)=P{{{\mathsf{sa}}^{n}\left(K\right)}}=P, and Lasserre show that lan(K)=P{{{\mathsf{la}}^{n}\left(K\right)}}=P. Thus, the sequences

define hierarchies of polytopes that converge to PP. Furthermore, the Lasserre hierarchy is stronger than the Sherali-Adams hierarchy: lan(K)san(K){{{\mathsf{la}}^{n}\left(K\right)}}\subseteq{{{\mathsf{sa}}^{n}\left(K\right)}}. In this paper, we show that for the Knapsack{{\mathsf{Knapsack}}} problem, the Lasserre hierarchy is strictly stronger.

Lower bound for the Sherali-Adams hierarchy for 𝖪𝗇𝖺𝗉𝗌𝖺𝖼𝗄𝖪𝗇𝖺𝗉𝗌𝖺𝖼𝗄{\mathsf{Knapsack}}

In this section, we show that the integrality gap of the tt-th level of the Sherali-Adams hierarchy for Knapsack{{\mathsf{Knapsack}}} is close to 22. This lower bound even holds for the uniform Knapsack{{\mathsf{Knapsack}}} problem, in which vi=ci=1v_{i}=c_{i}=1 for all ii Some people call this problem Unweighted Knapsack or Subset Sum..

For every ϵ,δ>0\epsilon,\delta>0, the integrality gap at the tt-th level of the Sherali-Adams hierarchy for Knapsack{{\mathsf{Knapsack}}} where tδnt\leq\delta n is at least (2ϵ)(1/(1+δ))(2-\epsilon)(1/(1+\delta)).

Proof. (Sketch - for full proof see Appendix A.) Consider the instance KK of Knapsack{{\mathsf{Knapsack}}} with nn objects where ci=vi=1c_{i}=v_{i}=1 for all iVi\in V and capacity C=2(1ϵ)C=2(1-\epsilon). Then the optimal integer value is 1. On the other hand, we claim that the vector yy where y=1y_{\emptyset}=1, y{i}=C/(n+(t1)(1ϵ))y_{\left\{i\right\}}=C/(n+(t-1)(1-\epsilon)) and yI=0y_{I}=0 for all I>1|I|>1 is in SAt(K){{{\mathsf{SA}}^{t}\left(K\right)}}. Thus, the integrality gap of the ttth round of Sherali-Adams is at least Cn/(n+(t1)(1ϵ))Cn/(n+(t-1)(1-\epsilon)), which is at least (2ϵ)(1/(1+δ))(2-\epsilon)(1/(1+\delta)) when tδn.t\leq\delta n. qed

A decomposition theorem for the Lasserre hierarchy

In this section, we develop the machinery we will need for our Lasserre upper bounds. It turns out that it is more convenient to work with families (zX)(z^{X}) of characteristic vectors rather than directly with yy. We begin with some definitions and basic properties.

Let T{\cal T} be a collection of subsets of VV and let yy be a vector indexed by sets of T\cal T. We define the extension of yy to be the vector yy^{\prime}, indexed by all subsets of VV, such that yIy^{\prime}_{I} equals yIy_{I} if ITI\in\cal T and equals otherwise.

Let SS be a subset of VV and XX a subset of SS. We define the characteristic polynomial PXP^{X} of XX with respect to SS as

Let yy^{\prime} be a vector indexed by all subsets of VV. Let SS be a subset of VV and, for each XX subset of SS, let zX=PXyz^{X}=P^{X}*y^{\prime}:

Then y=XSzXy^{\prime}=\sum_{X\subseteq S}z^{X}.

Proof. Fix a subset II of VV. Substituting the definition of zIXz^{X}_{I} in XSzIX\sum_{X\subseteq S}z^{X}_{I}, and changing the index of summation, we get

For AA\neq\emptyset the inner sum is 0, so only the term for A=A=\emptyset, which equals yIy^{\prime}_{I}, remains. qed

Let yy^{\prime} be a vector indexed by all subsets of VV, SS be a subset of VV and XX be a subset of SS. Then

Proof. Let I=IXI^{\prime}=I\setminus X and I=IXI^{\prime\prime}=I{\cap}X. Using the definition of zIXz^{X}_{I} and noticing that XI=XX\cup I^{\prime\prime}=X yields zIX=zIXz^{X}_{I}=z^{X}_{I^{\prime}}. This immediately implies that for IXI\subseteq X, zIX=zXz^{X}_{I}=z^{X}_{\emptyset}.

Finally, consider a set II that intersects SXS\setminus X and let iI(S\X)i\in I{\cap}(S\backslash X). In the definition of zIXz^{X}_{I}, we group the terms of the sum into pairs consisting of JJ such that iJi\notin J and of J{i}J\cup\{i\}. Since I=I{i}I=I\cup\{i\}, we obtain:

Let yy^{\prime} be a vector indexed by all subsets of VV, SS be a subset of VV and XX be a subset of SS. Let wXw^{X} be defined as zX/zXz^{X}/z^{X}_{\emptyset} if zX0z^{X}_{\emptyset}\neq 0 and defined as 0 otherwise. Then, if zX0z^{X}_{\emptyset}\neq 0, then w{i}Xw^{X}_{\{i\}} equals 1 for elements of XX and 0 for elements of SXS\setminus X.

Let SS be an arbitrary subset of VV and T\cal T be a collection of subsets of VV. We say that T\cal T is closed under shifting by S if

The following lemma generalizes Lemma 5 in . It proves that the positive-semidefinite property carries over from yy to (zX)(z^{X}).

Let SS be an arbitrary subset of VV and T\cal T be a collection of subsets of VV that is closed under shifting by SS. Let yy be a vector indexed by sets of T\cal T. Then

Proof. Since MT(y)0M_{\cal T}(y){\succeq 0}, there exist vectors vIv_{I}, ITI\in{\cal T}, such that vI,vJ=yIJ{\langle v_{I},v_{J}\rangle}=y_{I{\cup}J}. Fix a subset XX of SS. For each ITI\in{\cal T}, let

which is well-defined since T\cal T is closed under shifting by SS.

Let I,JTI,J\in\cal T. It is easy to check that wI,wJ=(zX)IJ{\langle w_{I},w_{J}\rangle}=(z^{X})_{I{\cup}J}. Indeed,

by definition of vI,vJv_{I},v_{J} and since T\cal T is closed under shifting by SS (so that this is well-defined). Consider a non-empty subset HH of SXS\setminus X and let iHi\in H. We group the terms of the inner sum into pairs consisting of LL such that iLi\notin L and of L{i}L\cup\{i\}. Since H=H{i}H=H\cup\{i\}, we obtain:

This implies that MT(zX)0M_{\cal T}(z^{X}){\succeq 0}. qed

In the rest of the section, we prove a decomposition theorem for the Lasserre hierarchy, which allows us to “divide” the action of the hierarchy and think of it as using the first few rounds on some subset of variables, and the other rounds on the rest. We will use this theorem to prove that the Lasserre hierarchy closes the gap for the Knapsack{{\mathsf{Knapsack}}} problem in the next section.

Let t>1t>1 and yLat(K)y\in{{{\mathsf{La}}^{t}\left(K\right)}}. Let k<tk<t and SS be a subset of VV and such that

Consider the projection yP2t2k(V){y|_{{{\mathcal{P}_{2t-2k}\left(V\right)}}}} of yy to the coordinates corresponding to subsets of size at most 2t2k2t-2k of VV. Then there exist subsets X1,X2,,XmX_{1},X_{2},\ldots,X_{m} of SS such that yP2t2k(V){y|_{{{\mathcal{P}_{2t-2k}\left(V\right)}}}} is a convex combination of vectors wXiw^{X_{i}} with the following properties:

w^{X_{i}}_{\{j\}}=\left\{\begin{array}[]{ll}1&\text{ if }j\in X_{i}\\ 0&\text{ if }j\in S\setminus X_{i};\end{array}\right.

wXiLatk(K)w^{X_{i}}\in{{{\mathsf{La}}^{t-k}\left(K\right)}}; and

if KiK_{i} is obtained from KK by setting xj=w{j}Xix_{j}=w^{X_{i}}_{\{j\}} for jSj\in S, then wXiP2t2k(V\S)Latk(Ki){w^{X_{i}}|_{{{\mathcal{P}_{2t-2k}\left(V\backslash S\right)}}}}\in{{{\mathsf{La}}^{t-k}\left(K_{i}\right)}}.

To prove Theorem 13, we will need a couple more lemmas. In the first one, using assumption (5), we extend the positive semi-definite properties from yy to yy^{\prime}, and then, using Lemma 12, from yy^{\prime} to zXz^{X}.

where MM is a principal submatrix of MPt(V)(y)M_{{{\mathcal{P}_{t}\left(V\right)}}}(y).Thus M0M{\succeq 0}, and so MT1(y)0M_{{\cal T}_{1}}(y^{\prime}){\succeq 0}.

Observe that T1{\cal T}_{1} is closed under shifting by SS. By definition of zXz^{X} and Lemma 12, we thus get MT1(zX)0M_{{\cal T}_{1}}(z^{X}){\succeq 0}.

Let t,y,S,kt,y,S,k be defined as in Theorem 13, and yy^{\prime} be the extension of yy. Then for any XSX\subseteq S:

If zX=0z^{X}_{\emptyset}=0 then zIX=0z^{X}_{I}=0 for all I2t2k|I|\leq 2t-2k.

Proof. Let T1{\cal T}_{1} be defined as in Lemma 14. By Lemma 14 MT1(zX)0M_{{\cal T}_{1}}(z^{X}){\succeq 0} and zXz^{X}_{\emptyset} is a diagonal element of this matrix, hence zX0z^{X}_{\emptyset}\geq 0.

For the second part, start by considering JVJ\subseteq V of size at most tkt-k. Then JT1J\in{\cal T}_{1}, and so the matrix M{,J}(zX)M_{\left\{\emptyset,J\right\}}(z^{X}) is a principal submatrix of MT1(zX)M_{{\cal T}_{1}}(z^{X}), hence is also positive semidefinite. Since zX=0z^{X}_{\emptyset}=0,

Now consider any IVI\subseteq V such that I2t2k|I|\leq 2t-2k, and write I=I1I2I=I_{1}{\cup}I_{2} where I1tk|I_{1}|\leq t-k and I2tk|I_{2}|\leq t-k. M{I1,I2}(zX)M_{\left\{I_{1},I_{2}\right\}}(z^{X}) is a principal submatrix of MT1(zX)M_{{\cal T}_{1}}(z^{X}), hence is also positive semidefinite. Since zI1X=zI2X=0z^{X}_{I_{1}}=z^{X}_{I_{2}}=0, Since

We now have what we need to prove Theorem 13.

Proof of Theorem 13. By definition, Lemma 8 and the second part of Lemma 15, we have

By Lemma 8 and by definition of yy, we have XSzX=y=1\sum_{X\subseteq S}{z^{X}_{\emptyset}}=y_{\emptyset}=1, and the terms are non-negative by the first part of Lemma 15, so yP2t2k(V){y|_{{{\mathcal{P}_{2t-2k}\left(V\right)}}}} is a convex combination of wXw^{X}’s, as desired.

By Lemma 9, wI{j}Xi=wIXiw^{X_{i}}_{I{\cup}{\left\{j\right\}}}=w^{X_{i}}_{I} for jXij\in X_{i} and wI{j}Xi=0w^{X_{i}}_{I{\cup}{\left\{j\right\}}}=0 for jS\Xij\in S\backslash X_{i}. The claim follows. qed

Upper bound for the Lasserre hierarchy for 𝖪𝗇𝖺𝗉𝗌𝖺𝖼𝗄𝖪𝗇𝖺𝗉𝗌𝖺𝖼𝗄{\mathsf{Knapsack}}

In this section, we use Theorem 13 to prove that for the Knapsack{{\mathsf{Knapsack}}} problem the gap of Lat(K){{{\mathsf{La}}^{t}\left(K\right)}} approaches 11 quickly as tt grows, where KK is the LP relaxation of (1). First, we show that there is a set SS such that every feasible solution in Lat(K){{{\mathsf{La}}^{t}\left(K\right)}} satisfies the condition of the Theorem.

Given an instance (C,V)(C,V) of Knapsack{{\mathsf{Knapsack}}}, Let OPT(C,V)OPT(C,V) denote the value of the optimal integral solution.

Consider an instance (C,V)(C,V) of Knapsack{{\mathsf{Knapsack}}} and its linear programming relaxation KK given by (1). Let t>1t>1 and yLat(K)y\in{{{\mathsf{La}}^{t}\left(K\right)}}. Let k<tk<t and S={iVvi>OPT(C,V)/k}S={\left\{i\in V{\left|v_{i}>{OPT(C,V)}/{k}\right.}\right\}}. Then:

Proof. There are three cases depending on the size of II:

It1|I|\leq t-1. Recall the capacity constraint g(x)=CiVcixi0g(x)=C-\sum_{i\in V}c_{i}x_{i}\geq 0. On the one hand, since MPt1(V)(gy)0M_{{{\mathcal{P}_{t-1}\left(V\right)}}}(g*y){\succeq 0}, the diagonal entry (gy)I(g*y)_{I} must be non-negative. On the other hand, writing out the definition of (gy)I(g*y)_{I} and noting that the coefficients cic_{i} are all non-negative, we infer (gy)ICyI(iIci)yI(g*y)_{I}\leq Cy_{I}-\left(\sum_{i\in I}c_{i}\right)y_{I}. But by assumption, iIci>C\sum_{i\in I}c_{i}>C. Thus we must have yI=0y_{I}=0.

tI2t2t\leq|I|\leq 2t-2. Write I=I1I2=II=I_{1}{\cup}I_{2}=I with I1,I2t1|I_{1}|,|I_{2}|\leq t-1 and I1Sk|I_{1}{\cap}S|\geq k. Then yI1=0y_{I_{1}}=0. Since MPt(y)0M_{{{\mathcal{P}_{t}\left(y\right)}}}{\succeq 0}, its 2-by-2 principal submatrix M{I1,I2}(y)M_{\left\{I_{1},I_{2}\right\}}(y) must also be positive semi-definite.

and it is easy to check that we must then have yI=0y_{I}=0.

2t1I2t2t-1\leq|I|\leq 2t. Write I=I1I2=II=I_{1}{\cup}I_{2}=I with I1,I2t|I_{1}|,|I_{2}|\leq t and I1Sk|I_{1}{\cap}S|\geq k. Then yI1=0y_{I_{1}}=0 since t2t2t\leq 2t-2 for all t2t\geq 2. By the same argument as in the previous case, we must then have yI=0y_{I}=0.

The following theorem shows that the integrality gap of the ttht^{th} level of the Lasserre hierarchy for Knapsack{{\mathsf{Knapsack}}} reduces quickly when tt increases.

Consider an instance (C,V)(C,V) of Knapsack{{\mathsf{Knapsack}}} and its LP relaxation KK given by (1). Let t2t\geq 2. Then

and so the integrality gap at the tt-th level of the Lasserre hierarchy is at most 1+1/(t1)1+1/(t-1).

Proof. Let S={iVvi>OPT(C,V)/(t1)}S={\left\{i\in V{\left|v_{i}>{OPT(C,V)}/{(t-1)}\right.}\right\}}. Let yLat(K)y\in{{{\mathsf{La}}^{t}\left(K\right)}}. If ISt1|I{\cap}S|\geq t-1, then the elements of ISI\cap S have total value greater than OPT(C,V)OPT(C,V), so they must not be able to fit in the knapsack: their total capacity exceeds CC, and so by Lemma 16 we have yI=0y_{I}=0. Thus the condition of Theorem 13 holds for k=t1k=t-1.

Therefore, yP2(V){y|_{{{\mathcal{P}_{2}\left(V\right)}}}} is a convex combination of wXiw^{X_{i}} with XiSX_{i}\subseteq S, thus Value(y)maxiValue(wXi)\text{Value}(y)\leq\max_{i}\text{Value}(w^{X_{i}}). By the first and third properties of the Theorem, we have:

By the nesting property of the Lasserre hierarchy, Lemma 1, and definition of SS,

By the second property of the Theorem, wXiw^{X_{i}} is in Latk(K)K{{{\mathsf{La}}^{t-k}\left(K\right)}}\subseteq K, so it must satisfy the capacity constraint, so iXiciiIciC\sum_{i\in X_{i}}c_{i}\leq\sum_{i\in I}c_{i}\leq C, so XiX_{i} is feasible. Thus:

The first expression in the right hand side is equal to OPT(C,V)OPT(C,V), hence the Theorem. qed

Conclusion

We have shown that for Knapsack{{\mathsf{Knapsack}}}, an integrality gap of 2ϵ2-\epsilon persists up to a linear number of rounds in the Sherali-Adams hierarchy. This broadens the class of problems for which Sherali-Adams is not strong enough to capture the instrinsic difficulity of problems.

On the other hand, our positive result for Lasserre opens the posibility that lower bounds in the Lasserre hierarchy good indicators of the intrinsic dificulty of the problem, thus encourages more investigation on the effect of the hierarchy on “easy” problems (SpanningTree{\mathsf{SpanningTree}}, BinPacking{\mathsf{BinPacking}}, etc.)

Acknowledgement

Clare Mathieu would like to thank Eden Chlamtac for stimulating discussions.

References

Appendix

Appendix A Full proof of Theorem 5

Proof. Let t2t\geq 2. Consider the instance KK of Knapsack{{\mathsf{Knapsack}}} with nn objects where ci=vi=1c_{i}=v_{i}=1 for all iVi\in V and capacity C=2(1ϵ)C=2(1-\epsilon). Let α=C/(n+(t1)(1ϵ))\alpha={C}/({n+(t-1)(1-\epsilon)}) and consider the vector yPt(V)y\in^{{{\mathcal{P}_{t}\left(V\right)}}} defined by

We claim that ySAt(K)y\in{{{\mathsf{SA}}^{t}\left(K\right)}}. Consider any subset UVU\subseteq V such that Ut|U|\leq t. We have

Since Ut<n|U|\leq t<n, Uα1|U|\alpha\leq 1, and it is easy to see that this implies MP1(U)(y)0M_{{{\mathcal{P}_{1}\left(U\right)}}}(y){\succeq 0}, and so MP(U)(y)0M_{{{\mathcal{P}\left(U\right)}}}(y){\succeq 0}.

Next, let g(x)=CiVcixig(x)=C-\sum_{i\in V}c_{i}x_{i} and consider any subset WVW\subseteq V such that Wt1|W|\leq t-1. Again, we have

Since Wt1|W|\leq t-1, by definition of α\alpha we have W(C1)αCnα|W|(C-1)\alpha\leq C-n\alpha, and it is easy to see that this implies MP1(W)(gy)0M_{{{\mathcal{P}_{1}\left(W\right)}}}(g*y){\succeq 0}, and so MP(W)(gy)0M_{{{\mathcal{P}\left(W\right)}}}(g*y){\succeq 0}. Thus ySAt(K)y\in{{{\mathsf{SA}}^{t}\left(K\right)}}.

The integer optimum has value 1, so the integrality gap is at least the value of yy, which is nα=2(1ϵ)/(1+(t1)(12ϵ)/n)n\alpha=2(1-\epsilon)/(1+(t-1)(1-2\epsilon)/n). The supremum over all ϵ\epsilon is 2/(1+(t1)/n)2/(1+(t-1)/n), and the supremum of that over all nn is 2, so the integrality gap is at least 2.

On the other hand, it is well-known that the base linear program KK has value at most 2OPT2OPT (that is an immediate consequence of Lemma 1), hence, by the nesting property, every linear program in the hierarchy has integrality gap exactly equal to 2. qed

Appendix B A linear formulation of the Sherali-Adams hierarchy

If xx is indeed integral, then xik=xix_{i}^{k}=x_{i} for any k1k\geq 1. Thus, the constraint obtained by expanding (6) and replacing xikx_{i}^{k} by xix_{i} holds in PP and can be added to strengthen the relaxation. However, this constraint is not linear. To preserve the linearity of the system, each product iIxi\prod_{i\in I}x_{i} is replaced by a variable yIy_{I}.

In addition, to keep the number of variables from growing exponentially, we restrict ourselves to only variables yIy_{I} such that It|I|\leq t. By this, we “lift” the polytope KK to a polytope SAt(K)Pt(V){{{\mathsf{SA}}^{t}\left(K\right)}}\subseteq^{{{\mathcal{P}_{t}\left(V\right)}}}.

Let KK be a polytope defined as in equation 2. For any 1tn1\leq t\leq n, the tt-th Sherali-Adams lifted polytope SAt(K){{{\mathsf{SA}}^{t}\left(K\right)}} is defined by

expanding the result and replacing each xikx_{i}^{k} by xix_{i}; and

replacing each iSxi\prod_{i\in S}x_{i} by ySy_{S}.

We say that a point xVx\in^{V} belongs to the tt-th Sherali-Adams polytope sat(K){{{\mathsf{sa}}^{t}\left(K\right)}} iff there exists a ySAt(K)y\in{{{\mathsf{SA}}^{t}\left(K\right)}} such that y{i}=xiy_{\left\{i\right\}}=x_{i} for all iVi\in V.

In particular, in the case of Knapsack{{\mathsf{Knapsack}}}, SAt(K){{{\mathsf{SA}}^{t}\left(K\right)}} is the set of all points in Pt(V)^{{{\mathcal{P}_{t}\left(V\right)}}} that satisfy the following constraints for any I,JVI,J\subseteq V such that IJ=I{\cap}J=\emptyset and I+Jt1|I|+|J|\leq t-1:

For a proof that this definition is equivalent to Definition 3, we refer the reader to Laurent’s paper .