AM with Multiple Merlins

Scott Aaronson, Russell Impagliazzo, Dana Moshkovitz

Introduction

The PCP\mathsf{PCP} characterization of NP\mathsf{NP} , with the resulting hardness of approximation results, is one of the great achievements of computational complexity. Leading up to this work was another landmark result, the 1991 theorem of Babai, Fortnow, and Lund that MIP=NEXP\mathsf{MIP}=\mathsf{NEXP}, where MIP\mathsf{MIP} is Multi-Prover Interactive Proofs and NEXP\mathsf{NEXP} is Nondeterministic Exponential Time. Both of these results can be paraphrased as characterizing the hardness of a certain computational problem from game theory: estimating the value of a two-player cooperative game with simultaneous moves. Such games are known in the complexity community as two-prover games, and in the quantum information community as nonlocal games. From now on, we will use the term two-prover games.

finite question sets X,YX,Y and answer sets A,BA,B,

a probability distribution D\mathcal{D} over question pairs (x,y)X×Y\left(x,y\right)\in X\times Y, and

a verification function V:X×Y×A×B[0,1]V:X\times Y\times A\times B\rightarrow\left[0,1\right].In most of the actual games we will consider, VV will take values in {0,1}\left\{0,1\right\} only. However, the possibility of real VV is needed for full generality.

The value of the game, denoted ω(G)\omega\left(G\right), is the maximum, over all pairs of response functions a:XAa:X\rightarrow A and b:XBb:X\rightarrow B, of

The interpretation is this: the game GG involves a verifier/referee Arthur, as well as two cooperating provers Merlin1 and Merlin2, who can agree on a strategy in advance but cannot communicate once the game starts. First Arthur chooses a pair of questions (x,y)\left(x,y\right) from D\mathcal{D}, and sends xx to Merlin1 and yy to Merlin2. The Merlins then send back responses a=a(x)a=a\left(x\right) and b=b(y)b=b\left(y\right) respectively.Because of convexity, we can assume without loss of generality that both Merlins use deterministic strategies. Finally, Arthur declares the Merlins to have “won” with probability equal to V(x,y,a,b)V\left(x,y,a,b\right). Then ω(G)\omega\left(G\right) is just the probability that the Merlins win if they use an optimal strategy.

It is not hard to show that computing the exact value of a two-prover game is NP\mathsf{NP}-hard. The PCP Theorem can be interpreted as saying that even to approximate the value to within an additive constant is also NP\mathsf{NP}-hard. To make this precise, we can define the classes PCP\mathsf{PCP} and MIP\mathsf{MIP} as those decision problems polynomial-time reducible to approximating the value of a two-prover game. The difference between the classes is that for PCP’s, the reduction computes an explicit description of the game, whereas for MIP\mathsf{MIP}, the description is implicit.

To be more precise, we start with a decision problem LL. Given an instance II of LL, a reduction constructs a two-prover game GIG_{I} with the following properties:

(Completeness) If ILI\in L then ω(GP)2/3\omega\left(G_{P}\right)\geq 2/3.

(Soundness) If I∉LI\not\in L then ω(GP)1/3\omega\left(G_{P}\right)\leq 1/3.

(Efficiency) In the “explicit” case, the sets X,Y,A,BX,Y,A,B can be generated in time polynomial in n=In=\left|I\right|, the distribution D\mathcal{D} can be described in polynomial time as the uniform distribution over some subset of X×YX\times Y, and and the verification procedure V(x,y,a,b)V(x,y,a,b) can be generated in polynomial time as a table of size X×Y×A×B\left|X\right|\times\left|Y\right|\times\left|A\right|\times\left|B\right|. In the “implicit” case, X,Y,A,BX,Y,A,B are sets of poly(n)\operatorname*{poly}\left(n\right)-bit strings, D\mathcal{D} can be described as a probabilistic polynomial-time sampling procedure that returns a pair (x,y)X×Y(x,y)\in X\times Y, and the verification function V(x,y,a,b)V(x,y,a,b) can be computed in polynomial time.

The class PCP\mathsf{PCP} then consists of all decision problems that can be reduced explicitly to two-prover games, while MIP\mathsf{MIP} consists of all decision problems that can be reduced implicitly to two-prover games. As frequently happens, switching from explicit to implicit representations causes us to “jump up” in complexity by an exponential. The dual theorems PCP=NP\mathsf{PCP}=\mathsf{NP} and MIP=NEXP\mathsf{MIP}=\mathsf{NEXP} bear out this general pattern.

The hardness of approximating two-prover games can in turn be used to show hardness of approximation for many constraint-satisfaction problems. Better trade-offs in the parameters of the reduction and specific kinds of verification procedure give tighter hardness of approximation results for a wide variety of particular combinatorial optimization problems. So the study of two-prover games did not end with the PCP Theorem.

In this paper, we consider the following restriction of two-prover games:

What if we demand that Arthur’s challenges to Merlin1 and Merlin2 be independent? In other words, what if the distribution D\mathcal{D} is simply the uniform distribution over X×YX\times Y?We could also let D\mathcal{D} be an arbitrary product distribution, but we don’t gain any interesting generality that way: Arthur might as well just send Merlin1 and Merlin2 the uniform random bits he would’ve used to generate xXx\in X and yYy\in Y respectively.

In the PCP literature, two-prover games where D\mathcal{D} is uniform over X×YX\times Y are called “free” games, and have sometimes been studied as an easier-to-analyze special case of general games . Free games are also tightly connected to dense instances of constraint satisfaction problems. In this paper, we consider approximating the values of free games as an interesting computational problem in its own right, and one that has not received explicit attention. As far as we know, we are the first to study the complexity of this problem directly, and to formulate complexity classes of problems reducible to free games.

In more detail, the restriction to free games gives us an analogue of “public-coin” protocols in the single-prover interactive proof setting. This corresponds to the original definition of the class AM\mathsf{AM}, so we use a version of AM\mathsf{AM} notation. We consider AM(2)\mathsf{AM}\left(2\right), or two-prover Arthur-Merlin: the class of all languages that admit two-prover, two-round interactive proof systems, in which Arthur’s challenges to the Merlins are independent, uniformly-random poly(n)\operatorname*{poly}\left(n\right)-bit strings. In other words, AM(2)\mathsf{AM}(2) is the class of problems implicitly reducible to approximating the value of a free game. Clearly

We want to know: what is the true power of AM(2)\mathsf{AM}\left(2\right)? Is it more powerful than single-prover AM\mathsf{AM}? Is it less powerful than MIP\mathsf{MIP}? We will also be interested in the complexity of approximating the values of explicit free games.

As we’ll discuss in Section 4, an additional motivation to study AM(2)\mathsf{AM}\left(2\right) comes from difficult analogous questions about quantum multi-prover proof systems, and specifically about the quantum complexity class QMA(2)\mathsf{QMA}\left(2\right). Our results could shed light on QMA(2)\mathsf{QMA}\left(2\right), by showing how many questions about it get resolved in a simpler “classical model situation.”

Our Results

We have two main sets of results: upper bounds, showing that the value of a free game can be approximated in quasipolynomial time and translating that into complexity class containments; and hardness results, giving almost matching lower bounds for this problem under the Exponential Time Hypothesis (ETH). Thus, assuming only the ETH, we show both that free games are exponentially easier than general two-prover games, and also they still remain nontrivial, out of reach for polynomial-time algorithms.

Let FreeGameε be the problem of approximating the value of a free game to error ±ε\pm\varepsilon:

Given as input a description of a free game G=(X,Y,A,B,V)G=\left(X,Y,A,B,V\right), estimate ω(G)\omega\left(G\right) to within additive error ±ε\pm\varepsilon. (Here nn, the input size, is XYAB\left|X\right|\left|Y\right|\left|A\right|\left|B\right|, and ε\varepsilon is an arbitrarily small constant if not specified explicitly.)

We give a quasipolynomial-time algorithm for FreeGameε:

FreeGameε is solvable in deterministic time nO(ε2logn)n^{O(\varepsilon^{-2}\log n)}.

While this is the first algorithm explicitly for FreeGameε, there is some directly-related algorithmic work. After learning of our results (but before this paper was written), Brandao and Harrow gave an algorithm for FreeGameε with the same running time as ours, but using interestingly different techniques. (Our algorithm is purely combinatorial, whereas theirs uses linear programming relaxation.) Also, Barak et al. gave a quasipolynomial-time approximation algorithm for the related problem of approximating the values of dense CSPs with polynomial-sized alphabets.

In the implicit setting, Theorem 3 implies that AM(2)EXP\mathsf{AM}\left(2\right)\subseteq\mathsf{EXP}, which improves on the trivial upper bound of NEXP\mathsf{NEXP}. However, by building on the result of Barak et al. mentioned above, we are able to prove a stronger result, which completely characterizes AM(2)\mathsf{AM}\left(2\right):

We can even generalize Theorem 4 to handle any polynomial number of Merlins:

AM(k)=AM\mathsf{AM}\left(k\right)=\mathsf{AM} for all k=poly(n)k=\operatorname*{poly}\left(n\right).

Thus, in the complexity class setting, it is really the correlation between queries that makes multiple provers more powerful than a single prover.

2 Hardness Results

Seeing just the above, one might conjecture that the values of free games are approximable in polynomial time. But surprisingly, we give strong evidence that this is not the case.

To show the power of free games, we give a nontrivial reduction from 3Sat to FreeGame. Equivalently, we show that there exists a nontrivial AM(2)\mathsf{AM}\left(2\right) protocol: even if Arthur’s challenges are completely independent, two Merlins can be more helpful to him than one Merlin. In particular, given a 3Sat instance φ\varphi, let the size of φ\varphi be the number of variables plus the number of clauses. Then:

For some constant ε>0\varepsilon>0, there exists a reduction running in time 2O~(n)2^{\widetilde{O}(\sqrt{n})} that maps 3Sat instances of size nn to FreeGameε instances of size 2O~(n)2^{\widetilde{O}(\sqrt{n})} (where the O~\widetilde{O} hides polylogarithmic factors).

In other words, there is a protocol whereby Arthur can check that a 3Sat instance of size nn is satisfiable, by exchanging only O~(n)\widetilde{O}(\sqrt{n}) bits with the Merlins—i.e., sending O~(n)\widetilde{O}(\sqrt{n})-bit challenges and receiving O~(n)\widetilde{O}(\sqrt{n})-bit responses. The protocol has perfect completeness and a 11 vs. 1ε1-\varepsilon completeness/soundness gap, for some fixed constant ε>0\varepsilon>0. Since the first step we use is the PCP Theorem, by composing our main protocol with various PCP constructions, we can get reductions with different quantitative tradeoffs between reduction time, completeness, soundness, and alphabet size.

One corollary of Theorem 6 is that, if FreeGame is in P\mathsf{P}, then 3Sat is in TIME(2O~(n))\mathsf{TIME}(2^{\widetilde{O}(\sqrt{n})}). Since 3Sat is complete under quasilinear-time reductions for NTIME(n)\mathsf{NTIME}(n), the same holds for any problem in nondeterministic linear time. As a second corollary, we get a lower bound on the time to approximate FreeGame assuming the ETH. This lower bound almost matches the upper bounds described in Section 2.1. To be more precise, recall the Exponential Time Hypothesis (ETH) of Impagliazzo and Paturi :

Any deterministic algorithm for 3Sat requires 2Ω(n)2^{\Omega\left(n\right)} time. (There is also the Randomized ETH, which says the same for bounded-error randomized algorithms.)

Assuming the (randomized) ETH, any (randomized) algorithm for FreeGameε requires nΩ~(ε1logn)n^{\widetilde{\Omega}(\varepsilon^{-1}\log n)} time, for all ε1/n\varepsilon\geq 1/n bounded below some constant.

Again, by considering various PCP constructions, we get a variety of hardness results for many interesting versions and ranges of parameters for the FreeGame problem.

We can further reduce FreeGame to the problem of approximating dense CSPs, where an arity kk CSP is considered dense if it contains constraints for a constant fraction of all kk-tuples of variables. We thus get the following hardness result for dense CSPs.

Assuming the ETH, the problem of approximating a dense kk-CSP (constraint satisfaction problem) with a polynomial-size alphabet, to constant additive error, requires nΩ~(logn)n^{\widetilde{\Omega}\left(\log n\right)} time, for any k2k\geq 2.

Corollary 9 almost matches the upper bound of Barak et al. , explaining for the first time why Barak et al. were able to give a quasipolynomial-time algorithm for approximating dense CSPs, but not a polynomial-time one.

As another application of our hardness result for FreeGame, Brandao and Harrow were recently able to use it to prove that approximating the values of certain entangled games requires nΩ~(logn)n^{\widetilde{\Omega}\left(\log n\right)} time, assuming the ETH.See for the precise definition of the entangled games they consider. Briefly, though, the games involve a large number of provers, of whom two are selected at random to receive challenges (the other provers are ignored).

Detailed Overview of Results

We now proceed to more detailed overview of our results and the techniques used to prove them. Here, as in the technical part of the paper, we first describe our hardness results for FreeGame (or equivalently, AM(2)\mathsf{AM}\left(2\right) protocols for 3Sat), and then our approximation algorithms (or equivalently, limitations of AM(k)\mathsf{AM}\left(k\right) protocols).

The idea of our 3Sat protocol is simple. First Arthur transforms the 3Sat instance φ\varphi into a PCP, so that it’s either satisfiable or far from satisfiable. For this to work, we need a highly-efficient PCP theorem, which produces instances of near-linear size. Fortunately, such PCP theorems are now known. Depending on the desired parameters, we will use either the theorem of Dinur (which produces 3Sat instances of size npolylognn\operatorname*{polylog}n with a small constant completeness/soundness gap), or that of Moshkovitz and Raz (which produces 22-CSP instances of size n2(logn)1Ω(1)n\cdot 2^{\left(\log n\right)^{1-\Omega(1)}} with completeness/soundness gap arbitrarily close to 11).

Suppose for now that we use the PCP theorem of Dinur . Then next, Arthur runs a variant of the so-called clause/variable game, which we define below.

Given a 3Sat instance φ\varphi, consisting of nn variables x1,,xnx_{1},\ldots,x_{n} and mm clauses C1,,CmC_{1},\ldots,C_{m}, the clause/variable game GφG_{\varphi} is defined as follows. Arthur chooses an index i[m]i\in\left[m\right] uniformly at random, then chooses j[n]j\in\left[n\right] uniformly at random conditioned on xjx_{j} or xj\urcorner x_{j} appearing in CiC_{i} as a literal. He sends ii to Merlin1 and jj to Merlin2. Arthur accepts if and only if

Merlin1 sends back a satisfying assignment to the variables in CiC_{i}, and

Merlin2 sends back a value for xjx_{j} that agrees with the value sent by Merlin1.

Let SAT(φ)[0,1]\operatorname*{SAT}\left(\varphi\right)\in\left[0,1\right] be the maximum fraction of clauses of φ\varphi that can be simultaneously satisfied. Then clearly the clause/variable game has perfect completeness: that is, if SAT(φ)=1\operatorname*{SAT}\left(\varphi\right)=1 then ω(Gφ)=1\omega\left(G_{\varphi}\right)=1. The following well-known proposition shows that the game also has constant soundness.

If SAT(φ)1ε\operatorname*{SAT}\left(\varphi\right)\leq 1-\varepsilon, then ω(Gφ)1ε/3\omega\left(G_{\varphi}\right)\leq 1-\varepsilon/3.

Proof. Assume without loss of generality that Merlin2 answers according to a particular assignment x=(x1,,xn)x=\left(x_{1},\ldots,x_{n}\right). By hypothesis, xx violates the clause CiC_{i} with probability at least ε\varepsilon over ii. And if xx violates CiC_{i}, then regardless of what Merlin1 does, Arthur rejects with probability at least 1/31/3—since Merlin1’s assignment to CiC_{i} either violates CiC_{i}, or else disagrees with xx (which Arthur detects with probability at least 1/31/3 over the variable sent to Merlin2).

Also, given any two-prover game G=(X,Y,A,B,D,V)G=\left(X,Y,A,B,\mathcal{D},V\right), let GkG^{k} be the kk-fold parallel repetition of GG: that is, the game where Arthur

draws (x1,y1),,(xk,yk)\left(x_{1},y_{1}\right),\ldots,\left(x_{k},y_{k}\right) independently from D\mathcal{D},

sends x1,,xkx_{1},\ldots,x_{k} to Merlin1 and y1,,yky_{1},\ldots,y_{k} to Merlin2,

receives responses a1,,akAa_{1},\ldots,a_{k}\in A from Merlin1 and b1,,bkBb_{1},\ldots,b_{k}\in B from Merlin2, and then

accepts with probability equal to i=1kV(xi,yi,ai,bi)\prod_{i=1}^{k}V\left(x_{i},y_{i},a_{i},b_{i}\right).

Then the famous Parallel Repetition Theorem asserts that ω(Gk)\omega(G^{k}) decreases exponentially with kk:

If ω(G)1ε\omega\left(G\right)\leq 1-\varepsilon, then

Unfortunately, neither the original clause/variable game GφG_{\varphi}, nor its parallel repetition GφkG_{\varphi}^{k}, work in the setting of AM[2]\mathsf{AM}\left[2\right]. For both games rely essentially on correlation between the clause(s) sent to Merlin1 and the variable(s) sent to Merlin2. To eliminate the need for correlation, we use a new form of repetition that we call birthday repetition.

2 Approximation Algorithms for Free Games

Our second set of results aims at showing that a square-root savings in communication, as achieved by our AM(2)\mathsf{AM}\left(2\right) protocol for 3Sat, is the best that any such protocol can provide. More formally, we prove the following set of four interrelated results:

The FreeGameε problem is solvable deterministically in (XA)O(ε2logYB)=nO(ε2logn)\left(\left|X\right|\left|A\right|\right)^{O(\varepsilon^{-2}\log\left|Y\right|\left|B\right|)}=n^{O(\varepsilon^{-2}\log n)} time. (There is also a randomized algorithm that uses XAO(ε2logYB)\left|X\right|\cdot\left|A\right|^{O(\varepsilon^{-2}\log\left|Y\right|\left|B\right|)} time.)

Any AM(2)\mathsf{AM}\left(2\right) protocol involving p(n)p\left(n\right) bits of communication can be simulated in 2O(p(n)2)poly(n)2^{O(p\left(n\right)^{2})}\operatorname*{poly}\left(n\right) time (deterministically, if Arthur’s verification procedure is deterministic, and probabilistically otherwise). So in particular, AM(2)EXP\mathsf{AM}\left(2\right)\subseteq\mathsf{EXP}, improving the trivial upper bound of NEXP\mathsf{NEXP}. (As we point out, a closer analysis improves the upper bound to AM(2)AMNP\mathsf{AM}\left(2\right)\subseteq\mathsf{AM}^{\mathsf{NP}}.)

Assuming the Randomized ETH, any constant-soundness AM(2)\mathsf{AM}\left(2\right) protocol for 3Sat must use Ω(n)\Omega(\sqrt{n}) communication. (In more detail, such a protocol must use Ω(εn)\Omega(\sqrt{\varepsilon n}) communication if its completeness/soundness gap is 11 vs. 1ε1-\varepsilon, and Ω(nlog1/δ)\Omega(\sqrt{n\log 1/\delta}) communication if its gap is 11 vs. δ\delta. Also, if Arthur’s verification procedure is deterministic, then it suffices to assume the standard ETH.)

AM(2)=AM\mathsf{AM}\left(2\right)=\mathsf{AM}. (Of course, this supersedes our AM(2)EXP\mathsf{AM}\left(2\right)\subseteq\mathsf{EXP} and AM(2)AMNP\mathsf{AM}\left(2\right)\subseteq\mathsf{AM}^{\mathsf{NP}} results.)

In Section 7.1, we provide a self-contained proof for result (1), and then use (1) to deduce (2) and (3). The idea of our approximation algorithm is to sample a small random subset SXS\subset X of the questions to Merlin1. We then brute-force search over all possible strategies α:SA\alpha:S\rightarrow A for the questions in SS. For each such strategy α\alpha, we find the optimal response bα:YBb_{\alpha}:Y\rightarrow B of Merlin2 to that α\alpha, and then the optimal response aα:XAa_{\alpha}:X\rightarrow A of Merlin1 to bαb_{\alpha} on his full question set XX. A simple probabilistic analysis then shows that, provided we take S=Ω(ε2logYB)\left|S\right|=\Omega\left(\varepsilon^{-2}\log\left|Y\right|\left|B\right|\right), at least one of these “induced” strategy pairs (aα,bα)\left(a_{\alpha},b_{\alpha}\right) must achieve value within ε\varepsilon of the optimal value ω(G)\omega\left(G\right). Similar ideas have been used before in other approximation algorithms: for example, in that of Lipton, Markakis, and Mehta for finding approximate Nash equilibria.

Once we have an nO(ε2logn)n^{O\left(\varepsilon^{-2}\log n\right)}-time approximation algorithm for FreeGameε, the containment AM(2)EXP\mathsf{AM}\left(2\right)\subseteq\mathsf{EXP} follows almost immediately. We also sketch an improvement to AM(2)AMNP\mathsf{AM}\left(2\right)\subseteq\mathsf{AM}^{\mathsf{NP}}, which is obtained by modifying our approximation algorithm so that it fits into the property-testing framework of Goldreich, Goldwasser, and Ron . As for the optimality of our 3Sat protocol, we simply need to observe that, if we had a protocol that used o(n)o(\sqrt{n}) communication, then it would give rise to a free game GG of size 2o(n)2^{o(\sqrt{n})}, whose value ω(G)\omega\left(G\right) we could estimate in 2o(n)2^{o\left(n\right)} time by using our quasipolynomial-time approximation algorithm. But that would let us decide 3Sat in 2o(n)2^{o\left(n\right)} time, contradicting the Exponential Time Hypothesis.

For result (4), we wish to go further, and show that any two-Merlin protocol can be simulated using one Merlin: that is, AM(2)=AM\mathsf{AM}\left(2\right)=\mathsf{AM}. Here we appeal to a powerful line of earlier work on subsampling for dense CSPs. Specifically, Alon et al. showed in 2002 that, given any kk-ary constraint satisfaction problem φ\varphi over nn Boolean variables, one can estimate the maximum number of constraints in φ\varphi that can be simultaneously satisfied, to within additive error ±ε(nk)\pm\varepsilon\binom{n}{k}, by simply throwing away all the variables except for a random set II of size poly(1/ε)\operatorname*{poly}\left(1/\varepsilon\right), and then using brute-force search to find an optimal assignment to φI\varphi_{I}, the restriction of φ\varphi to II.

To build intuition, it is easy to satisfy φI\varphi_{I} at least as well as we can satisfy φ\varphi, with high probability over II. To do so, simply start with an optimal global assignment xx for φ\varphi; then restrict xx to the variables in II and apply a Chernoff bound. The hard part is to show that φI\varphi_{I} cannot be satisfied much better than the full instance φ\varphi was. Conversely, one needs to show that, given a collection of “local assignments,” involving just poly(1/ε)\operatorname*{poly}\left(1/\varepsilon\right) variables at a time, one can “patch them together” into a global assignment that is almost as good as the local ones.

In later work, Barak et al. proved a more general result, which removed Alon et al.’s assumption that the alphabet is Boolean. Their result lets us approximate the value of any dense kk-CSP φ\varphi over the finite alphabet Σ\Sigma to within additive error ±ε(nk)\pm\varepsilon\binom{n}{k}, by solving a random sub-instance on poly(1/ε)logΣ\operatorname*{poly}\left(1/\varepsilon\right)\cdot\log\left|\Sigma\right| variables.

To see the relevance of this work to free games, we simply need to observe that FreeGame can be directly encoded as a dense CSP. Given a free game G=(X,Y,A,B,V)G=\left(X,Y,A,B,V\right), we can create variables (a(x))xX\left(a\left(x\right)\right)_{x\in X} and (b(y))yY\left(b\left(y\right)\right)_{y\in Y} over the alphabets AA and BB respectively, and then for all (x,y,a,b)X×Y×A×B\left(x,y,a,b\right)\in X\times Y\times A\times B, add a number of constraints setting a(x)=aa\left(x\right)=a and b(y)=bb\left(y\right)=b that is proportional to V(x,y,a,b)V\left(x,y,a,b\right). Once we do this, the result of Barak et al. implies a subsampling theorem for free games—saying that the value of any free game GG can be well-approximated by the value of a logarithmic-sized random subgame. And this, in turn, readily implies that AM(2)=AM\mathsf{AM}\left(2\right)=\mathsf{AM}. For given any AM(2)\mathsf{AM}\left(2\right) protocol, we can simulate the protocol in AM\mathsf{AM} by having Arthur execute the following steps:

Choose random subsets S,TS,T of poly(n)\operatorname*{poly}\left(n\right) questions to Merlin1 and Merlin2 respectively.

Ask a single Merlin to send him responses to all questions in SS and TT.

Check the responses, for all possible question pairs (x,y)S×T\left(x,y\right)\in S\times T.

The soundness of this approach follows from the subsampling theorem, which says that if Merlins had no winning strategy in the original AM(2)\mathsf{AM}\left(2\right) protocol, then with high probability, they have no winning strategy even when restricted to the tiny subset of questions S×TS\times T.

One might ask: if existing results on dense CSPs can be used to show that AM(2)=AM\mathsf{AM}\left(2\right)=\mathsf{AM}, then why do we “reinvent the wheel,” and provide self-contained proofs for weaker results such as AM(2)EXP\mathsf{AM}\left(2\right)\subseteq\mathsf{EXP}? One answer is that the dense CSP results do not give good dependence on the error. For example, those results imply that FreeGameε can be solved in nO(εΛlogn)n^{O(\varepsilon^{-\Lambda}\log n)} time for some large and unspecified constant Λ\Lambda, but not that it can be solved in nO(ε2logn)n^{O(\varepsilon^{-2}\log n)} time. And we actually care about the dependence on ε\varepsilon, for at least two reasons. First, we wish to make an analogy with a recent nO(ε2logn)n^{O(\varepsilon^{-2}\log n)} algorithm for a problem in quantum information theory, due to Brandao, Christandl, and Yard (for details see Section 4). And second, we wish to show that, assuming the ETH, the “obvious” AM(2)\mathsf{AM}\left(2\right) protocol for 3Sat is optimal even in the very low-error and high-error cases. The dense CSP results do not get us close to such a statement, but our algorithm does.

More broadly, appealing to the dense CSP literature feels like overkill if we just want to show (for example) that our 3Sat protocol is optimal, or that the values of free games can be approximated in quasipolynomial time. If we can prove those results in an elementary, self-contained way, then it seems like we should—particularly because our proofs might help to make certain striking techniques from the dense CSP world more accessible than they would be otherwise.

Their algorithm also implies that AM(2)EXP\mathsf{AM}\left(2\right)\subseteq\mathsf{EXP}, and that our 3Sat protocol is essentially optimal assuming the ETH. On the other hand, it seems unlikely that their algorithm can be used to get the containment AM(2)AMNP\mathsf{AM}\left(2\right)\subseteq\mathsf{AM}^{\mathsf{NP}}, let alone AM(2)=AM\mathsf{AM}\left(2\right)=\mathsf{AM}.

3 Generalizing to k𝑘k Merlins

One might wonder whether our limitation theorems for AM(2)\mathsf{AM}\left(2\right) protocols could be evaded by simply adding more Merlins. So for example, even if AM(2)\mathsf{AM}\left(2\right) protocols for 3Sat require Ω(n)\Omega(\sqrt{n}) communication (assuming the ETH), could there be an AM(3)\mathsf{AM}\left(3\right) protocol that used O(n1/3)O(n^{1/3}) communication, an AM(10)\mathsf{AM}\left(10\right) protocol that used O(n1/10)O(n^{1/10}) communication, and so forth? In Sections 7.3 and 7.4, we generalize our limitation theorems to the case of kk Merlins, in order to rule out that possibility. In particular, we give the following extensions of our results from Section 3.2:

There is a deterministic algorithm that, given as input a kk-player free game GG with question sets Y1,,YkY_{1},\ldots,Y_{k} and answer sets B1,,BkB_{1},\ldots,B_{k}, approximates ω(G)\omega\left(G\right) to within ±ε\pm\varepsilon in time

where n=Y1B1YkBkn=\left|Y_{1}\right|\left|B_{1}\right|\cdots\left|Y_{k}\right|\left|B_{k}\right| is the input size. (There is also an alternative algorithm that runs in time nεO(1)lognn^{\varepsilon^{-O\left(1\right)}\log n}, independently of kk.)

AM(k)EXP\mathsf{AM}\left(k\right)\subseteq\mathsf{EXP} for all k=poly(n)k=\operatorname*{poly}\left(n\right). (Indeed, any constant-soundness AM(k)\mathsf{AM}\left(k\right) protocol involving p(n)p\left(n\right) total bits of communication can be simulated in 2O(p(n)2)poly(n)2^{O(p\left(n\right)^{2})}\operatorname*{poly}\left(n\right) randomized time, or 2O(p(n)2)poly(n)2^{O(p\left(n\right)^{2})}\operatorname*{poly}\left(n\right) deterministic time if Arthur’s verification procedure is deterministic.)

Assuming the Randomized ETH, any constant-soundness AM(k)\mathsf{AM}\left(k\right) protocol for 3Sat must use Ω(n)\Omega(\sqrt{n}) total bits of communication, regardless of how large kk is. (If, moreover, Arthur’s verification procedure is deterministic, then it suffices to assume the ordinary ETH.)

AM(k)=AM\mathsf{AM}\left(k\right)=\mathsf{AM} for all k=poly(n)k=\operatorname*{poly}\left(n\right).

We first prove (1’), and then derive (2’) and (3’) as consequences. For (1’), the basic idea is to generalize our approximation algorithm for 22-player free games to kk-player games, by calling the algorithm recursively to “peel off players one at a time.” In other words, we reduce the approximation of a kk-player game to the approximation of a quasipolynomial number of (k1)\left(k-1\right)-player games, and continue recursing until we get down to 11 player. When we do this, we need to control the buildup of error across all kk levels of the recursion, and that is why we get a factor of k2k^{2} in the exponent of the running time. Later, by using the subsampling machinery, we will be able to go back and give an alternative algorithm whose running time depends only on nn, not on kk. And that, in turn, will let us show that assuming the ETH, any AM(k)\mathsf{AM}\left(k\right) protocol for 3Sat must use Ω(n)\Omega(\sqrt{n}) total bits of communication, regardless of kk. (Our first algorithm only implies a lower bound of k+Ω(n/k)=Ω(n1/4)k+\Omega(\sqrt{n}/k)=\Omega(n^{1/4}) on the total communication, assuming the ETH.) The tradeoff is that the running time of the alternative algorithm depends exponentially on εΛ\varepsilon^{-\Lambda} for some large constant Λ\Lambda, rather than on ε2\varepsilon^{-2}.

For (4’), we need to show that the subsampling theorem of Barak et al. continues to give us what we want, so long as k=poly(n)k=\operatorname*{poly}\left(n\right). This boils down to proving a good subsampling theorem for kk-player free games. That is, given any kk-player free game G=(Y1,,Yk,B1,,Bk,V)G=\left(Y_{1},\ldots,Y_{k},B_{1},\ldots,B_{k},V\right) of total size n=Y1B1YkBkn=\left|Y_{1}\right|\left|B_{1}\right|\cdots\left|Y_{k}\right|\left|B_{k}\right|, we need to show that its value ω(G)\omega\left(G\right) can be approximated to within additive error ±ε\pm\varepsilon, by restricting attention to random subsets of questions (SiYi)i[k]\left(S_{i}\subset Y_{i}\right)_{i\in\left[k\right]}, where each SiS_{i} has size εO(1)logn\varepsilon^{-O\left(1\right)}\log n. A direct adaptation of our argument from the k=2k=2 case turns out not to work here (it breaks down when kk is greater than O(logn)O\left(\log n\right)), but we give an alternative encoding of kk-player free games by kk-CSPs that works for all k=poly(n)k=\operatorname*{poly}\left(n\right).

Quantum Motivation

In studying AM(2)\mathsf{AM}\left(2\right), our original motivation was to understand the quantum complexity class QMA(2)\mathsf{QMA}\left(2\right) (i.e., two-prover Quantum Merlin-Arthur). So in this section, we provide some background about QMA(2)\mathsf{QMA}\left(2\right), and explain the tantalizingly close analogy between it and AM(2)\mathsf{AM}\left(2\right). Readers who don’t care about quantum complexity theory can skip this section.

Recall that “ordinary” QMA\mathsf{QMA} is just the quantum analogue of MA\mathsf{MA}:

QMA\mathsf{QMA} is the class of languages L{0,1}L\subseteq\left\{0,1\right\}^{\ast} for which there exists a polynomial-time quantum algorithm QQ such that, for all inputs x{0,1}nx\in\left\{0,1\right\}^{n}:

If xLx\in L then there exists a quantum witness state ϕ\left|\phi\right\rangle, on poly(n)\operatorname*{poly}\left(n\right) qubits, such that Q(x,ϕ)Q\left(x,\left|\phi\right\rangle\right) accepts with probability at least 2/32/3.

If xLx\notin L then Q(x,ϕ)Q\left(x,\left|\phi\right\rangle\right) accepts with probability at most 1/31/3, for all purported witness states ϕ\left|\phi\right\rangle.

A lot is known about QMA\mathsf{QMA}: for example, it has natural complete promise problems, admits amplification, and is contained in PP\mathsf{PP} (see Aharonov and Naveh for a survey).

Now, QMA(k)\mathsf{QMA}\left(k\right) (introduced by Kobayashi et al. ) is just like QMA\mathsf{QMA}, but with kk Merlins who are assumed to be unentangled. Note that, if the Merlins were entangled, then the joint state they sent to Arthur could be arbitrary—so from Arthur’s perspective, there might as well be only one Merlin.For precisely this reason, in the classical case we trivially have MA(k)=MA\mathsf{MA}\left(k\right)=\mathsf{MA} for all k=poly(n)k=\operatorname*{poly}\left(n\right). With QMA(k)\mathsf{QMA}\left(k\right), the hope is that, ironically, Arthur can exploit his knowledge that the messages are unentangled to verify statements that he otherwise could not. More formally:

QMA(k)\mathsf{QMA}\left(k\right) is the class of languages L{0,1}L\subseteq\left\{0,1\right\}^{\ast} for which there exists a polynomial-time quantum algorithm QQ such that, for all inputs x{0,1}nx\in\left\{0,1\right\}^{n}:

If xLx\in L, then there exist quantum witness states ϕ1,,ϕk\left|\phi_{1}\right\rangle,\ldots,\left|\phi_{k}\right\rangle, each on poly(n)\operatorname*{poly}\left(n\right) qubits, such that Q(x,ϕ1ϕk)Q\left(x,\left|\phi_{1}\right\rangle\otimes\cdots\otimes\left|\phi_{k}\right\rangle\right) accepts with probability at least 2/32/3.

If xLx\notin L then Q(x,ϕ1ϕk)Q\left(x,\left|\phi_{1}\right\rangle\otimes\cdots\otimes\left|\phi_{k}\right\rangle\right) accepts with probability at most 1/31/3 for all purported witness states ϕ1,,ϕk\left|\phi_{1}\right\rangle,\ldots,\left|\phi_{k}\right\rangle.

Compared to QMA\mathsf{QMA}, strikingly little is known about QMA(2)\mathsf{QMA}\left(2\right). Clearly

but we do not know any better containments. We do not even have strong evidence that QMA(2)QMA\mathsf{QMA}\left(2\right)\neq\mathsf{QMA}, or at the other extreme that QMA(2)NEXP\mathsf{QMA}\left(2\right)\neq\mathsf{NEXP}. Harrow and Montanaro showed that QMA(2)\mathsf{QMA}\left(2\right) allows exponential amplification of success probabilities, and that QMA(2)=QMA(k)\mathsf{QMA}\left(2\right)=\mathsf{QMA}\left(k\right) for all k3k\geq 3; even these were surprisingly nontrivial results.

Of course, QMA(2)\mathsf{QMA}\left(2\right) would be of limited interest, if we could never actually exploit the promise of unentanglement to do anything new. In 2007, however, Blier and Tapp gave a QMA(2)\mathsf{QMA}\left(2\right) protocol for the NP\mathsf{NP}-complete 3Coloring problem, using two quantum witnesses with only logn\log n qubits each. The catch was that Arthur has only a 1/poly(n)1/\operatorname*{poly}\left(n\right) probability of catching the Merlins if they cheat. Even then, however, any one-prover QMA\mathsf{QMA} protocol with the same parameters would imply NPBQP\mathsf{NP}\subseteq\mathsf{BQP}.

Independently, Aaronson et al. gave a protocol to convince Arthur that a 3Sat instance of size nn is satisfiable, using O~(n)\widetilde{O}(\sqrt{n}) quantum witnesses with logn\log n qubits each. Unlike Blier and Tapp’s protocol, Aaronson et al.’s achieved constant soundness, and that is why it required more communication (O~(n)\widetilde{O}(\sqrt{n}) rather than logn\log n). Shortly afterward, Aaronson et al.’s protocol was improved by Harrow and Montanaro , who showed how to prove 3Sat using two quantum witnesses with O~(n)\widetilde{O}(\sqrt{n}) qubits each; and in a different direction by Chen and Drucker , who showed how to measure each of the O~(n)\widetilde{O}(\sqrt{n}) witnesses separately from the others.It is still not known whether one can combine the Harrow-Montanaro and Chen-Drucker improvements, to get a 3Sat protocol using two witnesses of O~(n)\widetilde{O}(\sqrt{n}) qubits each that are measured separately from each other.

Without going into too much detail, all of these O~(n)\widetilde{O}(\sqrt{n})-qubit protocols for 3Sat ultimately rely on the Birthday Paradox. In particular, they all involve Arthur measuring kk quantum registers with logn\log n qubits each—and if we want constant soundness, then (roughly speaking) we need a constant probability that two or more of Arthur’s measurements will reveal information about the same 3Sat variable xjx_{j}. And that is why we need k=Ω(n)k=\Omega(\sqrt{n}).

It is tempting to speculate that n\sqrt{n} qubits represents some sort of fundamental barrier for multi-prover QMA\mathsf{QMA} protocols: i.e., that assuming we want constant soundness, we can save a quadratic factor in the number of qubits needed to prove 3Sat, but no more than that. Certainly it would be astonishing if 3Sat could be proved (with constant soundness) using two unentangled witnesses with only polylogn\operatorname*{polylog}n qubits each. In that case, “scaling up” by an exponential, we would presumably get that QMA(2)=NEXP\mathsf{QMA}\left(2\right)=\mathsf{NEXP}.

When one thinks about the above questions—or for that matter, almost any questions about QMA(2)\mathsf{QMA}\left(2\right)—one is inevitably led to a computational problem that Harrow and Montanaro called the Best Separable State or BSS problem.

to additive error ±ε\pm\varepsilon. (Here ε\varepsilon is assumed to be an arbitrarily small constant if not specified otherwise.)

is just the largest eigenvalue of AA, which is easy to compute. Indeed, the proof of QMAPP\mathsf{QMA}\subseteq\mathsf{PP} works by reducing the simulation of a QMA\mathsf{QMA} protocol to the computation of λ(A)\lambda\left(A\right), for some exponentially-large Hermitian matrix AA.

By contrast, BSS asks us to maximize uAuu^{{\dagger}}Au only over unit vectors of the form u=vwu=v\otimes w. That is why BSS models the problem of maximizing the verifier’s acceptance probability in a QMA(2)\mathsf{QMA}\left(2\right) protocol, where the maximum is taken over all separable witnesses, of the form ϕ1ϕ2\left|\phi_{1}\right\rangle\otimes\left|\phi_{2}\right\rangle. From this standpoint, the reason why QMA(2)\mathsf{QMA}\left(2\right) is so much harder to understand than QMA\mathsf{QMA}—but also why QMA(2)\mathsf{QMA}\left(2\right) is potentially more powerful—is that (as one can check) BSS is a non-convex optimization problem, which lacks the clean linear-algebraic structure of computing λ(A)\lambda\left(A\right).

Indeed, from the protocol of Blier and Tapp mentioned earlier, it follows immediately that we can reduce 3Coloring to the problem of approximating λsep(A)\lambda_{\operatorname*{sep}}\left(A\right) up to additive error ±1/poly(n)\pm 1/\operatorname*{poly}\left(n\right). Furthermore, since the quantum witnesses in the Blier-Tapp protocol have only logn\log n qubits, the resulting matrix AA will have size 2O(logn)=poly(n)2^{O\left(\log n\right)}=\operatorname*{poly}\left(n\right). Thus:

BSS1/poly(n){}_{1/\operatorname*{poly}\left(n\right)} is NP\mathsf{NP}-hard.

One wants to know: is BSSε still a hard problem even for constant ε\varepsilon? Because it has constant soundness, the protocol of Harrow and Montanaro (building on Aaronson et al. ) lets us reduce 3Sat to the problem of approximating λsep(A)\lambda_{\operatorname*{sep}}\left(A\right) up to constant additive error. Now, since the quantum witnesses in the Harrow-Montanaro protocol have O~(n)\widetilde{O}(\sqrt{n}) qubits, the resulting matrix AA has size 2O~(n)2^{\widetilde{O}(\sqrt{n})}, so we do not get a polynomial-time reduction. We do, however, get something:

If BSS is solvable in t(n)t\left(n\right) time, then 3Sat is solvable in t(2O~(n))t(2^{\widetilde{O}(\sqrt{n})}) time. So in particular, assuming the Exponential Time Hypothesis, BSS requires nΩ(logn)n^{\Omega\left(\log n\right)} deterministic time. (Likewise, assuming the Randomized ETH, BSS requires nΩ(logn)n^{\Omega\left(\log n\right)} randomized time.)

Could we go further than Theorems 17 and 18, and prove that BSSε is NP\mathsf{NP}-hard even for constant ε\varepsilon? Notice that if we could, then “scaling up by an exponential,” we could presumably also show QMA(2)=NEXP\mathsf{QMA}\left(2\right)=\mathsf{NEXP}! If, on the other hand, we believe (as seems plausible) that QMA(2)EXP\mathsf{QMA}\left(2\right)\subseteq\mathsf{EXP}, then we seem forced to believe that BSS is solvable in npolylognn^{\operatorname*{polylog}n} time, even if we have no idea what the algorithm is.Strictly speaking, neither of these implications is a theorem. For example, even if BSSε turned out to be NP\mathsf{NP}-hard for constant ε\varepsilon, it’s possible that one could exploit the special structure of the matrices arising from polynomial-size quantum circuits to show that QMA(2)EXP\mathsf{QMA}\left(2\right)\subseteq\mathsf{EXP}. In practice, however, a “reasonable” proof that BSSε is NP\mathsf{NP}-hard would probably also imply QMA(2)=NEXP\mathsf{QMA}\left(2\right)=\mathsf{NEXP}, and a “reasonable” proof of QMA(2)EXP\mathsf{QMA}\left(2\right)\subseteq\mathsf{EXP} would probably proceed by solving BSSε in quasipolynomial time.

Raising the stakes even further, Barak et al. showed that BSS is intimately related to other problems of great current interest in complexity theory: namely, the Unique Games, Small Set Expansion, and 2-to-4 Norm problems.We refer the reader to for the definitions of these problems, but very briefly: Unique Games (UG) is the problem of deciding whether ω(G)\omega(G) is close to 11 or close to , given as input a description of a two-prover game G=(X,Y,A,B,D,V)G=\left(X,Y,A,B,\mathcal{D},V\right) with the special properties that A=B\left|A\right|=\left|B\right|, and that for every (x,y)X×Y\left(x,y\right)\in X\times Y there exists a permutation πx,y\pi_{x,y} such that V(x,y,a,b)=1V(x,y,a,b)=1 if and only if b=πx,y(a)b=\pi_{x,y}(a). Small Set Expansion (SSE) is the problem of deciding whether a given graph GG is close to or far from an expander graph, if we consider GG’s expansion on “small” subsets of vertices only. 2-to-4 Norm (2-to-4) is the problem, given as input an n×nn\times n matrix AA, of approximating the maximum of Av4\left\|Av\right\|_{4} over all vectors vv such that v2=1\left\|v\right\|_{2}=1. The lattice of known reductions among these problems is as follows:

Now, assuming the ETH, Theorem 18 gives us nΩ(logn)n^{\Omega\left(\log n\right)} hardness for BSS—and as a direct consequence, for 2-to-4 Norm as well. That might not sound like much, but it’s a lot more than we currently know for either Unique Games or Small Set Expansion! So the speculation arises that, if we fully understood BSS, we might be able to apply some of the insights to UG or SSE.

To lay our cards on the table, here is our conjecture about BSS:

BSSε is solvable in deterministic nO(ε2logn)n^{O(\varepsilon^{-2}\log n)} time.

If true, Conjecture 19 readily implies that QMA(2)EXP\mathsf{QMA}\left(2\right)\subseteq\mathsf{EXP}. Since a t(n)t\left(n\right)-time algorithm for BSS can be combined with a q(n)q\left(n\right)-qubit QMA(2)\mathsf{QMA}\left(2\right) protocol for 3Sat to get a t(2O(q(n)))t(2^{O\left(q\left(n\right)\right)})-time algorithm for 3Sat, Conjecture 19 also implies that, assuming the ETH, any QMA(2)\mathsf{QMA}\left(2\right) protocol for 3Sat must use Ω(n)\Omega(\sqrt{n}) qubits.

There has been some progress toward a proof of Conjecture 19. In particular, Brandao, Christandl, and Yard gave an algorithm that solves BSSε in nO(ε2logn)n^{O(\varepsilon^{-2}\log n)} time if A2=O(1)\left\|A\right\|_{2}=O\left(1\right), or alternatively, if AA represents a quantum measurement that can be implemented using LOCC (Local Operations and Classical Communication). This implied, among other things, that QMALOCC(k)=QMA\mathsf{QMA}_{\mathsf{LOCC}}\left(k\right)=\mathsf{QMA} for k=O(1)k=O\left(1\right), where QMALOCC(k)\mathsf{QMA}_{\mathsf{LOCC}}\left(k\right) is the subclass of QMA(k)\mathsf{QMA}\left(k\right) in which Arthur is restricted to LOCC measurements. Brandao et al.’s algorithm uses a technique that quantum information researchers know as symmetric extension, and that theoretical computer scientists know as the Lasserre hierarchy. It is not known whether similar techniques could work for arbitrary QMA(k)\mathsf{QMA}\left(k\right) protocols.

More recently, Brandao and Harrow showed that, assuming the ETH, any so-called BellQMA(k)\mathsf{BellQMA}\left(k\right) protocol for 3Sat—that is, any QMA(k)\mathsf{QMA}\left(k\right) protocol where each of the kk witnesses are measured separately—must use n1/2o(1)n^{1/2-o\left(1\right)} qubits. This lower bound is known to be essentially tight, due to the protocol of Chen and Drucker . The requirement that each witness be measured separately (with the measurement outcomes then combined with classical postprocessing) is even more stringent than the requirement of LOCC. Despite this, the result of Brandao and Harrow did not follow from the earlier result of Brandao, Christandl, and Yard that QMALOCC(k)=QMA\mathsf{QMA}_{\mathsf{LOCC}}\left(k\right)=\mathsf{QMA}, because the latter works only for constant kk.

But what does any of the above have to do with AM(2)\mathsf{AM}\left(2\right)? One way to view this paper’s contribution is as follows: we prove that a “classical analogue” of Conjecture 19 holds. In more detail, we can think of AM(2)\mathsf{AM}\left(2\right) as closely analogous in many ways to QMA(2)\mathsf{QMA}\left(2\right). For both classes, the only obvious lower bound comes from restricting to a single Merlin, while the only obvious upper bound is NEXP\mathsf{NEXP}. For both classes, the difficulty with proving an EXP\mathsf{EXP} upper bound is the requirement that the Merlins can’t communicate, which gives rise to a non-convex optimization problem. For both classes, there exists a protocol for 3Sat that uses logn\log n communication, but that has only a 1/poly(n)1/\operatorname*{poly}\left(n\right) probability of catching cheating Merlins. For both classes, we can improve the 3Sat protocol to have constant soundness, by using a strong PCP theorem together with the Birthday Paradox—but if we do so, then the communication cost increases from logn\log n to O~(n)\widetilde{O}(\sqrt{n}).

Because the analogy runs so deep, it seems of interest to QMA(2)\mathsf{QMA}\left(2\right) researchers to know that:

FreeGameε is solvable in nO(ε2logn)n^{O(\varepsilon^{-2}\log n)} time, as we conjecture that BSSε is.

AM(2)\mathsf{AM}\left(2\right) is contained in EXP\mathsf{EXP}, as we conjecture that QMA(2)\mathsf{QMA}\left(2\right) is.

The O~(n)\widetilde{O}(\sqrt{n})-communication AM(2)\mathsf{AM}\left(2\right) protocol for 3Sat is essentially optimal assuming the ETH, as we conjecture that the corresponding QMA(2)\mathsf{QMA}\left(2\right) protocol is.

Of course, we also show in this paper that AM(2)=AM\mathsf{AM}\left(2\right)=\mathsf{AM}. So pushing the analogy between AM(2)\mathsf{AM}\left(2\right) and QMA(2)\mathsf{QMA}\left(2\right) all the way to the end would lead to the conjecture that QMA(2)=QMA\mathsf{QMA}\left(2\right)=\mathsf{QMA}. We remain agnostic about whether the analogy extends that far!

Preliminaries

Some notation: we use E\operatorname{E} for expectation, [n]\left[n\right] for {1,,n}\left\{1,\ldots,n\right\}, and ([n]k)\binom{\left[n\right]}{k} for the set of subsets of [n]\left[n\right] of size kk. In addition to the notation O~(f(n))\widetilde{O}(f\left(n\right)) for O(f(n)polylogf(n))O(f\left(n\right)\operatorname*{polylog}f\left(n\right)), we also use Ω~(f(n))\widetilde{\Omega}(f\left(n\right)) for Ω(f(n)/polylogf(n))\Omega(f\left(n\right)/\operatorname*{polylog}f\left(n\right)). All logs are base 22 unless specified otherwise.

Sections 1 and 2 have already defined many of the concepts we will need, including two-prover games, free games, the clause/variable game, the birthday repetition, and the FreeGame problem. For completeness, though, we now give the general definition of kk-player free games.

finite question sets Y1,,YkY_{1},\ldots,Y_{k} and answer sets B1,,BkB_{1},\ldots,B_{k}, and

a verification function V:Y1××Yk×B1××Bk[0,1]V:Y_{1}\times\cdots\times Y_{k}\times B_{1}\times\cdots\times B_{k}\rightarrow\left[0,1\right].

The value of the game, denoted ω(G)\omega\left(G\right), is the maximum, over all tuples of response functions (bi:YiBi)i[k]\left(b_{i}:Y_{i}\rightarrow B_{i}\right)_{i\in\left[k\right]} , of

Directly related to kk-player free games is the complexity class AM(k)\mathsf{AM}\left(k\right), which we now formally define.AM(k)\mathsf{AM}\left(k\right) should not be confused with AM[k]\mathsf{AM}\left[k\right], which means AM\mathsf{AM} with a single Merlin but kk rounds of communication. A classic result of Babai and Moran says that AM[k]=AM[2]=AM\mathsf{AM}\left[k\right]=\mathsf{AM}\left[2\right]=\mathsf{AM} for all constants k2k\geq 2. When k=poly(n)k=\operatorname*{poly}\left(n\right), by contrast, such a collapse is not believed to happen, since AM[poly]=IP=PSPACE\mathsf{AM}\left[\operatorname*{poly}\right]=\mathsf{IP}=\mathsf{PSPACE}.

Let kk be a positive integer. Then AM(k)\mathsf{AM}\left(k\right) is the class of languages L{0,1}L\subseteq\left\{0,1\right\}^{\ast} for which there exists a probabilistic polynomial-time verifier VV such that for all nn and all inputs x{0,1}nx\in\left\{0,1\right\}^{n}:

(Completeness) If xLx\in L, then there exist functions b1,,bk:{0,1}poly(n){0,1}poly(n)b_{1},\ldots,b_{k}:\left\{0,1\right\}^{\operatorname*{poly}\left(n\right)}\rightarrow\left\{0,1\right\}^{\operatorname*{poly}\left(n\right)}, depending on xx, such that

(Soundness) If xLx\notin L, then for all such functions b1,,bkb_{1},\ldots,b_{k},

Clearly AM(1)=AM\mathsf{AM}\left(1\right)=\mathsf{AM} and AM(k)AM(k+1)\mathsf{AM}\left(k\right)\subseteq\mathsf{AM}\left(k+1\right) for all kk. We also have AM(k)MIP(k)\mathsf{AM}\left(k\right)\subseteq\mathsf{MIP}\left(k\right), thereby giving the crude upper bound AM(k)NEXP\mathsf{AM}\left(k\right)\subseteq\mathsf{NEXP} (later we will do much better).

We can easily generalize the definition of AM(k)\mathsf{AM}\left(k\right) to AM(k(n))\mathsf{AM}\left(k\left(n\right)\right), for any growth rate k(n)=O(poly(n))k\left(n\right)=O\left(\operatorname*{poly}\left(n\right)\right). Also, let AMp(n)(k)\mathsf{AM}_{p\left(n\right)}\left(k\right) be the variant of AM(k)\mathsf{AM}\left(k\right) where all messages (both the yiy_{i}’s and the bib_{i}’s) are constrained to be p(n)p\left(n\right) bits long.

Given any probabilistic complexity class C\mathcal{C}, one of the first questions we can ask is whether C\mathcal{C} admits amplification of success probabilities—or equivalently, whether C\mathcal{C} is robust under changing its error parameters (such as 1/31/3 and 2/32/3). At least for AM(2)\mathsf{AM}\left(2\right), we are fortunate that a positive answer follows from known results. In particular, building on the Parallel Repetition Theorem (Theorem 12), Rao proved a useful concentration bound for the parallel repetitions of two-prover games:

For all δ>0\delta>0 and all two-prover games G=(X,Y,A,B,D,V)G=\left(X,Y,A,B,\mathcal{D},V\right), if Merlin1 and Merlin2 play the parallel repeated version GNG^{N}, then they can win more than a ω(G)+δ\omega\left(G\right)+\delta fraction of the games with probability at most

Theorem 22 implies that “amplification works” for AM(2)\mathsf{AM}\left(2\right) protocols:

In the definition of AM(2)\mathsf{AM}\left(2\right), replacing the constants (1/3,2/3)\left(1/3,2/3\right) by (a,b)\left(a,b\right) for any constants 0<a<b<10<a<b<1, or indeed by (2p(n),12p(n))\left(2^{-p\left(n\right)},1-2^{-p\left(n\right)}\right) or (1/21/p(n),1/2+1/p(n))\left(1/2-1/p\left(n\right),1/2+1/p\left(n\right)\right) for any polynomial pp, gives rise to the same complexity class.

Proof. Suppose, for example, that we want to amplify (1/3,2/3)\left(1/3,2/3\right) to (2p(n),12p(n))\left(2^{-p\left(n\right)},1-2^{-p\left(n\right)}\right); the other cases are analogous. Given a language LAM(2)L\in\mathsf{AM}\left(2\right) and an input x{0,1}nx\in\left\{0,1\right\}^{n}, the AM(2)\mathsf{AM}\left(2\right) protocol for checking whether xLx\in L can be represented as a free game G=(X,Y,A,B,V)G=\left(X,Y,A,B,V\right), where X=Y=A=B={0,1}q(n)X=Y=A=B=\left\{0,1\right\}^{q\left(n\right)} for some polynomial qq. We have ω(G)2/3\omega\left(G\right)\geq 2/3 if xLx\in L, and ω(G)1/3\omega\left(G\right)\leq 1/3 if xLx\notin L. Now let G1/2NG_{1/2}^{N} be the game where the Merlins play NN parallel instances of the original game GG, and they “win” if and only if they win on at least N/2N/2 instances. If ω(G)2/3\omega\left(G\right)\geq 2/3, then clearly ω(G1/2N)12Ω(N)\omega(G_{1/2}^{N})\geq 1-2^{-\Omega\left(N\right)}—since if the Merlins just play their optimal strategy for GG on each of the NN instances separately, then the number that they win will be concentrated around ω(G)N2N/3\omega\left(G\right)N\geq 2N/3 by a standard Chernoff bound. On the other hand, if ω(G)1/3\omega\left(G\right)\leq 1/3, then Theorem 22 implies that ω(G1/2N)2Ω(N/q(n))\omega(G_{1/2}^{N})\leq 2^{-\Omega\left(N/q\left(n\right)\right)}. So, by simply choosing Np(n)q(n)N\gg p\left(n\right)q\left(n\right) to be a suitably large polynomial, we can ensure that ω(G1/2N)12p(n)\omega(G_{1/2}^{N})\geq 1-2^{-p\left(n\right)} if xLx\in L while ω(G1/2N)2p(n)\omega(G_{1/2}^{N})\leq 2^{-p\left(n\right)} if xLx\notin L.

Note that Proposition 23 can blow up the communication cost by a polynomial factor, because of the dependence of NN on q(n)q\left(n\right) (which derives from the 1/logAB1/\log\left|A\right|\left|B\right| factor in the exponent from Theorem 22). For this reason, Proposition 23 doesn’t directly imply any useful amplification for our O~(n)\widetilde{O}(\sqrt{n})-communication protocol for 3Sat. See Section 6.3 for further discussion of this issue, and for our best current results for the low-error case.

Rao (personal communication) believes that it would be straightforward to generalize Theorem 22 to games with k3k\geq 3 players, as long as the games are free.By contrast, for general games with k3k\geq 3 players, even proving a “standard” parallel repetition theorem is a notorious open problem. If so, then we would also obtain an amplification theorem for AM(k)\mathsf{AM}\left(k\right), for all k=poly(n)k=\operatorname*{poly}\left(n\right). However, this generalization has not yet been worked out explicitly.

One last remark: a classic result about “ordinary” AM\mathsf{AM} (see ) states that any AM\mathsf{AM} protocol can be made to have perfect completeness. In other words, the condition xLPr[V accepts]2/3x\in L\Rightarrow\Pr\left[V\text{ accepts}\right]\geq 2/3 can be strengthened to xLPr[V accepts]=1x\in L\Rightarrow\Pr\left[V\text{ accepts}\right]=1 without loss of generality. Another classic result states that any AM\mathsf{AM} protocol can be made public-coin, meaning that any random bits generated by Arthur are immediately shared with Merlin. In terms of games, the public-coin property would imply in particular that Arthur’s verification function was deterministic: that is, V(x,y,a,b){0,1}V\left(x,y,a,b\right)\in\left\{0,1\right\} for all x,y,a,bx,y,a,b.

Thus, one might wonder whether any AM(k)\mathsf{AM}\left(k\right) protocol can be made perfect-completeness and public-coin as well. Ultimately, affirmative answers to these questions will follow from our result that AM(k)=AM\mathsf{AM}\left(k\right)=\mathsf{AM}, which works regardless of whether the original AM(k)\mathsf{AM}\left(k\right) protocol had perfect completeness or was public-coin. But it would be interesting to find direct proofs of these properties for AM(k)\mathsf{AM}\left(k\right). (It would also be interesting to find a direct proof that AM(k)=AM(2)\mathsf{AM}\left(k\right)=\mathsf{AM}\left(2\right) for all k>2k>2, rather than deducing this as a consequence of AM(k)=AM\mathsf{AM}\left(k\right)=\mathsf{AM}.)

Analysis of the Birthday Game

Our goal in this section is to prove Theorem 6: informally, that AM(2)\mathsf{AM}\left(2\right) protocols for 3Sat can achieve nearly a quadratic savings in communication over the “naïve” bound of nn bits. The section is organized as follows. First, in Section 6.1, we give a “basic” protocol with a 11 vs. 1ϵ1-\epsilon completeness/soundness gap (for some fixed ϵ>0\epsilon>0) and O~(n)\widetilde{O}(\sqrt{n}) communication cost. The protocol is based on the birthday repetition already discussed in Section 3.1; for concreteness, we initially implement the idea using Dinur’s PCP Theorem and the clause/variable game. Next, in Section 6.2, we study the high-error case, showing how a more refined analysis leads to a protocol with a 11 vs. 1ε1-\varepsilon completeness/soundness gap and O(εnpolylogn)O(\sqrt{\varepsilon n}\operatorname*{polylog}n) communication cost. Then in Section 6.3, we switch to the low-error case, using the PCP Theorem of Moshkovitz and Raz to obtain an AM(2)\mathsf{AM}\left(2\right) protocol for 3Sat with a 11 vs. δ\delta completeness/soundness gap and n1/2+o(1)poly(1/δ)n^{1/2+o\left(1\right)}\operatorname*{poly}\left(1/\delta\right) communication cost. Finally, in Section 6.4, we give the implication NTIME[n]AMn1/2+o(1)(2)\mathsf{NTIME}\left[n\right]\subseteq\mathsf{AM}_{n^{1/2+o\left(1\right)}}\left(2\right) and show that this implication is nonrelativizing.

The first step is to state a variant of the PCP Theorem that is strong enough for our purposes.

Given a 3Sat instance φ\varphi of size nn, it is possible in poly(n)\operatorname*{poly}\left(n\right) time to produce a new 3Sat instance ϕ\phi, of size npolylognn\operatorname*{polylog}n, such that:

(Completeness) If SAT(φ)=1\operatorname*{SAT}\left(\varphi\right)=1 then SAT(ϕ)=1\operatorname*{SAT}\left(\phi\right)=1.

(Soundness) If SAT(φ)<1\operatorname*{SAT}\left(\varphi\right)<1 then SAT(ϕ)<1ϵ\operatorname*{SAT}\left(\phi\right)<1-\epsilon, for some constant 0<ϵ<1/80<\epsilon<1/8.

(Balance) Every clause of ϕ\phi involves exactly 33 variables, and every variable of ϕ\phi appears in exactly dd clauses, for some constant dd.It is known that we can assume this balance condition without loss of generality.

The reason why, for now, we use Dinur’s version of the PCP Theorem is that it produces instances of size npolylognn\operatorname*{polylog}n. Later, in Section 6.3, we will switch over to the PCP Theorem of Moshkovitz and Raz , which produces instances of the slightly larger size n2(logn)1Δ=n1+o(1)n\cdot 2^{\left(\log n\right)^{1-\Delta}}=n^{1+o\left(1\right)} (for some constant Δ>0\Delta>0) but achieves sub-constant error. Were we willing to accept a protocol with n2(logn)1Δ\sqrt{n}2^{\left(\log n\right)^{1-\Delta}} communication, we could have used the Moshkovitz-Raz version from the start, but we will try to keep the communication cost down to npolylogn\sqrt{n}\operatorname*{polylog}n for as long as we can.

Let the 3Sat instance ϕ\phi produced by Theorem 24 have NN variables x1,,xNx_{1},\ldots,x_{N} and MM clauses C1,,CMC_{1},\ldots,C_{M}. Also, let GϕG_{\phi} be the clause/variable game for ϕ\phi, as defined in Section 3.1. Then combining Theorem 24 with Proposition 11 yields the following corollary.

If ϕ\phi is unsatisfiable, then ω(Gϕ)<1ϵ/3\omega\left(G_{\phi}\right)<1-\epsilon/3.

Let U\mathcal{U} be the uniform distribution over all input pairs

Let pp be the success probability of that cheating strategy: that is,

Let D\mathcal{D} be the probability distribution over (I,J)\left(I,J\right) pairs induced by the cheating strategy above, if we average over all valid inputs (i,j)\left(i,j\right) to the original clause/variable game. Then let qq be the Merlins’ success probability in the birthday game, if they use their same cheating strategy (a,b)\left(a,b\right), but for (I,J)\left(I,J\right) pairs drawn from D\mathcal{D}, rather than from the uniform distribution U\mathcal{U}:

for any [0,1]\left[0,1\right] random variable ZZ. We upper-bound DU\left\|\mathcal{D}-\mathcal{U}\right\| in the following lemma.

Proof. Let A=(aij){0,1}M×NA=\left(a_{ij}\right)\in\left\{0,1\right\}^{M\times N} be the incidence matrix of the 3Sat instance ϕ\phi. That is, set aij:=1a_{ij}:=1 if the clause CiC_{i} involves the variable xjx_{j}, and aij:=0a_{ij}:=0 otherwise. Note that, by the balance condition, we must have ijaij=3M=dN\sum_{ij}a_{ij}=3M=dN (where dd is the number of clauses that each variable appears in), and

Given any I[M]I\subseteq\left[M\right] and J[N]J\subseteq\left[N\right], define

In the clause/variable game GϕG_{\phi}, Arthur chooses his input (i,j)[M]×[N]\left(i,j\right)\in\left[M\right]\times\left[N\right] uniformly at random subject to aij=1a_{ij}=1. Now consider an (I,J)\left(I,J\right) drawn from D\mathcal{D}. By symmetry, (I,J)\left(I,J\right) is equally likely to have been formed starting from any input (i,j)I×J\left(i,j\right)\in I\times J such that aij=1a_{ij}=1. This means that PrD[(I,J)]\Pr_{\mathcal{D}}\left[\left(I,J\right)\right] must simply be proportional to SIJS_{IJ}, and normalization gives us the rest:

where line (25) used Cauchy-Schwarz. Now,

Here it is convenient to divide the sum into four cases: the case i=ii=i^{\prime} and j=jj=j^{\prime}, the case i=ii=i^{\prime} but jjj\neq j^{\prime}, the case j=jj=j^{\prime} but iii\neq i^{\prime}, and the case iii\neq i^{\prime} and j=jj=j^{\prime}. These cases give us respectively:

where we treated dd as a constant. Therefore

This completes the proof of Theorem 26—showing that if ϕ\phi is unsatisfiable, then

There exists an AM(2)\mathsf{AM}\left(2\right) protocol for 3Sat that uses O~(n)\widetilde{O}(\sqrt{n}) communication, and that has a 11 vs. 1ϵ1-\epsilon completeness/soundness gap for some constant ϵ>0\epsilon>0.

Proof. Given a 3Sat instance φ\varphi of size nn, we apply Theorem 24 to get a PCP ϕ\phi of size N=npolylognN=n\operatorname*{polylog}n. We then consider the birthday game Gϕk×kG_{\phi}^{k\times k}, where k=cNk=c\sqrt{N} for some large constant cc. Clearly, if φ\varphi is satisfiable then ω(Gϕk×k)=1\omega(G_{\phi}^{k\times k})=1, while if φ\varphi is unsatisfiable then ω(Gϕk×k)1ϵ\omega(G_{\phi}^{k\times k})\leq 1-\epsilon for some constant ϵ>0\epsilon>0. The only further observation we need to make is that Arthur can apply his verification function VBDV_{BD} in time polynomial in nn.

Of course, one way to state our AM(2)\mathsf{AM}\left(2\right) protocol is as a reduction: starting with a 3Sat instance φ\varphi of size nn, we produce a free game GG of size 2O~(n)2^{\widetilde{O}\left(\sqrt{n}\right)} in 2O~(n)2^{\widetilde{O}\left(\sqrt{n}\right)} time, such that ω(G)=1\omega\left(G\right)=1 if φ\varphi is satisfiable and ω(G)1ϵ\omega\left(G\right)\leq 1-\epsilon if φ\varphi is unsatisfiable. This immediately implies that, assuming the Exponential Time Hypothesis, there must be some constant ϵ>0\epsilon>0 such that the FreeGameε problem requires nΩ~(logn)n^{\widetilde{\Omega}\left(\log n\right)} time for all εϵ\varepsilon\leq\epsilon.

2 The High-Error Case

Our goal, in this subsection, is to show that if ε=o(1)\varepsilon=o\left(1\right), then deciding whether ω(G)=1\omega\left(G\right)=1 or ω(G)1ε\omega\left(G\right)\leq 1-\varepsilon for a given free game GG requires nΩ~(ε1logn)n^{\widetilde{\Omega}(\varepsilon^{-1}\log n)} time assuming the ETH. (Later, Theorem 40 will give an algorithm that nearly achieves this lower bound.) To prove the ε\varepsilon-dependent hardness result, we first need a simple combinatorial lemma, which can be seen as a generalization of the Birthday Paradox to regular bipartite graphs.

Proof. Given a left-vertex v[M]v\in\left[M\right], let N(v)[N]\mathcal{N}\left(v\right)\subseteq\left[N\right] be the set of right-neighbors of vv; thus N(v)=c\left|\mathcal{N}\left(v\right)\right|=c for all vv. Then by regularity, for any fixed w[N]w\in\left[N\right] we have

Now let AA be the set of left-vertices in HH (thus A=k\left|A\right|=k), and let EE denote the event that there exist two vertices v,vAv,v^{\prime}\in A with a common neighbor. Then by the union bound,

Furthermore, if EE fails, then the left-vertices in HH have ckck distinct neighbors. So by the Bonferroni inequality, which states (as a special case) that

Proof. Reusing notation from Section 6.1 (and in particular, from the proof of Lemma 27), we have

Lemma 30 has the following corollary, which gives a counterpart of Theorem 6 for AM[2]\mathsf{AM}\left[2\right] protocols with less than n\sqrt{n} communication.

For all ε>0\varepsilon>0, there exists an AM(2)\mathsf{AM}\left(2\right) protocol for 3Sat instances of size nn which uses O(εnpolylogn)O(\sqrt{\varepsilon n}\operatorname*{polylog}n) bits of communication, and which has a 11 vs. 1ε1-\varepsilon completeness/soundness gap.

We also get the desired hardness result for FreeGame.

Assuming the ETH, there exists a constant Δ>0\Delta>0 such that FreeGameε requires nΩ~(ε1logn)n^{\widetilde{\Omega}(\varepsilon^{-1}\log n)} deterministic time, for all ε[1/n,Δ]\varepsilon\in\left[1/n,\Delta\right]. (Likewise, FreeGameε requires nΩ~(ε1logn)n^{\widetilde{\Omega}(\varepsilon^{-1}\log n)} randomized time assuming the Randomized ETH.)

Proof. Set Δ:=ϵ/16\Delta:=\epsilon/16, where ϵ\epsilon is the constant from Lemma 30. Fix a function ε=ε(M)[1/M,Δ]\varepsilon=\varepsilon\left(M\right)\in\left[1/M,\Delta\right], and suppose that FreeGameε instances of size MM were solvable in time

We need to show how, using that, we could decide a 3Sat instance φ\varphi of size nn in time 2o(n)2^{o\left(n\right)}, thereby violating the ETH. The first step is to convert φ\varphi into a PCP ϕ\phi of size N=npolylognN=n\operatorname*{polylog}n. Next, we generate the birthday repetition Gϕk×kG_{\phi}^{k\times k}, where k=εNk=\sqrt{\varepsilon N}. (Here we use the assumption ε1/n\varepsilon\geq 1/n to ensure that k1k\geq 1, and we use the assumption εϵ/16\varepsilon\leq\epsilon/16 to ensure that kϵN/4k\leq\sqrt{\epsilon N}/4.) Note that the sizes of Gϕk×kG_{\phi}^{k\times k}’s question and answer sets are M=2klogN=NkM=2^{k\log N}=N^{k}.

If ϕ\phi is satisfiable then ω(Gϕk×k)=1\omega(G_{\phi}^{k\times k})=1, while by Lemma 30, if ϕ\phi is unsatisfiable then

So by approximating ω(Gϕk×k)\omega(G_{\phi}^{k\times k}) to within ±ε\pm\varepsilon, we can distinguish these cases and thereby decide whether φ\varphi was satisfiable. Using our hypothesized algorithm for FreeGameε, this takes time

for some constant RR. Note that if kk is large, then logk\log k is Ω(logN)\Omega\left(\log N\right), while if kk is small, then log(N/k2)\log\left(N/k^{2}\right) is Ω(logN)\Omega\left(\log N\right). Therefore, provided RR is large enough, the denominator will contain enough factors of logN\log N to clear all the logN\log N factors in the numerator, and our algorithm will have running time exp(o(n))\exp\left(o\left(n\right)\right), giving the desired violation of the ETH. This reduction produces a deterministic algorithm if the FreeGame algorithm was deterministic, or randomized if the FreeGame algorithm was randomized.

We conjecture that the bound of Theorem 32 could be improved to nΩ~(ε2logn)n^{\widetilde{\Omega}(\varepsilon^{-2}\log n)}, by considering free games GG with ω(G)1/2\omega\left(G\right)\approx 1/2 rather than ω(G)1\omega\left(G\right)\approx 1. This is a problem that we leave to future work.

3 The Low-Error Case

There is one obvious question that we haven’t yet addressed: can we give an AM(2)\mathsf{AM}\left(2\right) protocol for 3Sat with near-perfect soundness? Or equivalently, given a free game GG and some tiny δ>0\delta>0, can we show that (assuming the ETH) there is no polynomial-time algorithm even to decide whether ω(G)=1\omega\left(G\right)=1 or ω(G)<δ\omega\left(G\right)<\delta? In this section we show that, using high-powered PCP machinery, we can indeed do this, although the result we get is probably not optimal.

Note that Rao proved that, for the special case of projection games, one can dramatically improve the Parallel Repetition Theorem, to show that ω(Gt)ω(G)Ω(t)\omega\left(G^{t}\right)\leq\omega\left(G\right)^{\Omega\left(t\right)} with no dependence on logAB\log\left|A\right|\left|B\right|. Here a projection game is a two-prover game G=(X,Y,A,B,D,V)G=\left(X,Y,A,B,\mathcal{D},V\right) (not necessarily free) with V{0,1}V\in\left\{0,1\right\} such that, for every (x,y)X×Y\left(x,y\right)\in X\times Y in the support of D\mathcal{D} and every aAa\in A, there is a unique bBb\in B such that V(x,y,a,b)=1V\left(x,y,a,b\right)=1. Unfortunately, while the clause/variable game itself is a projection game, its birthday repetition is not.

Recently, Shaltiel proved a “derandomized” version of the Parallel Repetition Theorem for the special case of free games. In particular, given a free game G=(X,Y,A,B,V)G=\left(X,Y,A,B,V\right) with V{0,1}V\in\left\{0,1\right\}, Shaltiel constructs a new free game Gt=(Xt,Yt,At,Bt,Vt)G_{t}=\left(X_{t},Y_{t},A_{t},B_{t},V_{t}\right), which satisfies ω(Gt)ω(G)t\omega\left(G_{t}\right)\leq\omega\left(G\right)^{t}, as well as ω(Gt)=1\omega\left(G_{t}\right)=1 whenever ω(G)=1\omega\left(G\right)=1. Furthermore, the question sets XtX_{t} and YtY_{t} in Shaltiel’s game have size at most (XYAB)O(t)\left(\left|X\right|\left|Y\right|\left|A\right|\left|B\right|\right)^{O\left(t\right)}, which is perfect for our application. Unfortunately, the answer sets AtA_{t} and BtB_{t} have size exp((tlogXYAB)C)\exp(\left(t\log\left|X\right|\left|Y\right|\left|A\right|\left|B\right|\right)^{C}) for some large constant CC, and this once again prevents the desired contradiction with the ETH.

Finally, if we try to apply parallel repetition to the clause/variable game before applying birthday repetition, then the situation is even worse. For even one or two rounds of parallel repetition will blow up the question sets XX and YY so that they no longer have size n1+o(1)n^{1+o\left(1\right)}, meaning that we no longer have any hope of finding a collision using n1/2+o(1)n^{1/2+o\left(1\right)} rounds of birthday repetition.

Currently, then, the best approach we know to the low-error case is simply to choose a PCP that already has low error, and then ensure that birthday repetition does not increase its error much further. In particular, rather than Theorem 24, we can start with the following result of Moshkovitz and Raz :

Given a 3Sat instance φ\varphi of size nn as well as δ>0\delta>0, it is possible in poly(n)\operatorname*{poly}\left(n\right) time to produce a 22-CSP instance ϕ\phi, with n1+o(1)poly(1/δ)n^{1+o\left(1\right)}\operatorname*{poly}\left(1/\delta\right) variables and constraints, and over an alphabet Σ\Sigma of size Σ2poly(1/δ)\left|\Sigma\right|\leq 2^{\operatorname*{poly}\left(1/\delta\right)}, such that:

(Completeness) If SAT(φ)=1\operatorname*{SAT}\left(\varphi\right)=1 then SAT(ϕ)=1\operatorname*{SAT}\left(\phi\right)=1.

(Soundness) If SAT(φ)<1\operatorname*{SAT}\left(\varphi\right)<1 then SAT(ϕ)<δ\operatorname*{SAT}\left(\phi\right)<\delta.

(Balance) The constraint graph of ϕ\phi is bipartite, and every variable appears in exactly dd constraints, for some d=poly(1/δ)d=\operatorname*{poly}\left(1/\delta\right).

Since Theorem 33 outputs a 22-CSP ϕ\phi, we do not even need to consider the clause/variable game. Rather, ϕ\phi directly gives rise to a two-prover game HϕH_{\phi}, in which Arthur chooses a constraint CC of ϕ\phi uniformly at random, sends one of CC’s variables to Merlin1 and the other to Merlin2, gets back their values, and accepts if and only if the values satisfy CC. Clearly, if SAT(φ)=1\operatorname*{SAT}\left(\varphi\right)=1 then ω(Hϕ)=1\omega(H_{\phi})=1, while if SAT(φ)<1\operatorname*{SAT}\left(\varphi\right)<1 then ω(Hϕ)<δ\omega(H_{\phi})<\delta.

So in particular, suppose we set k:=N/δk:=\sqrt{N}/\delta. Then if SAT(φ)<1\operatorname*{SAT}\left(\varphi\right)<1, we find that

Of course, if SAT(φ)=1\operatorname*{SAT}\left(\varphi\right)=1 then ω(Hϕk×k)=1\omega(H_{\phi}^{k\times k})=1. This gives us the following corollary:

For all δ>0\delta>0, there exists an AM(2)\mathsf{AM}\left(2\right) protocol for 3Sat instances of size nn which uses n1/2+o(1)poly(1/δ)n^{1/2+o\left(1\right)}\operatorname*{poly}\left(1/\delta\right) bits of communication, and which has a 11 vs. δ\delta completeness/soundness gap.

We also get the following hardness result for distinguishing ω(G)=1\omega\left(G\right)=1 from ω(G)<δ\omega\left(G\right)<\delta:

Assuming the ETH, any deterministic algorithm to decide whether ω(G)=1\omega\left(G\right)=1 or ω(G)<δ\omega\left(G\right)<\delta, given as input a description of a free game GG of size nn, requires npoly(δ)(logn)1o(1)n^{\operatorname*{poly}\left(\delta\right)\cdot\left(\log n\right)^{1-o\left(1\right)}} time. (Likewise, any randomized algorithm requires npoly(δ)(logn)1o(1)n^{\operatorname*{poly}\left(\delta\right)\cdot\left(\log n\right)^{1-o\left(1\right)}} time assuming the Randomized ETH.)

Proof. Given a free game GG of size MM, suppose we could decide whether ω(G)=1\omega\left(G\right)=1 or ω(G)<δ\omega\left(G\right)<\delta in time Mp(δ)(logM)1ηM^{p\left(\delta\right)\cdot\left(\log M\right)^{1-\eta}}, for some constant η>0\eta>0 and sufficiently large polynomial pp. We need to show how, using that, we could decide a 3Sat instance φ\varphi of size nn in time 2o(n)2^{o\left(n\right)}, thereby violating the ETH. The first step is to convert φ\varphi into a 22-CSP ϕ\phi with N=n1+o(1)poly(1/δ)N=n^{1+o\left(1\right)}\operatorname*{poly}\left(1/\delta\right) variables, using Theorem 33. Observe that the game Hϕ=(X,Y,A,B,V)H_{\phi}=\left(X,Y,A,B,V\right) satisfies X=Y=N\left|X\right|=\left|Y\right|=N and A=B=Σ=2poly(1/δ)\left|A\right|=\left|B\right|=\left|\Sigma\right|=2^{\operatorname*{poly}\left(1/\delta\right)}.

Next, we generate the birthday repetition Hϕk×kH_{\phi}^{k\times k}, where k:=cN/δk:=c\sqrt{N}/\delta for some suitable constant cc. Note that Hϕk×kH_{\phi}^{k\times k} has question sets of size

Thus, we set M:=exp(n1/2+o(1)/poly(δ))M:=\exp\left(n^{1/2+o\left(1\right)}/\operatorname*{poly}\left(\delta\right)\right).

If ϕ\phi is satisfiable then ω(Hϕk×k)=1\omega(H_{\phi}^{k\times k})=1, while if ϕ\phi is unsatisfiable then ω(Hϕk×k)δ\omega(H_{\phi}^{k\times k})\leq\delta by equation (65), provided the constant cc was large enough. So by distinguishing these cases, we can decide whether φ\varphi was satisfiable. Using our hypothesized algorithm, this takes time

provided the polynomial pp was large enough. This gives us our desired violation of the ETH.

We conjecture that, assuming the ETH, distinguishing ω(G)=1\omega\left(G\right)=1 from ω(G)<δ\omega\left(G\right)<\delta for a free game GG should require nΩ(lognlog1/δ)n^{\Omega\left(\frac{\log n}{\log 1/\delta}\right)} time for all δ1/2\delta\leq 1/2, matching an upper bound that we will give in Theorem 40. A first step toward proving this conjecture would be to improve Theorem 33 (the result of Moshkovitz and Raz), so that it gave alphabet size Σpoly(1/δ)\left|\Sigma\right|\leq\operatorname*{poly}\left(1/\delta\right) rather than Σ2poly(1/δ)\left|\Sigma\right|\leq 2^{\operatorname*{poly}\left(1/\delta\right)}. This is a well-known open problem. However, even if that problem were solved, one would also need a more refined analysis of birthday repetition, to eliminate the dependence of kk on 1/δ1/\delta in the proof of Theorem 36.

4 Complexity Consequences

Setting δ:=1/3\delta:=1/3, Theorem 34 finally puts us in a position to say that

where AMn1/2+o(1)(2)\mathsf{AM}_{n^{1/2+o\left(1\right)}}\left(2\right) is defined with a 2/32/3 vs. 1/31/3 completeness/soundness gap, as in Definition 21. If we further combine this with a tight Cook-Levin Theorem (see, e.g., Tourlakis ), showing that every language LNTIME[n]L\in\mathsf{NTIME}\left[n\right] can be efficiently reduced to a set of 3Sat instances of size npolylognn\operatorname*{polylog}n, then we get the following corollary:

NTIME[n]AMn1/2+o(1)(2).\mathsf{NTIME}\left[n\right]\subseteq\mathsf{AM}_{n^{1/2+o\left(1\right)}}\left(2\right).

Let us observe that Corollary 37 is non-relativizing.

There exists an oracle AA relative to which NTIMEA[n]⊄AMn1/2+o(1)A(2)\mathsf{NTIME}^{A}\left[n\right]\not\subset\mathsf{AM}_{n^{1/2+o\left(1\right)}}^{A}\left(2\right).

Proof Sketch. For each nn, the oracle AA encodes a Boolean function An:{0,1}n{0,1}A_{n}:\left\{0,1\right\}^{n}\rightarrow\left\{0,1\right\}, which is either identically or else 11 on exactly one input. Let LAL_{A} be the unary language defined by 0nLA0^{n}\in L_{A} if there exists an x{0,1}nx\in\left\{0,1\right\}^{n} such that An(x)=1A_{n}\left(x\right)=1, and 0nLA0^{n}\notin L_{A} otherwise. Then certainly LANTIMEA[n]L_{A}\in\mathsf{NTIME}^{A}\left[n\right] for all AA. On the other hand, using standard diagonalization techniques, it is not hard to construct AA in such a way that LAAMn1/2+o(1)A(2)L_{A}\notin\mathsf{AM}_{n^{1/2+o\left(1\right)}}^{A}\left(2\right)—or even LAAMn/4A(2)L_{A}\notin\mathsf{AM}_{n/4}^{A}\left(2\right). Intuitively, if the Merlins send only n/4n/4 bits each to Arthur (so n/2n/2 bits in total), then regardless of how those bits depend on their random challenges, with high probability Arthur will still need to query AA on at least 2n/22^{n/2} inputs to confirm that 0nLA0^{n}\in L_{A}. We omit the details, which are similar to those in the paper of Fortnow and Sipser .

We leave as an open problem whether Corollary 37 is algebrizing in the sense of Aaronson and Wigderson .

Limitations of Multi-Prover 𝖠𝖬𝖠𝖬\mathsf{AM}

Our goal in this section is to prove that our 3Sat protocol is essentially optimal assuming the ETH, that AM(k)EXP\mathsf{AM}\left(k\right)\subseteq\mathsf{EXP} for all k=poly(n)k=\operatorname*{poly}\left(n\right), and that AM(k)=AM\mathsf{AM}\left(k\right)=\mathsf{AM} for all k=O(logn)k=O\left(\log n\right).

The section is organized as follows. First, in Section 7.1, we give a quasipolynomial-time algorithm for estimating the value of a 22-player free game. This algorithm implies that AM(2)EXP\mathsf{AM}\left(2\right)\subseteq\mathsf{EXP}, and even (with some more work) that AM(2)AMNP\mathsf{AM}\left(2\right)\subseteq\mathsf{AM}^{\mathsf{NP}}. The algorithm also implies that, if there exists an AM(2)\mathsf{AM}\left(2\right) protocol for 3Sat using o(n)o(\sqrt{n}) communication, then 3Sat is solvable in 2o(n)2^{o\left(n\right)} time. In Section 7.2, we go further, using a result of Barak et al. about subsampling dense CSPs to show that the value of any free game can be approximated by the value of a logarithmic-sized random subgame, and as a consequence, that AM(2)=AM\mathsf{AM}\left(2\right)=\mathsf{AM}. Finally, in Section 7.3, we generalize these results from AM(2)\mathsf{AM}\left(2\right) to AM(k)\mathsf{AM}\left(k\right) for all k=poly(n)k=\operatorname*{poly}\left(n\right).

We now explain how to approximate the value of a free game in quasipolynomial time.

FreeGameε is solvable in time nO(ε2logn)n^{O(\varepsilon^{-2}\log n)}. In more detail, given as input a description of a free game G=(X,Y,A,B,V)G=\left(X,Y,A,B,V\right), there exists a randomized algorithm running in time XAO(ε2logYB)\left|X\right|\cdot\left|A\right|^{O(\varepsilon^{-2}\log\left|Y\right|\left|B\right|)}, which estimates ω(G)\omega\left(G\right) to within additive error ±ε\pm\varepsilon, with at least 2/32/3 success probability. There also exists a deterministic algorithm running in time (XA)O(ε2logYB)\left(\left|X\right|\left|A\right|\right)^{O(\varepsilon^{-2}\log\left|Y\right|\left|B\right|)}.

Proof. The randomized estimation algorithm, call it REst, works as follows. First REst chooses a subset of questions SXS\subseteq X uniformly at random, subject to S=κ\left|S\right|=\kappa where

Next REst loops over all Aκ\left|A\right|^{\kappa} possible settings α:SA\alpha:S\rightarrow A of the answers to the κ\kappa questions in SS. For each such α\alpha, REst does the following:

It computes Merlin2’s “optimal response” bα:YBb_{\alpha}:Y\rightarrow B to α\alpha, supposing that Merlin1 was only asked questions in SS. For each question yYy\in Y, in other words, Est finds a response bα(y)Bb_{\alpha}\left(y\right)\in B that maximizes

It computes Merlin1’s “optimal response” aα:XAa_{\alpha}:X\rightarrow A to bαb_{\alpha}. For each xXx\in X, in other words, REst finds an aα(x)Aa_{\alpha}\left(x\right)\in A that maximizes

It computes the “value” obtained from the setting α\alpha, as follows:

and outputs W+εW+\varepsilon as its estimate for ω(G)\omega\left(G\right).

To prove correctness, we need to argue that, with high probability over the choice of SS, we have

First observe that Wαω(G)W_{\alpha}\leq\omega\left(G\right) for every α\alpha. Therefore Wω(G)W\leq\omega\left(G\right) as well, and W+εω(G)+εW+\varepsilon\leq\omega\left(G\right)+\varepsilon. So it suffices to prove the other direction: that Wω(G)2εW\geq\omega\left(G\right)-2\varepsilon with at least 2/32/3 probability over SS.

Let a:XAa^{\ast}:X\rightarrow A be an optimal strategy for Merlin1 in the game GG: that is, a strategy that, when combined with an optimal response b:YBb^{\ast}:Y\rightarrow B by Merlin2, achieves the value ω(G)\omega\left(G\right). Also, fix a particular question yYy\in Y and answer bBb\in B. Then since the function VV is [0,1]\left[0,1\right]-valued, Hoeffding’s inequality (which also holds in the case of sampling without replacement) gives us

So by the union bound, if we choose SS randomly, then we have

for every yYy\in Y and bBb\in B simultaneously, with probability at least

over SS. So suppose the inequality (78) holds. Let α:SA\alpha^{\ast}:S\rightarrow A be the restriction of the optimal strategy aa^{\ast} to the set SS, and let bα:YBb_{\alpha^{\ast}}:Y\rightarrow B be an optimal response to α\alpha^{\ast}. Then

where lines (83) and (85) used inequality (78).

Finally, to get a deterministic estimation algorithm—call it Est—we simply need to loop over all possible SXS\subseteq X with S=κ\left|S\right|=\kappa, rather than choosing SS randomly. We then output the maximum of W+εW+\varepsilon over all SS as our estimate for ω(G)\omega\left(G\right). This yields a running time of

Let us point out some simple modifications to the algorithm Est from Theorem 39, which can improve its running time of nO(ε2logn)n^{O(\varepsilon^{-2}\log n)} in certain cases.

Given a free game GG of size nn, there is a deterministic algorithm running in nO(ε1logn)n^{O(\varepsilon^{-1}\log n)} time to decide whether ω(G)=1\omega\left(G\right)=1 or ω(G)1ε\omega\left(G\right)\leq 1-\varepsilon (promised that one of those is the case), and there is a deterministic algorithm running in nO(1+lognlog1/δ)n^{O\left(1+\frac{\log n}{\log 1/\delta}\right)} time to decide whether ω(G)=1\omega\left(G\right)=1 or ω(G)<δ\omega\left(G\right)<\delta.

Proof. In both cases, the key observation is that when running Est, we no longer need to estimate the quantity ExX[V(x,y,a(x),b)]\operatorname*{E}_{x\in X}\left[V\left(x,y,a^{\ast}\left(x\right),b\right)\right] to within ±ε\pm\varepsilon, and to pay the 1/ε21/\varepsilon^{2} price that comes from Hoeffding’s inequality for doing so. Instead, for each yYy\in Y and bBb\in B, we simply need to know whether ExX[V(x,y,a(x),b)]\operatorname*{E}_{x\in X}\left[V\left(x,y,a^{\ast}\left(x\right),b\right)\right] is 11 or less than 11. Or equivalently, does there exist a “bad” xXx\in X—one such that V(x,y,a(x),b)<1V\left(x,y,a^{\ast}\left(x\right),b\right)<1? Moreover, we are promised that, if such a bad xx does exist, then at least an ε\varepsilon or a 1δ1-\delta fraction (respectively) of all xx’s are bad. Thus, when choosing the subset of questions SXS\subseteq X, it suffices to ensure that, with nonzero probability over SS, we succeed in sampling one of the bad xx’s for every yYy\in Y and bBb\in B. By the union bound, this means that it suffices if, respectively,

where κ=S\kappa=\left|S\right|. Solving, we get respectively κ=O(ε1logYB)\kappa=O(\varepsilon^{-1}\log\left|Y\right|\left|B\right|) or κ=O(1+logYBlog1/δ)\kappa=O\left(1+\frac{\log\left|Y\right|\left|B\right|}{\log 1/\delta}\right). Now we just need to plug the lower values of κ\kappa into equation (75) from the proof of Theorem 39 to get the improved running times.

The proof of Theorem 39 has a curious property. Namely, we showed that the value ω(G)\omega\left(G\right) can in some sense be well-approximated by restricting attention to a random subset of questions SXS\subseteq X of logarithmic size. However, if GSG_{S} is the subgame obtained from GG by restricting XX to SS, then the proof did not imply that ω(GS)ω(G)\omega(G_{S})\approx\omega\left(G\right)! Using Hoeffding’s inequality, one can easily show that ω(GS)ω(G)ε\omega(G_{S})\geq\omega\left(G\right)-\varepsilon with high probability over the choice of SS. The difficulty comes from the other direction—ironically, the “trivial” direction in the proof of Theorem 39. To get that Wαω(G)W_{\alpha}\leq\omega\left(G\right), we implicitly used the fact that WαW_{\alpha} was the value of a strategy pair (aα,bα)\left(a_{\alpha},b_{\alpha}\right) for the whole game GG, not merely for the subgame GSG_{S}. Therefore, nothing we said implies that ω(GS)ω(G)\omega(G_{S})\leq\omega\left(G\right), or even ω(GS)ω(G)+ε\omega(G_{S})\leq\omega\left(G\right)+\varepsilon. And this makes intuitive sense: if the Merlins know that Merlin1’s question xx will be restricted to a set of logarithmic size, then how do we know they can’t exploit that knowledge to win with greater probability? As we’ll discuss in Section 7.2, it turns out that one can prove the stronger result that ω(GS)ω(G)+ε\omega(G_{S})\leq\omega\left(G\right)+\varepsilon with high probability over SS—and this, in turn, lets one prove that AM(2)=AM\mathsf{AM}\left(2\right)=\mathsf{AM}. But more work (in particular, that of Alon et al. and Barak et al. ) is needed.

Before we discuss that, let us point out some simple corollaries of Theorems 39 and 40.

AM(2)EXP\mathsf{AM}\left(2\right)\subseteq\mathsf{EXP}. (In more detail, we can simulate any AM(2)\mathsf{AM}\left(2\right) protocol that uses p(n)p\left(n\right) communication and r(n)=poly(n)r\left(n\right)=\operatorname*{poly}\left(n\right) auxiliary randomness in 2O(p(n)2)+r(n)poly(n)2^{O(p\left(n\right)^{2})+r(n)}\operatorname*{poly}\left(n\right) deterministic time, or 2O(p(n)2)poly(n)2^{O(p\left(n\right)^{2})}\operatorname*{poly}\left(n\right) randomized time.)

Proof. Let LAM(2)L\in\mathsf{AM}\left(2\right). Then given an input x{0,1}nx\in\left\{0,1\right\}^{n}, the AM(2)\mathsf{AM}\left(2\right) protocol for checking whether xLx\in L can be represented as a free game G=(X,Y,A,B,V)G=\left(X,Y,A,B,V\right), where X=Y=A=B={0,1}p(n)X=Y=A=B=\left\{0,1\right\}^{p\left(n\right)}, and where Arthur’s verification function VV is computable in randomized poly(n)\operatorname*{poly}\left(n\right) time using r(n)r\left(n\right) random bits. Now by Theorem 39, we can estimate ω(G)\omega\left(G\right) to additive error (say) ±1/10\pm 1/10, using (XA)O(logYB)=2O(p(n)2)\left(\left|X\right|\left|A\right|\right)^{O\left(\log\left|Y\right|\left|B\right|\right)}=2^{O(p\left(n\right)^{2})} deterministically-chosen evaluations of VV. Furthermore, each of these VV evaluations can be performed in poly(n)\operatorname*{poly}\left(n\right) steps by a randomized algorithm (including the time needed for amplification to exponentially-small error probability), or in 2r(n)poly(n)2^{r(n)}\operatorname*{poly}\left(n\right) steps by a deterministic algorithm. Finally, estimating ω(G)\omega\left(G\right) lets us decide whether ω(G)2/3\omega\left(G\right)\geq 2/3 or ω(G)1/3\omega\left(G\right)\leq 1/3, and hence whether xLx\in L.

Corollary 41 (and Theorem 40) have the following further consequence:

If 3Sat AMp(n)(2)\in\mathsf{AM}_{p(n)}\left(2\right), then 3Sat TIME[2O(p(n)2)poly(n)]\in\mathsf{TIME}[2^{O(p\left(n\right)^{2})}\operatorname*{poly}\left(n\right)]. So in particular, assuming the Randomized ETH, any AM(2)\mathsf{AM}\left(2\right) protocol for 3Sat with a 11 vs. 1ε1-\varepsilon completeness/soundness gap must use Ω(εn)\Omega(\sqrt{\varepsilon n}) communication. Likewise, assuming the Randomized ETH, any protocol with a 11 vs. δ\delta gap must use Ω(nlog1/δ)\Omega(\sqrt{n\log 1/\delta}) communication provided δ2n\delta\geq 2^{-n}. (If, moreover, Arthur’s verification procedure is deterministic, then it suffices to assume the ordinary ETH.)

Also, a closer examination of the proof of Theorem 39 yields a better upper bound on AM(2)\mathsf{AM}\left(2\right) than EXP\mathsf{EXP}.

AM(2)AMNP\mathsf{AM}\left(2\right)\subseteq\mathsf{AM}^{\mathsf{NP}}.

Proof Sketch. We only sketch the proof, since in any case this result will be superseded by the later result that AM(2)=AM\mathsf{AM}\left(2\right)=\mathsf{AM}.

In the algorithm REst from Theorem 39, the first step has the form of an AM\mathsf{AM} protocol. That is, following Corollary 41, let G=(X,Y,A,B,V)G=\left(X,Y,A,B,V\right) be the free game associated to the AM(2)\mathsf{AM}\left(2\right) protocol we want to simulate, with X=Y=A=B={0,1}poly(n)X=Y=A=B=\left\{0,1\right\}^{\operatorname*{poly}\left(n\right)}. Then in our AMNP\mathsf{AM}^{\mathsf{NP}} simulation, we can have Arthur first choose a subset SXS\subseteq X of size κ=poly(n)\kappa=\operatorname*{poly}\left(n\right) uniformly at random and send SS to Merlin. Next, using κlogA=poly(n)\kappa\log\left|A\right|=\operatorname*{poly}\left(n\right) bits, Merlin can send back a complete description of a response function α:SA\alpha:S\rightarrow A that is claimed to achieve (say) Wα2/3W_{\alpha}\geq 2/3. The question is how to implement the rest of the algorithm—or equivalently, how Arthur can verify that WαW_{\alpha} is large using an NP\mathsf{NP} oracle.

Here the key idea is to use the property-testing paradigm of Goldreich, Goldwasser, and Ron . As it stands, the inner loop of REst requires first computing Merlin2’s optimal response bα:YBb_{\alpha}:Y\rightarrow B to α\alpha, then computing Merlin1’s optimal response aα:XAa_{\alpha}:X\rightarrow A to bαb_{\alpha}, and finally computing the value WαW_{\alpha} achieved by the pair (aα,bα)\left(a_{\alpha},b_{\alpha}\right). In our case, all three of these steps would operate on 2p(n)2^{p\left(n\right)}-sized objects and require 2O(p(n))2^{O\left(p\left(n\right)\right)} time.

Next, Arthur chooses another subset UXU\subseteq X of size m=poly(n)m=\operatorname*{poly}\left(n\right) uniformly at random, and again uses his NP\mathsf{NP} oracle to find a response function γ:UA\gamma:U\rightarrow A that maximizes

Finally, Arthur accepts if and only if maxγWγ1/2\max_{\gamma}W_{\gamma}\geq 1/2.

If ω(G)2/3\omega\left(G\right)\geq 2/3, then certainly Merlin can cause Arthur to accept with high probability in this protocol—for example, by sending Arthur α=α\alpha=\alpha^{\ast}, the restriction of the globally optimal strategy a:XAa^{\ast}:X\rightarrow A to the subset SS. As we saw in the proof of Theorem 39, the Hoeffding inequality and union bound ensure that α\alpha^{\ast} “induces” responses β:TB\beta:T\rightarrow B by Merlin2 that are close to the best possible responses in the full game GG. Furthermore, even if TT is only an O(1ε2log(XA))O(\frac{1}{\varepsilon^{2}}\log\left(\left|X\right|\left|A\right|\right))-sized subset of the full set YY, a second application of the Hoeffding inequality and union bound ensure that β\beta, in turn, induces responses γ:UA\gamma:U\rightarrow A by Merlin1 that are close to the best possible responses. So with high probability over the choices of SS, TT, and UU, the optimal response functions β:TB\beta:T\rightarrow B and γ:UA\gamma:U\rightarrow A will achieve a value of WγW_{\gamma} close to ω(G)\omega\left(G\right).

As usual, the more interesting part is soundness: if ω(G)1/3\omega\left(G\right)\leq 1/3, then why can Merlin not cause Arthur to accept with high probability? The basic answer is that Merlin has to provide α\alpha without knowing TT or UU (which Arthur will only choose later), and without being able to control β\beta or γ\gamma (which are both just solutions to maximization problems, obtained using the NP\mathsf{NP} oracle). As a consequence, one can show that, if maxγWγ1/2\max_{\gamma}W_{\gamma}\geq 1/2 with high probability over SS, TT, and UU, then one can construct a global strategy pair a:XAa:X\rightarrow A, b:YBb:Y\rightarrow B that achieves value close to 1/21/2. We omit the details, which closely follow those in the correctness proofs for property-testing algorithms due to Goldreich, Goldwasser, and Ron .

2 Subsampling for Free Games and 𝖠𝖬​(2)=𝖠𝖬𝖠𝖬2𝖠𝖬\mathsf{AM}\left(2\right)=\mathsf{AM}

In this section, we wish to go further than AM(2)EXP\mathsf{AM}\left(2\right)\subseteq\mathsf{EXP} or AM(2)AMNP\mathsf{AM}\left(2\right)\subseteq\mathsf{AM}^{\mathsf{NP}}, and prove that actually AM(2)=AM\mathsf{AM}\left(2\right)=\mathsf{AM}. For this purpose, given a free game G=(X,Y,A,B,V)G=\left(X,Y,A,B,V\right), we need to show not merely that a near-optimal pair of strategies for GG can be “induced” by examining a small random subgame GSG_{S}, but that ω(GS)\omega\left(G_{S}\right) itself gives a good approximation to ω(G)\omega\left(G\right), with high probability over SS. As we explained in Section 7.1, it is easy to see that ES[ω(GS)]ω(G)\operatorname{E}_{S}\left[\omega\left(G_{S}\right)\right]\geq\omega\left(G\right), since we can start with optimal strategies for GG and then restrict them to GSG_{S}. The hard part is to prove the other direction, that ES[ω(GS)]ω(G)+ε\operatorname{E}_{S}\left[\omega\left(G_{S}\right)\right]\leq\omega\left(G\right)+\varepsilon.

Here it is convenient to appeal to a powerful recent result of Barak et al. , which shows that any dense CSP over a finite alphabet Σ\Sigma can be “subsampled,” generalizing an earlier subsampling result for the Boolean case by Alon et al. .

Let φ\varphi be a kk-CSP, involving nn variables X=(x1,,xn)X=\left(x_{1},\ldots,x_{n}\right) over the finite alphabet Σ\Sigma. Suppose that φ\varphi has “density” α\alpha, in the following sense: for every collection YXY\subset X of k1k-1 variables, φ\varphi contains a [0,1]\left[0,1\right]-valued constraint CC involving the variables Y{xi}Y\cup\left\{x_{i}\right\}, for at least an α\alpha fraction of the remaining variables xiXYx_{i}\in X\setminus Y. Let SAT(φ)[0,1]\operatorname*{SAT}\left(\varphi\right)\in\left[0,1\right] be the value of φ\varphi; that is, the maximum of ECφ[C(X)]\operatorname*{E}_{C\in\varphi}\left[C\left(X\right)\right] over all XΣnX\in\Sigma^{n}. Also, given a subset of variable indices I[n]I\subseteq\left[n\right], let φI\varphi_{I} be the restriction of φ\varphi to the variables in II (and to those constraints that only involve II variables). Then provided we choose II uniformly at random subject to IlogΣαεΛ\left|I\right|\geq\frac{\log\left|\Sigma\right|}{\alpha\varepsilon^{\Lambda}} for some suitable constant Λ\Lambda, we have

As a side note, if the alphabet size Σ\left|\Sigma\right| is constant, and if one does not care about the dependence of I\left|I\right| on ε\varepsilon, then a version of Theorem 44 follows almost immediately from the Szemerédi Regularity Lemma, in its many-colored variant (see for example [29, Theorem 1.18]). However, this is of limited relevance to us, since in our case Σ=poly(n)\left|\Sigma\right|=\operatorname*{poly}\left(n\right).

We now use Theorem 44 to deduce an analogous subsampling theorem for free games.

Given a free game G=(X,Y,A,B,V)G=\left(X,Y,A,B,V\right) and ε>0\varepsilon>0, let κ:=2εΛlog(A+B)\kappa:=2\varepsilon^{-\Lambda}\log\left(\left|A\right|+\left|B\right|\right) (for some suitable constant Λ\Lambda), and assume κX\kappa\leq\left|X\right|. Choose a subset SXS\subseteq X of Merlin1 questions uniformly at random subject to S=κ\left|S\right|=\kappa, and let GSG_{S} be the subgame of GG with Merlin1’s questions restricted to SS. Then

Proof. We define a 22-CSP φ\varphi as follows. Let X:=X×R1X^{\prime}:=X\times R_{1} and Y:=Y×R2Y^{\prime}:=Y\times R_{2}, where R1R_{1} and R2R_{2} are finite sets chosen to ensure that XR1=YR2\left|X\right|\left|R_{1}\right|=\left|Y\right|\left|R_{2}\right|. We think of XX^{\prime} and YY^{\prime} as “augmented” versions of XX and YY respectively, obtained by duplicating variables. Then φ\varphi will have a variable set consisting of a(x,r1)a\left(x,r_{1}\right) for all (x,r1)X\left(x,r_{1}\right)\in X^{\prime} and b(y,r2)b\left(y,r_{2}\right) for all (y,r2)Y\left(y,r_{2}\right)\in Y^{\prime}, and alphabet Σ:=AB\Sigma:=A\cup B. For each (x,r1)X\left(x,r_{1}\right)\in X^{\prime} and (y,r2)Y\left(y,r_{2}\right)\in Y^{\prime}, we will add a [0,1]\left[0,1\right]-valued constraint CC between a(x,r1)a\left(x,r_{1}\right) and b(y,r2)b\left(y,r_{2}\right), which behaves as follows:

If aAa\in A and bBb\in B, then C(a,b)=V(x,y,a,b)C\left(a,b\right)=V\left(x,y,a,b\right).

If aAa\notin A or bBb\notin B, then C(a,b)=0C\left(a,b\right)=0.

In this way, we ensure the following two properties:

SAT(φ)=ω(G)\operatorname*{SAT}\left(\varphi\right)=\omega\left(G\right). Indeed, from any strategy pair (a,b)\left(a,b\right) that achieves value ω\omega in GG, we can construct an assignment to φ\varphi with value at least ω\omega, and conversely. (To see the converse, note that by convexity, if some a(x,r1)a\left(x,r_{1}\right) takes on more than one value as we range over r1R1r_{1}\in R_{1}, then there must be a single value a(x,r1)=a(x)a\left(x,r_{1}\right)=a\left(x\right) that does at least as well as the mean, and likewise for b(y,r2)b\left(y,r_{2}\right).)

φ\varphi has density α=1/2\alpha=1/2 in the sense of Theorem 44. For it includes a constraint between every aa-variable and every bb-variable (although no constraints relating two aa-variables or two bb-variables), and the numbers of aa-variables and bb-variables are equal.

Now suppose we choose a subset II of the variables of φ\varphi uniformly at random, subject to I=κ\left|I\right|=\kappa where κ=2εΛlog(A+B)\kappa=2\varepsilon^{-\Lambda}\log\left(\left|A\right|+\left|B\right|\right). Then by Theorem 44, we have

To complete the proof, we need to show that

We will do so by appealing to the following general principle. Suppose we were to draw II by some random process in which we started with the set of all variables in φ\varphi, then repeatedly discarded variables until we were left with a uniformly-random subset II of size κ\kappa. Suppose, further, that at any point in this process, the distribution over constraints between remaining variables remained uniform over the set of all constraints CφC\in\varphi. Let JJ be a subset of variables obtained by stopping such a process at any intermediate point. Then we must have

The reason is simply that, if we had a collection of partial assignments to the φJ\varphi_{J}’s that achieved expected value ω\omega, then restricting those assignments to the φI\varphi_{I}’s would also achieve expected value ω\omega, by linearity of expectation.

So in particular, suppose we form JJ by choosing discarding all but κ\kappa variables of the form a(x,r1)a\left(x,r_{1}\right), (while keeping all variables of the form b(y,r2)b\left(y,r_{2}\right)). Then since the distribution over constraints remains uniform for this JJ, and since a sequence of further discardings could produce a uniformly-random subset II with I=κ\left|I\right|=\kappa, we have

But EJ[SAT(φJ)]\operatorname*{E}_{J}\left[\operatorname*{SAT}\left(\varphi_{J}\right)\right] is simply ES[ω(GS)]\operatorname*{E}_{S}\left[\omega\left(G_{S}\right)\right], where SXS\subseteq X is a uniformly-random subset of Merlin1 questions of size κ\kappa. This completes the proof.

Theorem 45 has the following easy corollary, which removes the “asymmetry” between the two Merlins.

Given a free game G=(X,Y,A,B,V)G=\left(X,Y,A,B,V\right) and ε>0\varepsilon>0, let κ:=2εΛlog(A+B)\kappa:=2\varepsilon^{-\Lambda}\log\left(\left|A\right|+\left|B\right|\right), and assume κmin{X,Y}\kappa\leq\min\left\{\left|X\right|,\left|Y\right|\right\}. Choose SXS\subseteq X and TYT\subseteq Y uniformly at random and independently, subject to S=T=κ\left|S\right|=\left|T\right|=\kappa. Also, let GS,TG_{S,T} be the subgame of GG with Merlin1’s questions restricted to SS and Merlin2’s restricted to TT. Then

Proof. We simply need to apply Theorem 45 twice in succession, once to reduce Merlin1’s question set, and then a second time to reduce Merlin2’s. The result follows by linearity of expectation.

Using Corollary 46, we now prove that AM(2)=AM\mathsf{AM}\left(2\right)=\mathsf{AM}.

Proof. Let LAM(2)L\in\mathsf{AM}\left(2\right). Then just like in Corollary 41, an AM(2)\mathsf{AM}\left(2\right) protocol for checking whether a string is in LL can be represented as a free game G=(X,Y,A,B,V)G=\left(X,Y,A,B,V\right), where X=Y=A=B={0,1}p(n)X=Y=A=B=\left\{0,1\right\}^{p\left(n\right)} for some polynomial pp, and VV is computable in randomized poly(n)\operatorname*{poly}\left(n\right) time.

Let ε:=1/24\varepsilon:=1/24 and κ:=2εΛ(p(n)+1)\kappa:=2\varepsilon^{-\Lambda}\left(p\left(n\right)+1\right), and suppose we choose SXS\subseteq X and TYT\subseteq Y uniformly at random subject to S=T=κ\left|S\right|=\left|T\right|=\kappa. Then by Corollary 46,

But this immediately gives us our AM\mathsf{AM} simulation, as follows. First Arthur chooses S,T{0,1}p(n)S,T\subseteq\left\{0,1\right\}^{p\left(n\right)} uniformly at random, subject to S=T=κ\left|S\right|=\left|T\right|=\kappa as above. He then sends SS and TT to Merlin, using 2κp(n)=O(p(n)2)2\kappa\cdot p\left(n\right)=O(p\left(n\right)^{2}) bits. Next Merlin replies with a pair of strategies a:SAa:S\rightarrow A and b:TBb:T\rightarrow B, which again takes O(p(n)2)O(p\left(n\right)^{2}) bits. Let

be the subsampled success probability; notice that ωS,Tω(GS,T)\omega_{S,T}\leq\omega\left(G_{S,T}\right) for all S,TS,T. Then finally, if VV is deterministic, then Arthur simply computes ωS,T\omega_{S,T} and accepts if and only if ωS,T1/2\omega_{S,T}\geq 1/2. If VV is randomized, then Arthur instead computes an estimate ω~S,T\widetilde{\omega}_{S,T} such that

and accepts if and only if ω~S,T0.51\widetilde{\omega}_{S,T}\geq 0.51.

We claim, first, that this protocol has completeness error at most exp(κ)\exp\left(-\kappa\right). For we can always consider an optimal pair of strategies a:XAa:X\rightarrow A and b:YBb:Y\rightarrow B for the full protocol, which achieve value

by assumption. Then a standard Chernoff bound implies that ωS,T0.52\omega_{S,T}\geq 0.52, and hence ω~S,T0.51\widetilde{\omega}_{S,T}\geq 0.51, with at least 1exp(κ)1-\exp\left(-\kappa\right) probability over the choice of SS and TT.

We next upper-bound the soundness error. Suppose ω(G)1/3\omega\left(G\right)\leq 1/3; then by equation (98),

as well. So Arthur rejects with constant probability. Of course, we can amplify the completeness/soundness gap further by repeating the protocol.

3 The k𝑘k-Merlin Case

In this section, we generalize our results from AM(2)\mathsf{AM}\left(2\right) to AM(k)\mathsf{AM}\left(k\right) for larger kk. The first step is to generalize Theorem 39, to obtain a nontrivial approximation algorithm for kk-player free games.

Let GG be a kk-player free game, with question sets Y1,,YkY_{1},\ldots,Y_{k} and answer sets B1,,BkB_{1},\ldots,B_{k} (assume Bi2\left|B_{i}\right|\geq 2 for all i[k]i\in\left[k\right]). There exists a deterministic algorithm that approximates ω(G)\omega\left(G\right) to within additive error ±ε\pm\varepsilon, in time

where n=Y1B1YkBkn=\left|Y_{1}\right|\left|B_{1}\right|\cdots\left|Y_{k}\right|\left|B_{k}\right| is the input size.

Proof. The basic idea is to use a recursive generalization, call it Estk, of the (deterministic) approximation algorithm Est from Theorem 39. The recursive version will “peel off the Merlins one at a time.” That is, given a description of a kk-player free game GG as input, Estk will reduce the estimation of ω(G)\omega\left(G\right) to the estimation of ω(G)\omega(G^{\prime}), for a quasipolynomial number of (k1)\left(k-1\right)-player free games GG^{\prime}, each one involving Merlin1 through Merlink-1 only (Merlink’s behavior having already been fixed). Each ω(G)\omega(G^{\prime}) will in turn be estimated by calling Estk-1, and so on until k=1k=1, at which point we can just do a straightforward maximization.

for some suitable constant CC. Then Estk loops over all (Ykκk)\binom{\left|Y_{k}\right|}{\kappa_{k}} subsets of questions SkYkS_{k}\subseteq Y_{k} such that Sk=κk\left|S_{k}\right|=\kappa_{k}, as well as all Bkκk\left|B_{k}\right|^{\kappa_{k}} possible settings αk:SkBk\alpha_{k}:S_{k}\rightarrow B_{k} of the answers to the κk\kappa_{k} questions in SkS_{k}. For each such pair P=(Sk,αk)P=\left(S_{k},\alpha_{k}\right), we define a (k1)\left(k-1\right)-player subgame GPG_{P}, which is played by Merlin1 through Merlink-1, and which has question sets Y1,,Yk1Y_{1},\ldots,Y_{k-1} and answer sets B1,,Bk1B_{1},\ldots,B_{k-1}. The verification function of GPG_{P} is defined as follows:

In other words, GPG_{P} is the same game as GG, except that we assume that Merlink is only asked questions ykSky_{k}\in S_{k}, and that he responds to each with αk(yk)\alpha_{k}\left(y_{k}\right).

Now, for each PP, the algorithm Estk does the following:

If k3k\geq 3, then it calls Estk-1 recursively, in order to find approximately optimal strategies (bP,i:YiBi)i[k1]\left(b_{P,i}:Y_{i}\rightarrow B_{i}\right)_{i\in\left[k-1\right]} for Merlin1 through Merlink-1 in GPG_{P}. Here “approximately optimal” means achieving value at least ω(GP)δ\omega(G_{P})-\delta. Of course, when k=2k=2, the algorithm can simply compute Merlin1’s exactly-optimal response bP,1:Y1B1b_{P,1}:Y_{1}\rightarrow B_{1} by maximizing

for each y1Y1y_{1}\in Y_{1} separately, just like in the two-player algorithm Est.

Given the responses bP,1,,bP,k1b_{P,1},\ldots,b_{P,k-1} of Merlin1 through Merlink-1, the algorithm computes Merlink’s best response bP,k:YkBkb_{P,k}:Y_{k}\rightarrow B_{k} on the full set YkY_{k} by maximizing

for each ykYky_{k}\in Y_{k} separately. It then lets

be the value of the kk-tuple of strategies induced by PP.

Finally, Estk outputs W:=maxPWPW:=\max_{P}W_{P} as its estimate for ω(G)\omega\left(G\right). Note that, in addition to WW, the algorithm also outputs a strategy kk-tuple (bP,1,,bP,k)\left(b_{P,1},\ldots,b_{P,k}\right) that achieves value WW.

terms all get absorbed by asymptotically larger terms. Asymptotically, then,

Since the running time is dominated by evaluations of VV (each of which takes constant time), this also gives the asymptotic running time.

The proof of correctness for Estk follows the same general outline as the proof of the correctness for Est. Once again, since each WPW_{P} is the value achieved by some actual kk-tuple of strategies bP,1,,bP,kb_{P,1},\ldots,b_{P,k} in the full game GG, it is clear that WPω(G)W_{P}\leq\omega\left(G\right) for all PP. The nontrivial part is to show that WPω(G)εW_{P}\geq\omega\left(G\right)-\varepsilon for some P=(Sk,αk)P=\left(S_{k},\alpha_{k}\right).

Combining facts (i) and (ii), we find that, if Estℓ-1 can achieve value at least ω(GP)ϵ\omega\left(G_{P}\right)-\epsilon in GPG_{P}, then Estℓ can achieve value at least ω(GQ)ϵδ\omega\left(G_{Q}\right)-\epsilon-\delta in GQG_{Q}. Intuitively, this is because the errors build up linearly: we incur an error of δ/2\delta/2 when switching from GQG_{Q} to GPG_{P}, then an error of ϵ\epsilon (by hypothesis) when running Estℓ-1 to find strategies for GPG_{P}, and finally another error of δ/2\delta/2 when switching from GPG_{P} back to GQG_{Q}. This completes the induction, and hence the proof that Wω(G)εW\geq\omega\left(G\right)-\varepsilon.

Just like in the k=2k=2 case, we can modify the algorithm Estk so that it chooses the sets SS uniformly at random, rather than looping over all possible SS’s. By doing so, we can get a randomized algorithm that approximates ω(G)\omega\left(G\right) to within additive error ±ε\pm\varepsilon in the slightly better running time

Analogously to Theorem 40, we can also improve the running time of Estk in the case of perfect completeness.

Given a kk-player free game G=(Y1,,Yk,B1,,Bk,V)G=\left(Y_{1},\ldots,Y_{k},B_{1},\ldots,B_{k},V\right), we can decide whether ω(G)=1\omega\left(G\right)=1 or ω(G)<1ε\omega\left(G\right)<1-\varepsilon (promised that one of those is the case) using a deterministic algorithm that runs in time nO(ε1k2logn)n^{O(\varepsilon^{-1}k^{2}\log n)}, where n=Y1B1YkBkn=\left|Y_{1}\right|\left|B_{1}\right|\cdots\left|Y_{k}\right|\left|B_{k}\right| is the input size. (In more detail, in both running time bounds of Theorem 48, we can improve the factor of k2/ε2k^{2}/\varepsilon^{2} in the exponent to k2/εk^{2}/\varepsilon.)

Proof Sketch. As in Theorem 40, the key observation is that, if we only care about distinguishing ω(G)=1\omega\left(G\right)=1 from ω(G)<1ε\omega\left(G\right)<1-\varepsilon, then it suffices to set

Theorem 48 readily implies an upper bound on AM(k)\mathsf{AM}\left(k\right).

AM(k)EXP\mathsf{AM}\left(k\right)\subseteq\mathsf{EXP} for all polynomials k=k(n)k=k\left(n\right).

Proof. Let LAM(k)L\in\mathsf{AM}\left(k\right). Then given an input x{0,1}nx\in\left\{0,1\right\}^{n}, the AM(k)\mathsf{AM}\left(k\right) protocol for checking whether xLx\in L can be represented as a kk-player free game G=((Yi)i[k],(Bi)i[k],V)G=(\left(Y_{i}\right)_{i\in\left[k\right]},\left(B_{i}\right)_{i\in\left[k\right]},V), where Yi=Bi={0,1}p(n)Y_{i}=B_{i}=\left\{0,1\right\}^{p\left(n\right)} for all ii (for some polynomial pp), and where Arthur’s verification function VV is computable in poly(n)\operatorname*{poly}\left(n\right) time using r(n)=poly(n)r\left(n\right)=\operatorname*{poly}\left(n\right) bits of randomness. Now by Theorem 48, we can estimate ω(G)\omega\left(G\right) to additive error (say) ε=1/10\varepsilon=1/10 by a deterministic algorithm that makes

evaluations of VV. Furthermore, each VV evaluation can be performed in deterministic time 2r(n)poly(n)2^{r\left(n\right)}\operatorname*{poly}\left(n\right) (or in randomized time poly(n)\operatorname*{poly}\left(n\right), even allowing for amplification to exponentially small error probability). But this lets us decide whether ω(G)2/3\omega\left(G\right)\geq 2/3 or ω(G)1/3\omega\left(G\right)\leq 1/3, and hence whether xLx\in L.

A second corollary of Theorem 48 is that, assuming the ETH, there is a hard Ω(n1/4)\Omega(n^{1/4}) limit on the amount of communication needed in any constant-soundness AM(k)\mathsf{AM}\left(k\right) protocol for 3Sat, regardless of k=k(n)k=k\left(n\right). Furthermore, if k=no(1)k=n^{o\left(1\right)}, then n1/2o(1)n^{1/2-o\left(1\right)} communication is needed. (Later, in Section 7.4, we will improve this to show that Ω(n)\Omega(\sqrt{n}) communication is needed regardless of kk.)

Assuming the Randomized ETH, any AM(k)\mathsf{AM}\left(k\right) protocol for 3Sat with a 11 vs. 1ε1-\varepsilon completeness/soundness gap must use Ω(k+εn/k)=Ω((εn)1/4)\Omega(k+\sqrt{\varepsilon n}/k)=\Omega(\left(\varepsilon n\right)^{1/4}) bits of communication in total. (Also, if Arthur’s verification procedure is deterministic, then it suffices to assume the ordinary ETH.)

Proof. Assume for simplicity that ε=1/2\varepsilon=1/2. Consider an AM(k)\mathsf{AM}\left(k\right) protocol that uses q(n)q\left(n\right) bits of communication in total. We can assume q(n)kq\left(n\right)\geq k, since otherwise we could eliminate one of the Merlins and reduce to the AM(k1)\mathsf{AM}\left(k-1\right) case. Now suppose that for all i[k]i\in\left[k\right], Arthur sends an sis_{i}-bit message to Merlini and receives a tit_{i}-bit response. Then by Theorem 48, we can simulate the protocol to within constant error by an algorithm that makes

evaluations of Arthur’s verification procedure VV. Furthermore, each VV evaluation can be performed in randomized poly(n)\operatorname*{poly}\left(n\right) time (even allowing for amplification to exponentially small error probability). So if 3Sat requires 2Ω(n)2^{\Omega\left(n\right)} randomized time, then k2q(n)2=Ω(n)k^{2}q(n)^{2}=\Omega(n) and q(n)=Ω(n/k)q\left(n\right)=\Omega(\sqrt{n}/k). Combining with q(n)kq\left(n\right)\geq k then yields q(n)=Ω(n1/4)q\left(n\right)=\Omega(n^{1/4}).

For general ε>0\varepsilon>0, we simply need to use Theorem 49 rather than Theorem 48. For the last part, we note that if VV is deterministic then so is our 3Sat algorithm.

4 Subsampling with k𝑘k Merlins

Finally, let us show that AM(k)=AM\mathsf{AM}\left(k\right)=\mathsf{AM} for all k=poly(n)k=\operatorname*{poly}\left(n\right). The first step is to generalize Theorem 45, the subsampling theorem for 22-player free games, to kk players for arbitrary kk. For technical reasons—related to the definition of “denseness” in the statement of Theorem 44—doing this will require reducing a free game to a kk-CSP in a different way than we did in the proof of Theorem 45.In more detail, suppose we tried to encode a kk-player free game GG as a kk-CSP in the “obvious” way. Then among all possible kk-tuples of variables, the fraction that were related by a nontrivial constraint would decrease like k!/kkekk!/k^{k}\approx e^{-k}, simply because any such kk-tuple must involve exactly one variable for each of the kk players, with no “collisions.” But this, in turn, would mean that we could only get the conclusion AM(k)=AM\mathsf{AM}\left(k\right)=\mathsf{AM} when k=O(logn)k=O\left(\log n\right): for larger kk, our kk-CSP simply wouldn’t be “dense” enough for Theorem 44 to give what we want. To get around this problem, we use a different encoding of GG as a kk-CSP: one in which every variable, individually, involves questions to all kk of the players.

and ε>0\varepsilon>0, let κ:=εΛlog(B1Bk)\kappa:=\varepsilon^{-\Lambda}\log\left(\left|B_{1}\right|\cdots\left|B_{k}\right|\right) (for some suitable constant Λ\Lambda), and assume κmin{Y1,,Yk}\kappa\leq\min\left\{\left|Y_{1}\right|,\ldots,\left|Y_{k}\right|\right\}. For each i[k]i\in\left[k\right], choose a subset SiYiS_{i}\subseteq Y_{i} of Merlini questions uniformly at random subject to Si=κ\left|S_{i}\right|=\kappa, let S:=S1××SkS:=S_{1}\times\cdots\times S_{k}, and let GSG_{S} be the subgame of GG with Merlini’s questions restricted to SiS_{i}. Then

Proof. We define a kk-CSP φ\varphi as follows. Let

Then there is one variable, of the form b(y)B\mathbf{b}\left(\mathbf{y}\right)\in\mathbf{B}, for every kk-tuple yY\mathbf{y}\in\mathbf{Y}. (Thus, an assignment b:YB\mathbf{b}:\mathbf{Y}\rightarrow\mathbf{B} to φ\varphi will be a fairly large object, mapping kk-tuples of questions to kk-tuples of answers.) There is also a [0,1]\left[0,1\right]-valued constraint, CRC_{\mathbf{R}}, for every subset R={y1,,yk}Y\mathbf{R}=\left\{\mathbf{y}_{1},\ldots,\mathbf{y}_{k}\right\}\subseteq\mathbf{Y} of size kk. Let (y)iYi\left(\mathbf{y}\right)_{i}\in Y_{i} denote the ithi^{th} component of the kk-tuple yY\mathbf{y}\in\mathbf{Y}, and likewise let (b)iBi\left(\mathbf{b}\right)_{i}\in B_{i} denote the ithi^{th} component of bB\mathbf{b}\in\mathbf{B}. Then the constraint CRC_{\mathbf{R}} has the following satisfaction value:

where we fix some ordering of the yi\mathbf{y}_{i}’s, like y1<<yk\mathbf{y}_{1}<\cdots<\mathbf{y}_{k}. In words, we can think of CRC_{\mathbf{R}} as an algorithm that first randomly permutes the kk-tuples y1,,yk\mathbf{y}_{1},\ldots,\mathbf{y}_{k} and b(y1),,b(yk)\mathbf{b}\left(\mathbf{y}_{1}\right),\ldots,\mathbf{b}\left(\mathbf{y}_{k}\right), and that then checks “satisfaction of VV along the diagonal”: i.e., does Arthur accept if, for each i[k]i\in\left[k\right], Merlini is asked the ithi^{th} question in yi\mathbf{y}_{i} and responds with the ithi^{th} answer in b(yi)\mathbf{b}\left(\mathbf{y}_{i}\right)?

In this way, we ensure the following four properties:

φ\varphi has density α=1\alpha=1 in the sense of Theorem 44, since it includes a constraint for every possible subset of kk variables.

φ\varphi has alphabet size Σ=B=B1Bk\left|\Sigma\right|=\left|\mathbf{B}\right|=\left|B_{1}\right|\cdots\left|B_{k}\right|.

SAT(φ)ω(G)\operatorname*{SAT}\left(\varphi\right)\geq\omega\left(G\right). To see this: given any strategy (bi:YiBi)i[k]\left(b_{i}:Y_{i}\rightarrow B_{i}\right)_{i\in\left[k\right]} for GG that achieves value ω\omega, we can easily construct an assignment b:YB\mathbf{b}:\mathbf{Y}\rightarrow\mathbf{B} to φ\varphi that achieves value ω\omega, by setting

SAT(φ)ω(G)\operatorname*{SAT}\left(\varphi\right)\leq\omega\left(G\right) (so in fact SAT(φ)=ω(G)\operatorname*{SAT}\left(\varphi\right)=\omega\left(G\right)). To see this: fix any assignment b:YB\mathbf{b}:\mathbf{Y}\rightarrow\mathbf{B}. Then for each i[k]i\in\left[k\right], let Di\mathcal{D}_{i} be the probability distribution over functions bi:YiBib_{i}:Y_{i}\rightarrow B_{i} obtained by first choosing yjYjy_{j}\in Y_{j} uniformly at random for all jij\neq i, and then considering the function bi(yi):=(b(y1,,yk))ib_{i}\left(y_{i}\right):=\left(\mathbf{b}\left(y_{1},\ldots,y_{k}\right)\right)_{i}. Then

Now suppose we choose a random subset IY\mathbf{I}\subseteq\mathbf{Y} of size

and consider a restriction φI\varphi_{\mathbf{I}} of φ\varphi to the subset of variables {b(y)}yI\left\{\mathbf{b}\left(\mathbf{y}\right)\right\}_{\mathbf{y}\in\mathbf{I}}. Then by Theorem 44, together with properties (1) and (2) above, we have

Furthermore, for each i[k]i\in\left[k\right], let SiYiS_{i}\subseteq Y_{i} be chosen uniformly at random subject to Si=κ\left|S_{i}\right|=\kappa, and let Si={yi1,,yiκ}S_{i}=\left\{y_{i1},\ldots,y_{i\kappa}\right\}, fixing a uniformly-random ordering of yi1,,yiκy_{i1},\ldots,y_{i\kappa}. Also let S:=S1××SkS:=S_{1}\times\cdots\times S_{k}. Then for each j[κ]j\in\left[\kappa\right], let yj:=(y1j,,ykj)\mathbf{y}_{j}:=\left(y_{1j},\ldots,y_{kj}\right), and let I:={y1,,yκ}\mathbf{I}:=\left\{\mathbf{y}_{1},\ldots,\mathbf{y}_{\kappa}\right\}. Then reusing the same argument from property (3) above, we have ω(GS)SAT(φI)\omega\left(G_{S}\right)\leq\operatorname*{SAT}\left(\varphi_{\mathbf{I}}\right) for every SS. But the uniform distribution over SS’s (and over the orderings of the elements in each SiS_{i}) induces the uniform distribution over I\mathbf{I}’s. It follows that

Finally, by property (4) we have SAT(φ)ω(G)\operatorname*{SAT}\left(\varphi\right)\leq\omega\left(G\right). Combining, we get

We are now ready to prove that AM(k)=AM\mathsf{AM}\left(k\right)=\mathsf{AM}.

AM(k)=AM\mathsf{AM}\left(k\right)=\mathsf{AM} for all k=poly(n)k=\operatorname*{poly}\left(n\right).

Proof. Let LAM(k)L\in\mathsf{AM}\left(k\right). Then just like in Theorem 47, an AM(k)\mathsf{AM}\left(k\right) protocol for checking whether a string is in LL can be represented as a kk-player free game G=(Y1,,Yk,B1,,Bk,V)G=\left(Y_{1},\ldots,Y_{k},B_{1},\ldots,B_{k},V\right), where Yi=Bi={0,1}p(n)Y_{i}=B_{i}=\left\{0,1\right\}^{p\left(n\right)} for all i[k]i\in\left[k\right] and some polynomial pp, and VV is computable in randomized poly(n)\operatorname*{poly}\left(n\right) time.

Let ε:=112\varepsilon:=\frac{1}{12} and κ:=εΛp(n)\kappa:=\varepsilon^{-\Lambda}p\left(n\right). Suppose we choose SiYiS_{i}\subseteq Y_{i} uniformly at random subject to Si=κ\left|S_{i}\right|=\kappa for all i[k]i\in\left[k\right], then set S:=S1××SkS:=S_{1}\times\cdots\times S_{k}. Then by Theorem 52,

But this immediately gives us our AM\mathsf{AM} simulation, as follows. First Arthur chooses S1,,Sk{0,1}p(n)S_{1},\ldots,S_{k}\subseteq\left\{0,1\right\}^{p\left(n\right)} uniformly at random, subject as above to S1==Sk=κ\left|S_{1}\right|=\cdots=\left|S_{k}\right|=\kappa, and lets S=S1××SkS=S_{1}\times\cdots\times S_{k}. He then sends descriptions of S1,,SkS_{1},\ldots,S_{k} to Merlin, using kκp(n)=O(kp(n)2)k\kappa\cdot p\left(n\right)=O(k\cdot p\left(n\right)^{2}) bits. Next Merlin replies with a kk-tuple of strategies (bi:SiBi)i[k]\left(b_{i}:S_{i}\rightarrow B_{i}\right)_{i\in\left[k\right]}, which again takes O(kp(n)2)O(k\cdot p\left(n\right)^{2}) bits. Let

be the subsampled success probability; notice that ωSω(GS)\omega_{S}\leq\omega\left(G_{S}\right) for all SS. Then finally, Arthur computes an estimate ω~S\widetilde{\omega}_{S} such that

which he can do in randomized poly(n)\operatorname*{poly}\left(n\right) time, and accepts if and only if ω~S0.51\widetilde{\omega}_{S}\geq 0.51. (One small difference from Theorem 47 is that, even if VV is deterministic, in general Arthur will still need to estimate ωS\omega_{S} rather than computing it exactly. The reason is that ωS\omega_{S} is an average of S1Sk=κk\left|S_{1}\right|\cdots\left|S_{k}\right|=\kappa^{k} terms, and κk\kappa^{k} is more than polynomial whenever kk is more than constant.)

The completeness and soundness arguments are precisely the same as in Theorem 47.

Let us also show how, by using Theorem 52, we can go back and tighten Corollaries 50 and 51 from Section 7.3.

Let GG be a kk-player free game, with question sets Y1,,YkY_{1},\ldots,Y_{k} and answer sets B1,,BkB_{1},\ldots,B_{k} (assume Bi2\left|B_{i}\right|\geq 2 for all i[k]i\in\left[k\right]). There exists a deterministic algorithm that approximates ω(G)\omega\left(G\right) to within additive error ±ε\pm\varepsilon, in time

where n=Y1B1YkBkn=\left|Y_{1}\right|\left|B_{1}\right|\cdots\left|Y_{k}\right|\left|B_{k}\right| is the input size.

Proof. Let κ:=εΛlog(B1Bk)\kappa:=\varepsilon^{-\Lambda}\log\left(\left|B_{1}\right|\cdots\left|B_{k}\right|\right). Then we simply need to loop over all possible subsets

such that Si=κ\left|S_{i}\right|=\kappa for all i[k]i\in\left[k\right]. For each one, we compute the value ω(GS)\omega\left(G_{S}\right) via a brute-force search over all possible strategy kk-tuples (bi:SiBi)i[k]\left(b_{i}:S_{i}\rightarrow B_{i}\right)_{i\in\left[k\right]}. Then we output ω~:=ES[ω(GS)]\widetilde{\omega}:=\operatorname*{E}_{S}\left[\omega\left(G_{S}\right)\right] as our estimate for ω(G)\omega\left(G\right).

The correctness of this algorithm—i.e., the fact that ω~ωε\left|\widetilde{\omega}-\omega\right|\leq\varepsilon—follows from Theorem 52. For the running time, note that the number of possible subsets SS is

Also, for each SS, the number of possible strategy kk-tuples is B1κBkκnεO(1)logn\left|B_{1}\right|^{\kappa}\cdots\left|B_{k}\right|^{\kappa}\leq n^{\varepsilon^{-O\left(1\right)}\log n}. Hence the total running time is nεO(1)lognn^{\varepsilon^{-O\left(1\right)}\log n} as well.

Corollary 54, in turn, has the following further corollary.

Assuming the Randomized ETH, any AM(k)\mathsf{AM}\left(k\right) protocol for 3Sat with a constant completeness/soundness gap must use Ω(n)\Omega(\sqrt{n}) bits of communication in total. (Also, if Arthur’s verification procedure is deterministic, then it suffices to assume the ordinary ETH.)

Proof. Suppose there existed an AM(k)\mathsf{AM}\left(k\right) protocol for 3Sat, which used q(n)=nO(1)q\left(n\right)=n^{O\left(1\right)} bits of communication in total, and which had a completeness/soundness gap of, say, 2/32/3 versus 1/31/3 (the exact constants will be irrelevant). Set ε:=1/10\varepsilon:=1/10. Then by Corollary 54, we can approximate the Merlins’ maximum winning probability ω\omega to within ±ε\pm\varepsilon by a deterministic algorithm that makes q(n)εO(1)logq(n)=2O(log2q(n))q\left(n\right)^{\varepsilon^{-O\left(1\right)}\log q\left(n\right)}=2^{O(\log^{2}q\left(n\right))} evaluations of Arthur’s verification function VV. Furthermore, each VV evaluation takes poly(n)\operatorname*{poly}\left(n\right) time by a randomized algorithm if VV is randomized (even counting the time needed to amplify to exp(q(n)2)\exp(-q\left(n\right)^{2}) error probability), or poly(n)\operatorname*{poly}\left(n\right) time by a deterministic algorithm if VV is deterministic. Thus, the algorithm’s total running time is 2O(log2q(n))poly(n)2^{O(\log^{2}q\left(n\right))}\operatorname*{poly}\left(n\right). Moreover, the algorithm lets us decide whether ω2/3\omega\geq 2/3 or ω1/3\omega\leq 1/3, and hence whether our original 3Sat instance was satisfiable. On the other hand, 3Sat must take 2Ω(n)2^{\Omega\left(n\right)} time assuming the ETH. Combining, we obtain q(n)=Ω(n)q\left(n\right)=\Omega(\sqrt{n}).

Conclusions and Open Problems

In this paper, we saw how a deceptively simple problem—understanding the power of AM(2)\mathsf{AM}\left(2\right) protocols, and the complexity of approximating free games—hides a wealth of interesting phenomena. On the one hand, the fact that a two-prover game GG is free leads to a quasipolynomial-time approximation algorithm for ω(G)\omega\left(G\right), and even a proof of AM(2)=AM\mathsf{AM}\left(2\right)=\mathsf{AM}. On the other hand, the fact that the Merlins still can’t communicate leads to quasipolynomial-time hardness (assuming the ETH), and to an O~(n)\widetilde{O}(\sqrt{n})-communication AM(2)\mathsf{AM}\left(2\right) protocol for 3Sat.

While we managed to give nearly-matching upper and lower bounds for the complexity of FreeGame, numerous open problems remain, both about free games themselves, and about the applicability of our techniques to other problems. We now list twelve.

Can we improve our result NTIME[n]AMn1/2+o(1)(2)\mathsf{NTIME}\left[n\right]\subseteq\mathsf{AM}_{n^{1/2+o\left(1\right)}}\left(2\right) to NTIME[n]AMO~(n)(2)\mathsf{NTIME}\left[n\right]\subseteq\mathsf{AM}_{\widetilde{O}(\sqrt{n})}\left(2\right)? This would follow if, for example, we could get the “best of both worlds” between the two PCP theorems of Dinur and Moshkovitz and Raz , and achieve npolylognn\operatorname*{polylog}n size together with a 11 vs. δ\delta completeness/soundness gap.

Assuming the ETH, can we completely close the gap between our nO(ε2logn)n^{O(\varepsilon^{-2}\log n)} upper bound and nΩ~(ε1logn)n^{\widetilde{\Omega}(\varepsilon^{-1}\log n)} lower bound on the complexity of FreeGameε? What is the right dependence on ε\varepsilon? Also, given a PCP ϕ\phi of size NN, is there an AM(2)\mathsf{AM}\left(2\right) protocol for verifying ϕ\phi’s satisfiability that uses O(N)O(\sqrt{N}) communication rather than O(NlogN)O(\sqrt{N}\log N)? (In other words, in our hardness result, can we at least eliminate the log\log factor that comes from the birthday game, if not the log\log or larger factors from the PCP reduction?)

We gave two different algorithms for approximating the value of a kk-player free game with k3k\geq 3: one that took nO(ε2k2logn)n^{O(\varepsilon^{-2}k^{2}\log n)} time (using a recursive reduction to (k1)\left(k-1\right)-player games), and one that took nεO(1)lognn^{\varepsilon^{-O\left(1\right)}\log n} time (using subsampling). Can we get the “best of both worlds,” and give an algorithm that takes nO(ε2logn)n^{O(\varepsilon^{-2}\log n)} time? If so, this would imply that, assuming the ETH, any AM(k)\mathsf{AM}\left(k\right) protocol for 3Sat with a 11 vs. 1ε1-\varepsilon completeness/soundness gap requires Ω(εn)\Omega(\sqrt{\varepsilon n}) total communication, regardless of kk.

Can we generalize the Parallel Repetition Theorem, as well as Rao’s concentration bound (Theorem 22), to kk-player free games for arbitrary kk? This would let us amplify AM(k)\mathsf{AM}\left(k\right) protocols for k>2k>2, though as usual with a polynomial blowup in communication cost.

Is our result that NTIME[n]AMn1/2+o(1)(2)\mathsf{NTIME}\left[n\right]\subseteq\mathsf{AM}_{n^{1/2+o\left(1\right)}}\left(2\right)—that is, the existence of our 3Sat protocol—non-algebrizing in the sense of Aaronson and Wigderson ? (Recall from Proposition 38 that the result is non-relativizing.)

Can we give “direct” proofs that AM(k)=AM(2)\mathsf{AM}\left(k\right)=\mathsf{AM}\left(2\right) for all k>2k>2, and that any AM(k)\mathsf{AM}\left(k\right) protocol can be made public-coin and perfect-completeness (where “direct” means, without using the full power of AM(k)=AM\mathsf{AM}\left(k\right)=\mathsf{AM})?

How far can we improve our approximation algorithms for free games, if we assume that the game is also a projection game or a unique game? Conversely, what hardness results can we prove under those restrictions?

Let AM(2)\mathsf{AM}^{\ast}\left(2\right) be defined the same way as AM(2)\mathsf{AM}\left(2\right), except that now the Merlins can share an unlimited amount of quantum entanglement. (Their communication with Arthur is still classical.) What can we say about this class? Does our 3Sat protocol become unsound? If so, then can we somehow “immunize” it against entangled provers—as the spectacular work of Ito and Vidick (see also Vidick ) recently managed to do for the original BFL protocol? In the other direction, it’s currently a notorious open problem to prove any upper bound whatsoever on the class MIP\mathsf{MIP}^{\ast} (that is, MIP\mathsf{MIP} with entangled provers): even the set of computable languages! The issue is that we don’t have any a priori upper bound on the amount of entanglement the provers might need; and the more entanglement they use, the longer it could take to simulate them. Does this problem become more tractable if we restrict attention to AM\mathsf{AM}^{\ast} protocols: that is, to protocols with uncorrelated questions?

Can we use our hardness result for FreeGame—or more generally, the idea of birthday repetition—as a starting point for proving nΩ(logn)n^{\Omega\left(\log n\right)} hardness results for other problems? One problem of particular interest is approximate Nash equilibrium. For that problem, Lipton, Markakis, and Mehta gave an nO(ε2logn)n^{O(\varepsilon^{-2}\log n)} approximation algorithm—indeed, one strikingly reminiscent of our algorithm from Theorem 39—while Hazan and Krauthgamer recently showed nΩ(logn)n^{\Omega\left(\log n\right)} hardness, assuming nΩ(logn)n^{\Omega\left(\log n\right)} hardness for the planted clique problem.Similarly, while this reduction is arguably weaker than the one we give, it is not hard to show that FreeGameε requires nΩ(logn)n^{\Omega\left(\log n\right)} time for constant ε\varepsilon, under the assumption that the planted clique problem requires nΩ(logn)n^{\Omega\left(\log n\right)} time. We thank Oded Regev for this observation. We conjecture that, using birthday repetition of 3Sat, one could show nΩ~(ε1logn)n^{\widetilde{\Omega}(\varepsilon^{-1}\log n)} hardness for approximate Nash equilibrium assuming only the ETH. This would solve an open problem explicitly raised by Hazan and Krauthgamer.Technically, Hazan and Krauthgamer asked for a proof that approximate Nash equilibrium is not in P\mathsf{P}, assuming Max Clique requires 2ω(n)2^{\omega\left(\sqrt{n}\right)} time. But this is extremely similar to assuming the ETH.

What can we say about QMA(2)\mathsf{QMA}\left(2\right), the class that originally motivated our study of AM(2)\mathsf{AM}\left(2\right)? Is QMA(2)EXP\mathsf{QMA}\left(2\right)\subseteq\mathsf{EXP}? Are the O~(n)\widetilde{O}(\sqrt{n})-qubit protocols for 3Sat, due to Aaronson et al. and Harrow and Montanaro , optimal assuming the ETH? Is the BSSε problem from Section 4 solvable in nO(ε2logn)n^{O\left(\varepsilon^{-2}\log n\right)} time, as FreeGameε is?

Acknowledgments

We thank Boaz Barak, Oded Regev, Avi Wigderson, and other participants at the 2013 Banff Complexity Theory workshop for helpful discussions. We especially thank Peter Shor for early discussions, Ryan O’Donnell for pointing us to and , Anup Rao for clarifications about parallel repetition, and Aram Harrow for goading us to write this paper up after a four-year delay.

References