The study of total NP search problems (TFNP) was initiated by Megiddo and Papadimitriou [MP91] and Papadimitriou [Pap94] to characterize the complexity of search problems that have a solution for every input and where a given solution can be efficiently checked for validity. Meggido and Papadimitriou [MP91] showed that the notion of NP-hardness is inadequate to capture the complexity of total NP search problems. By now, this theory has flowered into a sprawling jungle of widely-studied syntactic complexity classes (such as PLS [JPY88], PPA/PPAD/PPP [Pap94], CLS [DP11]) that serve to classify the complexities of many relevant search problems.
The goal of identifying naturalFollowing the terminology of many TFNP papers, including [Gri01, FG18, FG19, SZZ18], a natural problem is one that does not have explicitly a circuit or a Turing machine as part of the input. complete problems for these complexity classes lies in the foundation of this sub-field of complexity theory and not only gives a complete picture of the computational complexity of the corresponding search problems, but also provides a better understanding of the complexity classes. Such natural complete problems have also been an essential middle-step for proving the completeness of other important search problems, the same way that the NP-completeness of Sat is an essential middle step in showing the NP-completeness of many other natural problems. Some known natural complete problems for TFNP subclasses are: the PPAD-completeness of NashEquilibrium [DGP09], the PPA-completeness of ConsensusHalving, NecklaceSplitting and HamSandwich problems [FG18, FG19] and the PPP-completeness of natural problems related to lattice-based cryptography [SZZ18]. Finally, the theory of total search problems has found connections beyond its original scope to areas like communication complexity and circuit lower bounds [GKRS19], cryptography [BPR15, KNY19, CHK+19] and the Sum-of-Squares hierarchy [KM18].
Our main result is to identify the first natural complete problem for the classes PPAq, a variant of the class PPA. We also illustrate the relevance of these classes through connections with important search problems from combinatorics and cryptography.
Class PPAq. The class PPAq was defined, in passing, by Papadimitriou [Pap94, p. 520]. It is a modulo-q analog of the well-studied polynomial parity argument class PPA (which corresponds to q=2). The class embodies the following combinatorial principle:
If a bipartite graph has a node of degree not a multiple of q, then there is another such node.
In more detail, PPAq consists of all total NP search problems reducibleHere, we consider a many-one reduction, which is a polynomial time algorithm with one oracle query to the said problem. In contrast, a Turing reduction allows polynomially many oracle queries. See Section 1.5 for a comparison. to the problem \textscBipartiteq defined as follows. An instance of this problem is a balanced bipartite graph G=(V∪U,E), where V∪U={0,1}n together with a designated vertex v⋆∈V∪U. The graph G is implicitly given via a circuit C that computes the neighborhood of every node in G. Let deg(v) be the degree of the node v in G. A valid solution is a node v∈{0,1}n such that, either
In Section 2 we provide some other total search problems (\textscLonelyq, \textscLeafq) that are reducible to and from \textscBipartiteq. Any one of these problems could be used to define PPAq. In fact, \textscLonelyq and \textscLeafq are natural variants of the standard problems Lonely and Leaf which are used to define the class PPA.
Our contributions. We illustrate the importance of the complexity classes PPAq by showing that many important search problems whose computational complexity is not well understood belong to PPAq (see §1.6 for details). These problems span a wide range of scientific areas, from algebraic topology to cryptography. For some of these problems we conjecture that PPAq-completeness is the right notion to characterize their computational complexity. The study of PPAq is also motivated from the connections to other important and well-studied classes like PPAD.
In this paper, we provide a systematic study of the complexity classes PPAq. Our main result is the identification of the first natural complete problem for PPAq together with some structural results. Below we give a more precise overview of our results.
We characterize PPAq in terms of PPAp for prime p.
Our main result is that an explicitFollowing the terminology in [BIQ+17], by explicit we mean that the system of polynomials, which is the input of the computational problems we define, are given as a sum of monic monomials. version of the Chevalley-Warning theorem is complete
for PPAp for prime p. This problem is natural in that it does not involve circuits as part of the input and is the first known natural complete problem for PPAp when p≥3.
As a consequence of the PPAp-completeness of our natural problem, we show that restricting
the input circuits in the definition of PPAp to just constant depth arithmetic formulas doesn’t change the power of the class.
We show a connection between PPAq and the Short Integer Solution (SIS) problem from
the theory of lattices. This connection implies that SIS with constant modulus q belongs to PPAq∩PPP, but also provides a polynomial time algorithm for solving SIS when the modulus q is constant and has only 2 and 3 as prime factors.
We sketch how existing results already paint a near-complete picture of the relative power
of PPAp relative to other TFNP subclasses (via inclusions and oracle separations). We also show that PPAq is closed under Turing reductions.
In Section 1.6, we include a list of open problems that illustrate the broader relevance of PPAq. We note that a concurrent and independent work by Hollender [Hol19] also establishes the structural properties of PPAq corresponding to §1.1 and §1.5.
PPAq=&p∣qPPAp.
A special case of Theorem 1 is that PPApk=PPAp for every prime power pk. Showing the inclusion PPApk⊆PPAp is the crux of our proof. This part of the theorem can be viewed as a total search problem analog of the counting class result of Beigel and Gill [BG92] stating that ModpkP=ModpP; “an unexpected result”, they wrote at the time. Throughout this paper, we use q to denote any integer ≥2 and p to denote a prime integer.
2 A Natural Complete Problem via Chevalley-Warning Theorem
There have been several works focusing on completeness results for the class PPA (i.e. PPA2). Initial works showed the PPA-completeness of (non-natural) total search problems corresponding to topological fixed point theorems [Gri01, ABB15, DEF+16]. Closer to our paper, Belovs et al. [BIQ+17] show the PPA-completeness of computational analogs of Combinatorial Nullstellensatz and the Chevalley–Warning Theorem, but which explicitly involve a circuit as part of the input. More recently, breakthrough results showed PPA-completeness of problems without a circuit or a Turing Machine in the input such as Consensus-Halving, Necklace-Splitting and Ham-Sandwich [FG18, FG19] resolving an open problem since the definition of PPA in [Pap94].
Our main contribution is to provide a natural complete problem for PPAp, for every prime p; thereby also yielding a new complete problem for PPA. Our complete problem is an extension of the problem \textscChevalleyp, defined by Papadimitriou [Pap94], which is a search problem associated to the celebrated Chevalley-Warning Theorem. We first present an abstract way to understand the proof of the Chevalley-Warning Theorem that motivates the definition of our natural complete problem for PPAp.
In 1935, Claude Chevalley [Che35] resolved a hypothesis stated by Emil Artin, that all finite fields are quasi-algebraically closed. Later, Ewald Warning [War36] proved a slight generalization of Chevalley’s theorem. This generalized statement is usually referred to as the Chevalley-Warning Theorem (CWT, for short). Despite its initial algebraic motivation, CWT has found profound applications in combinatorics and number theory as we discuss in §1.4 (and Section 6).
In the above definition, it is important to consider the symbolic expansion of CWf and ignore any cancellation of coefficients that might occur. Observe that, although the expansion of CWf is exponentially large in the description size of f, each monic monomial of CWf can be succinctly described as a combination of monic monomials of the polynomials f1,…,fm. We formally discuss this in Section 4.
Using the definition of max-degree monic monomials, we state the main technical lemma underlying the proof of CWT (with proof in Section 4).
2.2 Proofs of Cancellation
Thus, any condition on f that implies the (Extended CW Condition) can replace (CW Condition) in the Chevalley-Warning Theorem. Note that the (Extended CW Condition) is equivalent to all the max-degree monic monomials in CWf cancelling out. Thus, we call any such condition on f that implies (Extended CW Condition) to be a “proof of cancellation” for the system f.
We can now reinterpret the result of Belovs et al. [BIQ+17] in this framework of “proof of cancellation” conditions. In particular, [BIQ+17] considers the case p=2 and defines the problem PPA-Circuit-Chevalley, in which a “proof of cancellation” is given in a specific form of circuits. These circuits describe the system (f1,…,fm) in the PPA-Circuit-Chevalley problem. It is then shown that PPA-Circuit-Chevalley is PPA2-complete.
2.3 Computational Problems Based on Chevalley-Warning Theorem
Every “proof of cancellation” that is syntactically refutable can be used to define a total search problem that lies in PPAp. By syntactically refutable we mean that whenever the “proof of cancellation” is false, there exists a small witness that certifies so. In this section, we define three computational problems with their corresponding “proof of cancellation”: (1) the \textscChevalleyp problem defined by [Pap94], (2) the \textscGeneralChevalleyp problem that is a generalization of \textscChevalleyp, and (3) the problem \textscChevalleyWithSymmetryp that we show to be PPAp-complete. All these problems are defined for every prime modulus p and are natural in the sense that they do not explicitly involve a circuit or a Turing Machine in their input. In particular, the polynomial systems in the input are explicit in that they are given as a sum of monic monomials.
This is the direct computational analog of the Chevalley-Warning Theorem and was defined by Papadimitriou [Pap94] as the following total search problem:
[Refuting witness] (CW Condition) is not satisfied.
x∈Vf∖{x⋆}.
We will particularly consider a special case where all the fi’s have zero constant term (zecote, for short). In this case, x⋆=0∈Vf, so there is no need to explicitly include x∗ in the input.
As mentioned already, we can define a search problem corresponding to any syntactically refutable condition that implies the (Extended CW Condition). One such condition is to directly assert that
In particular, note that (CW Condition) implies this condition. Moreover, this condition is syntactically refutable by a max-degree monic monomial, which is efficiently representable as a combination of at most m(p−1) monomials of the fi’s. Thus, we can define the following total search problem generalizing \textscChevalleyp.
\textscGeneralChevalleyp
[Refuting Witness] A max-degree monic monomial of CWf.
x∈Vf∖{x⋆}.
Hence, we further generalize \textscGeneralChevalleyp into a problem that incorporates this additional “proof of cancellation” in the form of a permutation σ∈Sn.
We now define the following natural total search problem.
\textscChevalleyWithSymmetryp
[Refuting Witness – 1] A max-degree monic monomial of CWg.
[Refuting Witness – 2] x∈Vg∩Vh such that σ(x)∈/(Vg∩Vh)∖{x}.
x∈Vf∖{x⋆}.
The above problem is natural, because the input consists of a system of polynomial in an explicit form, i.e. as a sum of monic monomials, together with a permutation in Sn given say in one-line notation. Also, observe that when h is empty, the above problem coincides with \textscGeneralChevalleyp (since Vh=∅ when h is empty). Our main result is the following (proved in Section 4).
For any prime p, \textscChevalleyWithSymmetryp is PPAp-complete.
3 Complete Problems via Small Depth Arithmetic Formulas
We hope that this theorem will be helpful in the context of proving PPAp-hardness of other problems. There it would be enough to consider only constant depth arithmetic formulas (and hence NC1 Boolean formulas) in the definitions of PPAp as opposed to unbounded depth circuits. Such a simplification has been a key-step for proving hardness results for other TFNP subclasses, e.g. in the PPAD-hardness proofs of approximate-Nash (cf. [Rub16]).
4 Applications of Chevalley-Warning
Apart from its initial algebraic motivation, the Chevalley-Warning theorem has been used to derive several non-trivial combinatorial results. Alon et al. [AFK84] show that adding an extra edge to any 4-regular graph forces it to contain a 3-regular subgraph. More generally, they prove that certain types of “almost” regular graphs contain regular subgraphs. Another application of CWT is in proving zero-sum theorems similar to the Erdös-Ginzburg-Ziv Theorem. A famous such application is the proof of Kemnitz’s conjecture by Reiher [Rei07].
For a certain range of parameters n,m, it holds that
For all primes p : \textscBISp and \textscSISp are Karp-reducible to \textscChevalleyp, hence are in PPAp.
For all q : \textscBISq and \textscSISq are Turing-reducible to any PPAq–complete problem.
For all k : \textscBIS2k is solvable in polynomial time.
Even though the \textscSISq problem is well-studied in lattice theory, not many results are known in the regime where q is a constant and the number of variables depends linearly on the number of equations. Part (1) of the above theorem establishes a reduction from \textscSISp to \textscChevalleyp for prime p. Part (2) follows by a bootstrapping method that allows us to combine algorithms for \textscSISq1 and \textscSISq2 to give an algorithm for \textscSISq1q2 (for a certain regime for parameters n and m). Finally Parts (3) and (4) results follow by using this bootstrapping method along with the observation that Gaussian elimination provides valid solutions for \textscBIS2 (hence also \textscSIS2) and for \textscSIS3.
5 Structural properties
Buss and Johnson [BJ12, Joh11] had defined a class PMODq which turns out to be slightly weaker than PPAq (refer to Section 7). Despite this slight difference between the definitions of PPAq and PMODq, we can still deduce statements about PPAq from the work of [Joh11]. In particular, it follows that PPAD⊆PPAq (refer to Section 7.1).
More broadly, a near-complete picture of the power of PPAq relative to other subclasses of TFNP is summarized in Figure 1. These relationships (inclusions and oracle separations) mostly follow from prior work in proof complexity [BR98, BGIP01, Joh11, GKRS19] (refer to Section 7.2).
Recall that TFNP subclasses are defined as the set of all total search problems that are many-one reducible (aka Karp–reducible) to the corresponding complete problems. One can ask whether more power is gained by allowing Turing reductions, that is, polynomially many oracle queries to the corresponding complete problem. Buss and Johnson [BJ12] showed that PLS, PPAD, PPADS, PPA are closed under Turing reductions (with a notable exception of PPP, which remains open). We show this for PPAp when p is a prime.
FPPPAp=PPAp for every prime p.
By contrast, it follows from [BJ12, §6] that PPAq is not closed under black-box Turing reductions for non-prime powers q. See Section 7.3 for details.
6 Open questions
It has been shown that Factoring reduces to PPP-complete problems as well as to PPA-complete problems [BO06, Jer16], albeit under randomized reductions (which can be derandomized assuming the Generalized Reimann Hypothesis). It has been asked whether in fact Factoring could be reduced to PPAD-complete problems [Jer16]. As a step towards this problem, we propose the following question.
Is Factoring in PPAp for all primes p (perhaps under randomized reductions)?
This is clearly an easier problem since PPAD⊆PPAp. Interestingly, note that there exists an oracle O relative to which ⋂pPPApO⊈PPADO. Thus, the above problem, even if established for all prime p, is still weaker than showing that Factoring reduces to PPAD-complete problems.
The q\textsc−Necklace−Splitting problem is defined as follows: There is an open necklacean “open necklace” means that the beads form a string, not a cycle with q⋅ai beads of color i, for i∈[n]. The goal is to cut the necklace in (q−1)⋅n places and partition the resulting substrings into k collections, each containing precisely ai beads of color i for each i∈[n].
The fact that such a partition exists was first shown in the case of q=2 by Goldberg and West [GW85] and by Alon and West [AW86]. Later, Alon [Alo87] proved it for all q≥2. As mentioned before, Filos-Ratsikas and Goldberg [FG19] showed that the 2\textsc−Necklace−Splitting problem is PPA-complete. Moreover, they put forth the following question (which we strengthen further).
Is q\textsc−Necklace−Splitting in PPAq? More strongly, is it PPAq-complete?
While we do not know how to prove/disprove this yet, we point out that it was also shown in [FG19] that 2k\textsc−Necklace−Splitting is in fact in PPA2. This is actually well aligned with this conjecture since we showed that PPA2k=PPA2 (Theorem 1).
Alon’s proof of the q-Necklace-Splitting theorem [Alo87] was topological and used a certain generalization of the Borsuk-Ulam theorem due to Bárány, Shlosman and Szücs [BSS81]. Since the computational Borsuk-Ulam problem is PPA-complete, we could ask a similar question about this generalization.
Is \textscBaˊraˊny−Shlosman−Szu¨csp problem in PPAp (perhaps even PPAp-complete)?
We conclude with some interesting directions for further exploring the connections of Chevalley with other computational problems.
Does \textscSISq admit worst-to-average case reductions to other lattice problems in our range of parameters? Or is it average-case hard assuming standard cryptographic assumptions, e.g. the “learning with errors” assumption?
If resolved positively, the above would serve as evidence of the average-case hardness for the class PPAp, similar to the evidence that we have for PPA by reduction from Factoring.
For all primes p, is \textscChevalleyp reducible to \textscBISp?
For all q, is there a non-trivial regime of parameters n, m where \textscBISq is solvable in polynomial time?
A search problem R1 is Karp-reducible (or many-one reducible) to a search problem R2, or R1⪯R2 for short, if there exist polynomial-time computable functions f and g such that given any instance x of R1, f(x) is an instance of R2 such that for any y∈R2(f(x)), it holds that g(x,f(x),y)∈R1(x).
On the other hand, we say that R1 is Turing-reducible to R2, or R1⪯TR2 for short, if there exists a polynomial-time oracle Turing machine that on input x to R1, makes oracle queries to R2, and outputs a y∈R1(x). In this paper, we primarly deal with Karp-reductions, except in Section 7.3, where we compare the two different notions of reductions in the context of PPAq.
We describe several total search problems (parameterized by q) that we show to be inter-reducible. PPAq is then defined as the set of all search problems reducible to either one of the search problems defined below.
A bipartite graph with a non-multiple-of-q degree node has another such node.
Bipartite graph G=(V∪U,E). Designated vertex v∗∈V
▹C:{0,1}n→({0,1}n)k, with ({0,1}n)k interpreted as a k-subset of {0,1}n▹v∗∈{0}×{0,1}n−1 (usually 0n)
V:={0}×{0,1}n−1, U:={1}×{0,1}n−1, E:={(v,u):v∈V∩C(u) and u∈U∩C(v)}
A q-dimensional matching on a non-multiple-of-q many vertices has an isolated node.
q-dimensional matching G=(V,E). Designated vertices V∗⊆V with ∣V∗∣≤q−1
▹C:[q]n→[q]n▹V∗⊆[q]n with ∣V∗∣≤q−1
V:=[q]n. For distinct v1,…,vq, edge e:={v1,…,vq}∈E if C(vi)=vi+1, C(vq)=v1
v∈V∗ if deg(v)=1 and v∈/V∗ if deg(v)=0
A q-uniform hypergraph with a non-multiple-of-q degree node has another such node.
▹C:{0,1}n→({0,1}nq)q, with ({0,1}nq)q interpreted as q many q-subsets of {0,1}n▹v∗∈{0,1}n (usually 0n)
V:={0,1}n. For distinct v1,…,vq, edge e:={v1,…,vq}∈E if e∈C(v) for all v∈e
We remark that \textscLonelyq and \textscLeafq are modulo-q analogs of the PPA-complete problems Lonely and Leaf [Pap94, BCE+98]. We prove the following theorem in Appendix A.
The problems \textscBipartiteq, \textscLonelyq and \textscLeafq are inter-reducible.
We will use the following simple conventions repeatedly, in order to simplify the descriptions of reductions between different search problems.
We will often use “algorithms”, instead of “circuits” to encode our hypergraphs. It is standard to simulate polynomial-time algorithms by polynomial sized circuits.
The above simplification gives us that all our problems have an instance-extension property (cf. [BM04]) – this will be helpful in proving Theorem 4.
To simplify our reductions even further, we’ll often describe the edges/hyperdges directly instead of specifying how to compute the neighbors of a given vertex. This is only for simplicity and it will be easy to see how to compute the neighbors of any vertex locally.
Characterization via Primes
In this section we prove Theorem 1, namely PPAq=&p∣qPPAp. The theorem follows by combining the following two ingredients.
PPApk=PPAp for any prime power pk.
2 Prime power case
We will describe an algorithm that on vertex v∈V outputs a hyperedge of p vertices that contains v (if any). To this end, first fix an algorithm that for any set e:={u1,…,upk}⊆V and for any 1≤i≤pt, computes some “canonical” partition of the set (ie) into subsets of size p, and moreover assigns a canonical cyclic order within each such subset. This is indeed possible because of Eq. 3.2, since t<k.
Given a vertex v:={v1,…,vpt}∈V,
It is easy to see that v is isolated in G iff all v∈v are isolated in G. Moreover, any isolated vertex in V∖V∗ contains at least one isolated vertex in V∖V∗; and a non-isolated vertex in V∗ contains at least one non-isolated vertex in V∗ (in fact pt many).
The edges of G can indeed be computed efficiently with just black-box access to C. In order to complete the reduction, we only need that V is efficiently indexable. This is indeed standard; see [KS98, §2.3] for a reference. See Figure 3 for an illustration of the proof.
Note that the size of the underlying graph blows up polynomially in our reduction. We do not know whether a reduction exists that avoids such a blow-up, although we suspect that the techniques of [BR98] can be used to show that some blow-up is necessary for black-box reductions.
A Natural Complete Problem
We start with some notation that will be useful for the presentation of our results.
We repeat the formal statement of Chevalley-Warning Theorem together with its proof.
A monic monomial of CWf is a product tS(x)=∏i=1mtsi(x) for S=(s1,…,sm)∈U.
We define Mf to be the set of max-degree monic monomials of CWf, i.e. Mf:={S∈U∣tS is a max-degree monic monomial of CWf}.
In words, the monomials t(S) are precisely the ones that arise when symbolically expanding CWf(x). We illustrate this with an example: Let p=3 and f1(x1,x2)=x1+x2 and f2(x1,x2)=x12. Then modulo {x13−x1,x23−x2}, we have
Thus there are 18(=6×3) monic monomials in the system (f1,f2). The monomial corresponding to S=((1,5),(2,2)) is a maximal monomial since the 5-th term in CWf1 is x22 and 2-nd term in CWf2 is x12. Using the above definitions, we now state the main technical lemma of the proof of CWT.
The proof of Chevalley-Warning Theorem follows easily from 4.2.
We have that deg(CWf)≤(p−1)∑i=1mdeg(fi). Thus, if f satisfies (CW Condition), then deg(CWf)<(p−1)n and hence ∣Mf∣=0. CWT now follows from 4.2. ∎
2 The Chevalley-Warning Theorem with Symmetry
In this section, we formalize the intuition that we built in Sections 1.2.2 and 1.2.3 to prove the more general statements to lead to the same conclusion as the Chevalley-Warning Theorem.
First, we prove a theorem that argues about the cardinality of Vf directly using some symmetry of the system of polynomials f. Then, combining this symmetry-based argument with the (General CW Condition) we get the generalization of the Chevalley-Warning Theorem. Our natural PPAp-complete problem is based on this generalization.
Our first theorem highlights the use of symmetry in arguing about the size of ∣Vf∣.
Since σ acts freely on Vf, we can partition Vf into orbits of any x∈Vf under actions of ⟨σ⟩, namely sets of the type {σi(x)}i∈[p] for x∈Vf. Since ⟨σ⟩ acts freely on Vf, each such orbit has size p. Thus, we can conclude that Vf≡0(modp) from which the theorem follows. ∎
We now state and prove an extension of CWT that captures both the argument from 4.2 and the symmetry argument from Theorem 6.
3 Computational Problems Related to Chevalley-Warning Theorem
We now follow the intuition developed in the previous section and in Section 1.2 to formally define the computational problems \textscChevalleyp, \textscGeneralChevalleyp, and \textscChevalleyWithSymmetryp.
General Chevalley-Warning Theorem via (General CW Condition).
A max-degree monic monomial tS(x) of CWf, or
black(\textscChevalleyWithSymmetryp)
Chevalley-Warning Theorem with Symmetry (Theorem 7).
(a) A max-degree monic monomial tS(x) of CWg, or
(b) x∈Vg∩Vh such that σ(x)∈(Vg∩Vh)∖{x}, or
Some observations about the above computational problems follow:
In the problems \textscGeneralChevalleyp and \textscChevalleyWithSymmetryp, we assume that, if the output is a max-degree monic monomial, this is given via the multiset of indices S that describes the monomial as formalized in 4.1.
We have \textscChevalleyp⪯\textscGeneralChevalleyp⪯\textscChevalleyWithSymmetryp. Thus, inclusion of \textscChevalleyWithSymmetryp in PPAp implies that the problems \textscChevalleyp and \textscGeneralChevalleyp are in PPAp. Also, in Section 6 we prove that \textscSISp⪯\textscChevalleyp, where \textscSISp is a cryptographically relevant problem. This shows that the \textscGeneralChevalleyp and the \textscChevalleyWithSymmetryp problems are at least as hard as the \textscSISp problem.
We fist prove that ChevalleyWithSymmetry is in PPAp and then prove its PPAp-hardness.
Even though Papadimitriou [Pap94] provided a rough proof sketch of \textscChevalleyp∈PPAp, a formal proof was not given. We show that \textscChevalleyWithSymmetryp is in PPAp (and so are \textscGeneralChevalleyp and \textscChevalleyp). In order to do so we extend the definition of \textscBipartiteq to instances where the vertices might have exponential degree and edges appear with multiplicity. The key here is to define a \textscBipartiteq instance with unbounded (even exponential) degree, but with additional information that allows us to verify solutions efficiently.
Similar to \textscBipartiteq, but degrees are allowed to be exponentially large, edges are allowed with multiplicities at most q−1.
Let V:={0}×{0,1}n−1 and U:={1}×{0,1}n−1: ▹C:V×U→[q], edge counting circuit ▹ϕV:V×U×[q]→(U×[q])q, grouping pivoted at V▹ϕU:V×U×[q]→(V×[q])q, grouping pivoted at U▹e∗=(v∗,u∗,k∗), designated edge
V:={0}×{0,1}n−1, U:={1}×{0,1}n−1, E:={(v,u,k):1≤k≤C(v,u),(v,u)∈V×U} (here k distinguishes multiplicities) Edge (v,u,k) is grouped with {(v,u′,k′):(u′,k′)∈ϕV(v,u,k)} (pivoting at v), provided ∣ϕV(v,u,k)∣=q, all (v,u′,k′)∈E and ϕV(v,u′,k′)=ϕV(v,u,k). Edge (v,u,k) is grouped with {(v′,u,k′):(v′,k′)∈ϕU(v,u,k)} (pivoting at u), provided ∣ϕU(v,u,k)∣=q, all (v,u′,k′)∈E and ϕU(v′,u,k′)=ϕV(v,u,k).
e∗ if e∗ is grouped, pivoting at v∗, or if e∗ is not grouped pivoting at u∗, OR e=e∗ if e is not grouped pivoting at one of its ends.
In words, \textscSuccinctBipartitep encodes a bipartite graph with arbitrary degree. Instead of listing the neighbors of a vertex using a circuit, we have a circuit that outputs the multiplicity of edges between any two given vertices. We are therefore unable to efficiently count the number of edges incident on any vertex. The grouping function ϕV aims to group edges incident on any vertex v∈V into groups of size q. Similarly, ϕU aims to group edges incident on any vertex u∈U. The underlying principle is that if we have an edge e∗ that is not grouped pivoting at v∗ (one of its endpoints), then either e∗ is not pivoted at u∗ (its other endpoint) or there exists another edge that is also not grouped pivoting at one of its ends. Note that in contrast to the problems previously defined, v∗ might still be an endpoint of a valid solution.
For all primes p, \textscChevalleyWithSymmetryp∈PPAp.
We reduce \textscChevalleyWithSymmetryp to \textscSuccinctBipartitep, which we show to be PPAp–complete in Section A.1. Given an instance of \textscChevalleyWithSymmetryp, namely a zecote polynomial system f=(g,h) and a permutation σ, we construct a bipartite graph G=(U∪V,E) encoded as an instance of \textscSuccinctBipartitep as follows.
We first describe the edges between U and V1, namely include an edge between an assignment x and a monomial t with multiplicity t(x). With these edges in place, the degree of vertices are as follows:
x=0n has a single edge corresponding to the constant monomial 1, since f is zecote. We let this be the designated edge e∗ in the final \textscSuccinctBipartitep instance.
x∈Vg∩Vh such that x=0, or
t∈V1 such that t(x) is a max-degree monomial or
x∈Vg∩Vh such that σ(x)=x or
v∈V2 such that ∃x∈v satisfying x∈Vg∩Vh and σ(x)∈/Vg∩Vh.
These correspond precisely to the solutions of \textscChevalleyWithSymmetryp. To summarize, the edge counting circuit C on input (x,t)∈U×V1 outputs t(x) and on input (x,v)∈U×V2 outputs p−1 if x∈Vg∩Vh, σ(x)=x and v=Σx and 0 otherwise.
The grouping functions ϕU and ϕV are defined as follows (analogous to the so-called “chessplayer algorithm” in [Pap94]):
Grouping ϕU (corresponding to endpoint in U):
For x∈Vg: there exists some i such that CWgi(x)=0. Consider an edge (x,(a1,a2,…,amg),k). We can explicitly list out the multiset containing the monomials tj=(a1,a2,…,ai←j,…,amg) with multiplicity tj(x), for each 1≤j≤Li. Since CWgi(x)=0, this multiset has size multiple of p. Hence, we can canonically divide its elements into groups of size p, counting multiplicities and ϕU returns the subset containing (t,k).
For x∈Vg∩Vh such that σ(x)=x: Note that gip−1(x)=0 for all i∈[mg]. Let v1∈V1 be the vertex corresponding to the constant monomial 1. ϕU groups the edge (x,v1,1) (of multiplicity 1) with the p−1 edges (x,Σx,k) for k∈[p−1]. For any other t∈V1∖{v1} and an edge (x,t,k), we have that t=(a1,…,amg) has ai=0 for some i. We define the multiset containing tj=(a1,…,ai←j,…amg) with multiplicity tj(x) for each 1≤j<Li. Since gip−1(x)=0, this multiset has size which is a multiple of p, which we can canonically partition into groups of size p. Thus, ϕU on input (x,t,k) returns the group containing (t,k).
Grouping ϕV (corresponding to endpoint in V):
We show that \textscLonelyp⪯\textscChevalleyWithSymmetryp. In the \textscChevalleyWithSymmetryp instance that we create, we will ensure that there are no solutions of type 0 (as in 4.8) and thus, the only valid solutions will be of type 1. In order to do so, we introduce the notions of labeling and proper labeling and prove a generalization of CWT that we call Labeled CWT (Theorem 8).
As we will see, the Labeled CWT, is just a re-formulation of the original CWT rather than a generalization. To understand the Labeled CWT we start with some examples that do not seem to satisfy the Chevalley-Warning condition, but where a solution exists.
Ignore Some Terms. The Labeled CWT formalizes the phenomenon observed in Example 1 and shows that under certain conditions we can ignore some terms when defining the degree of each polynomial. For instance, in Example 1, we can ignore the term x12 when computing the degree of f and treat f as a degree 1 polynomial of 2 variables, in which case the condition of CWT is satisfied.
We describe which terms can be ignored by defining a labeling of the terms of each polynomial in the system. The labels take values in {−1,0,+1} and our final goal is to ignore the terms with label +1. Of course, it should not be possible to define any labeling that we want; for example we cannot ignore all the terms of a polynomial. Next, we describe the rules of a proper labeling that will allow us to prove the Labeled CWT. We start with a definition of a labeling.
Example 1 (continued). According to the lexicographic ordering, f(x1,x2)=−x12+x2 and we have the monomials t11=−x12 and t12=x2. Hence, one possible labeling, which as we will see later corresponds to the vanilla Chevalley-Warning Theorem, is λ(1,1)=λ(1,2)=0. According to this labeling, degλ(f)=2. Another possible labeling, that, as we will see, allows us to apply the Labeled CWT , is λ(1,1)=+1 and λ(1,2)=−1. In this case, the labeled degree is degλ(f)=1.
As we highlighted before, our goal is to prove the Chevalley-Warning Theorem, but with the weaker condition that ∑i=1mdegλ(fi)<n instead of ∑i=1mdeg(fi)<n. Of course, we first have to restrict the space of all possible labelings by defining proper labelings. In order to make the condition of proper labelings easier to interpret we start by defining the notion of a labeling graph.
Example 2. Let p=2 and f1(x1,x2,x3,x4)=x1x2−x3, f2(x1,x2,x3,x4)=x1x3−x4. In this system, if we use the lexicographic monomial ordering we have the monomials t11=x1x2, t12=−x3, t21=x1x3, t22=−x4. The following figure shows the graph Gλ for the labeling λ(1,1)=+1, λ(1,2)=−1, λ(2,1)=+1 and λ(2,2)=−1.
For all i, either λ(i,j)∈{−1,1} for all j, or λ(i,j)=0 for all j.
If two monomials tij, tij′ contain the same variable xk, then λ(i,j)=λ(i,j′).
If λ(i,j)=−1, then tij is multilinear.
If xk is a variable in the monomials tij, ti′j′, with i=i′ and λ(i,j)=−1, then λ(i′,j′)=+1.
If λ(i,j)=0, then there exists a j′ such that λ(i,j′)=−1.
The labeling graph Gλ contains no directed cycles.
We give an equivalent way to understand the definition of a proper labeling.
Condition (1) : there is a partition of the polynomial system f into polynomial systems g and h such that all monomials in g are labeled in {+1,−1} and all monomials in h are labeled .
Condition (2) : each polynomial gi in g can be written as gi=gi++gi−, such that gi+ and gi− are polynomials on a disjoint set of variables.
Condition (3) : Each gi− is multilinear.
Condition (4) : Any variable xk can appear in at most one of the gi−. Moreover, if an xk appears in some gi−, it does not appear in any hj in h.
Condition (5) : Every gi− involves at least one variable.
Condition (6) : The graph Gλ is essentially between polynomials in g and the variables that appear in them, with an edge (gi→xk) if xk appears in gi− or an edge (xk→gi) if xk appears in gi+.
Note that degλ(gi)=deg(gi−), whereas degλ(hj)=deg(hj).
It is easy to see that the trivial labeling λ(i,j)=0 is always proper. As we will see this special case of the Labeled CWT corresponds to the original CWT . Note that in this case the labeling graph Gλ is an empty graph. Also, given a system of polynomials f and a labeling λ, it is possible to check in polynomial time whether the labeling λ is proper or not.
Example 2 (continued). It is an instructive exercise to verify that the labeling λ specified was indeed a proper labeling of f.
We can re-write CWf(x) as ∑S⊆[m]∏i∈S(−1)∣S∣fip−1. We’ll show that every monomial appearing in the expansion of ∏i∈Sfip−1 will have at least one variable with degree at most p−1. For simplicity, we focus on the case S=[m] and the other cases of S follow similarly.
We are now ready to prove the PPAp-hardness of \textscChevalleyWithSymmetryp.
For all primes p, \textscChevalleyWithSymmetryp is PPAp-hard.
We prove that \textscLonelyp⪯\textscChevalleyWithSymmetryp. Let us assume (without loss of generality from 7.1) that the \textscLonelyp instance has a single distinguished vertex represented by 0n. We’ll assume that 0n is isolated, otherwise, no further reduction is necessary.
Given circuit C′ with (+,×,1) gates, there exists circuit Cˉ with (+,×) gates such that
Thus, we can transform our original circuit C into a circuit Cˉ with just (+,×) gates. For simplicity, we’ll write Cˉ as simply C from now on.
As an intermiate step in the reduction we describe a system of polynomials fC over 2n+s variables (x1,…,xn,z1,…,zs,y1,…,yn), where s is the size of the circuit C. The variables x=(x1,…,xn) correspond to the input of C, the variables y=(y1,…,yn) correspond to the output and the variables z=(z1,…,zs) correspond to the gates of C. For an addition gate (+) we include a polynomial of the form
where a1 is the variable corresponding to the output of the (+) gate and a2,a3 are the variables corresponding to its two inputs. Similarly for a multiplication (×) gate, we include a polynomial of the form
Finally, for the output of the circuit, we include the polynomial
where a is the variable corresponding to the i-th output gate of C. It holds that
We now describe the reduction from an instance of \textscLonelyp to that of \textscChevalleyWithSymmetryp. In order to do this, we need to specify a system of polynomials (g,h) and a permutation σ such that ∣σ∣=p. In addition, we will provide a proper labeling λ for g satisfying the degree condition. We will also ensure that ⟨σ⟩ acts freely on Vg∩Vh. And hence, the only valid solutions for the resulting \textscChevalleyWithSymmetryp instance will be x∈Vg∩Vh.
The polynomial system g contains the following systems of polynomials.
Note that there are N=(2n+s)p variables in total.
For the polynomials belonging to a system of the form fC, the labeling is equal to −1 for the monomials corresponding to the output of each gate and +1, otherwise. For instance, let a2+a3−a1 be the i-th polynomial of g corresponding to a (+) gate and let a1≺a2≺a3, then λ(i,1)=−1 and λ(i,2)=λ(i,3)=+1.
For the polynomials belonging to a system of the form xi−xi+1, the labeling is equal to −1 for the monomials with variables in xi+1 and +1 for the monomials with variables in xi.
The labeling λ for g is proper.
By 4.15, the labeling λ is proper if the following conditions hold.
For all i, either λ(i,j)∈{−1,1} for all j, or λ(i,j)=0 for all j. In the labeling λ, there are no labels equal to , so this condition holds trivially.
If two monomials tij, tij′ contain the same variable xk, then λ(i,j)=λ(i,j′). By construction of g, no variable appears twice in the same polynomial with a different labeling. For polynomials of fC, this holds because the output variable of a gate is not simultaneously an input variable and all input variables have the same labeling. For polynomials in a system of the form xi−xi+1, each polynomial contains two different variables.
If λ(i,j)=−1, then tij is multilinear. For polynomials of fC, only the output variable of a gate has label −1 and by definition this monomial is linear. For polynomials in a system of the form xi−xi+1, all monomials are linear, so the condition holds trivially.
If xk is a variable in the monomials tij, ti′j′, with i=i′ and λ(i,j)=−1, then λ(i′,j′)=+1. Observe that all monomials with label −1 contain only a single variable, so we refer to a monomial xk with label −1. For a polynomial in fC, a monomial xk with label −1 corresponds to the output of a gate. Hence, if xk appears in other monomials of fC, these monomials correspond to inputs and have label +1. Also, if xk is an output variable of fC, then it might appear in a polynomial of the form a1−a2. However, by construction the monomials of xi−xi+1 that correspond to output variables of fC have label +1.
If λ(i,j)=0, then there exists a j′ such that λ(i,j′)=−1. By the definition of λ, all polynomials of g have a monomial with label −1. These are the monomials that correspond to the outputs of a gate for the systems of the form fC and the monomials that correspond to xi+1 for the systems of the form xi−xi+1.
The labeling graph Gλ contains no cycles. Each system of the form xi−xi+1 has incoming edges with variables appearing only in the i-th copy of fC and outgoing edges with variables appearing only in the (i+1)-th copy of fC. Also, the variables appearing on the i-th copy of fC might appear only in the systems xi−1−xi and xi−xi+1. Hence, Gλ has no cycles that contain vertices of two different copies of fC or of a copy of fC and a system of the form xi−1−xi.
It is left to argue that the labeling graph restricted to a copy of fC does not have any cycles. Let the vertices of fC be ordered according to the topological ordering of C. This restricted part of Gλ corresponds exactly to the graph of C, which by definition is a DAG. Hence, Gλ contains no cycles.∎
We also need to show that for this labeling g satisfies the labeled Chevalley condition.
The labeled Chevalley condition ∑i=1mgdegλ(gi)<N holds for g with labeling λ.
Each polynomial of g has a unique monomial with λ(i,j)=−1 and this monomial has degree 1. Thus, ∑i=1mgdegλ(gi)=mg. On the other hand, the i-th polynomial of g has exactly one variable that has not appeared in any of the previous polynomials. More specifically, the number of variables is equal to mg+n, where n is the size of the input of C. Hence, the labeled Chevalley condition holds for g. ∎
The system of polynomials g allows us to compute the p vertices given by Ci(x) for i∈[p+1]. From the definition of \textscLonelyp and our pre-processing on C, this group of p vertices is a hyperedge if and only if C(x)=x. Since solutions of \textscLonelyp are lonely vertices, we define h to exclude x such that C(x)=x. Namely, we set h to be the system of polynomials
In the description of f=(g,h), we have used the following vector of variables:
We define the permutation σ such that σ(x)=(x3,x4,…,x2p,x1,x2,z3,4,z5,6,…,z2p−1,2p,z1,2), as illustrated in the following figure. The blue arrows indicate the polynomials g and the green arrows indicate the permutation σ in the case of p=3.
z1,2<spanclass="katex−display"><spanclass="katex"><spanclass="katex−mathml"><mathxmlns="http://www.w3.org/1998/Math/MathML"display="block"><semantics><mrow><msub><mimathvariant="bold−italic">x</mi><mn>3</mn></msub></mrow><annotationencoding="application/x−tex">x3</annotation></semantics></math></span><spanclass="katex−html"aria−hidden="true"><spanclass="base"><spanclass="strut"style="height:0.5944em;vertical−align:−0.15em;"></span><spanclass="mord"><spanclass="mord"><spanclass="mord"><spanclass="mordboldsymbol">x</span></span></span><spanclass="msupsub"><spanclass="vlist−tvlist−t2"><spanclass="vlist−r"><spanclass="vlist"style="height:0.3011em;"><spanstyle="top:−2.55em;margin−right:0.05em;"><spanclass="pstrut"style="height:2.7em;"></span><spanclass="sizingreset−size6size3mtight"><spanclass="mordmtight"><spanclass="mordmtight">3</span></span></span></span></span><spanclass="vlist−s"></span></span><spanclass="vlist−r"><spanclass="vlist"style="height:0.15em;"><span></span></span></span></span></span></span></span></span></span></span>=<spanclass="katex−display"><spanclass="katex"><spanclass="katex−mathml"><mathxmlns="http://www.w3.org/1998/Math/MathML"display="block"><semantics><mrow><msub><mimathvariant="bold−italic">x</mi><mn>4</mn></msub></mrow><annotationencoding="application/x−tex">x4</annotation></semantics></math></span><spanclass="katex−html"aria−hidden="true"><spanclass="base"><spanclass="strut"style="height:0.5944em;vertical−align:−0.15em;"></span><spanclass="mord"><spanclass="mord"><spanclass="mord"><spanclass="mordboldsymbol">x</span></span></span><spanclass="msupsub"><spanclass="vlist−tvlist−t2"><spanclass="vlist−r"><spanclass="vlist"style="height:0.3011em;"><spanstyle="top:−2.55em;margin−right:0.05em;"><spanclass="pstrut"style="height:2.7em;"></span><spanclass="sizingreset−size6size3mtight"><spanclass="mordmtight"><spanclass="mordmtight">4</span></span></span></span></span><spanclass="vlist−s"></span></span><spanclass="vlist−r"><spanclass="vlist"style="height:0.15em;"><span></span></span></span></span></span></span></span></span></span></span>z3,4<spanclass="katex−display"><spanclass="katex"><spanclass="katex−mathml"><mathxmlns="http://www.w3.org/1998/Math/MathML"display="block"><semantics><mrow><msub><mimathvariant="bold−italic">x</mi><mn>5</mn></msub></mrow><annotationencoding="application/x−tex">x5</annotation></semantics></math></span><spanclass="katex−html"aria−hidden="true"><spanclass="base"><spanclass="strut"style="height:0.5944em;vertical−align:−0.15em;"></span><spanclass="mord"><spanclass="mord"><spanclass="mord"><spanclass="mordboldsymbol">x</span></span></span><spanclass="msupsub"><spanclass="vlist−tvlist−t2"><spanclass="vlist−r"><spanclass="vlist"style="height:0.3011em;"><spanstyle="top:−2.55em;margin−right:0.05em;"><spanclass="pstrut"style="height:2.7em;"></span><spanclass="sizingreset−size6size3mtight"><spanclass="mordmtight"><spanclass="mordmtight">5</span></span></span></span></span><spanclass="vlist−s"></span></span><spanclass="vlist−r"><spanclass="vlist"style="height:0.15em;"><span></span></span></span></span></span></span></span></span></span></span>=<spanclass="katex−display"><spanclass="katex"><spanclass="katex−mathml"><mathxmlns="http://www.w3.org/1998/Math/MathML"display="block"><semantics><mrow><msub><mimathvariant="bold−italic">x</mi><mn>6</mn></msub></mrow><annotationencoding="application/x−tex">x6</annotation></semantics></math></span><spanclass="katex−html"aria−hidden="true"><spanclass="base"><spanclass="strut"style="height:0.5944em;vertical−align:−0.15em;"></span><spanclass="mord"><spanclass="mord"><spanclass="mord"><spanclass="mordboldsymbol">x</span></span></span><spanclass="msupsub"><spanclass="vlist−tvlist−t2"><spanclass="vlist−r"><spanclass="vlist"style="height:0.3011em;"><spanstyle="top:−2.55em;margin−right:0.05em;"><spanclass="pstrut"style="height:2.7em;"></span><spanclass="sizingreset−size6size3mtight"><spanclass="mordmtight"><spanclass="mordmtight">6</span></span></span></span></span><spanclass="vlist−s"></span></span><spanclass="vlist−r"><spanclass="vlist"style="height:0.15em;"><span></span></span></span></span></span></span></span></span></span></span>z5,6 Claim 4.20. The group ⟨σ⟩ has order p and acts freely on Vg∩Vh.
In order to see that ∣σ∣=p, note that the input of σ consists of 3p blocks of variables. The permutation σ performs a rotation of the first 2p blocks by two positions and of the last p blocks by one position. All that remains is to show that ⟨σ⟩ acts freely on Vg∩Vh. First, we show that ⟨σ⟩ defines a group action on Vg∩Vh, that is for all x∈Vg∩Vh, it holds that σ(x)∈Vg∩Vh. Let x=(x1,x2,…,x2p−1,x2p,z1,2,z3,4,…,z2p−1,2p)∈Vg∩Vh, then
x∈Vg implies that fC(x2i−1,x2i,z2i−1,2i)=0 for i∈[p] and x2i=x2i+1 for i∈[p−1]
x∈Vh implies that x1=x2, that is, C(x1)=x1 since fC(x1,x2,z1,2)=0⇔x2=C(x1).
Now, σ(x)=(x3,x4,…,x1,x2,z3,4,z5,6,…,z1,2)∈Vg∩Vh holds because
fC(x2i−1,x2i,z2i−1,2i)=0 for i∈[p] and x2i=x2i+1 for i∈[p−1], which holds from x∈Vg. Additionally, x1=x2p holds because we pre-processed C such that Cp(x1)=x1,
x3=x4, which holds because x4=C(x3) for i∈[p] and from the definition of C, C(x1)=x1 implies that x2i=x2i−1 for all i∈[p].
Finally, if x∈Vg∩Vh, by construction of C, we have that x2k=x2j for k=j and thus σ(x)=x simply because x3=x1. Thus, we conclude that ⟨σ⟩ acts freely on Vg∩Vh. ∎
The solution of this instance of \textscChevalleyWithSymmetryp cannot be a vector x∈Vg∩Vh with σ(x)∈Vg∩Vh or σ(x)=x, since we know from 4.20 that ⟨σ⟩ acts freely on Vg∩Vh. We also have from Theorem 8 that the solution also cannot be a max-degree monomial in the expansion of CWg(x)=∏(1−gip−1). Thus, the solution must be an x=0 such that f(x)=0. Let x1 denote the first n coordinates of x, then f(x)=0 implies that x1=C(x1) and x=0 implies that x1=0. Hence, x1 corresponds to a lonely vertex of the \textscLonelyp instance. ∎
Complete Problems via Small Depth Arithmetic Circuits
In [Rub16], a similar simplification theorem was shown for PPAD. In fact, this simplification involves only the End-of-Line problem and does not go through a natural complete problem for PPAD (see Theorem 1.5 in [Rub16]). A similar result can be shown for other TFNP subclasses, including PPA. However, it is unclear if these techniques also apply to PPAp classes.
Since the reductions between \textscSuccinctBipartitep and other problems studied in this work (refer to Appendix A) can also be implemented as AC0 circuits, we get the following corollary.
For all primes p, \textscLonelyp[NC1], \textscLeafp[NC1] and \textscBipartitep[NC1] are all PPAp-complete.
Thus, Theorem 9 allows us to consider reductions from these PPAp-complete problems with instances encoded by a shallow formulas rather than an arbitrary circuit. We believe this could be a useful starting point for finding other PPAp-complete problems.
Applications of Chevalley-Warning
x∈{0,1}n such that x=0 and Ax≡0(modq)
x∈{−1,0,1}n such that x=0 and Ax≡0(modq)
For the regime of parameters n, m as in Definitions 6.1 and 6.2,
For all primes p : \textscBISp,\textscSISp⪯\textscChevalleyp.
For all q : \textscBISq,\textscSISq∈FPPPAq,
For all k : \textscBIS2k∈FP,
Part 1. For all primes p, \textscBISp,\textscSISp⪯\textscChevalleyp.
Given an \textscBISp instance A=(aij), we define a zecote polynomial system as follows
Part 2. For all q : \textscBISq,\textscSISq∈FPPPAq.
Finally, we define x:=(z1y1,…,zn1yn1)∈{0,1}n. Observe that since yi and z are binary, x is also binary. Additionally,
Note that for a prime p and any k, we have from Theorem 1, that PPApk=PPAp. Additionally, Theorem 4 shows that PPAp is closed under Turing reductions, so we have the following corollary.
For all primes p and all k : \textscBISpk,\textscSISpk∈PPAp.
Even though the \textscSISq problem is well-studied in lattice theory, not many results are known in the regime we consider where q is a constant. Our results show that solving \textscChevalleyp is at least as hard as finding short integer solutions in p-ary lattices for a specific range of parameters. More specifically, our reduction assumes that q is a constant and, thus, it does not depend on the input lattice, and that the dimension n of lattice is related to the number of constraints in the dual as n>((m+1)/2)N(q)(q−1). On the other hand, we showed (in Parts 3, 4) that there are q-ary lattice for which finding short integer solutions is easy.
In this section, we prove the structural properties of PPAq outlined in Section 1.5.
Buss and Johnson [BJ12, Joh11] defined a problem \textscModq, which is almost identical to \textscLonelyq, with the only difference being that the q-dimensional matching is over a power-of-2 many vertices encoded by C:{0,1}n→{0,1}n, with no designated vertices, except when q is a power of 2 in which case we have one designated vertex. The class PMODq is then defined as the class of total search problems reducible to \textscModq. The restriction of number of vertices to be a power of 2, which arises as an artifact of the binary encoding of circuit inputs, makes the class PMODq slightly weaker than PPAq.
To compare PPAq and PMODq, we define a restricted version of \textscLonelyq, where the number of designated vertices is exactly k; call this problem \textscLonelyqk. Clearly, \textscLonelyqk reduces to \textscLonelyq. We show that a converse holds, but only for prime p; see Section A.2 for proof.
For all primes p and k∈{1,…,p−1}, \textscLonelyp reduces to \textscLonelypk.
For all primes p, PPAp=PMODp.
Johnson [Joh11] already showed that PPAD⊆PMODq which implies that PPAD⊆PPAq. We present a simplified version of that proof.
We reduce the PPAD-complete problem End-of-Line to \textscLonelyq. An instance of End-of-Line is a circuit C that implicitly encodes a directed graph G=(V,E), with in-degree and out-degree at most 1 and a designated vertex v∗ with in-degree and out-degree 1.
2 Oracle separations
Here we explain how PPAq can be separated from other TFNP classes relative to oracles, as summarized in Figure 1. That is, for distinct primes p,p′, there exist oracles O1,…,O5 such that
The usual technique for proving such oracle separations is propositional proof complexity (together with standard diagonalization arguments) [BCE+98, BM04, BJ12]. The main insight is that if a problem S1 reduces to another problem S2 in a black-box manner, then there are “efficient proofs” of the totality of S1 starting from the totality of S2. The discussion below assumes some familiarity with these techniques.
Johnson [Joh11] showed all the above separations with respect to PMODp. Since we showed PPAp=PMODp (7.2), the same oracle separations hold for PPAp.
For a fixed k≥1, consider the problem Sk:=\rotatebox[origin=c]180.0&i∈[k]\textscLonelypi where pi are the primes. Buss et al. [BGIP01] showed that the principle underlying Si is incomparable with the principle underlying \textscLonelypi+1. This translates into an relativized separation ⋂i∈[k]PPApi⊈PPApi+1 which in particular implies ⋂i∈[k]PPApi⊈PPAD. Finally, one can consider the problem S:=Sk(n) where k(n) is a slowly growing function of the input size n. This problem is in ⋂pPPAp since for each fixed p and for large enough input size, S reduces to the PPAp-complete problem. On the other hand, the result of Buss et al. [BGIP01] is robust enough to handle a slowly growing k(n); we omit the details.
3 Closure under Turing reductions
For any prime p and total search problem S, if S⪯T\textscLonelyp, then S⪯m\textscLonelyp.
The key reason why this theorem holds for prime p is 7.1: In a \textscLonelyp instance, we can assume w.l.o.g. that there are exactly p−1 distinguished vertices.
We make the following simplifying assumptions.
For any query the vertices Vi∗ are always isolated in Gi (if some vertex in Vi∗ were to not be isolated, the algorithm could be modified to simply not make the query).
Exactly t queries are made irrespective of the oracle answers.
We reduce x to a single instance of \textscLonelyp as follows.
We’ll define the hyperedge for vertex v=(v1,…,vk) for any k≤t. Let j≤k be the last coordinate such that for all i<j, the vertex vi is a valid solution for the \textscLonelyp instance (Ci,Vi∗), which the algorithm creates on receiving v1,…,vi−1 as answers to previous queries.
Let u1,…,up−1 be the neighbors of vk in a canonical trivial matching over [p]n; e.g. {[p]×w:w∈[p]n−1}. The neighbors of v are {(v1,…,vk−1,ui)}i.
We consider three cases, depending on whether vk is designated, non-isolated or isolated in the \textscLonelyp instance (Ck,Vk∗).
For u1,…,up−1 being the neighbors of vk in Gk, the neighbors of v are {(v1,…,vk−1,ui)}i.
Such a vk is a valid solution for (Ck,Vk∗).
the algorithm will have a next oracle query (Ck+1,Vk+1∗). In this case, for u1,…,up−1 being the designated vertices in Vk+1∗, the neighbors of v are {(v1,…,vk−1,vk,ui)}i.
there are no more queries, and we leave v isolated.
Let u1,…,up−2 be the other designated vertices in Vk∗. The neighbors of v are {(v1,…,vk−1,ui)}i∪{(v1,…,vk−1)}.
V1<spanclass="katex−display"><spanclass="katex"><spanclass="katex−mathml"><mathxmlns="http://www.w3.org/1998/Math/MathML"display="block"><semantics><mrow><msub><mi>V</mi><mn>1</mn></msub><mo>×</mo><msub><mi>V</mi><mn>2</mn></msub></mrow><annotationencoding="application/x−tex">V1×V2</annotation></semantics></math></span><spanclass="katex−html"aria−hidden="true"><spanclass="base"><spanclass="strut"style="height:0.8333em;vertical−align:−0.15em;"></span><spanclass="mord"><spanclass="mordmathnormal"style="margin−right:0.2222em;">V</span><spanclass="msupsub"><spanclass="vlist−tvlist−t2"><spanclass="vlist−r"><spanclass="vlist"style="height:0.3011em;"><spanstyle="top:−2.55em;margin−left:−0.2222em;margin−right:0.05em;"><spanclass="pstrut"style="height:2.7em;"></span><spanclass="sizingreset−size6size3mtight"><spanclass="mordmtight"><spanclass="mordmtight">1</span></span></span></span></span><spanclass="vlist−s"></span></span><spanclass="vlist−r"><spanclass="vlist"style="height:0.15em;"><span></span></span></span></span></span></span><spanclass="mspace"style="margin−right:0.2222em;"></span><spanclass="mbin">×</span><spanclass="mspace"style="margin−right:0.2222em;"></span></span><spanclass="base"><spanclass="strut"style="height:0.8333em;vertical−align:−0.15em;"></span><spanclass="mord"><spanclass="mordmathnormal"style="margin−right:0.2222em;">V</span><spanclass="msupsub"><spanclass="vlist−tvlist−t2"><spanclass="vlist−r"><spanclass="vlist"style="height:0.3011em;"><spanstyle="top:−2.55em;margin−left:−0.2222em;margin−right:0.05em;"><spanclass="pstrut"style="height:2.7em;"></span><spanclass="sizingreset−size6size3mtight"><spanclass="mordmtight"><spanclass="mordmtight">2</span></span></span></span></span><spanclass="vlist−s"></span></span><spanclass="vlist−r"><spanclass="vlist"style="height:0.15em;"><span></span></span></span></span></span></span></span></span></span></span>⋯⋯ It is easy to see that our definition of edges are consistent and the only vertices which are isolated (apart from those in V∗) are of the type (y1,…,yt) where each yi is a valid solution for the \textscLonelyp instance (Ci,Vi∗). Thus, given an isolated vertex y, we can immediately infer a solution for x as L(x,y1,…,yt). This completes the reduction since V is efficiently representable and indexable — see 2.4. ∎
Acknowledgements
We thank Christos Papadimitriou, Robert Robere, Dmitry Sokolov and Noah Stephens-Davidowitz for helpful discussions. We also thank anonymous referees for valuable suggestions.
MG was supported by NSF grant CCF-1412958 (this work was done while MG was at IAS). PK was supported in parts by NSF Award numbers CCF-1733808 and IIS-1741137 and MIT-IBM Watson AI Lab and Research Collaboration Agreement No. W1771646 (this work was done while PK was a student at MIT). MZ is supported by a Google PhD Fellowship. KS is supported in part by NSF/BSF grant #1350619, an MIT-IBM grant, and a DARPA Young Faculty Award, MIT Lincoln Laboratories and Analog Devices.
Appendix A Appendix: Reductions Between Complete Problems
In order to prove Theorem 5, we introduce an additional problem that will serve as intermediate problem in our reductions.
Same as \textscLeafq, but degrees are allowed to be larger (polynomially bounded).
▹C:{0,1}n→({0,1}nq)k, with ({0,1}nq)k interpreted as k many q-subsets of {0,1}n▹v∗∈{0,1}n (usually 0n)
V:={0,1}n. For distinct v1,…,vq, edge e:={v1,…,vq}∈E if e∈C(v) for all v∈e
We show the following inter-reducibilities: (1) \textscLeafq≍\textscLeafq′, (2) \textscLeafq′≍\textscBipartiteq and (3) \textscLeafq≍\textscLonelyq.
black(1a) \textscLeafq⪯\textscLeafq′ Each instance of \textscLeafq is trivially an instance of \textscLeafq′.
black(1b) \textscLeafq′⪯\textscLeafq. We start with a \textscLeafq′ instance (C,v∗), where C encode a q-uniform hypergraph G=(V,E) with degree at most k. Let t=⌈k/q⌉. We construct a \textscLeafq instance encoding a hypergraph G=(V,E) on vertex set V:=V×[t], intuitively making t copies of each vertex.
By 2.4, this completes the reduction since the edges are locally computable with black-box access to C and V is efficiently indexable.
black(2a) \textscLeafq′⪯\textscBipartiteq. We start with a \textscLeafq′ instance (C,v∗), where C encode a q-uniform hypergraph G=(V,E). We construct a \textscBipartiteq instance encoding a graph G=(V∪U,E) such that V=V and U=(qV), i.e. all q-sized subsets of V. We include the edge (v,e)∈E if e∈E is incident on v. The designated vertex for the \textscBipartiteq instance is v∗ in V.
Clearly, all vertices e∈U have degree either q or . For any v∈V, the degree of v in G is same as its degree in G. Thus, any solution to the \textscBipartiteq instance immediately gives a solution to the original \textscLeafq′ instance. By 2.4, this completes the reduction since the edges are locally computable with black-box access to C and V and U are efficiently indexable (cf. [KS98, §2.3] for efficiently indexing U).
black(2b) \textscBipartiteq⪯\textscLeafq′. We start with a \textscBipartiteq instance (C,v∗) encoding a bipartite graph G=(V∪U,E) with maximum degree of any vertex being at most k. We construct a \textscLeafq′ instance encoding a hypergraph G=(V,E) such that V=V with designated vertex v∗.
black(3a) \textscLeafq⪯\textscLonelyq. We start with a \textscLeafq instance (C,v∗), where C encode a q-uniform hypergraph G=(V,E) with degree at most q. If degG(v∗)=q or , then we don’t need any further reduction. Else, we construct a \textscLonelyq instance encoding a q-dimensional matching G=(V,E) on vertex set V=V×[q]. The designated vertices will be V∗={(v,q−i):1≤i≤q−deg(v∗)}. Note that, ∣V∗∣=q−degG(v∗) and hence 1≤∣V∗∣≤q−1.
For any vertex (v,i)∈V, we assign it at most one hyperedge as follows:
If degG(v)=0, we include the hyperedge {(v,i):i∈[q]}.
Else if 0<degG(v)<i, we leave it isolated.
It is easy to see that our definition of hyperedges is consistent and that the designated vertices V∗ are indeed isolated. Moreover, a vertex (v,i) is isolated in G only if 1≤degG(v)≤q−1. Thus, solutions to the \textscLeafq instance naturally maps to solutions to the original \textscLeafq′ instance.
By 2.4, this completes the reduction since the edges are locally computable with black-box access to C and V is efficiently indexable.
black(3b) \textscLonelyq⪯\textscLeafq. We start with a \textscLonelyq instance (C,V∗), where C encode a q-dimensional matching G=(V,E). We construct a \textscLeafq instance encoding a q-uniform hypergraph G=(V,E) on vertex set V that will be specified shortly. We describe the hyperedges in G and it’ll be clear how to compute the hyperedges for any vertex locally with just black-box access to C.
We start with V=V. Our goal is to transform all vertices of degree 1 to degree q, while ensuring that vertices of degree are mapped to vertices of degree not a multiple of q. Towards this goal we let E to be set of edges in E in addition to q−1 canonical q-dimensional matchings over V. For example, for a vertex v:=(x1,…,xn)∈V=[q]n, the corresponding edges in E include an edge in E (if any) and edges of the type ei={(x1,…,xi−1,j,xi+1,…,xn):j∈[q]} for i∈[q−1] (note, this requires us to assume n≥q−1). Adding the q−1 matchings increases the degree of each vertex by q−1. Therefore, vertices with initial degree 1 now have degree q and vertices with initial degree now have degree q−1. However, a couple of issues remain in order to complete the reduction, which we handle next.
Multiplicities. An edge e∈E might have gotten added twice, if it belonged to one of the canonical matchings. To avoid this issue altogether, instead of adding edges directly on V, we augment V to become V:=V∪((qV)×[q−1]), i.e. in addition to V, we have q−1 vertices for every potential hyperedge of G. For any edge e:={v1,…,vq}∈E, instead of adding it directly in G, we add hyperedge {v,(e,1),(e,2),…,(e,q−1)} for each v∈e. Note that, all vertices (e,i)∈(qV)×[q−1] have degree q if e∈E and degree if e∈/E, so they are non-solutions for the \textscLeafq instance. For vertices in V, we still have as before that vertices with initial degree 1 now have degree q and vertices with initial degree now have degree q−1.
Designated vertex. In a \textscLeafq instance, we need to specify a single designated vertex v∗∈V. If the \textscLonelyq instance had a single designated vertex then we would be done. However, in general it is not possible to assume this (for non-prime q). Nevertheless, we provide a way to get around this. We augment V with t=(q−1)(q−k)+1 additional vertices to become V:=V∪((qV)×[q−1])∪{wi,j:i∈[q−k],j∈[q−1]}∪{v∗}, where v∗ will eventually be the single designated vertex for the \textscLeafq instance.
Let V∗={u1,…,uk}⊆V be the set of designated vertices in the \textscLonelyq instance (note 1≤k<q). So far, note that degG(ui)=q−1. The only new hyperedges we add will be among ui’s, wi,j’s and v∗, in such a way that degG(ui) will become q, the degree of all wi,j’s will also be q and degree of v∗ will be q−k.
For each u∈V∗, include {u,w1,1,…,w1,q−1}. So far, degG(u)=q and degG(w1,j)=k.
For each j∈[q−1] and each i∈{2,…,q−k}, include {w1,j,wi,1,…,wi,q−1}. So far, degG(wi,j)=q−1 for all (i,j)∈[q−k]×[q−1].
Finally, for each (i,j)∈[q−k]×[q−1], include {v∗,wi,1,…,wi,q−1}. Now, degG(wi,j)=q for all (i,j)∈[q−k]×[q−1] and degG(v∗)=q−k.
Thus, we have finally reduced to a \textscLeafq instance encoding the graph G=(V,E) with V:=V∪((qV)×[q−1])∪{wi,j:i∈[q−k],j∈[q−1]}∪{v∗}. By 2.4, this completes the reduction, since V is efficiently indexable (again, see [KS98] for a reference on indexing (qV)) and the edges are locally computable using black-box access to C. ∎
We introduce a new intermediate problem to show PPAp–completeness of \textscSuccinctBipartitep.
Two p-dimensional matchings over a common vertex set, with a vertex in exactly one of the matchings, has another such vertex.
Two p-dimensional matchings G0=(V,E0), G1=(V,E1). Designated vertex v∗∈V.
▹C0:{0,1}n→({0,1}n)p and C1:{0,1}n→({0,1}n)p▹v∗∈{0,1}n
V:={0,1}n. For b∈{0,1}, Eb:={e:Cb(v)=e for all v∈e}
v∗ if degG0(v∗)=1 or degG1(v∗)=0 and v=v∗ if degG0(v∗)=degG1(v∗)
Observe that in the case of p=2, \textscTwoMatchingsp can be readily seen as equivalent to \textscLeaf2.
For any prime p, \textscSuccinctBipartitep and \textscTwoMatchingsp are PPAp–complete.
We show that \textscBipartitep⪯\textscSuccinctBipartitep⪯\textscTwoMatchingsp⪯\textscLonelyp.
black\textscSuccinctBipartitep⪯\textscTwoMatchingsp. We reduce to a \textscTwoMatchingsp instance encoding two p-dimensional matchings G0=(V,E0) and G1=(V,E1), over the vertex set V=V×U×[p−1], that is, all possible edges producible in the \textscSuccinctBipartitep instance. The designated vertex v∗ is the designated edge e∗ in the \textscSuccinctBipartitep instance.
For any edges e1,…,ep, which are grouped by ϕV pivoted at some v∈V, we include the hyperedge {e1,…,ep} in E0. Similarly, for any edges e1,…,ep, which are grouped by ϕU pivoted at some u∈U, we include the hyperedge {e1,…,ep} in E1. It is easy to see that points in exactly one of the two matchings G0 or G1 correspond to edges of the \textscSuccinctBipartitep instance that are not grouped at exactly one end. Thus, we can derive a solution to \textscSuccinctBipartitep from a solution to \textscTwoMatchingsp. (Remark: while edges which are not grouped at either end are solutions to \textscSuccinctBipartitep, they do not correspond to a solution in the \textscTwoMatchingsp instance.)
black\textscTwoMatchingsp⪯\textscLonelyp. Given a \textscTwoMatchingsp instance encoding two p-dimensional matchings G0=(V,E0) and G1=(V,E1), we reduce to an instance of \textscLonelyp encoding a p-dimensional matching G=(V,E) such that V=V×[p]. The designated vertex for the \textscLonelyp instance is (v∗,p).
For any hyperedge {v1,…,vp} in E0, we include the hyperedge {(v1,i),(v2,i),…,(vp,i)} in G for each i∈{1,…,p−1}. Similarly, for any hyperedge {v1,…,vp} in E1, we include the hyperedge {(v1,p),(v2,p),…,(vp,p)} in G. If v∈V is isolated in both G0 and G1, then we include the hyperedge {v}×[p].
Observe that, (v∗,p) is isolated by design. A vertex (v,i), for i<p is isolated only if degG0(v)=0 and deg(G1)=1. Similarly, the vertex (v,p) is isolated only if degG0(v)=1 and deg(G1)=0. Thus, isolated vertices in the \textscLonelyp instance correspond to solutions of the \textscTwoMatchingsp instance. ∎
Appendix B Appendix: Proof of Theorem 9
Each polynomial fi has degree at most 2.
Each polynomial fi has at most 3 monomials.
Each polynomial fi has at most 3 variables.
Hence, we can compute each of the polynomials gip−1 explicitly as a sum of monomials. The degree of this polynomial is O(p) and the number of monomials is at most 3p. Observe that since p is a constant, 3p is also a constant.
From Edge Counting Circuit To Edge Counting Formula. As described in the proof of 4.11 the edge counting circuit takes as input a vertex u∈U and a vertex v∈V and outputs the multiplicity of the edge {u,v} in G. Hence, the edge counting formula C, that we want to implement, takes as input a tuple (x,s,a,y). The vector x corresponds to the assignment in U. The vector a corresponds to the description of a monomial of F, as the product ∏i=1mtiai′ where tiai′ is the ai-th monomial of the polynomial 1−gip−1. The vector y=(y1,y2,…,yp) and corresponds to a p-tuple in V2. Finally, s is a selector number to distinguish between v∈V1 and v∈V2, namely if s=1, we have v∈V1 and if s=0, we have that v∈V2. So, the edge counting formula can be written as follows
This way we can define the edge counting formula C1 for when v∈V1 and the edge counting formula C2 for when v∈V2 separately and combine them by using at most two additional layers in the arithmetic formula. Now, C1(x,y,a)=\mathds1(y=0)⋅∏i=1mQi(x,ai) where Qi(x,ai) is the formula to compute the value ti,ai(x). Observe that the factor \mathds1(y=0) can be easily computed and is necessary since C1 should consider only neighbors between x and monomials in V1. Hence, if y is not equal to 0, C1 should return . As we already explained the number of monomials of 1−gip−1 is constant, and hence the formula Qi(x,ai) can be easily implemented in constant depth using a selector between all different monomials similarly to Equation (B.1). Hence, C1 is implemented in constant depth.
The formula C2 has a factor \mathds1(a=0) to ensure only neighbors in V2 have non-zero outputs. The main challenge in the description of C2 is that every distinct p-tuple y has p! equivalent representations, but the modulo p argument of 4.11 applies only when edges appear to precisely one of the equivalent copies of the p-tuple. Thus, we let C2 add edges only to the lexicographically ordered version of y. It is a simple exercise to see that sorting of p! numbers, when p is constant, is possible in constant depth. We leave this folklore observation as an exercise to the reader. Once we make sure that y is lexicographically sorted, we compute a sorted representation of the set Σx={x,σ(x),…,σp−1(x)}, where σ is the permutation in the input of the \textscChevalleyWithSymmetryp problem. Then, we can easily check whether the p-tuple represented by y is the same as the sorted p-tuple Σx. Finally, we observe that edges between x and Σx are only used when x∈Vg∩Vh which again can be checked with constant depth formulas. If these checks pass, then C2 outputs p−1, otherwise it outputs .
For this step we use selectors similarly to Equation (B.1) and sorting as in the description of C2. We consider two different cases for the grouping formula ϕ. When the first argument is in U, i.e. grouping with respect to an assignment, we call the formula ψ and when the first argument is in V, i.e. grouping with respect to monomials/p-tuples, we call the formula χ. Then, ϕ selects between ψ and χ using a selector. This adds at most two layers to ϕ.
First, we describe ψ with inputs x∈U, (s,a,y)∈V and r be the copy of the input edge. We have two cases with respect to whether s=1 or s=0. Let ψ1 be the formula for the first case and ψ2 be the formula for the second case. For the case s=1, we need again to consider two cases: (i) x∈Vg and (ii) x∈Vg. For case (i) we describe the formula ψ11 and for case (ii) we define the formula ψ21. It is easy to see that computing \mathds1(x∈Vg) can be done using a depth 3 formula since g is given in an explicit form. Hence, once again, we can combine ψ11 and ψ21 using a selectors.
The formula ψ11 first computes i⋆=i:1−gip−1(x)=0mini. This is doable in constant depth, since we can compute in parallel the value \mathds1(1−gip−1(x)=0) for all i∈[m1] and then in an extra layer compute for every i whether 1−gip−1(x)=0 and 1−gjp−1(x)=0 for all j<i, which requires just one multiplication gate per i.
We remind that a=0 corresponds to the constant monomial 1 of the polynomial F. If a=0, this case is similar to the previous, except that we use the polynomials gip−1 instead of 1−gip−1, see also the proof of 4.11. If a=0, ψ21 outputs the input edge (1,a,0,1) and p−1 edges of the form (0,0,y,t),t∈[p−1] where y is the lexicographically ordered set Σx.
In this case, the formula ψ2 checks whether the vector y is in lexicographic order as described in the edge counting formula C and a=0. It also checks if x∈Vf1∩Vf2 as described before. If any of these checks fails, the output is 0. Otherwise, if y=Σx, then we output p−1 copies of the edge (0,0,y,t),t∈[p−1], that connects x with y, and the edge (1,0,0,1), that connects x with the constant term of F.
In this case, the input is a monomial ta(x)=∏i=1m1ti,ai(x) and we have to find a variable that appears with degree less than p−1. We first construct a formula χj1 that computes zk, where k is the degree of xj in ta(x). This can be done with a constant size formula that for a given index j multiplies the powers of xj in the monomials of 1−gip−1 appearing in t.
Now, we compute all values χj1(1), …, χj1(p−1) and we check in parallel if at least one of them is different from 1. If this is the case, then the degree of xj in t(x) is less than p−1. Hence, we have computed the formula \bar{\chi}^{1}_{j}(\bm{a})=\mathds{1}(\text{degree ofx_{j}int_{\bm{a}}}\neq p-1). We can find the smallest index j∗ such that χˉj1(a)=1 using the same construction as in ψ1. So, we can construct a formula for each j that is equal to 1 if and only if j=j∗ is the smallest index such that xj∗ has degree less than p−1 in ta. Finally, we use a selector to find the value Cj∗(x)=xj∗−kt(x), by computing Cj(x) for all j. This is done through the product of all variables that appear in ta(x) excluding xj.
For constructing the formula χ2 we first check whether x∈Vf1 and whether y is the lexicographically sorted version of Σx. These can both be done as we have described in the construction of the formula ψ above. If all checks pass, then we output the p edges of the form (z,r) for all z∈Σx, that correspond to the r-th copy of the edge between z and y.
Combining the formulas ψ and χ through a selector concludes the construction of ϕ.