The Classes PPA-$k$: Existence from Arguments Modulo $k$

Alexandros Hollender

Introduction

The complexity class TFNP is the class of all search problems such that every instance has a least one solution and any solution can be checked in polynomial time. It has attracted a lot of interest, because, in some sense, it lies between P and NP. Moreover, TFNP contains many natural problems for which no polynomial-time algorithm is known, such as Factoring (given a integer, find a prime factor) or Nash (given a bimatrix game, find a Nash equilibrium). However, no problem in TFNP can be NP-hard, unless NP == co-NP . Furthermore, it is believed that no TFNP-complete problem exists . Thus, the challenge is to find some way to provide evidence that these TFNP problems are indeed hard.

Papadimitriou proposed the following idea: define subclasses of TFNP and classify the natural problems of interest with respect to these classes. Proving that many natural problems are complete for such a class, shows that they are “equally” hard. Then, investigating how these classes relate to each other, yields a relative classification of all these problems. In other words, it provides a unified framework that gives a better understanding of how these problems relate to each other. TFNP subclasses are based on various non-constructive existence results. Some of these classes and their corresponding existence principle are:

PPAD : given a directed graph and an unbalanced vertex (i.e., out-degree \neq in-degree), there must exist another unbalanced vertex.

PPA : given an undirected graph and vertex with odd degree, there must exist another vertex with odd degree (Handshaking Lemma).

PPP : given a function mapping a finite set to a smaller set, there must exist a collision (Pigeonhole Principle).

Other TFNP subclasses are PPADS, PLS , CLS , PTFNP , EOPL and UEOPL . It is known that PPADPPADSPPP\textup{PPAD}\subseteq\textup{PPADS}\subseteq\textup{PPP}, PPADPPA\textup{PPAD}\subseteq\textup{PPA} and UEOPLEOPLCLSPPADPLS\textup{UEOPL}\subseteq\textup{EOPL}\subseteq\textup{CLS}\subseteq\textup{PPAD}\cap\textup{PLS}. Very recently it was shown that in fact CLS=PPADPLS\textup{CLS}=\textup{PPAD}\cap\textup{PLS} . Any separation between TFNP subclasses would imply P \neq NP, but various oracle separations exist (see Section 2 for more details).

TFNP subclasses have been very successful in capturing the complexity of natural problems. The most famous result is that the problem Nash is PPAD-complete , but various other natural problems have also been shown PPAD-complete . Many local optimisation problems have been proved PLS-complete . Recently, the first natural complete problems were found for PPA and PPP . The famous Factoring problem has been partially related to PPA and PPP .

The natural problem recently shown PPA-complete is a problem in fair division, called the 22-Necklace-Splitting problem . For k2k\geq 2, the premise of the kk-Necklace-Splitting problem is as follows. Imagine that kk thieves have stolen a necklace that has beads of different colours. Since the thieves are unsure of the value of the different beads, they want to divide the necklace into kk parts such that each part contains the same number of beads of each colour. However, the string of the necklace is made of precious metal, so the thieves don’t want to use too many cuts. Alon’s famous result says that this can always be achieved with a limited number of cuts.

The corresponding computational problem can be described as follows. We are given an open necklace (i.e., a segment) with nn beads of cc different colours, i.e., there are aia_{i} beads of colour ii and i=1cai=n\sum_{i=1}^{c}a_{i}=n. Furthermore, assume that for each ii, aia_{i} is divisible by kk (the number of thieves). The goal is to cut the necklace in (at most) c(k1)c(k-1) places and allocate the pieces to the kk thieves, such that every thief gets exactly ai/ka_{i}/k beads of colour ii, for each colour ii. By Alon’s result , a solution always exists, and thus the problem lies in TFNP.

The complexity of this problem has been an open problem for almost 30 years . While the 2-thieves version is now resolved, the complexity of the problem with kk thieves (k3k\geq 3) remains open. The main motivation of the present paper is to investigate the classes PPA-kk, which are believed to be the most likely candidates to capture the complexity of kk-Necklace-Splitting. Indeed, in the conclusion of the paper where they prove that 22-Necklace-Splitting is PPA-complete, Filos-Ratsikas and Goldberg [18, arXiv version] mention:

“What is the computational complexity of kk-thief Necklace-splitting, for kk not a power of 2? As discussed in , the proof that it is a total search problem, does not seem to boil down to the PPA principle. Right now, we do not even know if it belongs to PTFNP .

Interestingly, Papadimitriou in (implicitly) also defined a number of computational complexity classes related to PPA, namely PPA-pp, for a parameter p2p\geq 2. […] Given the discussion above, it could possibly be the case that the principle associated with Necklace-Splitting for kk-thieves is the PPA-kk principle instead.”

PPA-p𝑝p.

The TFNP subclasses PPA-pp were defined by Papadimitriou almost 30 years ago in his seminal paper . Recall that the existence of a solution to a PPA problem is guaranteed by a parity argument, i.e., an argument modulo 22. The classes PPA-pp are a generalisation of this. For every prime pp, the existence of a solution to a PPA-pp problem is guaranteed by an argument modulo pp. In particular, \textup{PPA-2}=\textup{PPA}. Surprisingly, these classes have received very little attention. As far as we know, they have only been studied in the following:

Papadimitriou defined the classes PPA-pp and proved that a problem called Chevalley-mod-pp lies in PPA-pp and a problem called Cubic-Subgraph lies in PPA-33.

In an online thread on Stack Exchange , Jeřábek provided two other equivalent ways to define PPA-33. The problems and proofs can be generalised to any prime pp.

In his thesis , Johnson defined the classes PMODk\text{PMOD}^{k} for any k2k\geq 2, which were intended to capture the complexity of counting arguments modulo kk. He proved various oracle separation results involving his classes and other TFNP classes. While the PPA-pp classes are not mentioned by Johnson, using Jeřábek’s results it is easy to show that \textup{\text{PMOD}^{p}}=\textup{PPA-p} for any prime pp. In Section 6, we characterise PMODk\text{PMOD}^{k} in terms of the classes PPA-pp when kk is not prime. In particular, we show that PMODk\text{PMOD}^{k} only partially captures existence arguments modulo kk.

Our contribution.

Finally, in Section 6 we investigate the classes PMODk\text{PMOD}^{k} defined by Johnson and provide a full characterisation in terms of the classes PPA-kk. In particular, we show that \textup{\text{PMOD}^{k}}=\textup{PPA-k} if kk is a prime power. However, when kk is not a prime power, we provide evidence that PMODk\text{PMOD}^{k} does not capture the full strength of existence arguments modulo kk, unlike PPA-kk. This characterisation of PMODk\text{PMOD}^{k} in terms of PPA-kk leads to some oracle separation results involving PPA-kk and other TFNP classes (using Johnson’s oracle separation results).

We note that a significant fraction of our results were also obtained by Göös, Kamath, Sotiraki and Zampetakis in concurrent and independent work . In their work, they have also provided the first “natural” complete problem for the classes PPA-pp (a variant of Chevalley-mod-pp), namely the first complete problem that does not involve circuits or other computational devices in its description. The present work, and in particular the equivalent characterisations of the classes PPA-kk, have been pivotal in subsequent work showing that the kk-Necklace-Splitting problem lies in PPA-kk under Turing reductions. However, the question of whether kk-Necklace-Splitting is also PPA-kk-hard remains open.

Preliminaries

Let {0,1}\{0,1\}^{*} denote the set of all finite length bit-strings and for w{0,1}w\in\{0,1\}^{*} let w|w| be its length. A computational search problem is given by a binary relation R{0,1}×{0,1}R\subseteq\{0,1\}^{*}\times\{0,1\}^{*}. The problem is: given an instance I{0,1}I\in\{0,1\}^{*}, find an s{0,1}s\in\{0,1\}^{*} such that (I,s)R(I,s)\in R, or return that no such ss exists. The search problem RR is in FNP (Functions in NP), if RR is polynomial-time computable (i.e., (I,s)R(I,s)\in R can be decided in polynomial time in I+s|I|+|s|) and there exists some polynomial pp such that (I,s)R    sp(I)(I,s)\in R\implies|s|\leq p(|I|). Thus, FNP is the search problem version of NP (and FNP-complete problems are equivalent to NP-complete problems under Turing reductions).

The class TFNP (Total Functions in NP ) contains all FNP search problems RR that are total: for every I{0,1}I\in\{0,1\}^{*} there exists s{0,1}s\in\{0,1\}^{*} such that (I,s)R(I,s)\in R. With a slight abuse of notation, we can say that P lies in TFNP. Indeed, if a decision problem is solvable in polynomial time, then both the “yes” and “no” answers can be verified in polynomial time. In this sense, TFNP lies between P and NP.

Note that TFNP problems are not promise problems, i.e., we are not allowed to restrict the instance space {0,1}\{0,1\}^{*}. This means that for any instance in {0,1}\{0,1\}^{*}, there must always exist at least one solution. Nevertheless, TFNP can indirectly capture various settings where the instance space is restricted. For example, if a problem RR in FNP is total only on a subset LL of the instances and LPL\in P, then we can transform it into an equivalent TFNP problem by adding (I,0)(I,0) to RR for all instances ILI\notin L.

Reductions.

Let RR and SS be total search problems in TFNP. We say that RR (many-one) reduces to SS, denoted RSR\leq S, if there exist polynomial-time computable functions f,gf,g such that

Note that if SS is polynomial-time solvable, then so is RR. We say that two problems RR and SS are (polynomial-time) equivalent, if RSR\leq S and SRS\leq R.

There is also a more general type of reduction. A Turing reduction from RR to SS is a polynomial-time oracle Turing machine that solves problem RR with the help of queries to an oracle for SS. Note that a Turing reduction that only makes a single oracle query immediately yields a many-one reduction.

Encoding of Sets.

PPA.

The class PPA (Polynomial Parity Argument) is defined as the set of all TFNP problems that many-one reduce to the problem Leaf : given an undirected graph with maximum degree 2 and a leaf (i.e., a vertex of degree 1), find another leaf. The important thing to note is that the graph is not given explicitly (in which case the problem would be very easy), but it is provided implicitly through a succinct representation.

The vertex set is {0,1}n\{0,1\}^{n} and the undirected graph is represented by a Boolean circuit C:{0,1}nSet2({0,1}n)C:\{0,1\}^{n}\to\textsf{Set}_{\leq 2}(\{0,1\}^{n}). By this we mean that for any x{0,1}nx\in\{0,1\}^{n}, we interpret C(x)C(x) as the set of potential neighbours of xx, where we syntactically enforce that xC(x)x\notin C(x). We say that there is an edge between xx and yy if xC(y)x\in C(y) and yC(x)y\in C(x). Thus, every vertex has at most two neighbours. Note that the size of the graph can be exponential with respect to its description size.

The full formal definition of the problem Leaf is: given a Boolean circuit C:{0,1}nSet2({0,1}n)C:\{0,1\}^{n}\to\textsf{Set}_{\leq 2}(\{0,1\}^{n}) representing an undirected graph on the vertex set {0,1}n\{0,1\}^{n} such that C(0n)=1|C(0^{n})|=1 (i.e., 0n0^{n} is a leaf), find

x0nx\neq 0^{n} such that C(x)=1|C(x)|=1 (another leaf)

or x,yx,y such that xC(y)x\in C(y) but yC(x)y\notin C(x) (an inconsistent edge)

Type 2 Problems and Oracle Separations.

We work in the standard Turing machine model, but TFNP subclasses have also been studied in the black-box model. In this model, one considers the type 2 versions of the problems, namely, the circuits in the input are replaced by black-boxes. In that case, it is possible to prove unconditional separations between type 2 TFNP subclasses (in the standard model this would imply P \neq NP). The interesting point here is that separations between type 2 classes yield separations of the corresponding classes in the standard model with respect to any generic oracle (see for more details on this). This technique has been used to prove various oracle separations between TFNP subclasses . In Section 6 we provide some oracle separations involving PPA-kk and other TFNP subclasses.

On the other hand, any reduction that works in the type 2 setting, also works in the standard setting. Indeed, it suffices to replace the calls to the black boxes by the corresponding circuits that compute them. In this paper, our reductions are stated in the standard model, but they also work in the type 2 setting, because they don’t examine the inner workings of the circuits.

Definition of the Classes

For any prime pp, Papadimitriou defined the class PPA-pp as the set of all TFNP problems that many-one reduce to the following problem, that we call Bipartite-mod-pp: We are given an undirected bipartite graph (implicitly represented by a circuit) and a vertex with degree 0modp\neq 0\mod p (which we call the trivial solution). The goal is to find another such vertex. This problem lies in TFNP: if all other vertices had degree =0modp=0\mod p, then the sum of the degrees of all vertices on each side would have a different value modulo pp, which is impossible.

The problem remains well-defined and total if pp is not a prime, and so we will instead define it for any k2k\geq 2. Let us now provide a formal definition of the problem. A vertex of the bipartite graph is represented as a bit-string in {0,1}×{0,1}n\{0,1\}\times\{0,1\}^{n}, where the first bit indicates whether the vertex lies on the “left” or “right” side of the bipartite graph. The graph will be represented by a Boolean circuit that outputs a set of potential neighbours, just as we did for Leaf. Instead of at most two neighbours, here we allow at most kk neighbours (see Remark 1 for why this is enough). Note that we can syntactically enforce that the graph is bipartite, i.e., that a vertex 0x0x can only have neighbours of the type 1y1y and vice versa.

Let k2k\geq 2. The problem Bipartite-mod-kk is defined as: given a Boolean circuit C:{0,1}×{0,1}nSetk({0,1}×{0,1}n)C:\{0,1\}\times\{0,1\}^{n}\to\textsf{Set}_{\leq k}(\{0,1\}\times\{0,1\}^{n}) representing a bipartite graph on the vertex set ({0}×{0,1}n,{1}×{0,1}n)(\{0\}\times\{0,1\}^{n},\{1\}\times\{0,1\}^{n}) with C(00n){1,,k1}|C(00^{n})|\in\{1,\dots,k-1\}, find

x00nx\neq 00^{n} such that C(x){0,k}|C(x)|\notin\{0,k\}

or x,yx,y such that yC(x)y\in C(x) but xC(y)x\notin C(y).

Here the trivial solution is the vertex 00n00^{n}. The first type of solution corresponds to a vertex with degree 0modk\neq 0\mod k. The second type of solution corresponds to an edge that is not well-defined. We can always ensure that all edges are well-defined by doing some pre-processing. Indeed, in polynomial time we can construct a circuit CC^{\prime} such that all solutions are of the first type and yield a solution for CC. On input 0x0x the circuit CC^{\prime} first computes C(0x)={1y1,,1ym}C(0x)=\{1y_{1},\dots,1y_{m}\} and then for each ii removes 1yi1y_{i} from this set, if 0xC(1yi)0x\notin C(1y_{i}).

Note that in this problem statement we require that all degrees lie in {0,1,,k}\{0,1,\dots,k\}. This is easily seen to be equivalent to the more general formulation where vertices can have more than kk neighbours. Indeed, any vertex that has more than kk edges can be split into multiple copies such that all the copies have or kk edges, except for one copy which is allowed to have any number of edges in {0,1,,k}\{0,1,\dots,k\}. A solution of the original instance is then easily recovered from a solution of this modified instance. Note that since the set of neighbours is given as the output of a circuit, it will have length bounded by some polynomial in the input size and so this argument can indeed be applied.

For any k2k\geq 2, the class PPA-kk is defined as the set of all TFNP problems that many-one reduce to Bipartite-mod-kk.

Recall that PPA can be defined using the canonical complete problem Leaf : given an undirected graph where every vertex has degree at most 2, and a leaf (i.e., degree =1=1), find another leaf. This immediately yields PPA-22 \subseteq PPA, since Bipartite-mod-22 is just a special case of Leaf where the graph is bipartite.

Given an instance of Leaf with graph G=({0,1}n,E)G=(\{0,1\}^{n},E) we construct an instance of Bipartite-mod-22 on the vertex set {0,1}×{0,1}2n\{0,1\}\times\{0,1\}^{2n} as follows. For any u{0,1}nu\in\{0,1\}^{n} we have a vertex xu:=0u0nx_{u}:=0u0^{n} on the left side of the bipartite graph. For any edge {u,v}E\{u,v\}\in E (u,vu,v ordered lexicographically) we have a vertex yuv:=1uvy_{uv}:=1uv on the right side of the bipartite graph and we create the edges {xu,yuv}\{x_{u},y_{uv}\} and {xv,yuv}\{x_{v},y_{uv}\}. All other vertices in {0,1}×{0,1}2n\{0,1\}\times\{0,1\}^{2n} are isolated. In polynomial time we can construct a circuit that computes the neighbours of any vertex. Furthermore, w{0,1}nw\in\{0,1\}^{n} is a leaf, if and only if xwx_{w} has degree 1. Finally, all vertices on the right-hand side have degree 0 or 2. ∎

In the definition of the PPA-kk-complete problem Bipartite-mod-kk (Definition 1) the degree of the trivial solution 00n00^{n} can be any number in {1,,k1}\{1,\dots,k-1\}. In this section we define more refined classes where the degree of the trivial solution is fixed. In Section 5, these classes will be very useful to describe how the PPA-kk classes relate to each other. These definitions are inspired by the corresponding “counting principles” studied in Beame et al. that were also defined in a refined form in order to describe how they relate to each other. We believe that these refined classes will also be useful to capture the complexity of natural problems. Note that for k=2k=2, the degree of the trivial solution will always be 11 and thus the question does not even appear in the study of PPA.

Note that this problem remains in TFNP, since the condition can be checked efficiently.

Let R0R_{0} and R1R_{1} be two TFNP problems. Then the problem R0&R1R_{0}\operatorname*{\&}R_{1} is defined as: given an instance I0I_{0} of R0R_{0}, an instance I1I_{1} of R1R_{1} and a bit b{0,1}b\in\{0,1\}, find a solution to IbI_{b}.

We extend the &\operatorname*{\&} operation to TFNP subclasses in the natural way. Let C0C_{0} and C1C_{1} be TFNP subclasses with complete problems R0R_{0} and R1R_{1} respectively. Then C0&C1C_{0}\operatorname*{\&}C_{1} is the class of all TFNP problems that many-one reduce to R0&R1R_{0}\operatorname*{\&}R_{1}. Note that the choice of complete problems does not matter. Intuitively, this class contains all problems that can be solved in polynomial time by a Turing machine with a single oracle query to either C0C_{0} or C1C_{1}.

Together with Lemma 1, Lemma 2 yields, e.g., PPA-66 == PPA-6[#2]6[\#2] &\operatorname*{\&} PPA-6[#3]6[\#3].

Equivalent Definitions

In this section we show that PPA-kk can be defined by using other problems instead of Bipartite-mod-kk. The totality of these problems is again based on arguments modulo kk. By showing that these problems are indeed PPA-kk-complete, we provide additional support for the claim that PPA-kk captures the complexity of “polynomial arguments modulo kk”. While these problems are not “natural” and thus not interesting in their own right, they provide equivalent ways of defining PPA-kk, which can be very useful when working with these classes. In particular, we make extensive use of these equivalences in this work.

The TFNP problems we consider are the following:

Imbalance-mod-kk : given a directed graph and a vertex that is unbalanced-mod-kk, i.e., out-degree - in-degree 0modk\neq 0\mod k, find another such vertex.

Hypergraph-mod-kk : given a hypergraph and a vertex that has degree 0modk\neq 0\mod k, find another such vertex or a hyperedge that has size k\neq k.

Partition-mod-kk : given a set of size 0modk\neq 0\mod k and a partition into subsets, find a subset that has size k\neq k.

Imbalance-mod-kk, Hypergraph-mod-kk, Partition-mod-kk are PPA-kk-complete.

The problem Imbalance-mod-kk is a generalisation of the PPAD-complete problem Imbalance : given a directed graph and a vertex that is unbalanced (i.e., out-degree - in-degree 0\neq 0), find another unbalanced vertex. It is known that in Imbalance we can assume without loss of generality that the given vertex has imbalance exactly 11. As a result, Imbalance trivially reduces to Imbalance-mod-kk, and thus Theorem 1 also yieldsThis observation was also made by Jeřábek for the classes PPA-pp (pp prime).:

For all k2k\geq 2, we have PPAD \subseteq PPA-kk.

Furthermore, if we use the convention that a=bmod0a=b\mod 0 if and only if a=ba=b, then Imbalance-mod- actually corresponds to Imbalance. Thus, in a certain sense we could define \textup{PPA-0}=\textup{PPAD}. On the other hand, Imbalance-mod-11 is a trivial problem.

Let k2k\geq 2. The problem Imbalance-mod-kk is defined as: given Boolean circuits S,P:{0,1}nSetk({0,1}n)S,P:\{0,1\}^{n}\to\textsf{Set}_{\leq k}(\{0,1\}^{n}) representing a directed graph on the vertex set {0,1}n\{0,1\}^{n} with S(0n)P(0n){k,0,k}|S(0^{n})|-|P(0^{n})|\notin\{-k,0,k\}, find

x0nx\neq 0^{n} such that S(x)P(x){k,0,k}|S(x)|-|P(x)|\notin\{-k,0,k\}

or x,yx,y such that yS(x)y\in S(x) but xP(y)x\notin P(y), or yP(x)y\in P(x) but xS(y)x\notin S(y).

Hypergraph.

A hypergraph on the vertex set {0,1}n\{0,1\}^{n} is represented as follows. For every vertex x{0,1}nx\in\{0,1\}^{n}, a circuit C:{0,1}nSetk(Setk({0,1}n))C:\{0,1\}^{n}\to\textsf{Set}_{\leq k}(\textsf{Set}_{\leq k}(\{0,1\}^{n})) outputs the set C(x)C(x) of all hyperedges containing xx, where each hyperedge is a set of vertices in {0,1}n\{0,1\}^{n}. As usual, we only need to consider the case where every vertex is contained in at most kk hyperedges and every hyperedge has size at most kk. A hyperedge {x1,,xm}\{x_{1},\dots,x_{m}\} exists in the hypergraph, if all the vertices involved indeed agree that it is present, i.e., if {x1,,xm}C(xi)\{x_{1},\dots,x_{m}\}\in C(x_{i}) for all i{1,,m}i\in\{1,\dots,m\}.

Let k2k\geq 2. The problem Hypergraph-mod-kk is defined as: given a Boolean circuit C:{0,1}nSetk(Setk({0,1}n))C:\{0,1\}^{n}\to\textsf{Set}_{\leq k}(\textsf{Set}_{\leq k}(\{0,1\}^{n})) representing a hypergraph on the vertex set {0,1}n\{0,1\}^{n} with C(0n){0,k}|C(0^{n})|\notin\{0,k\}, find

x0nx\neq 0^{n} such that C(x){0,k}|C(x)|\notin\{0,k\}

or xx such that C(x)C(x) contains a hyperedge of size k\neq k

or x,yx,y such that C(x)C(x) and C(y)C(y) are not consistent with one another.

Note that for k=2k=2 this problem essentially corresponds to the PPA-complete problem Leaf and its (equivalent) generalisation Odd : given an undirected graph and a vertex with odd degree, find another one.

Partition.

A partition of {0,1}n\{0,1\}^{n} is represented by a Boolean circuit C:{0,1}n{0,1}nC:\{0,1\}^{n}\to\{0,1\}^{n} as follows: x{0,1}nx\in\{0,1\}^{n} lies in the subset given by the orbit of xx with respect to CC, i.e., {Ci(x):i0}\{C^{i}(x):i\geq 0\}, where Ci(x)=C(C(C(x)))C^{i}(x)=C(C(\dots C(x))\dots) (ii times). The problem we define below is based on the simple observation that a base set of size 0modk\neq 0\mod k cannot be partitioned into sets of size kk. The base set consists of all elements in {0,1}n\{0,1\}^{n} except for mm elements that have been removed, for some m<2nm<2^{n} such that 2nm0modk2^{n}-m\neq 0\mod k. Here it is convenient to identify {0,1}n\{0,1\}^{n} with {0,1,,2n1}\{0,1,\dots,2^{n}-1\} in the natural way. Thus, we can think of the base set as simply being {m,m+1,,2n1}\{m,m+1,\dots,2^{n}-1\}.

Let k2k\geq 2. The problem Partition-mod-kk is defined as: given m<2nm<2^{n} with 2nm0modk2^{n}-m\neq 0\mod k and a Boolean circuit C:{0,1}n{0,1}nC:\{0,1\}^{n}\to\{0,1\}^{n}, such that C(x)=xC(x)=x for all x<mx<m, find

or x{0,1}nx\in\{0,1\}^{n} such that Ck(x)xC^{k}(x)\neq x

The condition “C(x)=xC(x)=x for all x<mx<m” corresponds to excluding elements that do not lie in the base set and it can be enforced syntactically. The first solution type corresponds to finding a set in the partition such that its size divides kk (but is k\neq k), while the second solution type corresponds to finding a set such that its size does not divide kk. Note that a solution is guaranteed to exist since 2nm0modk2^{n}-m\neq 0\mod k.

The definition of this problem can be modified in various ways without changing its complexity. For instance, the first solution type can be changed to simply ask for xmx\geq m such that Cd(x)=xC^{d}(x)=x for some d<kd<k. We have defined the problem in a slightly more complicated way to make the connection with the MODk\textup{MOD}^{k} problems more immediate (see Section 6). Yet another equivalent way of defining the problem would be to consider a Boolean circuit C:{0,1}nSetk({0,1}n)C:\{0,1\}^{n}\to\textsf{Set}_{\leq k}(\{0,1\}^{n}) where C(x){0,1}nC(x)\subseteq\{0,1\}^{n} is interpreted as the set containing xx in the partition. A solution would then be any xmx\geq m with C(x)<k|C(x)|<k or any x,yx,y witnessing an inconsistency in the partition given by CC.

2 Proof of Theorem 1

Mitosis gadgets.

Let k2k\geq 2. We now show how to construct a small bipartite graph such that exactly one vertex on each side has degree 11 and all other vertices have degree kk (or ). This “gadget” can then be used to increase the degree of two vertices (one on each side of the bipartite graph) without adding any solutions, i.e., vertices with degree 0modk\neq 0\mod k.

The gadget is a bipartite graph with k+1k+1 vertices on each side: a1,,ak+1a_{1},\dots,a_{k+1} and b1,,bk+1b_{1},\dots,b_{k+1}. It contains all the edges {ai,bj}\{a_{i},b_{j}\} for i,jki,j\leq k, except the edge {ak,bk}\{a_{k},b_{k}\}. It also contains the edges {ak,bk+1}\{a_{k},b_{k+1}\} and {ak+1,bk}\{a_{k+1},b_{k}\}. Thus, all vertices have degree kk, except for ak+1a_{k+1} and bk+1b_{k+1} which have degree 11.

We call this the “Mitosis” gadget, because it allows us to duplicate edges that already exist. Let uu and vv be two vertices in a bipartite graph, one on each side. Furthermore, consider the case where there is an edge {u,v}\{u,v\}. We would like to increase the degree of uu and vv by 11, but without introducing any new solutions, in particular without introducing any vertex with degree 0modk\neq 0\mod k. Using the Mitosis gadget, we can just add new vertices a1,,aka_{1},\dots,a_{k} and b1,,bkb_{1},\dots,b_{k}, and identify ak+1a_{k+1} with uu and bk+1b_{k+1} with vv. Adding the corresponding vertices of the gadget yields a bipartite graph where the degree of uu and vv has increased by 11, but no new solutions have been introduced. Note that this gadget can, in particular, be used to turn a bipartite graph with multi-edges into one without them, without changing the degree of existing vertices and without adding any new solutions.

Relationship Between the Classes

In this section, we present some results that provide deeper insights into how the classes relate to each other. For any k2k\geq 2, PF(k)\operatorname{\mathsf{PF}}(k) denotes the set of all prime factors of kk. The main conceptual result is that PPA-kk is entirely determined by the set of prime factors of kk:

For any k2k\geq 2 we have PPA-kk == &pPF(k)\operatorname*{\&}\limits_{p\in\operatorname{\mathsf{PF}}(k)} PPA-pp.

This equation can be understood as saying the following:

Given a single query to an oracle for PPA-kk, we can solve any problem in PPA-pp for any pPF(k)p\in\operatorname{\mathsf{PF}}(k)

Given a single query to an oracle that solves any PPA-pp problem for any pPF(k)p\in\operatorname{\mathsf{PF}}(k), we can solve any problem in PPA-kk.

For k1,k22k_{1},k_{2}\geq 2, if PF(k1)PF(k2)\operatorname{\mathsf{PF}}(k_{1})\subseteq\operatorname{\mathsf{PF}}(k_{2}), then PPA-k1k_{1} \subseteq PPA-k2k_{2}.

For all k1,k22k_{1},k_{2}\geq 2, PPA-k1k2k_{1}k_{2} == PPA-k1k_{1} &\operatorname*{\&} PPA-k2k_{2}.

For all k2k\geq 2 and all r1r\geq 1 we have PPA-krk^{r} == PPA-kk.

The proof of Theorem 3 can be found in the next section. Before we move on to that, let us briefly show that Theorem 2 follows from Theorem 3.

All containment results follow from Theorem 4 below, except

Inspired by the definition of the PPA-complete problem Lonely , Buss and Johnson defined TFNP problems called MODp\textup{MOD}^{p} to represent arguments modulo some prime pp. Their main motivation was to use these problems to show separations (in the type 2 setting) between Turing reductions with mm oracle queries and Turing reductions with m+1m+1 oracle queries. In his thesis , Johnson generalised the definition of MODk\textup{MOD}^{k} to any k2k\geq 2 and defined corresponding classes PMODk\text{PMOD}^{k}. He also proved some separations between these classes and other TFNP classes in the type 2 setting (which yield oracle separations in the standard setting). It seems that Johnson was not aware of Papadimitriou’s PPA-pp classes.

In this section, we study the classes PMODk\text{PMOD}^{k} and prove a characterisation in terms of the classes PPA-pp. In particular, we show that PMODk\text{PMOD}^{k} does not capture the full strength of arguments modulo kk, when kk is not a prime power. This characterisation also allows us to use Johnson’s separations to obtain some oracle separations involving PPA-kk and other TFNP classes.

Informally, the problem MODk\textup{MOD}^{k} can be defined as follows. We are given a partition of {0,1}n\{0,1\}^{n} into subsets and the goal is to find one of these subsets that has size k\neq k. If kk is not a power of 22, then such a subset must exist. If kk is a power of 22, then we instead consider {0,1}n{0n}\{0,1\}^{n}\setminus\{0^{n}\} and the problem remains total.

Let k2k\geq 2. The problem MODk\textup{MOD}^{k} is defined as: given a Boolean circuit CC with nn inputs and outputs,

or x{0,1}nx\in\{0,1\}^{n} such that Ck(x)xC^{k}(x)\neq x

If kk is a power of 22: Let additionally C(0n)=0nC(0^{n})=0^{n} and find

or x{0,1}nx\in\{0,1\}^{n} such that Ck(x)xC^{k}(x)\neq x

For any k2k\geq 2, the class PMODk\text{PMOD}^{k} is defined as the set of all TFNP problems that many-one reduce to MODk\textup{MOD}^{k}.

Johnson proves a lemma [25, Lemma 7.4.5] that gives some idea of how the PMODk\text{PMOD}^{k} classes relate to each other. It can be stated as follows: if k=p1p2prk=p_{1}p_{2}\dots p_{r}, where the pip_{i} are distinct primes, then \textup{\text{PMOD}^{k}}=\cap_{i}\textup{\text{PMOD}^{p_{i}}}. He proves this if all pi2p_{i}\neq 2 and claims that the proof also works if some pi=2p_{i}=2. However, if some pi=2p_{i}=2 then the proof does not work. This is easy to see, since our results below prove that \textup{\text{PMOD}^{6}}=\textup{\text{PMOD}^{3}} which is not equal to \textup{\text{PMOD}^{2}}\cap\textup{\text{PMOD}^{3}}, unless \textup{\text{PMOD}^{2}}\subseteq\textup{\text{PMOD}^{3}}. However, Johnson proves that \textup{\text{PMOD}^{2}}\not\subseteq\textup{\text{PMOD}^{3}} in the type 2 setting.

The following result provides a full characterisation of PMODk\text{PMOD}^{k} in terms of the classes PPA-pp.

If kk is not a power of 22, then \textup{\text{PMOD}^{k}}=\textup{PPA-\widetilde{k}[\#1]}=\cap_{p\in\operatorname{\mathsf{PF}}(\widetilde{k})}\textup{PPA-p} where k~\widetilde{k} is the largest odd divisor of kk.

If kk is a power of 22, then \textup{\text{PMOD}^{k}}=\textup{PPA-2}.

The proof of Theorem 5 is given below in Section 6.1.

for all primes pp and r1r\geq 1, \textup{\text{PMOD}^{p^{r}}}=\textup{PPA-p^{r}}=\textup{PPA-p}

for all k2k\geq 2, \textup{\text{PMOD}^{2k}}=\textup{\text{PMOD}^{k}}

for all odd k3k\geq 3, \textup{\text{PMOD}^{k}}=\textup{PPA-k[\#1]}=\cap_{p\in\operatorname{\mathsf{PF}}(k)}\textup{PPA-p}

If kk is a prime power, then PMODk\text{PMOD}^{k} is the same as PPA-kk. However, for other values of kk, we argue that PMODk\text{PMOD}^{k} fails to capture the full strength of arguments modulo kk. For example, \textup{\text{PMOD}^{15}}=\textup{PPA-15[\#1]}=\textup{PPA-3}\cap\textup{PPA-5}, whereas \textup{PPA-15}=\textup{PPA-3}\operatorname*{\&}\textup{PPA-5}. This means that PPA-1515 can solve any problem that lies in PPA-33 or PPA-55, while PMOD15\text{PMOD}^{15} can only solve problems that lie both in PPA-33 and PPA-55. In particular, if \textup{\text{PMOD}^{15}}=\textup{PPA-15}, then it would follow that \textup{PPA-3}=\textup{PPA-5}, which is not believed to hold (see oracle separations below). Even worse perhaps, is the fact that \textup{\text{PMOD}^{2k}}=\textup{\text{PMOD}^{k}} for any k2k\geq 2. In particular, this means that \textup{\text{PMOD}^{6}}=\textup{\text{PMOD}^{3}}, which indicates that PMOD6\text{PMOD}^{6} does not really capture arguments modulo 66.

Nevertheless, Johnson’s oracle separation results (obtained from the corresponding type 2 separations as in ) also yield corresponding results for the PPA-kk classes (using Theorem 5). We briefly mention a few of the results obtained this way. See Johnson [25, Chapter 8] for additional results. Relative to any generic oracle (see ):

\textup{PPA-p}\not\subseteq\textup{PPA-q} for any distinct primes p,qp,q

\textup{PPA-k}\not\subseteq\textup{PPP}, \textup{PPA-k}\not\subseteq\textup{PLS}, \textup{PPA-k}\not\subseteq\textup{PPADS} for any k2k\geq 2

\textup{PPP}\not\subseteq\textup{PPA-p}, \textup{PLS}\not\subseteq\textup{PPA-p} for any prime pp

For k=2k=2, MOD2\textup{MOD}^{2} corresponds to the PPA-complete problem Lonely , and thus \textup{\text{PMOD}^{2}}=\textup{PPA}=\textup{PPA-2}. Let r2r\geq 2. Consider an instance (C,m)(C,m) of Partition-mod-2r[#(2r1)]2^{r}[\#(2^{r}-1)] on the set {0,1}n\{0,1\}^{n}. Without loss of generality, assume nrn\geq r. Then 2n=0mod2r2^{n}=0\mod 2^{r} and thus m=2n(2nm)=(2r1)mod2r=1mod2rm=2^{n}-(2^{n}-m)=-(2^{r}-1)\mod 2^{r}=1\mod 2^{r}. This means that we can (efficiently) partition {0,1,,m1}\{0,1,\dots,m-1\} into subsets of size 2r2^{r}, leaving only 0=0n0=0^{n} out. Thus, we have reduced Partition-mod-2r[#(2r1)]2^{r}[\#(2^{r}-1)] to MOD2r\textup{MOD}^{2^{r}}. Since \textup{PPA-2^{r}[\#(2^{r}-1)]}=\textup{PPA-2} (Theorem 3), we obtain \textup{PPA-2}\subseteq\textup{\text{PMOD}^{2^{r}}}. On the other hand we also have \textup{\text{PMOD}^{2^{r}}}\subseteq\textup{PPA-2^{r}}=\textup{PPA-2} by Corollary 2.

Consider some k3k\geq 3 that is not a power of 22. First, let us show that \textup{\text{PMOD}^{2k}}=\textup{\text{PMOD}^{k}}. MOD2k\textup{MOD}^{2k} reduces to MODk\textup{MOD}^{k} by splitting every subset into two subsets of size kk (or less, if the subset has size <2k<2k). Conversely, consider an instance of MODk\textup{MOD}^{k} on the set {0,1}n\{0,1\}^{n}. Make a copy of the instance, thus obtaining an instance on the set {0,1}n+1\{0,1\}^{n+1}. For every subset of the original instance, take the union with its copy. If the subset had size kk, the new subset has size 2k2k. Thus, we have reduced to MOD2k\textup{MOD}^{2k}.

Let k3k\geq 3 be coprime with 22. We will show \textup{\text{PMOD}^{k}}=\textup{PPA-k[\#1]}. Consider an instance of MODk\textup{MOD}^{k} on the set {0,1}n\{0,1\}^{n}. Since kk and 22 are coprime, there exists i{0,,k1}i\in\{0,\dots,k-1\} such that 2n+i=1modk2^{n+i}=1\mod k (e.g., by using Euler’s theorem). Thus, we take 2i2^{i} copies of the instance and obtain an instance on the set {0,1}n+i\{0,1\}^{n+i}, which is an instance of Partition-mod-k[#1]k[\#1] (with m=0m=0), since 2n+i=1modk2^{n+i}=1\mod k. Conversely, consider an instance (C,m)(C,m) of Partition-mod-k[#1]k[\#1] on the set {0,1}n\{0,1\}^{n}. As before, there exists i{0,,k1}i\in\{0,\dots,k-1\} such that 2n+i=1modk2^{n+i}=1\mod k. We construct an instance CC^{\prime} of MODk\textup{MOD}^{k} on {0,1}n+i\{0,1\}^{n+i} as follows. The element x{0,1}nx\in\{0,1\}^{n} of the original instance corresponds to the element 1ix{0,1}n1^{i}x\in\{0,1\}^{n} of the new instance. If xmx\geq m, set C(1ix)=1iC(x)C^{\prime}(1^{i}x)=1^{i}C(x). The number of elements that have not yet been assigned to a subset is m+(2i1)2n=(m2n)+2n+i=0modkm+(2^{i}-1)2^{n}=(m-2^{n})+2^{n+i}=0\mod k. Thus, we can efficiently partition them into subsets of size kk without introducing any solution. We have obtained an instance of MODk\textup{MOD}^{k}.

Many-one vs Turing Reductions

For any prime p2p\geq 2, PPA-pp is closed under Turing reductions.

In particular, PPA-prp^{r} = PPA-pp is also closed under Turing reductions. The proof of Theorem 6 can be found in Section 7.1. Furthermore, we also obtain:

If kk is not a prime power, then it is not known whether PPA-kk is closed under Turing reductions. Using our results from Section 6, we can actually provide an oracle separation between PPA-kk and the Turing-closure of PPA-kk, i.e., an oracle under which PPA-kk is not closed under Turing reductions. Let R1,,RkR_{1},\dots,R_{k} be TFNP problems. Following Johnson we define j=1kRj\bigotimes_{j=1}^{k}R_{j} as the problem: given instances (I1,,Ik)(I_{1},\dots,I_{k}), where IjI_{j} is an instance of RjR_{j}, solve IjI_{j} for all jj. As we did with the &\operatorname*{\&} operation, with a slight abuse of notation, we can also use the operation \otimes with the PPA-kk classes. In [25, Theorem 7.6.1], Johnson proved that for m2m\geq 2 and distinct primes p1,,pmp_{1},\dots,p_{m}, i=1m\bigotimes_{i=1}^{m} MODpi\textup{MOD}^{p_{i}} does not many-one reduce to &i=1m\operatorname*{\&}_{i=1}^{m} MODpi\textup{MOD}^{p_{i}} in the type 2 setting. Together with our Theorems 2 and 5 this yields:

Let k2k\geq 2 not a power of a prime. Relative to any generic oracle, it holds that \bigotimes_{p\in\operatorname{\mathsf{PF}}(k)}\textup{PPA-p}\not\subseteq\textup{PPA-k}. In particular, relative to any generic oracle, PPA-kk is not closed under Turing reductions.

S=\bigotimes_{p\in\operatorname{\mathsf{PF}}(k)}\textup{PPA-p} corresponds to solving PPA-pp for all prime factors pp of kk simultaneously. In particular, this can be done by using PF(k)|\operatorname{\mathsf{PF}}(k)| queries to PPA-kk, i.e., a Turing reduction to PPA-kk. Thus, SS lies in the Turing closure of PPA-kk, but not in PPA-kk (relative to any generic oracle).

We essentially apply the same technique that was used by Buss and Johnson to show that PPA, PPAD, PPADS and PLS are closed under Turing reductions.

Let Π\Pi be a problem that Turing-reduces to some problem in PPA-pp. This means that there exists a Turing machine MM with access to a PPA-pp-oracle that solves Π\Pi in polynomial time. Since Imbalance-mod-pp is PPA-pp-complete (Theorem 1), we assume that the oracle provides solutions to Imbalance-mod-pp instances. Our goal is to show that all the oracle queries can be combined into a single one. Indeed, a Turing reduction that always uses a single oracle query immediately yields a many-one reduction. Thus, by the definition of PPA-pp, this would yield \Pi\in\textup{PPA-p}.

We begin by showing that any Imbalance-mod-pp-instance can be efficiently transformed into an instance that has a particular form, namely: the starting node has imbalance +1+1 (in-degree and out-degree 11), and any solution has imbalance 1-1 (in-degree 11 and out-degree ). This can be achieved by the following steps:

Ensure that all vertices have in- and out-degree at most pp (by splitting vertices into multiple copies).

Ensure that any unbalanced vertex has in- or out-degree (by creating a copy that will take all the edges that yield the imbalance).

Since pp is prime, we can ensure that the starting vertex has imbalance +1+1.

Ensure that all vertices that have imbalance 0modp\neq 0\mod p, actually have imbalance +1+1 or 1-1 (by splitting every such vertex into pp vertices, each getting at most one edge).

Transform every solution that has imbalance +1+1 into p1p-1 solutions with imbalance 1-1 instead (by pointing to p1p-1 new vertices).

It remains to show that this graph GG can be constructed in polynomial time from II, i.e., we can efficiently construct circuits that compute the edges incident on any given node. This is easy to see, because any node contains enough information to simulate a run of MM up to the point that is needed to determine the neighbours in GG. We omit the full details, since the formal arguments are analogous to the ones in the corresponding proofs in .

Acknowledgements

I would like to thank Aris Filos-Ratsikas and Paul Goldberg for helpful discussions, as well as an anonymous reviewer for suggestions that helped improve the presentation of the paper. This work was supported by an EPSRC doctoral studentship (Reference 1892947).

References

Appendix A Technical Lemmas for Theorem 4

The proof ideas from are used to construct some of these reductions.

The circuit CC^{\prime} determines the image of aSa\in S by first computing x1,,xsx_{1},\dots,x_{s} and α1,,αs\alpha_{1},\dots,\alpha_{s} as described above, and determining the smallest index ii as explained above. Let ai=aO(xi)a_{i}=a\cap O(x_{i}). The circuit outputs

Let k1,k22k_{1},k_{2}\geq 2. If all prime factors of k2k_{2} also divide k1k_{1}, then \textup{PPA-k_{1}[\#1]}\subseteq\textup{PPA-k_{2}[\#1]}.

Similarly to our proof of Lemma 5, we adapt the proof of the corresponding statement for the counting formulas from Beame et al. [2, Lemma 2.5] in order to obtain a polynomial-time reduction.

The final step is to set m=2nr(2nm)rm^{\prime}=2^{nr}-(2^{n}-m)^{r} and construct an efficient bijection between {m,,2nr1}\{m^{\prime},\dots,2^{nr}-1\} and {m,,2n1}r\{m,\dots,2^{n}-1\}^{r}, which is easy to do. ∎