The Complexity of Splitting Necklaces and Bisecting Ham Sandwiches

Aris Filos-Ratsikas, Paul W. Goldberg

Computational complexity; Tucker’s Lemma; TFNP; fair division

Introduction

The complexity classes PPA and PPAD were introduced in a seminal paper of Papadimitriou in 1994, in an attempt to classify several natural problems in the class TFNP . TFNP is the class of total search problems in NP for which a solution exists for every instance, and solutions can be efficiently verified. Various important problems were subsequently proven to be complete for the class PPAD, such as the complexity of many versions of Nash equilibrium , market equilibrium computation , and others . As evidence of computational hardness, PPA-completeness is stronger than PPAD-completeness, i.e., $\mbox{{PPAD}}\subseteq\mbox{{PPA}}$ . Indeed, Jeřábek shows that it indicates cryptographic hardness in a strong sense: gives a randomised reduction from FACTORING to PPA-complete problems. This is not known for PPAD-complete problems. For more details, and the significance of PPA-completeness, we refer the reader to the related discussion in . PPA is the class of problems reducible to Leaf (Definition 1), and a PPA-complete problem is polynomial-time equivalent to Leaf.

An instance of the problem Leaf consists of an undirected graph $G$ whose vertices have degree at most 2; $G$ has $2^{n}$ vertices represented by bitstrings of length $n$ ; $G$ is presented concisely via a circuit that takes as input a vertex and outputs its neighbour(s). We stipulate that vertex $0^{n}$ has degree 1. The challenge is to find some other vertex having degree 1.

Complete problems for the class PPA seemed to be much more elusive than PPAD-complete ones, especially when one is interested in “natural” problems, where “natural” here has the very specific meaning of problems that do not explicitly contain a circuit in their definition. Besides Papadimitriou , other papers asking about the possible existence of natural PPA-complete problems include . In a recent precursor to the present paper we identified the first example of such a problem, namely the approximate Consensus-halving problem, dispelling the suspicion that such problems might not exist. In this paper we build on that result and settle the complexity of two natural and important problems whose complexity status were raised explicitly as open problems in Papadimitriou’s paper itself, and in many other papers beginning in the 1980s. Specifically, we prove that Necklace-splitting (with two thieves, see Definition 2) and Discrete Ham Sandwich are both PPA-complete.

In the $k$ -Necklace-splitting problem there is an open necklace with $ka_{i}$ beads of colour $i$ , for $1\leq i\leq n$ . An “open necklace” means that the beads form a string, not a cycle. The task is to cut the necklace in $(k-1)\cdot n$ places and partition the resulting substrings into $k$ collections, each containing precisely $a_{i}$ beads of colour $i$ , $1\leq i\leq n$ .

In Definition 2, $k$ is thought of as the number of thieves who desire to split the necklace in such a way that the beads of each colour are equally shared. In this paper, usually we have $k=2$ and we refer to this special case as Necklace-splitting.

In the Discrete Ham Sandwich problem, there are $n$ sets of points in $n$ dimensions having integer coordinates (equivalently one could use rationals). A solution consists of a hyperplane that splits each set of points into subsets of equal size (if any points lie on the plane, we are allowed to place them on either side, or even split them arbitrarily).

In Definition 3, each point set represents an ingredient of the sandwich, which is to be cut by a hyperplane in such a way that all ingredients are equally split.

The necklace-splitting problem was introduced in a 1982 paper of Bhatt and Leiserson (, Section 5), where it arose in the context of VLSI circuit design (the version defined in is the 2-thief case proved PPA-complete in the present paper). In 1985 and 1986, the 2-thief case was shown to have guaranteed solutions (as defined in Definition 2) by Goldberg and West and Alon and West and then in 1987, Alon proved existence of solutions for $k$ thieves as well. Early papers that explicitly raise its complexity-theoretic status as an open problem are Goldberg and West and Alon . Subsequently, the necklace-splitting problem was found to be closely related to “paint-shop scheduling”, a line of work in which several papers such as explicitly mention the question of the computational complexity of necklace-splitting. Meunier notes that the search for a minimum number of cuts admitting a fair division (which may be smaller than the number $(k-1)n$ that is guaranteed to suffice) is NP-hard, even for a subclass of instances of the 2-thief case. (That is a result of Bonsma et al. , for the “paint shop problem with words”, equivalent to 2-thief Necklace-splitting with 2 beads of each colour.)

In , we showed Necklace-splitting to be computationally equivalent to $\varepsilon$ -Consensus-halving for inverse-polynomial precision parameter $\varepsilon$ , but the PPA-completeness of $\varepsilon$ -Consensus-halving was only shown for inverse-exponential $\varepsilon$ . established PPAD-hardness of Necklace-splitting, applying the main result of . In this paper, we prove that $\varepsilon$ -Consensus-halving is PPA-complete for $\varepsilon$ inversely polynomial, thus obtaining the desired PPA-completeness of Necklace-splitting. While some structural parts of our reduction are extensions of those presented in , obtaining the result for inverse-polynomial precision is much more challenging, as the construction needs to move to a high-dimensional space (rather than the two-dimensional space which is sufficient for the result in ). We highlight the main new techniques that we have developed in this paper in Section 2.1, where we provide an overview of the reduction. Our PPA-completeness result gives a convincing negative answer to Meunier and Neveu’s questions about possible polynomial-time solvability or membership of PPAD for Necklace-splitting; likewise it runs counter to Alon’s cautious optimism at ICM 1990 (, Section 4) that the problem may be solvable in polynomial time.

The Ham Sandwich Theorem is of enduring and widespread interest due to its colourful and intuitive statement, and its relevance and applications in topology, social choice theory, and computational geometry. Roughly, it states that given $d$ measures in Euclidean $d$ -space, there exists a hyperplane that cuts them all simultaneously in half. Early work on variants and applications of the theorem focused on non-constructive existence proofs and mostly did not touch on the algorithmics. A 1983 paper by Hill hints at possible interest in the corresponding computational challenge, in the context of a related land division problem. The computational problem (and its complexity) was first properly studied in a line of work in computational geometry beginning in the 1980s, for example . The problem envisages input data consisting of $d$ sets of $n$ points in Euclidean $d$ -space, and asks for a hyperplane that splits all point sets in half. (The problem Discrete Ham Sandwich (Definition 3) as named in is essentially this, with $d$ set equal to $n$ to emphasise that we care about the high-dimensional case.) In this work in computational geometry, the emphasis has been on efficient algorithms for small values of $d$ ; Lo et al. improve the dependence on $d$ but it is still exponential, and the present paper shows for the first time that we should not expect to improve on that exponential dependence. More recently, Grandoni et al. apply the “Generalized Ham Sandwich Theorem” to a problem in multi-objective optimisation and note that a constructive proof would allow a more efficient algorithm to emerge. The only computational hardness result we know of is Knauer et al. who obtain a $W$ -hardness result for a constrained version of the problem; points out the importance of the computational complexity of the general problem. The PPA-completeness result of the present paper is the first hardness result of any kind for Discrete Ham Sandwich, and as we noted, is a strong notion of computational hardness. Karpic and Saha showing a form of equivalence between the Ham Sandwich Theorem and Borsuk-Ulam, explicitly mention the possible PPA-completeness of Discrete Ham Sandwich as an “interesting and challenging open problem”.

We prove the PPA-completeness of Discrete Ham Sandwich via a simple reduction from Necklace-splitting. Ours is not the first paper to develop the close relationship between the two problems: Blagojević and Soberón shows a generalisation, where multiple agents may share a “sandwich”, dividing it into convex pieces. Further papers to explicitly point out their computational complexity as open problems include Deng et al. (mentioning that both problems “show promise to be complete for PPA”), Aisenberg et al. , and Belovs et al. .

Further Related Work: The class TFNP was defined in and several of its subclasses were studied over the years, such as PPA, PPAD and PPP , PLS and CLS ; here we focus on the most recent results. As we mentioned earlier, in we identified the first natural complete problem for PPA, the approximate Consensus-halving problem. In a recent paper, Sotiraki et al. identified the first natural problem for the class PPP, the class of problems whose totality is established by an argument based on the pigeonhole principle. For the class CLS, both Daskalakis et al. and Fearnley et al. identified complete problems (two versions of the Contraction Map problem, where a metric or a meta-metric are given as part of the input). In the latter paper, the authors define a new class, namely EOPL (for “End of Potential Line”), and show that it is a subclass of CLS. Furthermore, they show that two well-known problems in CLS, the P-Matrix Linear Complementarity Problem (P-LCP), and finding a fixpoint of a piecewise-linear contraction map (PL-Contraction) belong to the class. The End of Potential Line problem of is closely related to the End of Metered Line of .

Problems and Results

We present and discuss our main results, and in Section 2.1 we give an overview of the proof and new techniques, in particular with respect to the precursors to this paper.

An instance $I_{CH}$ incorporates, for $1\leq i\leq n$ , a non-negative measure $\mu_{i}$ of a finite line interval $A=[0,x]$ , where each $\mu_{i}$ integrates to 1 and $x>0$ is part of the input. We assume that $\mu_{i}$ are step functions represented in a standard way, in terms of the endpoints of intervals where $\mu_{i}$ is constant, and the value taken in each such interval. We use the bit model (logarithmic cost model) of numbers. $I_{CH}$ specifies a value $\varepsilon\geq 0$ also using the bit model. We regard $\mu_{i}$ as the value function held by agent $i$ for subintervals of $A$ .

A solution consists firstly of a set of $n$ cut points in $A$ (also given in the bit model of numbers). These points partition $A$ into (at most) $n+1$ subintervals, and the second element of a solution is that each subinterval is labelled $A_{+}$ or $A_{-}$ . This labelling is a correct solution provided that for each $i$ , $|\mu_{i}(A_{+})-\mu_{i}(A_{-})|\leq\varepsilon$ , i.e. each agent has a value in the range $[\frac{1}{2}-\frac{\varepsilon}{2},\frac{1}{2}+\frac{\varepsilon}{2}]$ for the subintervals labelled $A_{+}$ (hence also values the subintervals labelled $A_{-}$ in that range).

We assume without loss of generality that in a valid solution, labels $A_{+}$ and $A_{-}$ alternate. We also assume that the alternating label sequence begins with label $A_{+}$ on the left-hand side of $A$ (i.e. $A_{+}$ denotes the leftmost label in a Consensus-halving solution).

The Consensus-halving problem of Definition 4 is a computational version of the Hobby-Rice theorem . Most of the present paper is devoted to proving the following theorem.

$\varepsilon$ -Consensus-halving is PPA-complete for some inverse-polynomial $\varepsilon$ .

As mentioned in the introduction, in it was proven that 2-thief Necklace-splitting and $\varepsilon$ -Consensus-halving for $\varepsilon$ inversely-polynomial are computationally equivalent, i.e. they reduce to each other in polynomial time. Therefore, by and the “in PPA” result proven in Section B.1, we immediately get the following corollary.

Necklace-splitting is PPA-complete when there are $k=2$ thieves.

If we knew that $k$ -Necklace-splitting belonged to PPA for other values of $k$ , we could of course make the blanket statement “Necklace-splitting is PPA-complete”. Alas, the proofs that Necklace-splitting is a total search problem for $k>2$ do not seem to boil down to the parity argument on an undirected graph! That being said, we do manage to establish membership of PPA for $k$ being a power of 2 (essentially an insight of ), see Section B of the Appendix for the details and a related discussion. Of course, the $k=2$ result strongly suggests that $k$ -Necklace-splitting is a hard problem for other values of $k$ .

As it happens, the PPA-completeness of Discrete Ham Sandwich follows straightforwardly, and we present that next. The basic idea of Theorem 2.3 of embedding the necklace in the moment curve appears already in and , p.48.

Inclusion in PPA is shown in Section B.1 of the Appendix. For PPA-hardness, we reduce from 2-thief Necklace-splitting which is PPA-complete by Theorem 2.2.

The idea is to embed the necklace into the moment curve $\gamma=\{(\alpha,\alpha^{2},\ldots,\alpha^{n}):\alpha\in\}$ . Assume all beads lie in the unit interval $ $. A bead having colour$ i\in[n] $located at$ \alpha\in $becomes a point mass of ingredient$ i $of the ham sandwich located at$ (\alpha,\alpha^{2},\ldots,\alpha^{n})\in\mbox{{IR}}^{n} $. It is known that any hyperplane intersects the moment curve$ \gamma $in at most$ n $points, (e.g. see , Lemma 5.4.2), therefore a solution to Discrete Ham Sandwich corresponds directly to a solution to Necklace-splitting, where the two thieves splitting the necklace take alternating pieces. (In the$ k=2$ case, we may assume without loss of generality that they do in fact take alternating pieces).

A limitation to Theorem 2.3 is that the coordinates may be exponentially large numbers; they could not be written in unary. We leave it as an open problem whether a unary-coordinate version is also PPA-complete. As defined in , Discrete Ham Sandwich stipulated that each of the $n$ sets of points is of size $2n$ , whereas Definition 3 allows polynomial-sized sets. We can straightforwardly extend PPA-completeness to the version of by adding “dummy dimensions” whose purpose is to allow larger sets of each ingredient; the new ingredients that are introduced, consist of compact clusters of point masses, each cluster in general position relative to the other clusters and the subspace of dimension $n$ that contains the points of interest.

We use the standard notation $[n]$ to denote the set $\{1,\ldots,n\}$ , and we also use $\pm[n]$ to denote $\{1,-1,2,-2,\ldots,n,-n\}$ . We often refer to elements of $\pm[n]$ as “labels” or “colours”. $\lambda$ is usually used to denote a labelling function (so its codomain is $\pm[n]$ ).

We let $A$ denote the domain of an instance of Consensus-halving; if that instance has complexity $n$ then $A$ will be the interval $[0,poly(n)]$ , where $poly(n)$ is some number bounded by a polynomial in $n$ . Recall by Definition 4 that $\mu_{a}$ denotes the value function, or measure, of agent $a$ on the domain $A$ , in a Consensus-halving instance. We also associate each agent with its own cut (recall that the number of agents and cuts is supposed to be equal) and we let $c(a)$ denote the cut associated with agent $a$ .

We let $p^{C}(n)$ be a polynomial that represents the number of “circuit-encoders” that we use in our reduction (see Section 5.1); we usually denote it $p^{C}$ , dropping the $n$ from the notation.

Finally, $B$ denotes the $n$ -cube (or “box”) $^{n}$ .

In an instance of Consensus-halving, a value-block of an agent $a$ denotes a sub-interval of the domain where $a$ possesses positive value, uniformly distributed on that interval. In our construction, value-blocks tend to be scattered rather sparsely along the domain.

1 Overview of the proof

We review the ground covered by the precursors to this paper, then we give an overview of the technical innovations of the present paper.

established PPAD-hardness of Consensus-halving. An arithmetic circuit can be encoded by an instance to $\varepsilon$ -Consensus-halving, by letting each gate have a cut whose location (in a solution) represents the (approximate) value taken at that gate. Agents’ valuation functions ensure that values taken at the gates behave according to the type of gate. A “PPAD circuit” can then be represented using an instance of Consensus-halving.

noted that the search space of solutions to instances as constructed by , is oriented. A radical new idea was needed to encode the non-oriented feature of topological spaces representable by PPA. That was achieved by using two cuts to represent the coordinates of a point on a triangular region faving two sides identified to form a Möbius strip. (These cuts are the only ones that lie in a specific subinterval of the interval $A$ of a Consensus-halving instance, called the “coordinate-encoding (c-e) region”. The two cuts are called the “coordinate-encoding cuts”.) Identifying two sides in this way is done by exploiting the equivalence of taking a cut on the LHS of the c-e region, and moving it to the RHS. In order to embed a hard search problem into the surface of a standard 2-dimensional Möbius strip, it was necessary to work at exponentially-fine resolution, which immediately required inverse-exponential $\varepsilon$ for instances of $\varepsilon$ -Consensus-halving. In we reduced from the PPA-complete problem 2D-Tucker (Definition 5 below), a search problem on an exponential-sized 2-dimensional grid.

In , the rest of $A$ is called the “circuit-encoding region” $R$ , and the cuts occurring within $R$ do the job of performing computation on the location of cuts in the c-e region. The present paper retains this high-level structure (Section 4.1). As in we use multiple copies of the circuit that performs the computation, each in its own subregion of $R$ . Here we use $p^{C}(n)$ copies where $p^{C}$ is a polynomial; in we used 100 copies. Each copy is called a circuit-encoder. The purpose of multiple copies is to make the system robust; a small fraction of copies may be unreliable: as in we have to account for the possibility that one of the c-e cuts may occur in the circuit-encoding region, rendering one of the copies unreliable. We re-use the “double negative lemma” of that such a cut is not too harmful. We also adapt a result of that when a cut is moved from the one end to the other end of the c-e region, this corresponds to identifying two facets of a simplex to form a Möbius strip.

uses a sequence of “sensor agents” to identify the endpoints of intervals labelled $A_{+}$ and $A_{-}$ in the coordinate-encoding region, and feed this information into the above mentioned circuit-encoders, which perform computation on those values. As in we use sensor agents. We obtain a simplification with respect to which is that we do not need the gadgets used there to perform “bit-extraction” (converting the position of a c-e cut into $n$ boolean values). In , a solution to an instance of Consensus-halving was associated with a sequence of 100 points in the Möbius-simplex (referred there as the “simplex-domain”), and the “averaging manoeuvre” introduced in was applied. In this paper, for a polynomial $p^{C}(n)$ , we sample a sequence of $p^{C}$ points in a more elegant manner, again exploiting the inverse-polynomial precision of solutions that we care about.

1.2 Technical innovations

As in , we reduce from the PPA-complete problem 2D-Tucker (Definition 5). That computational problem uses an exponentially-fine 2D grid, and (unlike ), in Section 3 we apply the snake-embedding technique invented in (versions of which are used in in the context of PPA) to convert this to a grid of fixed resolution, at the expense of going from 2 to $n$ dimensions. The new problem, Variant high-D Tucker (Definition 7) envisages a $7\times 7\times\cdots\times 7$ grid. Here, we design the snake-embedding in such a way that PPA-completeness holds for instances of the high-dimensional problem that obey a further constraint on the way the high-dimensional grid is coloured, that we exploit subsequently. A further variant, New variant high-D Tucker (Definition 8) switches to a “dual” version where a hypercube is divided into cubelets, and points in the hypercube are coloured such that interiors of cubelets are monochromatic. A pair of points is sought having equal and opposite colours and distant by much less than the size of the cubelets.

We encode a point in $n$ dimensions using a solution to an instance of Consensus-halving as follows. Instead of having just 2 cuts in the coordinate-encoding region (as in ), suppose we ensure that up to $n$ cuts lie there. These cuts split this interval into $n+1$ pieces whose total length is constant, so represent a point in the unit $n$ -simplex (in , the unit 2-simplex). This “Möbius-simplex” (Definition 17; Figure 10) has the further property that two facets are identified with each other in a way that effectively turns the simplex into an $n$ -dimensional Möbius strip.

In Section 5.2 we define a crucially-important coordinate transformation (see Figure 11) with the following key properties

the transformation and its inverse can be computed efficiently, and distances between transformed coordinate vectors are polynomially related to distances between un-transformed vectors;

at the two facets that are identified with each other, the coordinates of corresponding points are the negations of each other; our colouring function (that respects Tucker-style boundary conditions) has the effect that antipodal points get equal and opposite colours, and no undesired solutions are introduced at these facets.

This is the “smooth embedding” referred to in the abstract.

With the aid of the above coordinate transformation, we divide up the Möbius-simplex:

The twisted tunnel (Definition 23) is an inverse-polynomially thick strip, connecting the two facets that are identified in forming the Möbius-simplex. It contains at its centre an embedded copy of the hypercube domain of an instance $I_{VT}$ of New variant high-D Tucker. Outside of this embedded copy, it is “coloured in” (using our new coordinate system) in a way that avoids introducing solutions that do not encode solutions of $I_{VT}$ .

The Significant Region contains the twisted tunnel and constitutes a somewhat thicker strip connecting the two facets. It serves as a buffer zone between the twisted tunnel and the rest of the Möbius-simplex. It is subdivided into subregions where each subregion has a unique set of labels, or colours, from $\pm[n]$ . (We sometimes refer to these as “colour-regions”.) It is shown that any solution to an instance of Consensus-halving constructed as in our reduction, represents a point in the Significant Region.

If, alternatively, a set of cuts represents a point from outside the Significant Region, then certain agents (so-called “blanket-sensor agents”) will observe severe imbalances between labels $A_{+}$ and $A_{-}$ , precluding a solution.

In , it was relatively straightforward to integrate the subset of the 2-dimensional Möbius-simplex that corresponds with the twisted tunnel, with the parts of the domain where the blanket-sensor agent became active (ruling out a solution) in a way that avoided introducing solutions that fail to encode solutions of Tucker. In the present paper, that gap has to be “coloured-in” in a carefully-designed way (Section 5.3, list item 3), and this is the role of the part of the Significant Region that is not the twisted tunnel. The proofs that they work correctly (Sections 6.2, 6.3) become more complicated.

Snake embedding reduction

The purpose of this section is to establish the PPA-completeness of New variant high-D Tucker, Definition 8. The snake embedding construction was devised in , in order to prove that $\varepsilon$ -Nash equilibria are PPAD-complete to find when $\varepsilon$ is inverse polynomial; without this trick the result is just obtained for $\varepsilon$ being inverse exponential. We do a similar trick here. We will use as a starting-point the PPA-completeness of 2D-Tucker, from , which is the following problem:

(Aisenberg et al. ) An instance of 2D-Tucker consists of a labelling $\lambda:[m]\times[m]\rightarrow\{\pm 1,\pm 2\}$ such that for $1\leq i,j\leq m$ , $\lambda(i,1)=-\lambda(m-i+1,m)$ and $\lambda(1,j)=-\lambda(m,m-j+1)$ . A solution to such an instance of 2D-Tucker is a pair of vertices $(x_{1},y_{1})$ , $(x_{2},y_{2})$ with $|x_{1}-x_{2}|\leq 1$ and $|y_{1}-y_{2}|\leq 1$ such that $\lambda(x_{1},y_{1})=-\lambda(x_{2},y_{2})$ .

The hardness of the problem in Definition 5 arises when $m$ is exponentially-large, and the labelling function is presented by means of a boolean circuit.

We aim to prove the following is PPA-complete, even when the values $m_{i}$ are all upper-bounded by some constant (specifically, 7).

(Aisenberg et al. ) An instance of $n$ D-Tucker consists of a labelling $\lambda:[m_{1}]\times\cdots\times[m_{n}]\rightarrow\{\pm 1,\cdots,\pm n\}$ such that if a point ${\bf x}=(x_{1},\ldots,x_{n})$ lies on the boundary of this grid (i.e., $x_{i}=1$ or $x_{i}=m_{i}$ for some $i$ ), then letting $\bar{\bf x}$ be the antipodal point of x, we have $\lambda(\bar{\bf x})=-\lambda({\bf x})$ . (Two boundary points are antipodal if they lie at opposite ends of a line segment passing through the centre of the grid.) A solution consists of two points z, ${\bf z}^{\prime}$ on this grid, having opposite labels ( $\lambda({\bf z})=-\lambda({\bf z}^{\prime})$ ), each of whose coordinates differ (coordinate-wise) by at most 1.

It is assumed that $\lambda$ is presented in the form of a circuit, syntactically constrained to give opposite labels to antipodal grid points.

An instance of Variant high-D Tucker is similar to Definition 6 but whose instances obey the following additional constraints. The $m_{i}$ are upper bounded by the constant 7. We impose the further constraint that the facets of the cube are coloured with labels from $\pm[n]$ such that all colours are used, and opposite facets have opposite labels, and for $2\leq i\leq n$ it holds that the facet with label $i$ (resp. $-i$ ) has no grid-point on that facet with label $i$ (resp. $-i$ ).

A snake-embedding consists of a reduction from $k$ D-Tucker to $(k+1)$ D-Tucker, which we describe informally as follows. See Figure 1. Let $I$ be an instance of $k$ D-Tucker, on the grid $[m_{1}]\times\cdots\times[m_{k}]$ . Embed $I$ in $(k+1)$ -dimensional space, so that it lies in the grid $[m_{1}]\times\cdots\times[m_{k}]\times$ . Then sandwich $I$ between two layers, where all points in the top layer get labelled $k+1$ , and points in the bottom layer get labelled $-(k+1)$ , as in the left part of Figure 1. We now have points in the grid $[m_{1}]\times\cdots\times[m_{k}]\times$ , and notice that this construction preserves the required property that points on the boundary have labels opposite to their antipodal points.

Then, the main idea of the snake embedding is the following. We fold this grid into three layers, by analogy with folding a sheet of paper along two parallel lines so that the cross-section is a zigzag line, and one dimension of the folded paper is one-third of the unfolded version, the other dimension being unchanged (see the right hand side of Figure 1). In higher dimension, suppose that $m_{1}$ is the largest value of any $m_{i}$ . Then, we can reduce $m_{1}$ by a factor of about 3, while causing the final coordinate to go up from 3 to 9. By merging layers of label $k+1$ and $-(k+1)$ , the thickness of 9 reduces to 7. This operation preserves the labelling rule for antipodal boundary points.

However, there are two points that need extra care for the reduction to go through:

Firstly, simply folding the layers such that their cross-sections are zigzag lines may introduce diagonal adjacencies between cubelets that were not present in the original instance in $k$ -dimensions, i.e. we might end up generating adjacent cubelets with equal-and-opposite colours, see the left part of Figure 2 for an illustration. To remedy this, we will “copy” (or “duplicate”) the cubelets at the folding points, essentially having three cubelets of the same colour, whose cross-sections are the short vertical section in the right hand side of Figure 1, see also the right hand side of Figure 2 for an illustration. From now on, when referring to “folding”, we will mean the version where we also duplicate the cubelets at the folding points, as described above.

Secondly, the folding and duplicating operation only works if $m_{1}$ is a multiple of $3$ , as otherwise the $(k+1)$ -dimensional instance may not satisfy the boundary conditions of Definition 6, i.e. we might end up with antipodal cubelets that do not have equal-and-opposite colours. To ensure that $m_{1}$ is a multiple of $3$ before folding, we can add $1$ or $2$ additional layers of cubelets to $m_{1}$ , (depending on whether the remainder of the division of $m_{1}$ by $3$ is either $2$ or $1$ respectively). These layers are duplicate copies of the outer layers of cubelets at opposite ends of the length- $m_{1}$ direction; if there is only one additional layer to be added, we can add on either side. Note that this operation does not generate any cubelets of equal-and-opposite labels that were not there before and the same will be true for the instance after the folding operation. See Figure 3 for an illustration.

Let $I$ be an instance of $k$ D-Tucker having coordinates in ranges $[m_{1}],\ldots,[m_{k}]$ and label function $\lambda$ . Select the largest $m_{i}$ , breaking ties lexicographically. Assume for simplicity in what follows that $m_{1}$ is largest.

if $x_{1}^{\prime}\leq m_{1}$ , ${\bf x}^{\prime}$ is mapped to ${\bf x}=(x_{1}^{\prime},\ldots,x_{k}^{\prime})$ and $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ .

if $x_{1}^{\prime}=m_{1}+1$ , ${\bf x}^{\prime}$ is mapped to ${\bf x}=(m_{1},\ldots,x_{k}^{\prime})$ and $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ .

In other words, points for which the first coordinate ranges from $1$ to $x_{1}^{\prime}$ , are mapped to themselves and receive their own label, and points for which the first coordinate is $m_{1}+1$ are mapped to the points where the first coordinate is $m_{1}$ , receiving the label of that point. This essentially “duplicates” the layer of cubelets on the right endpoint of the $m_{1}$ -direction. See Figure 3 for an illustration.

if $x_{1}^{\prime}=1$ , ${\bf x}^{\prime}$ is mapped to ${\bf x}=(1,\ldots,x_{k}^{\prime})$ and $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ .

if $2\leq x_{1}^{\prime}\leq m_{1}+1$ , ${\bf x}^{\prime}$ is mapped to ${\bf x}=(x_{1}^{\prime}-1,\ldots,x_{k}^{\prime})$ and $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ . This is similar to the mapping and labelling in the previous case, except for the fact that we need to “shift” the labels of the points, since we essentially introduced a copy of the layer of cubelets on the left endpoint of the $m_{1}$ -direction. See Figure 3 for an illustration.

From $k$ to $k+1$ dimensions. Starting from an instance $I$ of $kD$ -Tucker, we will construct an instance $I^{\prime}$ of $(k+1)D-$ Tucker as follows. Let ${\bf x}=(x_{1},\ldots,x_{k})$ be a point in $[m_{1}]\times\ldots\times[m_{k}]$ with labelling function $\lambda$ . We will associate each such point with a corresponding point ${\bf x}^{\prime}$ in $\left[\frac{m_{1}}{3}+2\right]\times\ldots\times[m_{k}]\times$ and a label $\lambda^{\prime}({\bf x}^{\prime})$ as follows.

If $x_{1}\leq\frac{m_{1}}{3}$ , then ${\bf x}$ is mapped to ${\bf x}^{\prime}=(x_{1},\ldots,x_{k},2)$ , and $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ .

If $x_{1}=\frac{m_{1}}{3}+1$ (the first “folding” point), then ${\bf x}$ is mapped to the following three points in $I^{\prime}$ and receives the following colours (see the shaded cubelets at the right-hand side of Figure 2):

${\bf x}^{\prime}=(\frac{m_{1}}{3}+1,\ldots,x_{k},2)$ (the original cubelet) and $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ .

${\bf x}^{\prime}=(\frac{m_{1}}{3}+1,\ldots,x_{k},3)$ (the first copy) and $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ .

${\bf x}^{\prime}=(\frac{m_{1}}{3}+1,\ldots,x_{k},4)$ (the second copy) and $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ .

If $\frac{m_{1}}{3}+2\leq x_{1}\leq\frac{2m_{1}}{3}-1$ , then x is mapped to ${\bf x}^{\prime}=(\frac{2m_{1}}{3}+2-x_{1},x_{2},\ldots,x_{k},4)$ , with $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ .

If $x_{1}=\frac{2m_{1}}{3}$ (the second “folding” point), then ${\bf x}$ is mapped to the following three points in $I^{\prime}$ and receives the following colours:

${\bf x}^{\prime}=(2,\ldots,x_{k},4)$ (the original cubelet) and $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ .

${\bf x}^{\prime}=(2,\ldots,x_{k},5)$ (the first copy) and $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ .

${\bf x}^{\prime}=(2,\ldots,x_{k},6)$ (the second copy) and $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ .

If $\frac{2m_{1}}{3}+1\leq x_{1}\leq m_{1}$ , then ${\bf x}$ is mapped to ${\bf x}^{\prime}=(x_{1}+2-\frac{2m_{1}}{3},x_{2},\ldots,x_{k},6)$ , with $\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x})$ .

Set $\lambda^{\prime}(1,\ldots,1)=-k$ , along with any point x connected to it via a path of points that have not been labelled by the above procedure.

Set $\lambda^{\prime}(\frac{m_{1}}{3}+2,m_{2},\ldots,m_{k},7)=k$ , along with any point x connected to it via a path of points that have not been labelled by the above procedure.

First, it is not hard to check that the the composition of $O(n)$ snake-embeddings is a polynomial-time reduction. Also note that, by the way the high-dimensional instances is constructed, we have not introduced any adjacencies that did not already exist, i.e. if there is a pair of adjacent cubelets with equal-and-opposite labels in the instance $I^{\prime}$ of the high-dimensional version, this pair is present in the instance $I$ of the $2D$ version as well, and it is easy to recover it in polynomial time. Therefore, it suffices to show how to obey the additional constraint of Variant high-D Tucker, namely that for $i\geq 2$ , a side having label $i$ has no grid-points with label $i$ , and similarly for $-i$ .

We begin as in (see Figure 1 in that paper), by taking the original $2D$ instance $I$ , of size $m\times m$ , and extend to an instance of size $3m\times m$ as follows. The original instance is embedded in the centre of the new instance. Each region $R$ to the sides (of size $m\times m$ ) are labelled by copying the edge of $I$ facing $R$ , along an adjacent edge of $R$ , and connecting these two edges with paths that have two straight sections and connect 2 points of the same label, and points along that path have that label. The outermost path then labels a side of the new instance of length $m$ , so these two opposite sides get opposite labels. We may assume (by switching $1$ ’s and $2$ ’s if needed) that these new opposite sides are labelled $\pm 2$ .

The $S$ -fold approach shown in Figure 1 (in this paper) can be checked to retain this property. When we sandwich a cuboid between two layers of opposite (new) colours (call them $c$ and $-c$ ), as shown in Figure 1, we label the new facets thus formed with $-c$ and $c$ respectively. We label the other facets with their original labels (each of these facets has acquired the labels $c$ and $-c$ , and no other labels). The folding operation has a natural correspondence between the facets of the unfolded and folded versions of the cuboid. It can be checked that the set of colours of a facet before folding is the same as the set of colours of the corresponding facets after folding.

It is convenient to define the following problem, whose PPA-completeness follows fairly directly from the PPA-completeness of Variant high-D Tucker.

An instance of New variant high-D Tucker in $n$ dimensions is presented by a boolean circuit $C_{VT}$ that takes as input coordinates of a point in the hypercube $B=^{n}$ and outputs a label in $\pm[n]$ (assume $C_{VT}$ has $2n$ output gates, one for each label, and is syntactically constrained such that exactly one output gate will evaluate to true), having the following constraints that may be enforced syntactically.

Dividing $B$ into $7^{n}$ cubelets of edge length $2/7$ using axis-aligned hyperplanes, all points in the same cubelet get the same label by $C_{VT}$ ;

Interiors of antipodal boundary cubelets get opposite labels;

Points on the boundary of two or more cubelets get a label belonging to one of the adjacent cubelets;

Facets of $B$ are coloured with labels from $\pm[n]$ such that all colours are used, and opposite facets have opposite labels. For $2\leq i\leq n$ it also holds that the facet with label $i$ (resp. $-i$ ) does not intersect any cubelet having label $i$ (resp. $-i$ ). Facets coloured $\pm 1$ are unrestricted (we call them the “panchromatic facets”).

A solution consists of a polynomial number $p^{C}$ of points that all lie within an inverse polynomial distance $\delta(n)$ of each other (for concreteness, assume $\delta(n)=1/100n$ ). At least two of those points should receive equal and opposite labels by $C_{VT}$ .

New variant high-D Tucker corresponds to the problem Variant Tucker in ; in that paper a solution only contained 100 points, while here we use $p^{C}$ points. Here we need more points since we are in $n$ dimensions, and our analysis needs to tolerate $n$ points receiving unreliable labels.

Some building-blocks and definitions

Here we set up some of the general structure of instances of Consensus-halving constructed in our reduction. We identify some basic properties of solutions to these instances. We define the Möbius-simplex and the manner in which a solution encodes a point on the Möbius-simplex. The encoding of the circuitry is covered in Section 5.

We use the following values throughout the paper.

$\delta^{\rm tiny}$ is an inverse-polynomial quantity in $n$ , chosen to be substantially smaller than any other inverse-polynomial quantity that we use in the reduction, apart from $\varepsilon$ (below).

$\delta^{T}$ is an inverse-polynomial quantity in $n$ , which is smaller than any other inverse-polynomial quantity apart from $\delta^{\rm tiny}$ and is larger than $\delta^{\rm tiny}$ by an inverse-polynomial amount. The quantity $\delta^{T}$ denotes the width of the so-called “twisted tunnel” (see Definition 23).

$p^{\rm huge}$ denotes a large polynomial in $n$ ; specifically we let $p^{\rm huge}=n/\delta^{\rm tiny}$ . The quantity $p^{\rm huge}$ represents the number of sensor agents for each circuit encoder (see Definition 13).

$p^{\rm large}$ denotes a large polynomial in $n$ , which is however smaller than $p^{\rm huge}$ by a polynomial factor. The quantity $p^{\rm large}$ will be used in the definition of the “blanket-sensor agents” (see Definition 14) and will quantify the extent to which the cuts in the “coordinate-encoding region” (Definition 9) are allowed to differ from being evenly spaced, before the blanket-sensor agents become active (see Section 4). The choice of $p^{\rm large}$ controls the value $\delta_{w}$ of the radius of the Significant Region (see Proposition 4.4), with larger $p^{\rm large}$ meaning larger $\delta_{w}$ .

$\varepsilon$ is the precision parameter in the Consensus-Halving solution, i.e. each agent $i$ is satisfied with a partition as long as $|\mu_{i}(A_{+})-\mu_{i}(A_{-})|\leq\varepsilon$ . Henceforth, we will set $\varepsilon=\delta^{\rm tiny}/10$ .

1 Basic building-blocks

We consider instances $I_{CH}$ of Consensus-halving that have been derived from instances $I_{VT}$ of New variant high-D Tucker in $n$ dimensions. The general aim is to get any solution of such an instance $I_{CH}$ to encode a point in $n$ dimensions that “localises” a solution to $I_{VT}$ , by which we mean that from the solution of $I_{CH}$ , we will be able to find a point on the $I_{VT}$ instance that can be transformed to a solution of $I_{VT}$ in polynomial time and fairly straightforwardly.

Coordinate-encoding region (c-e region) Given an instance of Variant high-D Tucker in $n$ dimensions, the corresponding instance of Consensus-halving has a coordinate-encoding region, the interval $[0,n]$ , a (prefix) subinterval of $A$ .

The valuation functions of agents in an instance $I_{CH}$ of Consensus-halving obtained by our reduction from an instance of New variant high-D Tucker in $n$ dimensions, will be designed in such a way that either $n-1$ or $n$ cuts (typically $n$ ) must occur in the coordinate-encoding region, in any solution. Furthermore, the distance between consecutive cuts must be close to 1 (an additive difference from 1 that is upper-bounded by an inverse polynomial), shown in Proposition 4.4.

Coordinate-encoding agents (c-e agents). Given an instance of New variant high-D Tucker in $n$ dimensions, the corresponding instance of Consensus-halving has $n$ coordinate-encoding agents denoted $\{a_{1},\ldots,a_{n}\}$ .

The $n$ c-e agents have associated $n$ coordinate-encoding cuts (Definition 11). It will be seen that the c-e cuts typically occur in the c-e region. The c-e agents do not have any value for the coordinate-encoding region; their value functions are only ever positive elsewhere. In particular, they have blocks of value whose labels $A_{+}/A_{-}$ are affected by the output gates of the circuitry that is encoded to the right of the c-e region.

Coordinate-encoding cuts (c-e cuts). We identify $n$ cuts as the coordinate-encoding cuts. In the instances of Consensus-halving that we construct, in any (sufficiently good approximate) solution to the Consensus-halving instance, all other cuts will be constrained to lie outside the c-e region (and it will be straightforward to see that the value functions of their associated agents impose this constraint). A c-e cut is not straightforwardly constrained to lie in the c-e region, but it will ultimately be proved that in any approximate solution, the c-e cuts do in fact lie in the c-e region.

Recall that $p^{\rm huge}=n/\delta^{\rm tiny}$ from Section 2, which implies that the c-e region can be divided into $p^{\rm huge}$ intervals of length $\delta^{\rm tiny}$ (see also Figure 4).

$\sigma$ -shifted version. Given a value function $f$ (or measure) on the c-e region $[0,n]$ , we say that another function $f^{\prime}$ on the c-e region is a $\sigma$ -shifted version of $f$ , when we have that $f^{\prime}((x-\sigma)\mod n)=f(x)$ .

Recall that the circuit-encoding region (details in Section 5) contains $p^{C}$ circuit-encoders, mentioned in the following definitions.

Sensor agents. Each circuit-encoder $C_{i}$ , $i=1,\ldots,p^{C}$ , has a set ${\cal S}_{i}$ of sensor agents. ${\cal S}_{i}=\{s_{i,1},\ldots,s_{i,p^{\rm huge}}\}$ where the $s_{i,j}$ are defined as follows. When $i=1$ , $s_{1,j}$ has value $\frac{1}{10}$ uniformly distributed over the interval

For $i>1$ , $s_{i,j}$ is a $\frac{1}{p^{C}}(i-1)\delta^{\rm tiny}$ -shifted version of $s_{1,j}$ .

Each sensor agent $s_{i,j}$ also has valuation outside the c-e region, in non-overlapping intervals of the circuit-encoding region $R_{i}$ (see Section 5.1). This valuation consists of two valuation blocks of value $\frac{9}{20}$ each, with no other valuation block in between. These are exactly as described in , see also Appendix A and Figure 16 for an illustration.

This value gadget for $s_{i,j}$ causes the $j$ -th input gate in the circuit-encoder $C_{i}$ to be set according to the label received by $s_{i}$ ’s block of value in the c-e region, i.e. jump to the left or to the right in order to indicate that the corresponding value-block of $s_{i}$ in the c-e region is labelled $A_{+}$ or $A_{-}$ .

According to the definitions above, $C_{1}$ has a sequence of (a large polynomial number of) sensor agents that have blocks of value in a sequence of small intervals going from left to right of the c-e region (see Figure 4). For $1<i\leq p^{C}$ , $C_{i}$ has a similar sequence, shifted slightly to the right on the c-e region (by $\delta^{\rm tiny}(i-1)/p^{C}$ ). For $j\in[p^{\rm huge}]$ , the intervals defined by the value-blocks of the sensor agents $s_{1,j},\ldots,s_{p^{C},j}$ (for $C_{1},\ldots,C_{p^{C}}$ ) partition the interval $[(j-1)\delta^{\rm tiny},j\delta^{\rm tiny}]$ .

Remark: Note that a c-e cut may divide one of the above value-blocks held by a sensor agent in the c-e region, and in that case the input being supplied (using the gadget of ) to its circuit-encoder is unreliable. However, only $n$ sensor agents may be affected in that way, and their circuit-encoders will get “out-voted” by the ones that receive valid boolean inputs. This is part of the reason why we use $p^{C}$ circuit-encoders in total. More details on this averaging argument are provided in Section 5.

Blanket-sensor agents. Each circuit-encoder $C_{i}$ shall have $n-1$ blanket-sensor agents $b_{i,2},\ldots,b_{i,n}$ .

In $C_{1}$ , for each $j=2,\ldots,n$ , blanket-sensor agent $b_{1,j}$ has value $1/10$ distributed over $[j-2,j]$ (see Figure 4). This value consists of a sequence

of $2/\delta^{\rm tiny}=2p^{\rm huge}/n$ value-blocks, each of length $\delta^{\rm tiny}/p^{C}$ and of value $\frac{1}{10}\cdot(\delta^{\rm tiny}/2)$ .

In each $C_{i}$ , $1<i\leq p^{C}$ , and for each $j=2,\ldots,n$ , the value function of $b_{i,j}$ that lies in the c-e region is an $(i-1)\frac{\delta^{\rm tiny}}{p^{C}}$ -shifted version of $b_{1,j}$ .

The remaining value $9/10$ of each $b_{i,j}$ consists of 3 value-blocks of width $\delta^{\rm tiny}$ lying in a subinterval $I_{i,j}$ of the circuit-encoding region $R_{i}$ (see Section 5.1), such that:

respectively, where $\kappa=\Bigl{(}\frac{1}{10}\frac{\delta^{\rm tiny}}{2}\Bigr{)}p^{\rm large}$ .

$I_{i,j}$ contains also value-blocks of agents for each gate that takes the value of $b_{i,j}$ as input (the feedback gadgetry, see Section 5.1.2).

The value of the blanket-sensor agents in $I_{i,j}$ is very similar to the gadget used in , see Appendix A (of the present paper) and Figure 17. The structure of the blanket-sensor agents in the c-e region is shown in Figure 4.

Each blanket-sensor agent $b_{i,j}$ has an associated cut $c(b_{i,j})$ that lies in the subinterval $I_{i,j}$ . Agent $b_{i,j}$ “monitors” an interval of length 2, namely the interval $[j-2,j]$ within which the sequence of $2/\delta^{\rm tiny}$ value-blocks lie. If, in this interval, the number of these value-blocks labelled $A_{+}$ exceeds the number labelled $A_{-}$ by at least $p^{\rm large}$ (recall that $p^{\rm large}$ is a large polynomial which is however polynomially smaller than $p^{\rm huge}$ ) then (in any $\varepsilon$ -approximate solution to $I_{CH}$ , where, recall, $\varepsilon=\delta^{\rm tiny}/10$ ), the cut $c(b_{i,j})$ in $I_{i,j}$ lies in either the right-hand or the left-hand value-block, otherwise it lies in the central value-block. Note that these three possible positions may be converted to boolean values that influence circuit-encoder $C_{i}$ ; this was referred to as a “bit-detection gadget” in , see Appendix A for more details.

We say that blanket-sensor $b_{i,j}$ is active if $b_{i,j}$ in fact observes a sufficiently large label discrepancy in the c-e region, that $c(b_{i,j})$ lies in one of the two outer positions, left or right, and not in the central position. We say that $b_{i,j}$ is active towards $A_{+}$ if $A_{+}$ is the overrepresented label, with similar terminology for $A_{-}$ .

When blanket-sensor agent $b_{i,j}$ is active, it provides input to $C_{i}$ that causes $C_{i}$ to label the value held by $a_{j}$ and controlled by $C_{i}$ , to be either $A_{+}$ or $A_{-}$ ; the choice depends on the over-represented label in $[j-2,j]$ and the parity of the index of the blanket-sensor agent. The precise feedback mechanism to the c-e agent $a_{j}$ by the blanket-sensor $b_{i,j}$ is described in Section 5.1.3.

When no blanket-sensors are active, the sequence of c-e cuts encodes a point in the Significant Region (Definition 18).

In , we worked just in two dimensions and there was just one blanket-sensor agent for the entire c-e region, for each circuit-encoder. Note also that there, the blanket-sensor agent had a single value-block of length 2; here we split it into a polynomial sequence of small value-blocks. The advantage of using a polynomial sequence of value-blocks (which could not have been done in due to the exponential precision requirement) is that we can argue that in all but at most $n$ circuit-encoders, the blanket-sensor agents have value-blocks that are not cut by the c-e cuts, so we can be precise about how big a disparity between blocks labelled $A_{+}$ and $A_{-}$ cause a blanket-sensor to be active, and for at most $n$ circuit-encoders, we regard them as having unreliable inputs (see Definition 16 and Observation 4.2).

2 Features of solutions

The main result of this section is Proposition 4.4, that in a solution to approximate Consensus-halving as constructed here, the sequence of cuts in the c-e region are “evenly spaced” in the sense that the gap between consecutive cuts differs from 1 by at most an inverse-polynomial.

Given an instance $I_{CH}$ derived by our reduction from an instance of New variant high-D Tucker in $n$ dimensions, any inverse-polynomial approximate solution of $I_{CH}$ has the property that at most $n$ cuts lie in the coordinate-encoding region. This is because all other cuts are associated with agents who have at least $9/10$ of their value strictly to the right of the c-e region, thus in a solution, those cuts cannot lie in the c-e region.

We will say that a circuit-encoder receives reliable input, if no coordinate-encoding cut passes through value-blocks of its sensor agents.

At most $n$ circuit-encoders fail to receive reliable input (by Observation 4.1 and the fact that sensors of distinct circuit-encoders have value in distinct intervals).

When a circuit-encoder receives reliable input, it is straightforward to interpret the labels allocated to its sensors, as boolean values, and simulate a circuit computation on those values, ultimately passing feedback to the c-e agents via value-blocks that get labelled according to the output gates of the circuit being simulated. This is done in a conceptually similar way to that described in (e.g. see Sections 4.4.2 and 4.6 in ), see also Appendix A of the present paper.

The Möbius-simplex. The Möbius-simplex in $n$ dimensions consists of points ${\bf x}$ in $\mbox{{IR}}^{n+1}$ whose coordinates are non-negative and sum to 1. We identify every point $(x_{1},\ldots,x_{n},0)$ with the point $(0,x_{1},\ldots,x_{n})$ , for all non-negative $x_{1},\ldots,x_{n}$ summing to 1. We use the following metric $d(\cdot,\cdot)$ on the Möbius-simplex, letting $L_{1}$ be the standard $L_{1}$ distance on vectors:

where $(0,x_{1},\ldots,x_{n})\equiv(x_{1},\ldots,x_{n},0)$ .

Let $I_{CH}$ be an instance of Consensus-halving, obtained by reduction from New variant high-D Tucker in $n$ dimensions, hence having c-e region $[0,n]$ . Note that, by Observation 4.1, at most $n$ cuts may lie in the c-e region. A set of $k\leq n$ cuts of the coordinate-encoding region splits it into $k+1$ pieces. We associate such a split with a point ${\bf x}$ in $\mbox{{IR}}^{n+1}$ as follows. The first coordinate is the distance from the LHS of the consensus-halving domain to the first cut, divided by $n$ , the length of the c-e region. For $2\leq i\leq k+1$ , the $i$ -th coordinate of ${\bf x}$ is the distance between the $i-1$ -st and $i$ -th cuts, divided by $n$ . Remaining coordinates are 0.

If there are $n-1$ cuts in the c-e region, suppose we add a cut at either the LHS or the RHS. These two alternative choices correspond to a pair of points that have been identified as the same point, as described in Definition 17. (Observation 5.3 makes a similar point regarding transformed coordinates.)

Each circuit-encoder reads in “input” representing a point in the Möbius-simplex. Any circuit-encoder $C_{i}$ ( $i\in[p^{C}]$ ) behaves like $C_{1}$ on a point ${\bf x}_{i}$ , for which (for all $i,j\in[p^{C}]$ ) $d({\bf x}_{i},{\bf x}_{j})\leq\delta^{\rm tiny}$ (recall $d$ is defined in (1)). Consequently their collective output (the split between $A_{+}$ and $A_{-}$ of the value held by the c-e agents) is the output of a single circuit-encoder averaged over a collection of $p^{C}$ points in the Möbius-simplex, all within $\delta^{\rm tiny}$ of each other.

This follows by inspection of the way the $p^{C}$ circuit-encoders differ from each other: their sensor-agents are shifted but their internal circuitry is the same.

The Significant Region of the Möbius-simplex $D$ . The Significant Region of $D$ consists of all points in $D$ where no blanket-sensors are active (where “blanket-sensors” and “active” are defined in Definition 15).

There is an inverse-polynomial value $\delta^{w}$ such that all points ${\bf x}=(x_{1},\ldots,x_{n+1})$ in the Significant Region have coordinates $x_{i}$ that for $2\leq i\leq n$ differ from $1/n$ by at most $\delta^{w}$ , if ${\bf x}$ is encoded by the c-e cuts of an $\varepsilon$ -approximate solution to one of our instances of Consensus-halving. (Recall that $\varepsilon=\delta^{\rm tiny}/10$ ).

Thus, if an instance $I_{CH}$ of Consensus-halving (obtained using our reduction) has a solution $S_{CH}$ , then all the c-e cuts in $S_{CH}$ have the property that the distance between two consecutive c-e cuts differs from 1 by at most some inverse-polynomial amount.

Before we proceed with the proof of the proposition, we will state a few simple lemmas that will be used throughout the proof. We start with the following definition.

Intuitively, cuts that are $\delta$ -close to integer points lie close (within distance at most $\delta$ ) to either the endpoints or the midpoint of some monitored interval $[j-2,j]$ .

An interval $I$ is a monochromatic interval if it is not intersected by any cuts (which means that it receives a single label). Additionally, if for $A_{j}\in\{A_{+},A_{-}\}$ , $I$ is labelled with $A_{j}$ , then we will say that $I$ is a monochromatic interval of label $A_{j}$ .

It should be clear that if any monitored interval $[j-2,j]$ has a large enough (larger than $1+\delta$ ) monochromatic subinterval, then the blanket-sensor agent $b_{1,j}$ is active.

In $I$ , there are at least $k-1$ intervals monitored by blanket-sensor agents and we only have at most $k-2$ cuts at our disposal. With $k-2$ cuts, we can partition an interval of length $k$ in at most $k-1$ intervals, the largest of which, call it $I_{\text{max}}$ , will have length at least $1+1/k$ . Since $\delta<1/2n$ , the length of $I_{max}$ is actually larger than $1+\delta$ . The lemma follows then from the fact that, since the monitored intervals partition the interval $I$ , $I_{max}$ will contain a monochromatic interval of length at least $1+\delta$ , which will be entirely contained in some monitored interval, and the corresponding blanket-sensor agent will be active.

each of the $k-1$ cuts in $I$ will be $\delta$ -close to a different integer point and these integer points will be the midpoints of the monitored subintervals contained entirely in $I$ or

at least one of the blanket-sensors monitoring the subintervals in $I$ will be active.

In $I$ , there are at least $k-1$ intervals monitored by blanket-sensor agents and we have $k-1$ cuts at our disposal. Assume that there exists some integer point $j-1$ which is a midpoint of some monitored interval $[j-2,j]\subseteq I$ such that there exists no cut that is $\delta$ -close to $j-1$ . This implies that either $[j-2,j]$ is intersected by at least $2$ cuts, or the blanket-sensor agent corresponding to $[j-2,j]$ will be active. To see this, note that if there were not any cuts in $[j-2,j]$ then obviously the whole interval $[j-2,j]$ would be monochromatic and the corresponding blanket-sensor would be active. If $[j-2,j]$ was intersected by a single cut, since the cut lies at distance at least $\delta$ from the midpoint of the interval, there would exist a monochromatic subinterval of $[j-2,j]$ of length at least $1+\delta$ , activating the corresponding blanket-sensor agent $b_{1,j}$ .

However, for the blanket-sensor agent $b_{1,j}$ to not be active, it would have to be the case that some other cut in the interval $[j-2,j]$ is also not $\delta$ -close to the midpoint of one of the adjacent monitored intervals, therefore generating an imbalance in labels that has to be compensated with at least one additional cut in that interval. Given that there are only $k-1$ cuts in the interval $I$ , it follows that in some monitored subinterval $[j^{\prime},j^{\prime}-2]$ there will be a large enough imbalance, i.e a monochromatic subinterval of length at least $1+\delta$ , and the corresponding blanket-sensor agent $b_{1,j^{\prime}}$ will be active. See the right-hand side of Figure 5 for an illustration.

We are now ready to proceed with the proof of Proposition 4.4.

First, recall that by Observation 4.1, at most $n$ cuts can lie in the c-e region. Also recall that from Definition 14, for the circuit encoder $C_{1}$ , the blanket-sensor agent $b_{1,j}$ , $j\in\{2,\ldots,n\}$ has valuation only in the interval $[j-2,j]$ of the c-e region, i.e. it “monitors” the interval $[j-2,j]$ . The blanket-sensor agent $b_{i,j}$ for $i\in\{2,\ldots,p^{C}\}$ is a $(i-1)\frac{\delta^{\rm tiny}}{p^{C}}$ -shifted version of $b_{1,j}$ . We will make the argument for the blanket-sensor agents of the circuit-encoder $C_{1}$ ; the argument for any $C_{i}$ , with $i\neq 1$ is very similar.

It suffices to prove that if consecutive cuts are too far apart or too close together, some blanket-sensor agent will be active.

Case 1: The cuts are too far apart. First consider the case when two consecutive cuts are too far apart (by more than $1$ plus some inverse-polynomial amount $2\delta$ ). More formally, assume that there are two cuts $c_{1}$ and $c_{2}$ such that $c_{2}>c_{1}$ and $c_{2}-c_{1}>1+2\delta$ . Then, as we explain below, there is some $j\in\{2,\ldots,n\}$ such that some subinterval $I_{j}=[j_{1},j_{2}]\subseteq[j-2,j]$ with $j_{2}-j_{1}>1+\delta$ will receive a single label, either $A_{+}$ or $A_{-}$ . In particular, we have the following cases:

There is a $j$ such that $[c_{1},c_{2}]\subseteq[j-2,j]$ . In that case, $[c_{1},c_{2}]$ is such a monochromatic subinterval.

There is a $j$ such that $[j-2,j]\subseteq[c_{1},c_{2}]$ . In that case, the whole monitored subinterval $[j-2,j]$ is such a monochromatic subinterval.

Case 2: The cuts are too close together. Now consider the case when two consecutive cuts are too close together, closer than $1-2n\delta$ . More formally, assume that there are two consecutive cuts $c_{1}$ and $c_{2}$ in the c-e region such that $c_{2}>c_{1}$ and $c_{2}-c_{1}<1-2n\delta$ . Since the cuts are close together, there exists a monitored interval that is intersected by both $c_{1}$ and $c_{2}$ and let $[j-2,j]$ be such an interval. Notice that if there exists no other cut that intersects $[j-2,j]$ , then $[j-2,c_{1}]\cup[c_{2},j]$ is a union of subintervals of length at least $1+\delta$ that receive the same label, and we are done. Therefore there must exist at least $3$ cuts that lie in $[j-2,j]$ . We consider three cases.

There are 5 or more cuts in $[j-2,j]$ . This is an easy case to argue, as if that happens, there will be some interval, either $[0,j-2]$ or $[j,n]$ of length $k$ that is only intersected by at most $k-2$ cuts. By Lemma 4.5, some blanket-sensor agent will be active and we are done.

There are 4 cuts in $[j-2,j]$ . Consider the intervals $[0,j-2]$ and $[j,n]$ . If either $[0,j-2]$ is intersected by at most $(j-2)-2$ cuts or $[j,n]$ is intersected by at most $n-j-2$ cuts, then by Lemma 4.5, some blanket-sensor will be active and we are done. Note also for completeness that, if $j=2$ (respectively $j=n$ ), it is necessarily the case that $[j-2,n]$ (respectively $[0,j-2]$ ) is intersected by $n-4$ cuts and Lemma 4.5 again applies. Therefore, we can assume that $j\in\{3,\ldots,n-1\}$ , and that there are exactly $(j-2)-1$ cuts in $[0,j-2]$ and $n-j-1$ cuts in $[j,n]$ .

Consider the interval $[j,n]$ without loss of generality, as the argument for $[0,j-2]$ is symmetric. By Lemma 4.6, we know that the cuts in $[j,n]$ are $\delta$ -close to integer points and particularly, they are $\delta$ -close to the midpoints of the monitored intervals $[j,j+2],\ldots,[n-2,n]$ . This implies that in the monitored subinterval $[j,j+2]$ , the subinterval $[j,j+1-\delta]$ will be a monochromatic interval of label $A_{j}$ for some $A_{j}\in\{A_{+},A_{-}\}$ ,

In turn, this implies that $[j-1,j]$ has a monochromatic subinterval of length at least $1-\delta$ that receives the label $A_{-j}$ , where $A_{-j}\in\{A_{+},A_{-}\}$ is the complementary label to $A_{j}$ , for the blanket-sensor agent to not be activated, which is only possible if one of the $4$ cuts in $[j-2,j]$ is $\delta$ -close to the integer point $j$ . Propagating the effect of this cut sequence/labelling into the monitored interval $[j-2,j]$ in question, we obtain that $[j-2,j-1]$ also contains a monochromatic interval of length at least $1-\delta$ and label $A_{j}$ , as otherwise blanket-sensor agent $b_{1,j}$ would be active. From this discussion, it follows that:

all the cuts in $[j-2,j]$ are $\delta$ -close to integer coordinates within the interval and

there is at least one cut in $[j-2,j]$ that is $\delta$ -close to the midpoint $j-1$ of the monitored interval, one cut that is $\delta$ -close to the right endpoint $j$ of the monitored interval and at least one cut that is $\delta$ -close to the left endpoint $j-2$ of the monitored interval,

where the very last statement follows from the symmetric argument to the one developed above, for the interval $[0,j-2]$ . See Figure 6 for an illustration.

There are $3$ cuts in $[j-2,j]$ . Again, considering the intervals $[0,j-2]$ and $[j,n]$ as we did in the case of $4$ cuts in $[j-2,j]$ , we can now observe that one of the intervals will be intersected by at most $k-1$ cuts, where $k\in\{j-2,n-j\}$ is its length. Furthermore, if it is intersected by fewer than $k-1$ cuts, by Lemma 4.5 some blanket-sensor agent will be active and we are done. Therefore, we will consider the case when one of the intervals is intersected by exactly $k-1$ cuts and let $[j,n]$ be that interval, without loss of generality, as the argument for $[0,j-2]$ is symmetric.

Following exactly the same arguments as in the second and third paragraph of the case of $4$ cuts above, we can establish a very similar statement, namely that:

all the cuts in $[j-2,j]$ are $\delta$ -close to integer coordinates within the interval and

there is at least one cut in $[j-2,j]$ that is $\delta$ -close to the midpoint $j-1$ of the monitored interval, and one cut that is $\delta$ -close to the right endpoint $j$ of the monitored interval.

Remarks: Looking ahead, certain points in the Significant Region encode New variant high-D Tucker (namely, the ones in the “twisted tunnel”, Definition 23). The Significant Region contains the twisted tunnel, being a somewhat wider 1-dimensional “tunnel” of inverse-polynomial width at most $1/p_{w}(n)$ , whose central axis is the set of points $(\alpha,1/n,\ldots,1/n,1/n-\alpha)$ , where the endpoints are identified together (noting Definition 17). Topologically, the Significant Region is a high-dimensional Möbius strip.

Reducing from New variant high-D Tucker to Consensus-halving

In Sections 5.1 we give an overview of aspects of how we construct an instance $I_{CH}$ of $\varepsilon$ -Consensus-halving (in poly-time) from an instance of New variant high-D Tucker, for inverse polynomial $\varepsilon$ . Section 5.2 describes the new coordinate system for the Möbius-simplex $D$ and establishes key properties. Section 5.3 presents a colouring function of $D$ in terms of the coordinate system of Section 5.2. Section 5.4 describes how to construct a purported solution to $n$ -dimensional New variant high-D Tucker from a solution to $\varepsilon$ -Consensus-halving. In Section 6 we prove that a solution to New variant high-D Tucker that is obtained by reducing to $\varepsilon$ -Consensus-halving, solving it, and converting that solution to a solution to $n$ -dimensional New variant high-D Tucker, really is a valid solution.

We define the reduction from New variant high-D Tucker (Definition 8) to $\varepsilon$ -Consensus-halving.

Let $I_{VT}$ be an instance of New variant high-D Tucker in $n$ dimensions; let $C_{VT}$ be the boolean circuit that represents it. $I_{CH}$ will be the corresponding instance of Consensus-halving. We list ingredients of $I_{CH}$ and give notation to represent them, as follows. $A$ is the consensus-halving domain, an interval of the form $[0,poly(n)]$ . Any agent $a$ has a measure $\mu_{a}:A\longrightarrow\mbox{{IR}}$ represented as a step function (thus having a polynomial number of steps).

$I_{CH}$ has $n$ coordinate-encoding agents $a_{1},\ldots,a_{n}$ (Definition 10). See Figure 9.

The consensus-halving domain $A$ of $I_{CH}$ has a coordinate-encoding region (c-e region) (Definition 9) consisting of the interval $[0,n]$ .

$I_{CH}$ has $p^{C}$ circuit-encoders (Sections 5.1.1, 5.1.4), $C_{1},\ldots,C_{p^{C}}$ .

Each $C_{i}$ has a set ${\cal A}_{i}$ of agents (see Figure 9) which includes $C_{i}$ ’s sensor agents, also circuit-encoding agents (below).

Each $C_{i}$ has an associated circuit-encoding region $R_{i}$ of $A$ ; each $R_{i}$ is an interval of polynomial length, and the $R_{i}$ do not intersect with each other or with the coordinate-encoding region.

${\cal A}_{i}$ contains a polynomial number of circuit-encoding agents (one for each gate of $C_{VT}$ ), having value in $R_{i}$ .

Each $C_{i}$ has $p^{\rm huge}$ sensor agents as defined in Definition 13 each of which has a block of value $1/10$ in a small subinterval of the c-e region as specified in Definition 13, and further value in region $R_{i}$ .

Each $C_{i}$ has $n-1$ blanket-sensor agents as in Definition 14.

Remarks: We associate one cut with each agent; let $c(a)$ be the cut associated with agent $a$ . The cuts $c(a_{i})$ for coordinate-encoding agents, are called the coordinate-encoding cuts (or c-e cuts). A straightforward consequence of Proposition 4.4 is that in any solution, either all $n$ , or $n-1$ , of the coordinate-encoding cuts must lie in the coordinate-encoding region. All other cuts must lie in the regions $R_{i}$ , indeed, every cut, other than the c-e cuts, is constrained by the value of its associated agent, to lie in a small interval that does not overlap any other such intervals. In the event that a c-e cut lies outside the c-e region, we refer to it as a “stray cut”, and while such a cut may initially appear to interfere with the functioning of the circuitry, similarly to we have that the duplication of the circuit using $p^{C}$ circuit-encoders, allows the circuitry to be robust to this problem. See Appendix A for more details.

Recall $C_{VT}$ is the boolean circuit in the instance $I_{VT}$ of New variant high-D Tucker.

We assume that $C_{VT}$ has $2n$ output gates $g_{1},\ldots,g_{n}$ and $g_{-1},\ldots,g_{-n}$ having the property that exactly one of them will take value true (this may be enforced syntactically). $g_{i}$ getting value 1 (true) means that the point at coordinates represented by the input gets coloured $i$ .

$C_{VT}$ has $n\cdot{\rm polylog}(n)$ input gates, representing the coordinates of a point in $B=^{n}$ , each represented with inverse-polynomial precision.

We describe how circuit-encoder $C_{1}$ is derived from $C_{VT}$ . The subsequent circuit-encoders can then be specified in terms of $C_{1}$ . Each gate $g$ of $C_{VT}$ is simulated using a “gate agent” $a(g)$ , as constructed in (see Appendix A for a more detailed exposition). $a(g)$ ’s cut $c(a(g))$ occupies a right position, or a left position, representing true or false, as a function of the cut(s) that represent boolean inputs to $g$ .

The circuit-encoding agents ${\cal A}_{1}$ of $C_{1}$ thus include $2n$ gate agents whose corresponding cuts simulate the values of the output gates of $C_{VT}$ , provided that the input represented by the c-e cuts lies in the “Significant Region” (Definition 18). The positions of these cuts affect the labels of blocks of value held by the $n$ coordinate-encoding agents, as detailed in Section 5.1.2.

Reference sensor-agent. Noting from Definition 13 that the sensor agents for $C_{i}$ are denoted ${\cal S}_{i}=\{s_{i,1},\ldots,s_{i,p^{\rm huge}}\}$ , we let $s_{1,1}$ be the reference sensor-agent: Outputs produced by the circuit $C_{i}$ are taken with reference to the valueWe are using $s_{1,1}$ to denote the boolean value taken by $s_{1,1}$ as well as the sensor itself. $s_{1,1}$ , in the sense that after simulating $C_{VT}$ we take the exclusive-or with $s_{1,1}$ .

We used this crucial technique of Definition 21 in : it performs the task of disorienting the domain while at the same time ensuring continuity when we move a cut from the left-hand side of the c-e region to the right-hand side.

For each $C_{i}$ , we take all $p^{\rm huge}$ input bits, which appear in up to $n+1$ blocks of consecutive 1’s and 0’s, and convert them into the coordinates of a point in the Möbius-simplex (Definition 17). As noted earlier (Observation 4.2) at most $n$ circuit-encoders may receive ill-defined inputs caused by c-e cuts cutting through value-blocks in the c-e region that belong to their sensor agents; we simply assume that the output of those agents is unreliable, indeed adversarially chosen.

We then perform a coordinate transformation described in Section 5.2. A subset of points in the Möbius-simplex maps to a copy of the domain $B$ of the instance $I_{VT}$ (recall Definition 8). These points get their coordinates passed directly to a copy of $C_{VT}$ , and the outputs of $C_{CV}$ are used to provide feedback to the c-e agents as described in Section 5.1.2 (and discussed in Observations 4.2 and 4.3). Other points get coloured in a manner that avoids allowing bogus solutions to $I_{CH}$ (i.e. ones that do not encode solutions to $I_{VT}$ ).

The following generalises . $C_{VT}$ has output gates $g_{j}$ , $j\in\pm[n]$ , with the property that when inputs are well-defined, exactly one output gate evaluates to true. A circuit-encoder simulates $C_{VT}$ using the gate gadgets introduced in (see Appendix A). Let $x_{REF}\in\{\mbox{{\sc true}},\mbox{{\sc false}}\}$ be the negation of the value of the reference sensor (Definition 21). We use additional gates $g^{\prime}_{j}$ , $j\in\pm[n]$ where

if $g_{j}=g_{-j}=\mbox{{\sc false}}$ then $g^{\prime}_{|j|}=\mbox{{\sc true}}$ and $g^{\prime}_{-|j|}=\mbox{{\sc false}}$ ;

if $j>0$ and $g_{j}=\mbox{{\sc true}}$ (so $g_{-j}=\mbox{{\sc false}}$ ) then $g^{\prime}_{j}=g^{\prime}_{-j}=\mbox{{\sc true}}\oplus x_{REF}$ ;

if $j<0$ and $g_{j}=\mbox{{\sc true}}$ (so $g_{-j}=\mbox{{\sc false}}$ ) then $g^{\prime}_{j}=g^{\prime}_{-j}=\mbox{{\sc false}}\oplus x_{REF}$ ;

Each of the c-e agents $a_{1},\ldots,a_{n}$ has $2$ value-blocks of value $1/(2p^{C})$ in region $R_{1}$ , and each gate $g^{\prime}_{j}$ of $C_{1}$ is able to select the label of one of these value-blocks (recall that the boolean value at a gate is represented by two positions that may be taken by the corresponding cut, so that a block of value lies between these two positions.) Figure 20 in Appendix A shows an example of how this feedback works.

Let $A_{j}\in\{A_{+},A_{-}\}$ and let $A_{-j}\in\{A_{+},A_{-}\}$ , $A_{-j}\neq A_{j}$ be the complementary label. The blanket-sensor agents $b_{1,2},\ldots,b_{1,n}$ affect the output of the circuit-encoders as follows:

If none are active, the $2n$ outputs of $C_{1}$ are computed as described in Section 5.1.2.

If $j$ is odd and $b_{1,j}$ is active in direction $A_{j}$ then the output gates $g^{\prime}_{j},g^{\prime}_{-j}$ are both set to the value that causes c-e agent $a_{j}$ to observe more $A_{j}$ .

If $j$ is even and $b_{1,j}$ is active in direction $A_{j}$ then the output gates $g^{\prime}_{j},g^{\prime}_{-j}$ are both set to the value that causes c-e agent $a_{j}$ to observe more $A_{-j}$ .

Rules 2 and 3 override Rule 1, which allocates values that directly encode values output by $C_{VT}$ . Note that the gadgetry of the circuit can ensure that either an excess of $A_{+}$ or an excess of $A_{-}$ can be shown to the corresponding c-e agent as feedback, as the circuit can convert the input value encoded by the value gadget of the blanket-sensor agent in $R_{1}$ to either a “right” or “left” output position, depending on the parity of the index. Also, if more than one blanket-sensor agent is active, they all affect their corresponding gates. The reason for requiring blanket-sensors of different parities to feedback different labels to the c-e agents is to be consistent with the definition of “consistent colours”, see Definition 24.

Note that we do not define the behaviour of the blanket-sensor agents in terms of the reference sensor. They essentially look for an imbalance between $A_{+}$ and $A_{-}$ within some interval of length 2, and when they find a sufficiently large imbalance, they force the circuit $C_{1}$ to show their associated c-e agent more of the over-represented or under-represented label, depending on their parity, which can be done using gadgetry of .

Consider the operation of moving a cut from near the left-hand side of the c-e region to the right-hand side, which corresponds to two points in the Möbius-simplex that are close to each other via a path through the facets that have been identified according to Definition 17. Suppose also that within the c-e region, we do not change the label of any point. Then the blanket-sensor agents behave the same way: if some blanket-sensor agent sees an excess of $A_{+}$ in its interval then it will continue to see an excess of $A_{+}$ . Regarding the (non-blanket) sensor agents, our reduction will make them “want” to produce opposite outputs, but due to the flipping of $x_{REF}$ , the reference sensor value, the final output values produced by $g^{\prime}_{j}$ , $j\in\pm[n]$ are the same, and we will have continuity across this facet.

We next describe how the $p^{C}$ circuit-encoders differ from each other. Each $C_{i}$ has a set of circuit-encoding agents ${\cal A}_{i}$ , which contains $C_{i}$ ’s sensor agents ${\cal S}_{i}$ . For $i\in[n]$ let ${\cal A}_{i}$ be the agents $a_{i,1},\ldots,a_{i,p}$ for some polynomial $p$ .

For all $i,j$ , $\mu_{a_{i,j}}(x)=\mu_{a_{1,j}}(y)$ where $x$ and $y$ are corresponding points in $R_{i}$ and $R_{j}$ . By “corresponding points” here we mean points that lie in the same distance from the left-endpoint of the respective intervals $R_{i}$ and $R_{j}$ ; see , Section 4.4.3 for the precise definition.

For all $i,j$ , all $x$ in the c-e region, $\mu_{a_{i,j}}(x)$ , is specified in Definition 14.

The second of these items says that in the c-e region, the valuation function of the agents that make up $C_{i}$ differ from those of $C_{1}$ by having been shifted to the right by $\delta^{\rm tiny}(i-1)$ , where this shift wraps around in the event that we shift beyond $n$ (the right-hand point of the c-e region). In other respects, $C_{i}$ is an exact copy of $C_{1}$ , save that $C_{i}$ ’s internal circuitry lies in $R_{i}$ rather than $R_{1}$ .

For each $C_{i}$ , the c-e agents have a further $2n$ value-blocks of value $1/(2p^{C})$ in region $R_{i}$ , whose labels are governed by the outputs produced by $C_{i}$ in the same way as for $C_{1}$ . Consequently we have the following observation.

2 An alternative coordinate system for the Möbius-simplex

Recall that the Möbius-simplex $D$ is the $n$ -simplex consisting of points $(x_{1},\ldots,x_{n+1})$ whose components are non-negative and sum to 1. Furthermore, a typical point in $D$ is directly encoded via the positions of $n$ cuts in the c-e region.

Here we specify a transformed coordinate system that is needed in order to encode instances of New variant high-D Tucker. We will embed the hypercube-shaped domain of an instance of New variant high-D Tucker in a hypercube in the transformed coordinates, and then use properties of the transformed coordinate system to extend the labelling function to the rest of the domain in a way that does not introduce bogus solutions (i.e. fixpoints of the extended function that lie outside the hypercube and do not encode solutions of New variant high-D Tucker).

Let $F_{0}$ be the set of points in $D$ of the form $(0,x_{2},\ldots,x_{n},0)$ ; thus $F_{0}$ is a $(n-2)$ -face of $D$ . See Figure 10. For $\tau\in$ , let ${\bf x}_{\tau}$ be the point

(So, ${\bf x}_{0}$ and ${\bf x}_{1}$ are the endpoints of the 1-dimensional edge of $D$ that is not contained in $F_{0}$ .) Let $D_{\tau}$ be the $(n-1)$ -simplex consisting of convex combinations of $F_{0}$ , and ${\bf x}_{\tau}$ . Thus $D_{0}$ and $D_{1}$ are the two facets of $D$ that have been identified together as in Definition 17.

$D_{\tau}$ contains the point ${\bf 0}_{\tau}=(\tau/n,1/n,\ldots,1/n,(1-\tau)/n)$ , which we regard as the origin of $D_{\tau}$ . The set of points $\{{\bf 0}_{\tau}:0\leq\tau\leq 1\}$ will be referred to as the axis; it will transpire that all solutions must lie within an inverse polynomial distance from the axis (in particular will be in the Significant Region).

We then refer to points in $D_{\tau}$ by means of the coordinates in a coordinate system that itself is a linear function of $\tau$ . With respect to any fixed $\tau\in$ we define $n-1$ vectors $(d^{\tau}_{2},\ldots,d^{\tau}_{n})$ as follows. A key feature is that $(d^{\tau}_{2},\ldots,d^{\tau}_{n})$ form a basis of $D_{\tau}$ (so that with respect to the origin ${\bf 0}_{\tau}$ , any point in $D_{\tau}$ has unique coordinates). The other key feature (Observation 5.3) is that at $\tau=0$ the coordinate/directions are “equal and opposite” to the coordinates at $\tau=1$ . Also, the coordinate system varies suitably smoothly.

As a warm-up we start by considering $d^{\tau}_{2}$ :

$d^{\tau}_{2}$ consists of increasing the second coordinate at the expense of its neighbours. For small $\tau$ we increase mainly at the expense of the third coordinate and as $\tau$ increases, we increase the second coordinate more at the expense of the first. Notice in particular that at $\tau=\frac{1}{2}$ we have $d^{\tau}_{2}=(-\frac{1}{2},1,-\frac{1}{2},0,\ldots,0)$ .

Thus, again this consists of the $i$ -th coordinate increasing at the expense of its neighbours, and we have in particular

Observation 5.2 makes the important point that by linearity, the vectors $d^{\tau}_{i}$ , $i=2,\ldots,n$ , can be used as a coordinate system to refer to points in $D_{\tau}$ .

Any point ${\bf x}$ in $D_{\tau}$ can be uniquely expressed as a sum

To see this, note first that $D_{0}$ is points in $D$ of the form $(0,x_{2},\ldots,x_{n+1})$ , and $D_{1}$ is points of the form $(x_{1},\ldots,x_{n},0)$ . Note that the observation certainly works for $\tau=0$ or $\tau=1$ . To see that it works for intermediate $\tau$ , note that the vectors $d^{\tau}_{i}$ are linearly independent, and the reason why they span $D_{\tau}$ is that any vector $d^{\tau}_{i}$ is equal to $(1-\tau)$ multiplied by a vector in $D_{0}$ , added to $\tau$ multiplied by an equal-length vector in $D_{1}$ . So these vectors do indeed lie in $D_{\tau}$ .

where $(0;\alpha_{2},\ldots,\alpha_{n})\equiv(1;-\alpha_{2},\ldots,-\alpha_{n})$ .

With regard to Definition 22, consider two points ${\bf x}=(0;\alpha_{2},\ldots,\alpha_{n})$ , ${\bf x}^{\prime}=(1;-\alpha_{2},\ldots,-\alpha_{n})$ , that have been equated with each other. Assume these points are near the axis, specifically $|\alpha_{j}|<1/10n$ for all $j$ . Notice that

the cuts in the c-e region for ${\bf x}$ and ${\bf x}^{\prime}$ partition the c-e region in the same way.

(with reference to Figure 11) when we move from a point in $D_{1-\varepsilon}$ to a nearby point in $D_{\varepsilon}$ , for any $j\in\{2,\ldots,n\}$ , the direction of increasing $\alpha_{j}$ segues smoothly to the direction of decreasing $\alpha_{j}$ .

Our assumption that $|\alpha_{j}|<1/10n$ ensures that cuts are fairly evenly-spaced, and movement in any of the directions $d^{\tau}_{j}$ does not cause the cuts to cross each other.

In Subsection 5.2.1, we show that for points near the axis (i.e. within Euclidean distance $1/10n^{2}$ of the axis), the computation of the coordinate transformation —and its inverse— have the property that small perturbations of the input values lead to inverse-polynomial upper bounds on the resulting perturbations of the output values. Since we have this for both the transformation and its inverse, it follows that there are also inverse-polynomial lower bounds on the resulting perturbations of the output values.

Note that Proposition 5.4 does not hold for all points in $D$ ; the restriction to a neighbourhood of the axis is needed. For points on $F_{0}$ , all values of $\tau$ are equivalent, and for points close to $F_{0}$ , perturbed versions of them could result in large perturbations of $\tau$ .

We show in the next section that the coordinate transformation, and its inverse, can be computed in polynomial time, for points in $D$ that are within some inverse polynomial distance from the axis.

We verify here that (for points in the vicinity of the axis), our transformation may be performed efficiently, and that small perturbations of inputs lead to small perturbations of the outputs (in either direction). The easy direction is the computation of $(x_{1},\ldots,x_{n+1})$ from $(\tau;\alpha_{2},\ldots,\alpha_{n})$ . Recall that a point ${\bf x}$ on the original domain can be expressed in terms of the transformed coordinates $(\tau;\alpha_{2},\ldots,\alpha_{n})$ and the origin ${\bf 0}_{\tau}=(\frac{\tau}{n},\frac{1}{n},\ldots,\frac{1}{n},\frac{1-\tau}{n})$ as ${\bf x}={\bf 0}_{\tau}+\sum_{i=2}^{n}\alpha_{i}d^{\tau}_{i}$ . Therefore we have:

In the other direction, given $(x_{1},\ldots,x_{n+1})$ we first compute the value of $\tau$ for the transformed coordinate system: Note that $(x_{1},\ldots,x_{n+1})$ must be a convex combination of $x_{\tau}$ and $F_{0}$ (where recall that $x_{\tau}=(\tau,0,\ldots,0,1-\tau)$ and $F_{0}$ is points of the form $(0,x_{2},\ldots,x_{n},0)$ ), therefore $\tau$ can be computed as the solution to the equation

Note that the dependence of $\tau$ on $x_{1}$ and $x_{n+1}$ is not excessively sensitive near the axis, since $x_{1}+x_{n+1}$ is close to $1/n$ . Having computed $\tau\in$ , we could simply solve the equations above (the ones used to compute $x_{1},\ldots,x_{n+1}$ ) for $\alpha_{2}$ , $\alpha_{3}$ and so on successively, using the derived formulas for $\alpha_{i}$ for the computation of $\alpha_{i+1}$ and express each $\alpha_{i}$ as a function of only $\tau$ and the values $x_{1},x_{2},\ldots x_{n+1}$ . However, in the extremal cases of $\tau=0$ and $\tau=1$ , some of the $\alpha_{i}$ values might “disappear”; for example, for $\tau=0$ , expressing $\alpha_{3}$ in terms of only $\tau$ , $x_{1}$ and $x_{2}$ is not possible, since for $\tau=0$ we do not obtain a formula for $\alpha_{2}$ to substitute into the equation for $x_{2}$ . To remedy this, we consider two cases:

Case 1: $\tau\geq\frac{1}{2}$ . We compute $\alpha_{2},\ldots,\alpha_{n}$ as follows:

and so on for $\alpha_{4},\ldots,\alpha_{n}$ .

Case 2: $\tau\leq\frac{1}{2}$ . We compute $\alpha_{2},\ldots,\alpha_{n}$ starting at the opposite end:

and so on for $\alpha_{n-2},\ldots,\alpha_{2}$ .

Note that inverse polynomial-size perturbations of the $x_{i}$ lead to inverse polynomial-size perturbations of the transformed coordinates. As a sanity check, note that at the boundary (points with $\tau=0$ are the same as points with $\tau=1$ ), if we move a cut at the LHS to the RHS (so $(0,x_{2},\ldots,x_{n+1})$ becomes $(x_{2},\ldots,x_{n+1},0)$ ), it can be checked that the $\alpha_{i}$ get negated.

Note that these computations should be done with a precision (or rounding error) polynomially smaller than $\delta^{\rm tiny}$ .

This section defines a partial function $f:D\rightarrow\{-1,0,1\}^{n}$ ( $D$ being the $n$ -dimensional Möbius-simplex (Definition 17)). $f$ is constructed in polynomial time based on an instance $I_{VT}$ of New variant high-D Tucker in $n$ dimensions, defined using circuit $C_{VT}$ . $f$ is defined in the Significant Region (Definition 18) which is the set of points where no blanket-sensor agents (Definition 14) are active, thus it includes the twisted tunnel $T$ (Definition 23). $f$ is computable by a circuit $C$ , that is used to define the operations of the circuit-encoders in a derived instance of Consensus-halving. Within the Significant Region, $f$ determines the outputs of the circuit-encoders.

The function $f$ maps a point ${\bf x}$ in the Significant Region to a vector of length $n$ , $e_{f}({\bf x})$ , where $e_{f}({\bf x})_{j}=1$ means that the point receives colour $j$ , $e_{f}({\bf x})_{j}=-1$ means that the point receives colour $-j$ and $e_{f}({\bf x})_{j}=0$ means that the point does not receives colour $j$ or $-j$ . In general, $e_{f}({\bf x})$ may have multiple non-zero entries. We will use the term the colour of ${\bf x}$ for points that only receive a single colour (and therefore their outputs are vectors with only one non-zero entry).

The function $f$ will have a corresponding vector-valued function $f^{\prime}$ (Section 6.1) that more closely represents choices of labels $A_{+}$ / $A_{-}$ that the circuit shows to the c-e agents. We will do this in such a way that no “bogus” solutions result from the transition to parts of $D$ where blanket-sensor agents are active. By construction, there are no solutions where blanket-sensor agents are active, so all solutions occur where $f$ is defined.

In Section 6.1 we then define a vector-valued “Borsuk-Ulam style” function $F$ in terms of $f$ . Letting $I_{VT}$ be an instance of New variant high-D Tucker, $F({\bf x})$ will be approximately zero iff given ${\bf x}$ , we can derive an approximate consensus-halving solution to $I_{VT}$ . It will be shown that approximate zeroes of $F$ provide solutions to $I_{VT}$ .

Recall that $D$ is the set of points $(x_{1},\ldots,x_{n+1})$ whose components are non-negative and sum to 1. And, the “Significant Region” (Definition 18) of $D$ consists of points $(x_{1},\ldots,x_{n+1})$ for which coordinates $x_{i}$ ( $2\leq i\leq n-1$ ) differ from $1/n$ by at most an inverse polynomial $\delta^{w}=1/p^{w}(n)$ (Proposition 4.4). ( $\delta^{w}$ represents an upper bound on the thickness of the Significant Region.)

Let $B$ be the $n$ -dimensional “box” associated with $I_{VT}$ (recall $I_{VT}$ is represented by circuit $C_{VT}$ that maps points in $B$ to $\pm[n]$ ). We embed a copy of $B$ in $D$ as follows. Recall the way facets of $B$ are coloured in Definition 8. Let $(x_{1},\ldots,x_{n})$ denote a typical point in $B$ , and assume that the facets of $B$ with maximum and minimum $x_{1}$ (i.e. $x_{1}=1$ and $x_{1}=-1$ respectively) are the panchromatic facets of $B$ (as in Definition 8), and for $i\geq 2$ the facet of $B$ with maximum $x_{i}$ ( $x_{i}=1$ ) consists of points that do not have colour $i$ , and the the facet of $B$ with minimum $x_{i}$ ( $x_{i}=-1$ ) consists of points that do not have colour $-i$ .

The twisted tunnel $T$ is defined as follows. The axis of $T$ is the set of all points ${\bf 0}_{\tau}$ as defined in Section 5.2. The twisted tunnel is the set of all points with transformed coordinates $(\tau;\alpha_{2},\ldots,\alpha_{n})$ such that for all $i$ , $|\alpha_{i}|<\delta^{T}$ . Note that $\delta^{T}$ is an inverse polynomial quantity sufficiently small that $T$ is a subset of the Significant Region; this is achieved since by definition, $\delta^{T}$ is polynomially smaller than $\delta^{w}$ of Proposition 4.4. Thus, $T$ has (with respect to the transformed coordinates) a $(n-1)$ -cube-shaped intersection with any $D_{\tau}$ .

We define the behaviour of $f$ over the Significant Region (Definition 18) in 3 stages, as follows.

A point ${\bf x}=(x_{1},\ldots,x_{n})$ in $B$ is mapped to a point $g({\bf x})$ in $D$ as follows. $g({\bf x})$ lies in $D_{\tau}$ , where we choose $\tau=\frac{1}{2}+\delta^{T}\cdot x_{1}$ . Then (noting (4)) we set $g({\bf x})$ equal to ${\bf 0}_{\tau}+\sum_{i=2}^{n}\delta^{T}\cdot x_{i}d^{\tau}_{i}$ (i.e. $g({\bf x})$ has transformed coordinates $(\frac{1}{2}+\delta^{T}x_{1};\delta^{T}x_{2},\ldots,\delta^{T}x_{n})$ ). $g({\bf x})$ will receive a single colour; the colour of $g({\bf x})$ — i.e. the non-zero entry of $f(g({\bf x}))$ — is set equal to the colour allocated to ${\bf x}$ in $B$ by $I_{VT}$ . (Notice that the centre of $B$ is mapped to $(1/2n,1/n,\ldots,1/n,1/2n)$ , which is the origin of $D_{\frac{1}{2}}$ , and the centre of the Significant Region. This point has (recalling Definition 22) transformed coordinates $(\frac{1}{2};0,\ldots,0)$ where the first entry is the value of $\tau$ .)

We also colour other points in $T$ as follows — these will also receive single colours. Suppose ${\bf y}$ belongs to $D_{\tau}$ , where $\tau<\frac{1}{2}-\delta^{T}$ or $\tau>\frac{1}{2}+\delta^{T}$ . According to (4), ${\bf y}={\bf 0}_{\tau}+\sum_{i=2}^{n}\alpha_{i}d^{\tau}_{i}$ , and ${\bf y}$ has transformed coordinates $(\tau;\alpha_{2},\ldots,\alpha_{n})$ . Suppose all the $\alpha_{i}$ lie in the range $[-\delta^{T},\delta^{T}]$ . Then if $\tau<\frac{1}{2}-\delta^{T}$ , we set the colour of ${\bf y}$ to the colour of a point ${\bf y}^{\prime}=(\frac{1}{2}-\delta^{T};\alpha_{2},\ldots,\alpha_{n})$ . Thus ${\bf y}^{\prime}\in D_{\frac{1}{2}-\delta^{T}}$ , and the other transformed coordinates (Definition 22) are the same for ${\bf y}$ and for ${\bf y}^{\prime}$ . We do a similar thing for points in $D_{\tau}$ for $\tau>\frac{1}{2}+\delta^{T}$ . That is, if ${\bf y}$ has transformed coordinates $(\tau;\alpha_{2},\ldots,\alpha_{n})$ where $\tau>\frac{1}{2}+\delta^{T}$ and the $\alpha_{i}$ are all at most $\delta^{T}$ in absolute value, then ${\bf y}$ gets the same colour as a point ${\bf y}^{\prime}$ whose transformed coordinates are $(\frac{1}{2}+\delta^{T};\alpha_{2},\ldots,\alpha_{n})$ .

The Significant Region (Definition 18) is points in $D$ where no blanket-sensor agents are active, a subset of points that are close to the axis in the sense of Proposition 4.4. Consider ${\bf x}\in D\setminus T$ with transformed coordinates $(\tau;\alpha_{2},\ldots,\alpha_{n})$ .

For each $j\in\{2,\ldots,n\}$ if $\alpha_{j}>\delta^{T}$ then ${\bf x}$ gets colour $-j$ ;

For each $j\in\{2,\ldots,n\}$ if $\alpha_{j}<-\delta^{T}$ then ${\bf x}$ gets colour $j$ ;

these are not mutually exclusive, ${\bf x}$ gets at least one colour, possibly more.

Notice that (within the subspace $D_{\tau}$ ) the side(s) of the twisted tunnel $T$ closest to ${\bf x}$ is guaranteed not to be opposite to any colour of ${\bf x}$ .

For a subset $S$ of colours, let $R(S)$ be the region with colours in $S$ . We call these the “outer regions”.

Proposition 6.7 notes that when colour-regions meet each other at opposite ends of $T$ (which have been identified with each other according to the definition of the Significant Region), they will have equal and opposite colours.

4 How to compute a solution to New variant high-D Tucker from a solution to Consensus-halving

Suppose we have a solution $S_{CH}$ to an instance $I_{CH}$ of Consensus-halving, derived by our reduction from an instance $I_{VT}$ of New variant high-D Tucker.

Let ${\bf x}$ be the point in the Möbius-simplex represented by the c-e cuts of $S_{CH}$ . Proposition 4.4 already tells us that ${\bf x}$ must lie within some inverse-polynomial distance of the axis, since if not, some blanket-sensor agent will be active. We prove in Section 6 that ${\bf x}$ must lie within, or very close to, the twisted tunnel.

From this we identify two colour-regions that have equal and opposite colours, as follows. Let ${\bf x}$ have transformed coordinates $(\tau;\alpha_{2},\ldots,\alpha_{n})$ , which can be computed with inverse-polynomial precision from the c-e cuts.

If ${\bf x}$ occurs within the embedded copy of $B$ (Section 5.3) then identify two circuit-encoders $C_{i}$ and $C_{i^{\prime}}$ that both receive reliable inputs and have equal and opposite outputs. The proof of Proposition 6.4 tells us that this is always possible.

If ${\bf x}$ lies in $T$ but not in the embedded copy of $B$ , then we similarly find two distinct colour-regions that are adjacent and with opposite colours. Since these colour-regions are just extensions of the cubelets that lie on the panchromatic facets of the embedded instance of New variant high-D Tucker, we are done. Formally, if $S_{CH}$ represents a point with transformed coordinates $(\tau;\alpha_{2},\ldots,\alpha_{n})$ where $\tau>\frac{1}{2}+\delta^{T}$ , we take the point $(\frac{1}{2}+\delta^{T};\alpha_{2},\ldots,\alpha_{n})$ , and similarly, if $S_{CH}$ represents a point with transformed coordinates $(\tau;\alpha_{2},\ldots,\alpha_{n})$ where $\tau<\frac{1}{2}-\delta^{T}$ , we take the point $(\frac{1}{2}-\delta^{T};\alpha_{2},\ldots,\alpha_{n})$ .

Completing the Proof of Theorem 2.1

Let $I_{VT}$ be an instance of New variant high-D Tucker in $n$ dimensions, given in terms of circuit $C_{VT}$ ; suppose Consensus-halving instance $I_{CH}$ is derived from it by our reduction. Recall that $\varepsilon=\delta^{\rm tiny}/10$ . We show that any $\varepsilon$ -approximate solution $S_{CH}$ to $I_{CH}$ allows a solution to $I_{VT}$ to be recovered using Section 5.4.

In $S_{CH}$ , only the c-e cuts may lie in the c-e region (Observation 4.1), and every other cut $c(a)$ for $a$ not a c-e agent, must lie in some interval outside the c-e region (in order for $a$ ’s value to be evenly split). It follows from Proposition 4.4 that at least $n-1$ c-e cuts must lie in the c-e region, and they are evenly-spaced (the gaps between them differ from 1 by an inverse polynomial). The remaining cut may occur elsewhere, in which case it becomes what we called a “stray cut” in , and in that case, the “double negative lemma” of may be applied to prove that it has little effect on the quality of a solution: it causes a single circuit-encoder to have unreliable output.

Most or all of the points are in the twisted tunnel. In that case it will be proved that the procedure of Section 5.4 identifies a solution to New variant high-D Tucker; see Sections 6.1, 6.2.

Most or all are in the outer regions. In that case we are not at a solution since the colours cannot cancel each other.

Some of the points are outside the Significant Region. In that case we are far from the twisted tunnel, and Section 6.3 argues that no solution is possible here.

All points are outside the Significant Region. Then, we are certainly far from a solution since various blanket-sensor agents will be active, and the points are so close together (within $\delta^{\rm tiny}$ ) that the directions in which they are active, cannot cancel.

Recall that $D$ denotes the $n$ -dimensional Möbius-simplex (Definition 17). We start by defining a function $f^{\prime}:D\rightarrow^{n}$ based on $f$ defined as in Section 5.3. $f^{\prime}$ simulates the effect of the blanket-sensor agents as described in Section 5.1.3. Let ${\bf x}\in D$ .

When no blanket-sensor agents are active at ${\bf x}$ . Here we are in the Significant Region.

In $T$ , $f^{\prime}$ behaves like $f$ in the sense that if $f$ assigns the colour $i$ to ${\bf x}$ (recall that it assigns a single colour to points in $T$ ), then set $f^{\prime}({\bf x}):={\bf e}_{i}$ if $i>0$ , $f^{\prime}({\bf x}):=-{\bf e}_{|i|}$ for $i<0$ .

Outside $T$ , in outer region $R(S)$ , $f^{\prime}({\bf x}):=\sum_{i\in S,i>0}{\bf e}_{i}+\sum_{i\in S,i<0}-{\bf e}_{|i|}$ .

When one or more blanket-sensor agents are active. If the $j$ -th blanket-sensor agent of $C_{1}$ , $b_{1,j}$ is active towards $A_{+}$ (respectively $A_{-}$ ) then the $j$ -th entry of $f^{\prime}({\bf x})$ is set to $1$ if $j$ is odd and to $-1$ if $j$ is even (respectively, to $-1$ if $j$ is odd and $1$ is $j$ is even). This is done for all active blanket-sensor agents, thus $f^{\prime}({\bf x})$ can contain multiple 1’s and $-1$ ’s.

The following points are similar to Observation 4.3:

Suppose that circuit-encoder $C_{i}$ (some $i\in[p^{C}]$ ) of $I_{CH}$ receives reliable inputs. (Observation 4.2 tells us that at most $n$ of them fail to receive reliable inputs.) Then $C_{i}$ computes $f^{\prime}$ at a point within distance $\delta^{\rm tiny}$ from the ${\bf x}\in D$ encoded by the c-e cuts, in the sense that the value observed by each c-e agent $a_{j}$ that is labelled by $A_{+}$ , minus the amount labelled $A_{-}$ , restricted to that part of $a_{j}$ ’s value that lies in $R_{i}$ and so is governed by the output of $C_{i}$ , is the $j$ -th component of $f^{\prime}$ .

This follows from the construction of Section 5.1.2 and the association of boolean values $\mbox{{\sc true}},\mbox{{\sc false}}$ with the labels $A_{+}$ and $A_{-}$ . For ${\bf x}\in D$ , $F({\bf x})$ is the average of the outputs of the $C_{i}$ ; Proposition 6.2 provides the details.

$I_{CH}$ computes a function $F$ in the following sense. Let ${\bf x}$ be the point encoded by the c-e agents. Suppose all agents other than the c-e agents have error (i.e. discrepancy between $A_{+}$ and $A_{-}$ that they observe) at most $\varepsilon$ . Then the error of the c-e agents is within additive distance $1/n$ from the average value of $f^{\prime}$ , averaged over a set of points all within $\delta^{\rm tiny}$ of ${\bf x}$ .

We put together various observations about the way $I_{CH}$ is constructed. Observation 4.3 told us that the values observed by the c-e agents are the average of a set of points all within distance $\delta^{\rm tiny}$ of each other. The additive distance $1/n$ results from the existence of up to $n$ circuit-encoders that either fail to receive good inputs (Observation 4.2), or are affected by the stray cut, taken in conjunction with the fact that we average over $p^{C}$ points, where $p^{C}$ can be taken to be at least $2n^{2}$ .

$\delta^{\rm tiny}$ can be chosen to be sufficiently small (but still inverse-polynomial) that given a set of $p^{C}$ points ${\bf x}_{1},\ldots,{\bf x}_{p^{C}}\in D$ within distance $\delta^{\rm tiny}$ of each other, when we compute their transformed coordinates ${\bf y}_{1},\ldots,{\bf y}_{p^{C}}$ , we have:

Every pair of points ${\bf y},{\bf y}^{\prime}\in\{{\bf y}_{1},\ldots,{\bf y}_{p^{C}}\}$ has the property that they either lie in the same colour-region, or adjacent colour-regions (where a “colour-region” is one of monochromatic regions of Section 5.3), or one of the outer regions.

In identifying which colour-region a point with transformed coordinates ${\bf y}$ belongs to, for the colour-regions in the twisted tunnel $T$ , we compare coordinates with certain threshold values. These threshold values differ from each other by inverse-polynomial amounts, and the smallest difference between any pair of them is inverse-polynomial. Applying Proposition 5.4, we can keep the ${\bf y}_{i}$ closer to each other than this. (Colour-regions lie in the Significant Region, so these points lie within $1/10n^{2}$ of the axis, so Proposition 5.4 is applicable.)

This applies also to the outer regions $R(S)$ for sets of colours $S$ . Identifying which $R(S)$ a point ${\bf y}$ belongs to, uses the same information on comparisons of its coordinates with inverse-polynomials.

We call $F$ a Borsuk-Ulam-style function — The suffix “style” is to note that we define a kind of function that has desirable properties similar to those of a Borsuk-Ulam function, but for example the domain of the function is $D$ as opposed to a sphere. Also, the function $F$ is “approximately Lipschitz” rather then truly continuous, which is good enough for our purposes.

$|F({\bf x})|\leq\varepsilon$ (here, $|F|$ denotes the $L_{\infty}$ or “maximum” norm of $F$ ) iff ${\bf x}$ encodes an approximate Consensus-halving solution. Regarding this point, $F$ is not simulating a Borsuk-Ulam function, but rather simulating a function consisting of the difference between the values taken by a Borsuk-Ulam function, at two antipodal points.

2 Encoding the output of F𝐹F with a Consensus-halving solution

Let $S_{CH}$ be an $\varepsilon$ -approximate solution to $I_{CH}$ . Suppose that the c-e cuts of $S_{CH}$ represent a point ${\bf x}$ that lies in the twisted tunnel. Then we can reconstruct a solution to $I_{VT}$ in polynomial time.

Recall that $S_{CH}$ , $I_{CH}$ and $I_{VT}$ and $\varepsilon$ are as introduced at the start of Section 6.

Observation 6.1 tells us that if a circuit-encoder $C_{i}$ receives reliable inputs, it outputs the colour of a point in the Möbius-simplex that lies within $\delta^{\rm tiny}$ of ${\bf x}$ .

We note next that the feedback received by the c-e agents in $I_{CH}$ corresponds to the average (over $i\in[p^{C}]$ ) of the feedback received by the individual circuit-encoders $C_{i}$ . In detail, the $i$ -th coordinate of a typical point in $B=^{n}$ is obtained by taking c-e agent $a_{i}$ , and (given any attempt at a consensus-halving solution $S$ ) subtracting $a_{i}$ ’s value for the parts of the consensus-halving domain labelled $A_{-}$ according to $S$ , from those labelled $A_{+}$ . The resulting point is at the centre (or origin) of $B$ iff the c-e agents have balanced allocations of $A_{+}$ and $A_{-}$ (as required for a consensus-halving solution), and more generally, a point in $^{n}$ is close to the centre of $B$ iff the c-e agents have approximately balanced allocations of $A_{+}$ and $A_{-}$ .

Observation 4.2 tells us that at most $n$ circuit-encoders fail to receive reliable input. If all circuit-encoders received reliable input, then the total error at a solution would be at most $\varepsilon$ , i.e. the precision parameter of the $I_{CH}$ instance. However, since at most $n$ of them receive unreliable input, we might have an added discrepancy of at most $n/p^{C}$ when taking the average and therefore we need to get within distance $\varepsilon+\frac{2n}{p^{C}}$ of the centre of $B$ . For this to be possible, we need some of the cancelling to take place amongst the outputs of the circuit-encoders that received reliable inputs, so we really can find a pair of correctly oppositely-coloured points.

Proposition 6.3 tell us that if ${\bf x}\in D$ is the point in $D$ represented by some Consensus-halving solution, then provided ${\bf x}$ lies in the twisted tunnel, the corresponding cluster of $p^{C}$ points must be mapped by the circuit-encoders $C_{i}$ that mostly cancel each other out, so we find pairs of points that belong to oppositely-labelled colour-regions, from which Section 5.4 tells us how to recover two oppositely-coloured cubelets of $I_{VT}$ .

It remains to rule out the possibility of ${\bf x}$ occurring outside the twisted tunnel.

3 No bogus approximate-zeroes of F𝐹F at boundary of Significant Region

Proposition 6.4 tells us that $\varepsilon$ -approximate zeroes of $F$ (inputs for which $F$ has value in $[-\varepsilon,\varepsilon]^{n}$ ) within the twisted tunnel $T$ must encode solutions. Around $T$ , there are outer regions $R(S)$ ; note that if $i\in S$ then $-i\not\in S$ and moreover there is an inverse-polynomial lower bound on the distance between any pair of points belonging to outer regions containing opposite colours. But we have to rule out points with colour $j\in S$ being averaged with nearby points that are “coloured” $-j$ due to a blanket-sensor agent. In more detail, if ${\bf x}\in R(S)$ and ${\bf x}$ is within $\delta^{\rm tiny}$ of ${\bf x}^{\prime}$ for which the $j$ -th blanket-sensor agent is active and provides feedback corresponding to $-j$ , then we will prove that $S$ contains some other colour $k\not=j$ and no point in a $\delta^{\rm tiny}$ -neighbourhood of ${\bf x}$ activates the $k$ -th blanket-sensor $b_{1,k}$ to provide feedback corresponding to $-k$ .

In the following, we will refer to cuts in the following manner: “cut $i$ ” refers to the $i$ -th cut (from left to right) in the c-e region. Also, recall that the width $\delta^{T}$ of the twisted tunnel is smaller than any other inverse-polynomial quantities of interest, apart from $\delta^{\rm tiny}$ , which itself is smaller than all other inverse-polynomials of interest, including $\delta^{T}$ . We provide the following definition of a consistent colour.

For ${\bf x}=(\tau;\alpha_{2},\ldots,\alpha_{n})$ in the Significant Region, colour $j\in\{\pm 2,\ldots,\pm n\}$ , let $A_{j}\in\{A_{+},A_{-}\}$ be the label that tends to increase in interval $[j-2,j]$ when the $|j|$ -th coordinate $\alpha_{j}$ of ${\bf x}$ is increased if $j>0$ , or decreased if $j<0$ . ( $A_{j}$ depends on the sign and parity of $j$ .) We say that ${\bf x}$ has consistent colour $j$ if

if $j>0$ then $\alpha_{j}>2\delta^{T}$ ; if $j<0$ then $\alpha_{|j|}<-2\delta^{T}$ ;

at least $\frac{1}{2}-\frac{p^{\rm large}}{2p^{\rm huge}}$ of the interval $[j-2,j]$ gets the label $A_{j}$ .

Condition 1 says that at ${\bf x}\in R(S)$ , colour $j$ is a member of $S$ and the corresponding transformed coordinate is sufficiently far from the twisted tunnel. Condition 2 says that we are at least some (small but significant) distance from triggering the $j$ -th blanket-sensor in a direction that corresponds to excessive colour $-j$ . In other words, for the $j$ -th blanket-sensor to become active in direction $-j$ , we would have to increase $A_{-j}$ by an inverse-polynomial amount.

The following proposition establishes that for points in the outer regions $R(S)$ , consistent colours exist.

Suppose ${\bf x}=(\tau;\alpha_{2},\ldots,\alpha_{n})$ belongs to outer region $R(S)$ and that ${\bf x}$ is within distance $\delta^{\rm tiny}$ of the boundary of the Significant Region. Then ${\bf x}$ has a consistent colour in $\{\pm 2,\ldots,\pm n\}$ .

Case 1: $j>0$ (i.e., $\alpha_{j}>0$ ). In this case, moving in direction $d^{\tau}_{j}$ causes cuts $j-1$ and $j$ to move away from each other; this is illustrated in Figure 13.

We claim that $j$ is a consistent colour for ${\bf x}$ . Note first that $\alpha_{j}>2\delta^{T}$ and therefore Condition 1 is satisfied, since $j>0$ in this case. $\alpha_{j}>2\delta^{T}$ follows from the fact that $j\in\arg\max_{i\in\{2,\ldots,n\}}|\alpha_{i}|$ , we are close to the boundary of the Significant Region, and the width of Significant Region is polynomially larger than that of the twisted tunnel. In order to be close to the boundary of the Significant Region, we must have moved more than $2\delta^{T}$ in some direction from $\bf{0}_{\tau}$ and by the choice of $j$ , it holds that $\alpha_{j}>2\delta^{T}$ .

For Condition 2, recall first that ${\bf x}$ has transformed coordinates $(\tau;\alpha_{2},\ldots,\alpha_{n})$ , and that the origin of $D_{\tau}$ has transformed coordinates ${\bf 0}_{\tau}=(\tau;0,\ldots,0)$ . The $(j-1)$ -st and $j$ -th cuts corresponding to the point ${\bf 0}_{\tau}$ are located at positions $j-2+\tau$ and $j-1+\tau$ respectively, and are shown in red in Figure 13. Near the axis, where the cuts are evenly-spaced (see Proposition 4.4), movement in direction $d^{\tau}_{j}$ corresponds to moving the $(j-1)$ -st and $j$ -th cuts (in the c-e region) away from each other. We will consider moving from ${\bf 0}_{\tau}$ to ${\bf x}$ via a point ${\bf x}_{j}$ in which we will only have increased the transformed coordinate $\alpha_{j}$ .

First, consider moving from ${\bf 0}_{\tau}$ to point ${\bf x}_{j}=(\tau;0,\ldots,0,\alpha_{j},0,\ldots,0)$ for $\alpha_{j}>0$ . In this process, we move the $(j-1)$ -st cut to the left by $\alpha_{j}\cdot\tau$ and the $j$ -th cut to the right by $\alpha_{j}\cdot(1-\tau)$ ; all this takes place within the interval $[j-2,j]$ , see Figure 13. Now consider moving from ${\bf x}_{j}$ to ${\bf x}$ . In this process, the $(j-1)$ -st cut moves to the right by $\alpha_{j-1}\cdot(1-\tau)$ and the $(j+1)$ -st cut moves to the left by $\alpha_{j+1}\cdot\tau$ . From the choice of $j$ to be $j\in\arg\max_{i\in\{2,\ldots,n\}}|\alpha_{i}|$ , it follows that $\alpha_{j-1}\leq\alpha_{j}$ and $\alpha_{j+1}\leq\alpha_{j}$ . Then, there is a sub-interval of $[j-2,j]$ that contains the unit-length interval $I=[j-2+\tau+\alpha_{j}\cdot(1-2\tau),j-1+\tau+\alpha_{j}\cdot(1-2\tau)]$ which ends up coloured entirely $A_{j}$ , implying Condition 2. Overall, we obtain that $j$ is a consistent colour.

Case 2: $j<0$ (i.e., $\alpha_{j}<0$ ). In this case, moving in direction $d^{\tau}_{j}$ causes cuts $|j|-1$ and $|j|$ to move towards each other; this is illustrated in Figure 14.

Case 2a: $\tau\in[1/2n,1-(1/2n)]$ . In this case, all movements of the cuts, in and around the Significant Region, are in distances upper-bounded by $\delta^{w}$ , which by Proposition 4.4 is smaller than $1/2n$ by an inverse-polynomial amount. This means that if we start at ${\bf 0}_{\tau}$ and re-set individual transformed coordinates to those of ${\bf x}$ , in any order (i.e. going through any intermediate point ${\bf x}_{j}$ , similarly to above), the movement of the cuts will never force them to cross integer-valued thresholds. In other words, in moving from ${\bf 0}_{\tau}$ to ${\bf x}$ , only the relevant cuts $j-1$ and $j$ will lie in the interval $[j-2,j]$ . This case can be seen in the illustration of Figure 13, if one reverses the direction of the arrows, switches the labels $A_{j}$ to $A_{-j}$ and vice-versa, and substitutes $j$ by $|j|$ in the labelling of cuts. The argument establishing the existence of a consistent colour is exactly symmetric to that of Case 1 above.

Case 2b: $\tau\in[0,1/2n]\cup[1-(1/2n),1]$ . Here, we consider the case where $\tau\in[0,1/2n]$ ; the other case is similar by symmetry. This case is illustrated in Figure 14; note that the sequence of labels $A_{j}/A_{-j}$ is switched to make $A_{j}$ the label that increases when we move in direction $d^{\tau}_{j}$ .

Moving in direction $d^{\tau}_{j}$ causes an increase of the label $A_{j}$ in the interval $[|j|-2,|j|]$ . For $j$ not to be a consistent colour, we should observe an excess of the label $A_{-j}$ in this interval. In generating cut locations from coordinates of ${\bf x}$ , the amount of $A_{-j}$ in $[|j|-2,|j|]$ can be raised in the following ways (see Figure 14):

By increasing the transformed coordinate $\alpha_{|j|-1}$ in the negative direction, moving in direction $d^{\tau}_{-(|j|-1)}$ . This causes cuts $|j|-2$ and $|j|-1$ to move towards each other and therefore importantly for us here, cut $|j|-1$ to move to the left (towards integer point $|j|-2$ ).

By increasing the transformed coordinate $\alpha_{j+1}$ in the negative direction, moving in direction $d^{\tau}_{-(|j|+1)}$ . This causes cuts $|j|$ and $|j|+1$ to move towards each other.

Note that what may happen in this last case, is that cut $|j|+1$ which used to lie to the right of the integer point $|j|+2$ before moving in direction $d^{\tau}_{-(|j|+1)}$ , now lies to the left of the integer point $|j|+2$ after the movement, therefore increasing the label $A_{-j}$ at the right-hand-side of $[|j|-2,|j|]$ . We consider two more cases, depending on whether or not this is the case.

Case 2b(i): At ${\bf x}$ , cut $|j|+1$ is to the right of location $|j|$ or at location $|j|$ .

There are two ways to restore the deficit of $A_{-j}$ that resulted from moving in direction $d^{\tau}_{j}$ from ${\bf 0}_{\tau}$ to ${\bf x}_{j}$ . Moving in direction $d^{\tau}_{-(|j|-1)}$ moves cut $|j|-1$ to the left, and moving in direction $d^{\tau}_{-(|j|+1)}$ moves cut $|j|$ to the right. (Note that the movement of cut $|j|+1$ to the left has not changed the balance of $A_{j}$ and $A_{-j}$ in the interval $[|j|-2,|j|]$ any further, by the assumption of the case). Since $j$ was chosen to be in $\arg\max_{i\in\{2,\ldots,n\}}|\alpha_{i}|$ , it is easy to verify that

Cut $|j|-1$ has moved to the left as a result of moving in direction $d^{\tau}_{-(|j|-1)}$ at most as much as cut $|j|$ has moved to the left as a result of moving in direction $d^{\tau}_{j}$ (from ${\bf 0}_{\tau}$ to ${\bf x}_{j}$ ).

Cut $|j|$ has moved to the right as a result of moving in direction $d^{\tau}_{-(|j|+1)}$ at most as much as cut $|j|-1$ has moved to the right as a result of moving in direction $d^{\tau}_{j}$ (from ${\bf 0}_{\tau}$ to ${\bf x}_{j}$ ).

Therefore a large enough subinterval of $[|j|-2,|j|]$ has been coloured with $A_{j}$ , which means $j$ is a consistent colour.

Case 2b(ii): At ${\bf x}$ , cut $|j|+1$ is to the left of location $|j|$ .

In this case, we have $\alpha_{|j|+1}<0$ and movement in direction $d^{\tau}_{-(|j|+1)}$ causes cuts $|j|$ and $|j|+1$ to move towards each other. Note also that besides the effect of the movement in direction $d^{\tau}_{-(|j|+1)}$ , cut $|j|+1$ may move to the left due to movement in direction $d^{\tau}_{|j|+2}$ , since such a movement would cause cuts $|j|+1$ and $|j|+2$ to move away from each other and therefore, cut $|j|+1$ to move to the left. However, the distance moved in direction $d^{\tau}_{|j|+2}$ is small; it is at most $\tau\cdot|\alpha_{j}|$ , which is at most $\tau\cdot\delta^{+}$ . Therefore, we need movement at least $\tau(1-\delta^{+})$ in direction $d^{\tau}_{-(|j|+1)}$ in order to cover the distance moved in direction $d^{\tau}_{j}$ .

First, we verify that $-(|j|+1)$ satisfies Condition 1 of Definition 24, i.e. that $\alpha_{|j|+1}<-2\delta^{T}$ (at this point we know that $\alpha_{j+1}$ is a negative quantity). We consider two cases, depending on whether $\tau$ is “small” or “large” (relatively to the small interval $[0,1/2n]$ ).

In the case when $\tau<\frac{1}{4}|\alpha_{j}|$ , the largest part of the deficit of $A_{-j}$ introduced by moving from ${\bf 0}_{\tau}$ to ${\bf x}_{j}$ results from moving cut $|j|$ to the left. However, letting $c(|j|-1)$ denote the position of cut $|j|-1$ after this movement, the interval $[|j|-2,c(|j|-1)]$ is too small for the movement of cut $|j|-1$ in direction $d^{\tau}_{-(|j|-1)}$ to compensate. In other words, even if movement in direction $d^{\tau}_{-(|j|-1)}$ moves cut $|j|-1$ to the left endpoint of the interval $[|j|-2,|j|]$ , this is not enough to make up for the deficit of $A_{-j}$ introduced from the movement in direction $d^{\tau}_{j}$ . This means that cut $|j|+1$ needs to move to the left as well and in particular, it needs to move by more than $\tau/4$ to the left of location $j$ . This is only possible if $\alpha_{|j|+1}<-2\delta^{T}$ .

In the case when $\tau\geq\frac{1}{4}|\alpha_{j}|$ , since $\tau$ is large enough, cut $|j|+1$ needs to move a substantial distance to the left, in order to end up positioned to the left of integer position $|j|$ . In particular, it needs to move at least $\frac{1}{4}|\alpha_{j}|-\tau\cdot\delta^{+}$ to the left. This implies that Condition 1 is satisfied for colour $-(|j|+1)$ .

Now consider what needs to happen in order for the second condition to fail. Consider the interval $[|j|-1,|j|+1]$ (which is monitored by the $(j+1)$ -st blanket-sensor). Since cut $|j|+1$ is located to the left of location $|j|$ (the midpoint of this interval), there exists a subinterval of length at most $1$ labelled $A_{j}$ , within $[|j|-1,|j|+1]$ . This means that either

the colour $-(|j|+1)$ is a consistent colour and we are done, or

there is an additional amount of label $A_{j}$ within interval $[|j|-1,|j|+1]$ and the total number of value-blocks labelled $A_{j}$ outnumbers that of those labelled $A_{-j}$ by at least $p^{\rm large}$ . The only way this can happen is if cut $|j|+2$ lies to the left of the integer location $|j|+1$ , and in fact, it has to lie an inverse-polynomial distance, at least $\frac{p^{\rm large}}{2p^{\rm huge}}$ , to the left of $|j|+1$ .

In case that happens, we move on to consider interval $[j,j+2]$ and we apply the same argument. Again, $\alpha_{j+2}$ is negative, and since cut $|j|+2$ is to the left of location $j+1$ by a margin $\frac{p^{\rm large}}{2p^{\rm huge}}<2\delta^{T}$ , $-(|j|+2)$ satisfies Condition 1 to be a consistent colour. It will also satisfy Condition 2, unless cut $|j|+3$ lies to the left of location $|j|+2$ by an inverse-polynomial distance, at least $\frac{p^{\rm large}}{2p^{\rm huge}}$ , similarly to before.

Continuing like this, we will either find a consistent colour in some interval $[j-2,j]$ with $j<n$ , or we will reach interval $[n-2,n]$ . When we reach interval $[n-2,n]$ , cut $n$ has had to move to the left of integer location $n-1$ in order to prevent $-(n-1)$ from being a consistent colour (as otherwise we would have identified a consistent colour in some already examined interval). But then $-n$ is a consistent colour, since we have moved an inverse-polynomial distance (at least $\frac{p^{\rm large}}{2p^{\rm huge}}<2\delta^{T}$ ) in direction $d^{\tau}_{-n}$ (Condition 1), and at least $1/2$ of the interval $[n-2,n]$ is coloured in a way that agrees with this (Condition 2).

A solution $S_{CH}$ to $I_{CH}$ cannot encode a point ${\bf x}$ in the Significant Region, within distance $\delta^{\rm tiny}$ of the boundary (where blanket-sensor agent(s) become active).

Observation 6.1 tells us that the $k$ -th component of $f^{\prime}$ is the difference between $A_{+}$ and $A_{-}$ observed by c-e agent $a_{k}$ , and Proposition 6.2 tells us that all these components, averaged over a set of points within $\delta^{\rm tiny}$ of ${\bf x}$ need to be close to zero, at a solution.

Proposition 6.5 tells us that ${\bf x}$ has some consistent colour $k$ . All points within $\delta^{\rm tiny}$ of ${\bf x}$ cause two outputs (gates $g^{\prime}_{k}$ , $g^{\prime}_{-k}$ as defined in Section 5.1.2) of the circuit-encoders to represent colour $k$ . This includes points where blanket-sensor agents are active, since by the properties of consistent colours, we are at least an inverse-polynomial distance from any point where any $b_{i,k}$ can be active in the wrong direction. In Section 5.1.3 the blanket-sensor agents are designed to agree with the definition of consistent colour, Definition 24. So c-e agent $a_{k}$ observes a large imbalance between $A_{+}$ and $A_{-}$ .

4 No bogus approximate-zeroes of F𝐹F due to the connecting facet

Let ${\bf x}$ , ${\bf x}^{\prime}$ be points in the Significant Region having transformed coordinates $(\tau;\alpha_{2},\ldots,\alpha_{n})$ and $(1-\tau;-\alpha_{2},\ldots,-\alpha_{n})$ respectively, for $\tau<\frac{1}{2}-\delta^{T}$ . Then $f({\bf x})=-f({\bf x}^{\prime})$ .

The proposition extends Observation 5.3. The points ${\bf x}$ and ${\bf x}^{\prime}$ have been coloured according to Item 2 of Section 5.3, and they belong to two long thin colour-regions that extend the cubelets that lie on the panchromatic facets of the cube embedded at the centre of $T$ , all the way to the ends of $T$ . From the boundary conditions on the colouring of box $B$ in New variant high-D Tucker, and the way $f$ is constructed above, their colours are equal and opposite.

Remark: The ${\bf x}$ , ${\bf x}^{\prime}$ in Proposition 6.7 will “approach each other” as $\tau\rightarrow 0$ . That is, they correspond to sequences of Consensus-halving cuts where the left-hand cut in the c-e region “wraps around” to the right-hand side of the c-e region. Proposition 6.7 may thus seem to create Borsuk-Ulam directions that are in conflict with each other as we cross from facet $D_{0}$ to $D_{1}$ , but in fact the flip of labels in Consensus-halving that occurs when we move from $D_{0}$ to $D_{1}$ will mean that they are in agreement with each other.

We consider the case where the set of $p^{C}$ points in $D$ represented by the solution $S_{CH}$ to $I_{CH}$ , contains points on opposite sides of the facets of $D$ that have been identified with each other. Proposition 6.7 tells us that colour-regions are adjacent to colour-regions having the opposite colour. We need to verify that for a pair ${\bf x},{\bf x}^{\prime}$ of points that are close together but have opposite colours (due to lying in such a pair of colour-regions) the same (and not opposite) feedback is provided to the c-e agents. (So, in contrast with a pair of opposite-colour points that represent a solution, whose feedback to the c-e agents cancel each other out.)

In reasoning about these elements ${\bf x},{\bf x}^{\prime}\in D$ , it is helpful to depart from our convention that the label-sequence begins with $A_{+}$ , and suppose that for ${\bf x}^{\prime}$ , the label-sequence begins with $A_{-}$ . Suppose ${\bf x},{\bf x}^{\prime}$ have corresponding circuit-encoders $C_{i},C_{i^{\prime}}$ and assume that $C_{i}$ and $C_{i^{\prime}}$ receive reliable inputs, recalling that only $n$ circuit-encoders may fail to receive reliable inputs. Notice that if ${\bf x}$ causes a blanket-sensor agent $b_{i,j}$ to be active in direction $A_{+}$ , then ${\bf x}^{\prime}$ typically causes $b_{i^{\prime},j}$ to be active in direction $A_{+}$ also (the over-represented label is fed back to c-e agent $a_{j}$ ).

In the case that no blanket-sensor agents are active, if ${\bf x},{\bf x}^{\prime}$ receive opposite colours from $C_{i},C_{i^{\prime}}$ , then, reverting to our convention that the shared label-sequence begins with $A_{+}$ , we note that their reference-sensor agents get opposite labels, which causes $C_{i}$ and $C_{i^{\prime}}$ to agree with each other.

Remark. For intuition, it is possibly helpful to think about the move from ${\bf x}$ to ${\bf x}^{\prime}$ in terms of operations on the coordinate-encoding cuts. At ${\bf x}$ , there is a cut on the right-hand side of the c-e region, and in moving to ${\bf x}^{\prime}$ we move that cut to the left-hand side. If we move the cut while leaving the labels of the c-e region unchanged (apart from at the ends) we expect the circuit to behave as before, but since we have switched the roles of labels $A_{+}$ and $A_{-}$ , the feedback to agents $a_{1},\ldots,a_{n}$ gets inverted. We re-invert this feedback by reversing the colour, and hence the output of $f^{\prime}$ . This is very similar (in fact, an $n$ -dimensional analogue) to the handling of the “wrap-around points” in the sequence via the interpretation in terms of the virtual cuts in .

Further work

What is the computational complexity of $k$ -thief Necklace-splitting, for $k$ not a power of 2? As discussed in , the proof that it is a total search problem, does not seem to boil down to the PPA principle. Right now, we do not even even know if it belongs to PTFNP .

Interestingly, Papadimitriou in (implicitly) also defined a number of computational complexity classes related to PPA, namely PPA- $p$ , for a parameter $p\geq 2$ . PPA- $p$ is defined with respect to an input bipartite graph and a given vertex with degree which is not a multiple of $p$ , and the goal is to find another vertex with degree which is not a multiple of $p$ (it follows that PPA=PPA- $2$ ). This was done in the context of classifying the computational problem related to Chévalley’s Theorem from number theory, and it was proven that for prime $p$ , Chevally mod $p$ is in PPA- $p$ . Given the discussion above, it could possibly be the case that the the principle associated with Necklace-splitting for $k$ -thieves is the PPA- $k$ principle instead.

What about the computational hardness of the problem? Is 3-thief Necklace-splitting hard for PPA? At first glance, it seems like a more complicated problem, but there this is not obvious; for example, there is no way to cause the third thief to be a dummy agent and therefore a straightforward reduction is unlikely. However, it is worth mentioning here that the computational equivalence between $\varepsilon$ -Consensus-halving and Necklace-splitting that was proven in is actually established between the Necklace Splitting problem for any $k$ and the corresponding approximate $1/k$ -Division problem, a generalization of $\varepsilon$ -Consensus-halving (see ); a PPA-hardness or PPA-membership result for $k>2$ for the latter problem would imply a corresponding result for Necklace-splitting with $k>2$ . En route to either of these results, a possible approach that was suggested in , would be to define an appropriate generalisation of Tucker’s Lemma and prove it constructively (see Section 8 in ).

We have left open the questions of whether $\varepsilon$ -Consensus-halving remains PPA-complete for constant $\varepsilon$ , and whether Discrete Ham Sandwich remains PPA-complete when coordinates of points are given in unary. Recall that for the former problem, a PPAD-hardness result is known from ; it would be quite interesting to settle this, to verify whether it is possible for the precision parameter to play such an important role in the problem classification.

In classifying a problem as polynomial-time solvable versus NP-complete, this is usually seen as a statement about its computational (in)tractability. The distinction between PPAD-completeness and PPA-completeness is one of expressive power: we believe that PPAD-complete problems are hard, meanwhile PPA-complete problems are “at least as hard”, but of course are still in NP. The expressive power of totality principles that underpin TFNP problems is a topic of enduring interest ; note also the related work on Bounded Arithmetic discussed in . Our results highlight the distinction between computational (in)tractability and expressive power. In analysing the relationships between these complexity classes, it may be fruitful to focus on expressive power.

Finally, initiates an interesting experimental study of path-following algorithms for 2-thief Necklace-splitting, obtaining positive results when the number of bead colours is not too large. However, path-following seems to be inapplicable for, say, 3 thieves. The Necklace-splitting problem may constitute an interesting class of challenge-instances for SAT-solvers, now that it is known to be a very hard total search problem.

We thank Alex Hollender for detailed and insightful proof-reading of earlier versions of this paper.

References

APPENDIX

Appendix A Details from [29].

In this section, we include several details from that are being used (or extended) in the present paper as well.

First, the ability to detect the position of the cuts in the c-e region and feed this information to the circuit lies in the presence of gadgets developed in and used in , referred to as “bit detection gadgets” in . A bit detection gadget consists typically of two thin and dense valuation blocks of relatively large height and relatively small length, situated next to each other (e.g., see the rightmost set of value-blocks in Figure 16 or Figure 20, top). These value-blocks constitute most of the agent’s valuation over the related interval. The point of these gadgets is that if the discrepancy between $A_{+}$ and $A_{-}$ is (significantly) in the favour of one against the other, there will be a cut intersecting one of the two valuation blocks; which block is intersected will correspond to a $0/1$ value, i.e. a bit that indicates the “direction” of the discrepancy in the two labels.

These gadgets are used in several parts of the reduction, e.g.,

at the value-blocks of the sensor agents (Definition 13) in the region $R_{i}$ (see Figure 16) that assumes the right or left position depending on whether their small value-blocks in the c-e region are labelled $A_{+}$ or $A_{-}$ ,

in the encoding of the circuit $C_{VT}$ of New variant high-D Tucker using the circuit-encoding agents $C_{i}$ (also see Section 5.1),

at the value-blocks of the blanket-sensor agents (Definition 14) in the region $R_{i}$ , with the difference that in that case, a small value-block (of value $9\kappa/10$ ) lies between the two thin and dense valuation blocks of the bit detection gadget (see Figure 17).

Sensor and blanket sensor agents

The formal definitions of the sensor agents and the blanket sensor agents were given in the main text, see Definition 13 and Definition 14 respectively. Here, we explain in more detail how these agents make use of the bit-detection gadgets (which lie in the region $R_{i}$ ) to detect the positions of the cuts (for the sensor agents) or to detect large discrepancies on the volumes of the two labels in the c-e region (for the blanket sensor agents).

Starting from the sensor agents, recall that each such agent of $S_{i}\subset C_{i}$ has a small value-block (of value $1/10$ ) in the c-e region and its remaining value ( $9/10$ ) lies in the circuit encoding region and particularly, in the sub-region $R_{i}$ (recall that the circuit-encoding region $R$ is partitioned into sub-regions $R_{i}$ , one for each circuit encoder $C_{i}$ , where most of the gadgetry of the encoder lies). In particular, the sensor agent has two thin blocks of value $9/20$ in the c-e region and this is precisely the bit detection gadget of the agent, as described in the previous subsection - see Figure 20, top, for an illustration. If the value-block on the left-hand side (in the c-e region) is labelled $A_{-}$ , then the cut on the right-hand side intersects the rightmost value-block (i.e., “jumps” to the right) and if it is $A_{+}$ , then it “jumps” to the left. This information is then passed on to the next level of circuit encoding agents, those that implement the pre-processing unit of the circuit (see Section 5.1.1) and the subsection following this one. In , we referred to this information as “raw data” (although the cut extraction mechanism there had to be more elaborate, due to the inversely-exponential precision). The pre-processing unit is responsible for converting the raw data into appropriate inputs for the circuit $C_{VT}$ , which encode coordinate of points on the Möbius-simplex. These inputs are then “propagated” through the encoding of the circuit $C_{VT}$ , to produce the appropriate labels at the output gates $g_{j}$ , $j\in\pm[n]$ , as described in th following subsection.

The blanket sensor agents use very similar bit detection gadgets in their outputs (i.e., in their value-blocks in region $R_{i}$ ), but between their thin and dense value-blocks, they have an additional small value-blocks (the block of value $9\kappa/10$ in Definition 14). This is because the blanket sensor agents needs to be able to assume three states: “excess of $A_{+}$ ”, “excess of $A_{-}$ ” and “(approximately) balanced labels”. The latter option corresponds to the cut associated with the blanket sensor agent intersecting the middle value-block (therefore not “jumping” to either side), whereas the other two options correspond to the cut “jumping” to either the right or the left side, where the choice depends on the over-represented label and the parity of the index of the blanket sensor agent. It is straightforward (as before) to interpret these positions as boolean values. The main idea in is that if the blanket sensor agents are active, then this information will “over-ride” the circuit $C_{VT}$ and generate an imbalance of labels in the feedback provided to the c-e agents directly, essentially bypassing the output gates of $C_{VT}$ . We use the same principle here, but since we now have many blanket sensor agents, extra care must be taken to make sure that no artificial solutions are introduced when colouring the domain. The details on how the input from the blanket-sensors affects the feedback to the c-e agents are presented in Section 5.1.3.

Before we explain how the circuit of New variant high-D Tucker is encoded using the circuit encoders $C_{i}$ , we present the main building blocks for simulating boolean circuits based on bit-detection gadget, first presented in and later adapted and used in .

Consider a boolean gate that is an AND, an OR, or a NOT gate, denoted $g_{\land}$ , $g_{\lor}$ and $g_{\neg}$ respectively. Let $in_{1}$ , $in_{2}$ and $out$ be intervals such that $|in_{1}|=|in_{2}|=|out|=1$ . We will encode these gates using gate gadgets, shown in Figure 18 (from ).

Note that the gadget corresponding to the NOT gate only has one input, whereas the gadgets for the AND and OR gates have two inputs. In the interval $out$ , each gadget has two bit detection gadgets - in the case of the NOT gate these are even, but in the case of the AND and OR gates, they are uneven. Also note that for the inputs, as well as the output of the NOT gate, the label on the left-hand side of the cut is $A_{+}$ and the label on the right-hand side will be $A_{-}$ , whereas for the outputs to the OR and AND gate, the label on the left-hand side of the cut is $A_{-}$ and the label on the right hand side is $A_{+}$ . This can be achieved with the appropriate use of parity gadgets, i.e., simple valuation blocks that force cuts to lie in specific positions, only to change the parity of the cut sequence (see for more details).

As we mentioned earlier, the bit-detection gagdets allow us to extract boolean values corresponding to the positions of the cuts in the c-e region - this is achieved via the use of the sensor agents. In the next step, these values (referred earlier as the “raw data”) are supplied into a gadget that is referred to as the pre-processing circuit in . The role of this circuit is to convert this information to coordinates of points on the Möbius-simplex, which then will go through a coordinate transformation (see Section 5.2) and will be used as inputs to the encoding of $C_{VT}$ . The pre-processing circuit can be implemented using the boolean gate gadgets described above. The same is true for the actual circuit $C_{VT}$ as well, which can be simulated using the same set of gadgets, using the principle described in Figure 19 (from ). The outputs of the pre-preprocessing circuit are inputed to the input gates of $C_{VT}$ and their outputs, in turn, are inputted to the gates on the next level and this process carries on until the output gates of $C_{VT}$ .

Unreliable circuits and the stray cut

In the main text, we mentioned that we are guaranteed that at most $n$ cuts will lie in the c-e region (Observation 4.1) and at least $n-1$ cuts will lie in the c-e region, as otherwise some blanket sensor agent would be active (Proposition 4.4). If $n-1$ cuts lie in the c-e region, this means that one of the $n$ c-e cuts has moved into the circuit encoding region; following , we will refer to that cut as a stray cut. As we highlighted in , the presence of a stray cut may have the following two consequences.

It intersects the circuit-encoding region $R_{i}$ of some circuit encoder $C_{i}$ , for $i\in\{1,\ldots,p^{C}\}$ .

It flips the parity of the circuit encoders $C_{i}$ , with $R_{i}<c$ , where $c$ is the position of the stray cut in $R_{i-1}$ . In other words, if the first cut in $R_{i}$ was expecting to see $A_{+}$ on its left-hand side, it now sees $A_{-}$ and vice-versa.

The first effect is not much of a problem, we will simply assume that the corresponding circuit is unreliable. As explained in the main text, an unreliable circuit can have an arbitrary (or even adversarial) effect on the volume of $A_{+}$ and $A_{-}$ supplied as feedback to the c-e agents, but since there will be at most $1$ unreliable circuit, its effect will be “outvoted” by the remaining reliable circuits (circuits that do not receive reliable inputs will also be outvoted by those that do, since there are at most $n$ of them, see Observation 4.2).

The parity flip that the stray cut induces on other circuit-encoders however seems potentially more worrisome., since it could flip the output of the sensor agents that detect the positions of the cuts. This is taken care of by the presence of the reference sensor agent (Definition 21) which achieves the disorientation of the domain. As explained in Section 5.1.1, the outputs of circuit $C_{i}$ are taken with reference to the value $s_{1,1}$ of the reference senso agent, in the sense that after simulating $C_{VT}$ we take the exclusive-or with $s_{1,1}$ ; this can be implemented in the circuit using the boolean gate gadgets explained above, in a manner similar to (via the use of the XOR sub-circuit, see Section 4.4.2 in ). This allows us to use a similar “double-negative lemma” as the one we used in (see Lemma 5.4).

Appendix B Membership in PPA

We show that Discrete Ham Sandwich belongs to PPA, which appears to be folklore that has not, to out knowledge, been written down. (In fact, in the problem was claimed to belong to PPAD, which is now seen to be incorrect subject to $\mbox{{PPAD}}\not=\mbox{{PPA}}$ , by the result in ). Since Theorem 2.3 reduces from 2-thief Necklace-splitting to Discrete Ham Sandwich, it follows immediately that 2-thief Necklace-splitting belongs to PPA. In Section B.2 we go a bit further for Necklace-splitting: we show that Necklace-splitting belongs to PPA whenever the number of thieves is a power of 2.

We use Freund and Todd’s construction of an undirected graph with known degree-1 vertex, based on a “special triangulation” of a high-dimensional $L_{1}$ ball (i.e. a high-dimensional octahedron, also known as the crosspolytope ). Given an instance $I$ of Discrete Ham Sandwich, we show how to construct a suitable triangulation. $I$ contains $n$ sets of points $\{S_{1},\ldots,S_{n}\}$ in $n$ -dimensional space; we assume coordinates are represented as fractions whose numerators and denominators are give via standard binary expansions. (Recall that we leave it as an open problem whether Discrete Ham Sandwich remains PPA-hard for points presented in unary. Of course, the “in PPA” result follows immediately for that restricted version.)

Based on $I$ we identify an exponentially-large collection of “candidate hyperplanes” as follows, which contains a solution to $I$ . A candidate hyperplane $H$ is represented by a gradient vector $g_{H}$ whose coordinates are assumed to be normalised to 1 ( $L_{1}$ norm; their absolute values sum to 1). Given $g_{H}$ , $H$ is obtained by using binary search to efficiently find a hyperplane with gradient $g_{H}$ that bisects the union of the sets $S_{i}$ ; it is then easy to check whether $H$ is a solution. Note that there exists an integer $N$ whose binary expansion has length polynomial in $|I|$ such that entries of some solution $g_{H}$ can be assumed to be multiples of $1/N$ .

Fix a point $p\in\mbox{{IR}}^{n}$ such that $p$ is not contained in any candidate solution to $I$ , e.g. $p=(1/2N,1,\ldots,1)$ . Given any $H$ , the “positive” side of $H$ is the side that contains $p$ . Assume we chose $N$ large enough so that if two hyperplanes $H$ and $H^{\prime}$ have gradients $g_{H}$ and $g_{H^{\prime}}$ whose coordinates differ by at most $1/N$ , then for any point $x$ in $I$ where they disagree, $x$ lies in $H$ or $H^{\prime}$ (and does not lie strictly on the positive side of one and the negative side of the other).

For any $H$ , label it as follows. Find the set $S_{i}$ that is most unevenly split by $H$ breaking ties lexicographically. Label $H$ with $i$ if most of $S_{i}$ lies on the positive side, otherwise label $H$ with $-i$ . Each hyperplane $H$ has a $g_{H}$ that is a point on the $L_{1}$ -norm sphere $S^{n-1}$ . By construction, antipodal points receive opposite labels. We can use these as the vertices of a special triangulation of the octahedral ball $B^{n}$ , in which the origin is used as an additional point and is connected to all the points on $S^{n-1}$ that correspond to candidate solutions. The graph defined in is a degree 2 undirected graph with a single known degree-1 vertex (the origin), and for which any other degree-1 vertex represents a pair of hyperplanes that bisect all the $S_{i}$ .

B.2 Necklace-splitting is in PPA, if the number of thieves is a power of 2

The version of the problem with two thieves, 2-thief Necklace-splitting, belongs to PPA since we reduced it to Discrete Ham Sandwich which is shown in Section B.1 to belong to PPA. We can extend the membership to PPA for $k$ -Necklace-splitting, when $k$ is a power of $2$ , by using the argument of Proposition 3.2 of Alon (here, we take both $k$ and $l$ to be $2$ ). In particular, we will reduce Necklace-splitting to $4$ -Necklace-splitting.

We start from an instance of $4$ -Necklace-splitting (with $4$ thieves) and we regard it as an instance of Necklace-splitting (with $2$ thieves), which we solve using an algorithm for the latter problem. The solution is a sequence of intervals defined by the endpoints of the necklace and $n$ cuts, each belonging to one of the two collections (corresponding to the two thieves), such that each collection contains exactly half of the beads of each colour. Then, we set the beads that lie in intervals belonging to each collection aside and form two new instances of Necklace-splitting (essentially by “gluing” the different sub-intervals of the same collection together); note that each new instance will have an even number of beads of each colour, since we initial number of beads from each colour was a multiple of $4$ . Then we run the algorithm again on the resulting instances of Necklace-splitting to obtain a partition into $4$ collections ( $2$ for each individual instance), which consitutes a partition of the $4$ -Necklace-splitting into $4$ collections according to the definition of the problem. If $n$ is the number of colours, the total number of cuts is (at most) $3n$ , and therefore this partition is a solution to $4$ -Necklace-splitting.

The above is a Turing reduction, which can be extended straightforwardly to the case of $k$ a power of 2. We can convert such a reduction into a many-one reduction by applying Theorem 6.1 of Buss and Johnson , which shows that PPA and some related complexity classes are closed under Turing reductions.