The Complexity of Splitting Necklaces and Bisecting Ham Sandwiches

Aris Filos-Ratsikas, Paul W. Goldberg

Computational complexity; Tucker’s Lemma; TFNP; fair division

Introduction

The complexity classes PPA and PPAD were introduced in a seminal paper of Papadimitriou in 1994, in an attempt to classify several natural problems in the class TFNP . TFNP is the class of total search problems in NP for which a solution exists for every instance, and solutions can be efficiently verified. Various important problems were subsequently proven to be complete for the class PPAD, such as the complexity of many versions of Nash equilibrium , market equilibrium computation , and others . As evidence of computational hardness, PPA-completeness is stronger than PPAD-completeness, i.e., \mboxPPAD\mboxPPA\mbox{{PPAD}}\subseteq\mbox{{PPA}}. Indeed, Jeřábek shows that it indicates cryptographic hardness in a strong sense: gives a randomised reduction from FACTORING to PPA-complete problems. This is not known for PPAD-complete problems. For more details, and the significance of PPA-completeness, we refer the reader to the related discussion in . PPA is the class of problems reducible to Leaf (Definition 1), and a PPA-complete problem is polynomial-time equivalent to Leaf.

An instance of the problem Leaf consists of an undirected graph GG whose vertices have degree at most 2; GG has 2n2^{n} vertices represented by bitstrings of length nn; GG is presented concisely via a circuit that takes as input a vertex and outputs its neighbour(s). We stipulate that vertex 0n0^{n} has degree 1. The challenge is to find some other vertex having degree 1.

Complete problems for the class PPA seemed to be much more elusive than PPAD-complete ones, especially when one is interested in “natural” problems, where “natural” here has the very specific meaning of problems that do not explicitly contain a circuit in their definition. Besides Papadimitriou , other papers asking about the possible existence of natural PPA-complete problems include . In a recent precursor to the present paper we identified the first example of such a problem, namely the approximate Consensus-halving problem, dispelling the suspicion that such problems might not exist. In this paper we build on that result and settle the complexity of two natural and important problems whose complexity status were raised explicitly as open problems in Papadimitriou’s paper itself, and in many other papers beginning in the 1980s. Specifically, we prove that Necklace-splitting (with two thieves, see Definition 2) and Discrete Ham Sandwich are both PPA-complete.

In the kk-Necklace-splitting problem there is an open necklace with kaika_{i} beads of colour ii, for 1in1\leq i\leq n. An “open necklace” means that the beads form a string, not a cycle. The task is to cut the necklace in (k1)n(k-1)\cdot n places and partition the resulting substrings into kk collections, each containing precisely aia_{i} beads of colour ii, 1in1\leq i\leq n.

In Definition 2, kk is thought of as the number of thieves who desire to split the necklace in such a way that the beads of each colour are equally shared. In this paper, usually we have k=2k=2 and we refer to this special case as Necklace-splitting.

In the Discrete Ham Sandwich problem, there are nn sets of points in nn dimensions having integer coordinates (equivalently one could use rationals). A solution consists of a hyperplane that splits each set of points into subsets of equal size (if any points lie on the plane, we are allowed to place them on either side, or even split them arbitrarily).

In Definition 3, each point set represents an ingredient of the sandwich, which is to be cut by a hyperplane in such a way that all ingredients are equally split.

The necklace-splitting problem was introduced in a 1982 paper of Bhatt and Leiserson (, Section 5), where it arose in the context of VLSI circuit design (the version defined in is the 2-thief case proved PPA-complete in the present paper). In 1985 and 1986, the 2-thief case was shown to have guaranteed solutions (as defined in Definition 2) by Goldberg and West and Alon and West and then in 1987, Alon proved existence of solutions for kk thieves as well. Early papers that explicitly raise its complexity-theoretic status as an open problem are Goldberg and West and Alon . Subsequently, the necklace-splitting problem was found to be closely related to “paint-shop scheduling”, a line of work in which several papers such as explicitly mention the question of the computational complexity of necklace-splitting. Meunier notes that the search for a minimum number of cuts admitting a fair division (which may be smaller than the number (k1)n(k-1)n that is guaranteed to suffice) is NP-hard, even for a subclass of instances of the 2-thief case. (That is a result of Bonsma et al. , for the “paint shop problem with words”, equivalent to 2-thief Necklace-splitting with 2 beads of each colour.)

In , we showed Necklace-splitting to be computationally equivalent to ε\varepsilon-Consensus-halving for inverse-polynomial precision parameter ε\varepsilon, but the PPA-completeness of ε\varepsilon-Consensus-halving was only shown for inverse-exponential ε\varepsilon. established PPAD-hardness of Necklace-splitting, applying the main result of . In this paper, we prove that ε\varepsilon-Consensus-halving is PPA-complete for ε\varepsilon inversely polynomial, thus obtaining the desired PPA-completeness of Necklace-splitting. While some structural parts of our reduction are extensions of those presented in , obtaining the result for inverse-polynomial precision is much more challenging, as the construction needs to move to a high-dimensional space (rather than the two-dimensional space which is sufficient for the result in ). We highlight the main new techniques that we have developed in this paper in Section 2.1, where we provide an overview of the reduction. Our PPA-completeness result gives a convincing negative answer to Meunier and Neveu’s questions about possible polynomial-time solvability or membership of PPAD for Necklace-splitting; likewise it runs counter to Alon’s cautious optimism at ICM 1990 (, Section 4) that the problem may be solvable in polynomial time.

The Ham Sandwich Theorem is of enduring and widespread interest due to its colourful and intuitive statement, and its relevance and applications in topology, social choice theory, and computational geometry. Roughly, it states that given dd measures in Euclidean dd-space, there exists a hyperplane that cuts them all simultaneously in half. Early work on variants and applications of the theorem focused on non-constructive existence proofs and mostly did not touch on the algorithmics. A 1983 paper by Hill hints at possible interest in the corresponding computational challenge, in the context of a related land division problem. The computational problem (and its complexity) was first properly studied in a line of work in computational geometry beginning in the 1980s, for example . The problem envisages input data consisting of dd sets of nn points in Euclidean dd-space, and asks for a hyperplane that splits all point sets in half. (The problem Discrete Ham Sandwich (Definition 3) as named in is essentially this, with dd set equal to nn to emphasise that we care about the high-dimensional case.) In this work in computational geometry, the emphasis has been on efficient algorithms for small values of dd; Lo et al. improve the dependence on dd but it is still exponential, and the present paper shows for the first time that we should not expect to improve on that exponential dependence. More recently, Grandoni et al. apply the “Generalized Ham Sandwich Theorem” to a problem in multi-objective optimisation and note that a constructive proof would allow a more efficient algorithm to emerge. The only computational hardness result we know of is Knauer et al. who obtain a WW-hardness result for a constrained version of the problem; points out the importance of the computational complexity of the general problem. The PPA-completeness result of the present paper is the first hardness result of any kind for Discrete Ham Sandwich, and as we noted, is a strong notion of computational hardness. Karpic and Saha showing a form of equivalence between the Ham Sandwich Theorem and Borsuk-Ulam, explicitly mention the possible PPA-completeness of Discrete Ham Sandwich as an “interesting and challenging open problem”.

We prove the PPA-completeness of Discrete Ham Sandwich via a simple reduction from Necklace-splitting. Ours is not the first paper to develop the close relationship between the two problems: Blagojević and Soberón shows a generalisation, where multiple agents may share a “sandwich”, dividing it into convex pieces. Further papers to explicitly point out their computational complexity as open problems include Deng et al. (mentioning that both problems “show promise to be complete for PPA”), Aisenberg et al. , and Belovs et al. .

Further Related Work: The class TFNP was defined in and several of its subclasses were studied over the years, such as PPA, PPAD and PPP , PLS and CLS ; here we focus on the most recent results. As we mentioned earlier, in we identified the first natural complete problem for PPA, the approximate Consensus-halving problem. In a recent paper, Sotiraki et al. identified the first natural problem for the class PPP, the class of problems whose totality is established by an argument based on the pigeonhole principle. For the class CLS, both Daskalakis et al. and Fearnley et al. identified complete problems (two versions of the Contraction Map problem, where a metric or a meta-metric are given as part of the input). In the latter paper, the authors define a new class, namely EOPL (for “End of Potential Line”), and show that it is a subclass of CLS. Furthermore, they show that two well-known problems in CLS, the P-Matrix Linear Complementarity Problem (P-LCP), and finding a fixpoint of a piecewise-linear contraction map (PL-Contraction) belong to the class. The End of Potential Line problem of is closely related to the End of Metered Line of .

Problems and Results

We present and discuss our main results, and in Section 2.1 we give an overview of the proof and new techniques, in particular with respect to the precursors to this paper.

An instance ICHI_{CH} incorporates, for 1in1\leq i\leq n, a non-negative measure μi\mu_{i} of a finite line interval A=[0,x]A=[0,x], where each μi\mu_{i} integrates to 1 and x>0x>0 is part of the input. We assume that μi\mu_{i} are step functions represented in a standard way, in terms of the endpoints of intervals where μi\mu_{i} is constant, and the value taken in each such interval. We use the bit model (logarithmic cost model) of numbers. ICHI_{CH} specifies a value ε0\varepsilon\geq 0 also using the bit model. We regard μi\mu_{i} as the value function held by agent ii for subintervals of AA.

A solution consists firstly of a set of nn cut points in AA (also given in the bit model of numbers). These points partition AA into (at most) n+1n+1 subintervals, and the second element of a solution is that each subinterval is labelled A+A_{+} or AA_{-}. This labelling is a correct solution provided that for each ii, μi(A+)μi(A)ε|\mu_{i}(A_{+})-\mu_{i}(A_{-})|\leq\varepsilon, i.e. each agent has a value in the range [12ε2,12+ε2][\frac{1}{2}-\frac{\varepsilon}{2},\frac{1}{2}+\frac{\varepsilon}{2}] for the subintervals labelled A+A_{+} (hence also values the subintervals labelled AA_{-} in that range).

We assume without loss of generality that in a valid solution, labels A+A_{+} and AA_{-} alternate. We also assume that the alternating label sequence begins with label A+A_{+} on the left-hand side of AA (i.e. A+A_{+} denotes the leftmost label in a Consensus-halving solution).

The Consensus-halving problem of Definition 4 is a computational version of the Hobby-Rice theorem . Most of the present paper is devoted to proving the following theorem.

ε\varepsilon-Consensus-halving is PPA-complete for some inverse-polynomial ε\varepsilon.

As mentioned in the introduction, in it was proven that 2-thief Necklace-splitting and ε\varepsilon-Consensus-halving for ε\varepsilon inversely-polynomial are computationally equivalent, i.e. they reduce to each other in polynomial time. Therefore, by and the “in PPA” result proven in Section B.1, we immediately get the following corollary.

Necklace-splitting is PPA-complete when there are k=2k=2 thieves.

If we knew that kk-Necklace-splitting belonged to PPA for other values of kk, we could of course make the blanket statement “Necklace-splitting is PPA-complete”. Alas, the proofs that Necklace-splitting is a total search problem for k>2k>2 do not seem to boil down to the parity argument on an undirected graph! That being said, we do manage to establish membership of PPA for kk being a power of 2 (essentially an insight of ), see Section B of the Appendix for the details and a related discussion. Of course, the k=2k=2 result strongly suggests that kk-Necklace-splitting is a hard problem for other values of kk.

As it happens, the PPA-completeness of Discrete Ham Sandwich follows straightforwardly, and we present that next. The basic idea of Theorem 2.3 of embedding the necklace in the moment curve appears already in and , p.48.

Inclusion in PPA is shown in Section B.1 of the Appendix. For PPA-hardness, we reduce from 2-thief Necklace-splitting which is PPA-complete by Theorem 2.2.

The idea is to embed the necklace into the moment curve γ={(α,α2,,αn):α}\gamma=\{(\alpha,\alpha^{2},\ldots,\alpha^{n}):\alpha\in\}. Assume all beads lie in the unit interval $.Abeadhavingcolour. A bead having colouri\in[n]locatedatlocated at\alpha\inbecomesapointmassofingredientbecomes a point mass of ingredientiofthehamsandwichlocatedatof the ham sandwich located at(\alpha,\alpha^{2},\ldots,\alpha^{n})\in\mbox{{IR}}^{n}.Itisknownthatanyhyperplaneintersectsthemomentcurve. It is known that any hyperplane intersects the moment curve\gammainatmostin at mostnpoints,(e.g.see,Lemma5.4.2),thereforeasolutiontoDiscreteHamSandwichcorrespondsdirectlytoasolutiontoNecklacesplitting,wherethetwothievessplittingthenecklacetakealternatingpieces.(Inthepoints, (e.g. see , Lemma 5.4.2), therefore a solution to Discrete Ham Sandwich corresponds directly to a solution to Necklace-splitting, where the two thieves splitting the necklace take alternating pieces. (In thek=2$ case, we may assume without loss of generality that they do in fact take alternating pieces).

A limitation to Theorem 2.3 is that the coordinates may be exponentially large numbers; they could not be written in unary. We leave it as an open problem whether a unary-coordinate version is also PPA-complete. As defined in , Discrete Ham Sandwich stipulated that each of the nn sets of points is of size 2n2n, whereas Definition 3 allows polynomial-sized sets. We can straightforwardly extend PPA-completeness to the version of by adding “dummy dimensions” whose purpose is to allow larger sets of each ingredient; the new ingredients that are introduced, consist of compact clusters of point masses, each cluster in general position relative to the other clusters and the subspace of dimension nn that contains the points of interest.

We use the standard notation [n][n] to denote the set {1,,n}\{1,\ldots,n\}, and we also use ±[n]\pm[n] to denote {1,1,2,2,,n,n}\{1,-1,2,-2,\ldots,n,-n\}. We often refer to elements of ±[n]\pm[n] as “labels” or “colours”. λ\lambda is usually used to denote a labelling function (so its codomain is ±[n]\pm[n]).

We let AA denote the domain of an instance of Consensus-halving; if that instance has complexity nn then AA will be the interval [0,poly(n)][0,poly(n)], where poly(n)poly(n) is some number bounded by a polynomial in nn. Recall by Definition 4 that μa\mu_{a} denotes the value function, or measure, of agent aa on the domain AA, in a Consensus-halving instance. We also associate each agent with its own cut (recall that the number of agents and cuts is supposed to be equal) and we let c(a)c(a) denote the cut associated with agent aa.

We let pC(n)p^{C}(n) be a polynomial that represents the number of “circuit-encoders” that we use in our reduction (see Section 5.1); we usually denote it pCp^{C}, dropping the nn from the notation.

Finally, BB denotes the nn-cube (or “box”) n^{n}.

In an instance of Consensus-halving, a value-block of an agent aa denotes a sub-interval of the domain where aa possesses positive value, uniformly distributed on that interval. In our construction, value-blocks tend to be scattered rather sparsely along the domain.

1 Overview of the proof

We review the ground covered by the precursors to this paper, then we give an overview of the technical innovations of the present paper.

established PPAD-hardness of Consensus-halving. An arithmetic circuit can be encoded by an instance to ε\varepsilon-Consensus-halving, by letting each gate have a cut whose location (in a solution) represents the (approximate) value taken at that gate. Agents’ valuation functions ensure that values taken at the gates behave according to the type of gate. A “PPAD circuit” can then be represented using an instance of Consensus-halving.

noted that the search space of solutions to instances as constructed by , is oriented. A radical new idea was needed to encode the non-oriented feature of topological spaces representable by PPA. That was achieved by using two cuts to represent the coordinates of a point on a triangular region faving two sides identified to form a Möbius strip. (These cuts are the only ones that lie in a specific subinterval of the interval AA of a Consensus-halving instance, called the “coordinate-encoding (c-e) region”. The two cuts are called the “coordinate-encoding cuts”.) Identifying two sides in this way is done by exploiting the equivalence of taking a cut on the LHS of the c-e region, and moving it to the RHS. In order to embed a hard search problem into the surface of a standard 2-dimensional Möbius strip, it was necessary to work at exponentially-fine resolution, which immediately required inverse-exponential ε\varepsilon for instances of ε\varepsilon-Consensus-halving. In we reduced from the PPA-complete problem 2D-Tucker (Definition 5 below), a search problem on an exponential-sized 2-dimensional grid.

In , the rest of AA is called the “circuit-encoding region” RR, and the cuts occurring within RR do the job of performing computation on the location of cuts in the c-e region. The present paper retains this high-level structure (Section 4.1). As in we use multiple copies of the circuit that performs the computation, each in its own subregion of RR. Here we use pC(n)p^{C}(n) copies where pCp^{C} is a polynomial; in we used 100 copies. Each copy is called a circuit-encoder. The purpose of multiple copies is to make the system robust; a small fraction of copies may be unreliable: as in we have to account for the possibility that one of the c-e cuts may occur in the circuit-encoding region, rendering one of the copies unreliable. We re-use the “double negative lemma” of that such a cut is not too harmful. We also adapt a result of that when a cut is moved from the one end to the other end of the c-e region, this corresponds to identifying two facets of a simplex to form a Möbius strip.

uses a sequence of “sensor agents” to identify the endpoints of intervals labelled A+A_{+} and AA_{-} in the coordinate-encoding region, and feed this information into the above mentioned circuit-encoders, which perform computation on those values. As in we use sensor agents. We obtain a simplification with respect to which is that we do not need the gadgets used there to perform “bit-extraction” (converting the position of a c-e cut into nn boolean values). In , a solution to an instance of Consensus-halving was associated with a sequence of 100 points in the Möbius-simplex (referred there as the “simplex-domain”), and the “averaging manoeuvre” introduced in was applied. In this paper, for a polynomial pC(n)p^{C}(n), we sample a sequence of pCp^{C} points in a more elegant manner, again exploiting the inverse-polynomial precision of solutions that we care about.

1.2 Technical innovations

As in , we reduce from the PPA-complete problem 2D-Tucker (Definition 5). That computational problem uses an exponentially-fine 2D grid, and (unlike ), in Section 3 we apply the snake-embedding technique invented in (versions of which are used in in the context of PPA) to convert this to a grid of fixed resolution, at the expense of going from 2 to nn dimensions. The new problem, Variant high-D Tucker (Definition 7) envisages a 7×7××77\times 7\times\cdots\times 7 grid. Here, we design the snake-embedding in such a way that PPA-completeness holds for instances of the high-dimensional problem that obey a further constraint on the way the high-dimensional grid is coloured, that we exploit subsequently. A further variant, New variant high-D Tucker (Definition 8) switches to a “dual” version where a hypercube is divided into cubelets, and points in the hypercube are coloured such that interiors of cubelets are monochromatic. A pair of points is sought having equal and opposite colours and distant by much less than the size of the cubelets.

We encode a point in nn dimensions using a solution to an instance of Consensus-halving as follows. Instead of having just 2 cuts in the coordinate-encoding region (as in ), suppose we ensure that up to nn cuts lie there. These cuts split this interval into n+1n+1 pieces whose total length is constant, so represent a point in the unit nn-simplex (in , the unit 2-simplex). This “Möbius-simplex” (Definition 17; Figure 10) has the further property that two facets are identified with each other in a way that effectively turns the simplex into an nn-dimensional Möbius strip.

In Section 5.2 we define a crucially-important coordinate transformation (see Figure 11) with the following key properties

the transformation and its inverse can be computed efficiently, and distances between transformed coordinate vectors are polynomially related to distances between un-transformed vectors;

at the two facets that are identified with each other, the coordinates of corresponding points are the negations of each other; our colouring function (that respects Tucker-style boundary conditions) has the effect that antipodal points get equal and opposite colours, and no undesired solutions are introduced at these facets.

This is the “smooth embedding” referred to in the abstract.

With the aid of the above coordinate transformation, we divide up the Möbius-simplex:

The twisted tunnel (Definition 23) is an inverse-polynomially thick strip, connecting the two facets that are identified in forming the Möbius-simplex. It contains at its centre an embedded copy of the hypercube domain of an instance IVTI_{VT} of New variant high-D Tucker. Outside of this embedded copy, it is “coloured in” (using our new coordinate system) in a way that avoids introducing solutions that do not encode solutions of IVTI_{VT}.

The Significant Region contains the twisted tunnel and constitutes a somewhat thicker strip connecting the two facets. It serves as a buffer zone between the twisted tunnel and the rest of the Möbius-simplex. It is subdivided into subregions where each subregion has a unique set of labels, or colours, from ±[n]\pm[n]. (We sometimes refer to these as “colour-regions”.) It is shown that any solution to an instance of Consensus-halving constructed as in our reduction, represents a point in the Significant Region.

If, alternatively, a set of cuts represents a point from outside the Significant Region, then certain agents (so-called “blanket-sensor agents”) will observe severe imbalances between labels A+A_{+} and AA_{-}, precluding a solution.

In , it was relatively straightforward to integrate the subset of the 2-dimensional Möbius-simplex that corresponds with the twisted tunnel, with the parts of the domain where the blanket-sensor agent became active (ruling out a solution) in a way that avoided introducing solutions that fail to encode solutions of Tucker. In the present paper, that gap has to be “coloured-in” in a carefully-designed way (Section 5.3, list item 3), and this is the role of the part of the Significant Region that is not the twisted tunnel. The proofs that they work correctly (Sections 6.2, 6.3) become more complicated.

Snake embedding reduction

The purpose of this section is to establish the PPA-completeness of New variant high-D Tucker, Definition 8. The snake embedding construction was devised in , in order to prove that ε\varepsilon-Nash equilibria are PPAD-complete to find when ε\varepsilon is inverse polynomial; without this trick the result is just obtained for ε\varepsilon being inverse exponential. We do a similar trick here. We will use as a starting-point the PPA-completeness of 2D-Tucker, from , which is the following problem:

(Aisenberg et al. ) An instance of 2D-Tucker consists of a labelling λ:[m]×[m]{±1,±2}\lambda:[m]\times[m]\rightarrow\{\pm 1,\pm 2\} such that for 1i,jm1\leq i,j\leq m, λ(i,1)=λ(mi+1,m)\lambda(i,1)=-\lambda(m-i+1,m) and λ(1,j)=λ(m,mj+1)\lambda(1,j)=-\lambda(m,m-j+1). A solution to such an instance of 2D-Tucker is a pair of vertices (x1,y1)(x_{1},y_{1}), (x2,y2)(x_{2},y_{2}) with x1x21|x_{1}-x_{2}|\leq 1 and y1y21|y_{1}-y_{2}|\leq 1 such that λ(x1,y1)=λ(x2,y2)\lambda(x_{1},y_{1})=-\lambda(x_{2},y_{2}).

The hardness of the problem in Definition 5 arises when mm is exponentially-large, and the labelling function is presented by means of a boolean circuit.

We aim to prove the following is PPA-complete, even when the values mim_{i} are all upper-bounded by some constant (specifically, 7).

(Aisenberg et al. ) An instance of nnD-Tucker consists of a labelling λ:[m1]××[mn]{±1,,±n}\lambda:[m_{1}]\times\cdots\times[m_{n}]\rightarrow\{\pm 1,\cdots,\pm n\} such that if a point x=(x1,,xn){\bf x}=(x_{1},\ldots,x_{n}) lies on the boundary of this grid (i.e., xi=1x_{i}=1 or xi=mix_{i}=m_{i} for some ii), then letting xˉ\bar{\bf x} be the antipodal point of x, we have λ(xˉ)=λ(x)\lambda(\bar{\bf x})=-\lambda({\bf x}). (Two boundary points are antipodal if they lie at opposite ends of a line segment passing through the centre of the grid.) A solution consists of two points z, z{\bf z}^{\prime} on this grid, having opposite labels (λ(z)=λ(z)\lambda({\bf z})=-\lambda({\bf z}^{\prime})), each of whose coordinates differ (coordinate-wise) by at most 1.

It is assumed that λ\lambda is presented in the form of a circuit, syntactically constrained to give opposite labels to antipodal grid points.

An instance of Variant high-D Tucker is similar to Definition 6 but whose instances obey the following additional constraints. The mim_{i} are upper bounded by the constant 7. We impose the further constraint that the facets of the cube are coloured with labels from ±[n]\pm[n] such that all colours are used, and opposite facets have opposite labels, and for 2in2\leq i\leq n it holds that the facet with label ii (resp. i-i) has no grid-point on that facet with label ii (resp. i-i).

A snake-embedding consists of a reduction from kkD-Tucker to (k+1)(k+1)D-Tucker, which we describe informally as follows. See Figure 1. Let II be an instance of kkD-Tucker, on the grid [m1]××[mk][m_{1}]\times\cdots\times[m_{k}]. Embed II in (k+1)(k+1)-dimensional space, so that it lies in the grid [m1]××[mk]×[m_{1}]\times\cdots\times[m_{k}]\times. Then sandwich II between two layers, where all points in the top layer get labelled k+1k+1, and points in the bottom layer get labelled (k+1)-(k+1), as in the left part of Figure 1. We now have points in the grid [m1]××[mk]×[m_{1}]\times\cdots\times[m_{k}]\times, and notice that this construction preserves the required property that points on the boundary have labels opposite to their antipodal points.

Then, the main idea of the snake embedding is the following. We fold this grid into three layers, by analogy with folding a sheet of paper along two parallel lines so that the cross-section is a zigzag line, and one dimension of the folded paper is one-third of the unfolded version, the other dimension being unchanged (see the right hand side of Figure 1). In higher dimension, suppose that m1m_{1} is the largest value of any mim_{i}. Then, we can reduce m1m_{1} by a factor of about 3, while causing the final coordinate to go up from 3 to 9. By merging layers of label k+1k+1 and (k+1)-(k+1), the thickness of 9 reduces to 7. This operation preserves the labelling rule for antipodal boundary points.

However, there are two points that need extra care for the reduction to go through:

Firstly, simply folding the layers such that their cross-sections are zigzag lines may introduce diagonal adjacencies between cubelets that were not present in the original instance in kk-dimensions, i.e. we might end up generating adjacent cubelets with equal-and-opposite colours, see the left part of Figure 2 for an illustration. To remedy this, we will “copy” (or “duplicate”) the cubelets at the folding points, essentially having three cubelets of the same colour, whose cross-sections are the short vertical section in the right hand side of Figure 1, see also the right hand side of Figure 2 for an illustration. From now on, when referring to “folding”, we will mean the version where we also duplicate the cubelets at the folding points, as described above.

Secondly, the folding and duplicating operation only works if m1m_{1} is a multiple of 33, as otherwise the (k+1)(k+1)-dimensional instance may not satisfy the boundary conditions of Definition 6, i.e. we might end up with antipodal cubelets that do not have equal-and-opposite colours. To ensure that m1m_{1} is a multiple of 33 before folding, we can add 11 or 22 additional layers of cubelets to m1m_{1}, (depending on whether the remainder of the division of m1m_{1} by 33 is either 22 or 11 respectively). These layers are duplicate copies of the outer layers of cubelets at opposite ends of the length-m1m_{1} direction; if there is only one additional layer to be added, we can add on either side. Note that this operation does not generate any cubelets of equal-and-opposite labels that were not there before and the same will be true for the instance after the folding operation. See Figure 3 for an illustration.

Let II be an instance of kkD-Tucker having coordinates in ranges [m1],,[mk][m_{1}],\ldots,[m_{k}] and label function λ\lambda. Select the largest mim_{i}, breaking ties lexicographically. Assume for simplicity in what follows that m1m_{1} is largest.

if x1m1x_{1}^{\prime}\leq m_{1}, x{\bf x}^{\prime} is mapped to x=(x1,,xk){\bf x}=(x_{1}^{\prime},\ldots,x_{k}^{\prime}) and λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}).

if x1=m1+1x_{1}^{\prime}=m_{1}+1, x{\bf x}^{\prime} is mapped to x=(m1,,xk){\bf x}=(m_{1},\ldots,x_{k}^{\prime}) and λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}).

In other words, points for which the first coordinate ranges from 11 to x1x_{1}^{\prime}, are mapped to themselves and receive their own label, and points for which the first coordinate is m1+1m_{1}+1 are mapped to the points where the first coordinate is m1m_{1}, receiving the label of that point. This essentially “duplicates” the layer of cubelets on the right endpoint of the m1m_{1}-direction. See Figure 3 for an illustration.

if x1=1x_{1}^{\prime}=1, x{\bf x}^{\prime} is mapped to x=(1,,xk){\bf x}=(1,\ldots,x_{k}^{\prime}) and λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}).

if 2x1m1+12\leq x_{1}^{\prime}\leq m_{1}+1, x{\bf x}^{\prime} is mapped to x=(x11,,xk){\bf x}=(x_{1}^{\prime}-1,\ldots,x_{k}^{\prime}) and λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}). This is similar to the mapping and labelling in the previous case, except for the fact that we need to “shift” the labels of the points, since we essentially introduced a copy of the layer of cubelets on the left endpoint of the m1m_{1}-direction. See Figure 3 for an illustration.

From kk to k+1k+1 dimensions. Starting from an instance II of kDkD-Tucker, we will construct an instance II^{\prime} of (k+1)D(k+1)D-Tucker as follows. Let x=(x1,,xk){\bf x}=(x_{1},\ldots,x_{k}) be a point in [m1]××[mk][m_{1}]\times\ldots\times[m_{k}] with labelling function λ\lambda. We will associate each such point with a corresponding point x{\bf x}^{\prime} in [m13+2]××[mk]×\left[\frac{m_{1}}{3}+2\right]\times\ldots\times[m_{k}]\times and a label λ(x)\lambda^{\prime}({\bf x}^{\prime}) as follows.

If x1m13x_{1}\leq\frac{m_{1}}{3}, then x{\bf x} is mapped to x=(x1,,xk,2){\bf x}^{\prime}=(x_{1},\ldots,x_{k},2), and λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}).

If x1=m13+1x_{1}=\frac{m_{1}}{3}+1 (the first “folding” point), then x{\bf x} is mapped to the following three points in II^{\prime} and receives the following colours (see the shaded cubelets at the right-hand side of Figure 2):

x=(m13+1,,xk,2){\bf x}^{\prime}=(\frac{m_{1}}{3}+1,\ldots,x_{k},2) (the original cubelet) and λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}).

x=(m13+1,,xk,3){\bf x}^{\prime}=(\frac{m_{1}}{3}+1,\ldots,x_{k},3) (the first copy) and λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}).

x=(m13+1,,xk,4){\bf x}^{\prime}=(\frac{m_{1}}{3}+1,\ldots,x_{k},4) (the second copy) and λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}).

If m13+2x12m131\frac{m_{1}}{3}+2\leq x_{1}\leq\frac{2m_{1}}{3}-1, then x is mapped to x=(2m13+2x1,x2,,xk,4){\bf x}^{\prime}=(\frac{2m_{1}}{3}+2-x_{1},x_{2},\ldots,x_{k},4), with λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}).

If x1=2m13x_{1}=\frac{2m_{1}}{3} (the second “folding” point), then x{\bf x} is mapped to the following three points in II^{\prime} and receives the following colours:

x=(2,,xk,4){\bf x}^{\prime}=(2,\ldots,x_{k},4) (the original cubelet) and λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}).

x=(2,,xk,5){\bf x}^{\prime}=(2,\ldots,x_{k},5) (the first copy) and λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}).

x=(2,,xk,6){\bf x}^{\prime}=(2,\ldots,x_{k},6) (the second copy) and λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}).

If 2m13+1x1m1\frac{2m_{1}}{3}+1\leq x_{1}\leq m_{1}, then x{\bf x} is mapped to x=(x1+22m13,x2,,xk,6){\bf x}^{\prime}=(x_{1}+2-\frac{2m_{1}}{3},x_{2},\ldots,x_{k},6), with λ(x)=λ(x)\lambda^{\prime}({\bf x}^{\prime})=\lambda({\bf x}).

Set λ(1,,1)=k\lambda^{\prime}(1,\ldots,1)=-k, along with any point x connected to it via a path of points that have not been labelled by the above procedure.

Set λ(m13+2,m2,,mk,7)=k\lambda^{\prime}(\frac{m_{1}}{3}+2,m_{2},\ldots,m_{k},7)=k, along with any point x connected to it via a path of points that have not been labelled by the above procedure.

First, it is not hard to check that the the composition of O(n)O(n) snake-embeddings is a polynomial-time reduction. Also note that, by the way the high-dimensional instances is constructed, we have not introduced any adjacencies that did not already exist, i.e. if there is a pair of adjacent cubelets with equal-and-opposite labels in the instance II^{\prime} of the high-dimensional version, this pair is present in the instance II of the 2D2D version as well, and it is easy to recover it in polynomial time. Therefore, it suffices to show how to obey the additional constraint of Variant high-D Tucker, namely that for i2i\geq 2, a side having label ii has no grid-points with label ii, and similarly for i-i.

We begin as in (see Figure 1 in that paper), by taking the original 2D2D instance II, of size m×mm\times m, and extend to an instance of size 3m×m3m\times m as follows. The original instance is embedded in the centre of the new instance. Each region RR to the sides (of size m×mm\times m) are labelled by copying the edge of II facing RR, along an adjacent edge of RR, and connecting these two edges with paths that have two straight sections and connect 2 points of the same label, and points along that path have that label. The outermost path then labels a side of the new instance of length mm, so these two opposite sides get opposite labels. We may assume (by switching 11’s and 22’s if needed) that these new opposite sides are labelled ±2\pm 2.

The SS-fold approach shown in Figure 1 (in this paper) can be checked to retain this property. When we sandwich a cuboid between two layers of opposite (new) colours (call them cc and c-c), as shown in Figure 1, we label the new facets thus formed with c-c and cc respectively. We label the other facets with their original labels (each of these facets has acquired the labels cc and c-c, and no other labels). The folding operation has a natural correspondence between the facets of the unfolded and folded versions of the cuboid. It can be checked that the set of colours of a facet before folding is the same as the set of colours of the corresponding facets after folding.

It is convenient to define the following problem, whose PPA-completeness follows fairly directly from the PPA-completeness of Variant high-D Tucker.

An instance of New variant high-D Tucker in nn dimensions is presented by a boolean circuit CVTC_{VT} that takes as input coordinates of a point in the hypercube B=nB=^{n} and outputs a label in ±[n]\pm[n] (assume CVTC_{VT} has 2n2n output gates, one for each label, and is syntactically constrained such that exactly one output gate will evaluate to true), having the following constraints that may be enforced syntactically.

Dividing BB into 7n7^{n} cubelets of edge length 2/72/7 using axis-aligned hyperplanes, all points in the same cubelet get the same label by CVTC_{VT};

Interiors of antipodal boundary cubelets get opposite labels;

Points on the boundary of two or more cubelets get a label belonging to one of the adjacent cubelets;

Facets of BB are coloured with labels from ±[n]\pm[n] such that all colours are used, and opposite facets have opposite labels. For 2in2\leq i\leq n it also holds that the facet with label ii (resp. i-i) does not intersect any cubelet having label ii (resp. i-i). Facets coloured ±1\pm 1 are unrestricted (we call them the “panchromatic facets”).

A solution consists of a polynomial number pCp^{C} of points that all lie within an inverse polynomial distance δ(n)\delta(n) of each other (for concreteness, assume δ(n)=1/100n\delta(n)=1/100n). At least two of those points should receive equal and opposite labels by CVTC_{VT}.

New variant high-D Tucker corresponds to the problem Variant Tucker in ; in that paper a solution only contained 100 points, while here we use pCp^{C} points. Here we need more points since we are in nn dimensions, and our analysis needs to tolerate nn points receiving unreliable labels.

Some building-blocks and definitions

Here we set up some of the general structure of instances of Consensus-halving constructed in our reduction. We identify some basic properties of solutions to these instances. We define the Möbius-simplex and the manner in which a solution encodes a point on the Möbius-simplex. The encoding of the circuitry is covered in Section 5.

We use the following values throughout the paper.

δtiny\delta^{\rm tiny} is an inverse-polynomial quantity in nn, chosen to be substantially smaller than any other inverse-polynomial quantity that we use in the reduction, apart from ε\varepsilon (below).

δT\delta^{T} is an inverse-polynomial quantity in nn, which is smaller than any other inverse-polynomial quantity apart from δtiny\delta^{\rm tiny} and is larger than δtiny\delta^{\rm tiny} by an inverse-polynomial amount. The quantity δT\delta^{T} denotes the width of the so-called “twisted tunnel” (see Definition 23).

phugep^{\rm huge} denotes a large polynomial in nn; specifically we let phuge=n/δtinyp^{\rm huge}=n/\delta^{\rm tiny}. The quantity phugep^{\rm huge} represents the number of sensor agents for each circuit encoder (see Definition 13).

plargep^{\rm large} denotes a large polynomial in nn, which is however smaller than phugep^{\rm huge} by a polynomial factor. The quantity plargep^{\rm large} will be used in the definition of the “blanket-sensor agents” (see Definition 14) and will quantify the extent to which the cuts in the “coordinate-encoding region” (Definition 9) are allowed to differ from being evenly spaced, before the blanket-sensor agents become active (see Section 4). The choice of plargep^{\rm large} controls the value δw\delta_{w} of the radius of the Significant Region (see Proposition 4.4), with larger plargep^{\rm large} meaning larger δw\delta_{w}.

ε\varepsilon is the precision parameter in the Consensus-Halving solution, i.e. each agent ii is satisfied with a partition as long as μi(A+)μi(A)ε|\mu_{i}(A_{+})-\mu_{i}(A_{-})|\leq\varepsilon. Henceforth, we will set ε=δtiny/10\varepsilon=\delta^{\rm tiny}/10.

1 Basic building-blocks

We consider instances ICHI_{CH} of Consensus-halving that have been derived from instances IVTI_{VT} of New variant high-D Tucker in nn dimensions. The general aim is to get any solution of such an instance ICHI_{CH} to encode a point in nn dimensions that “localises” a solution to IVTI_{VT}, by which we mean that from the solution of ICHI_{CH}, we will be able to find a point on the IVTI_{VT} instance that can be transformed to a solution of IVTI_{VT} in polynomial time and fairly straightforwardly.

Coordinate-encoding region (c-e region) Given an instance of Variant high-D Tucker in nn dimensions, the corresponding instance of Consensus-halving has a coordinate-encoding region, the interval [0,n][0,n], a (prefix) subinterval of AA.

The valuation functions of agents in an instance ICHI_{CH} of Consensus-halving obtained by our reduction from an instance of New variant high-D Tucker in nn dimensions, will be designed in such a way that either n1n-1 or nn cuts (typically nn) must occur in the coordinate-encoding region, in any solution. Furthermore, the distance between consecutive cuts must be close to 1 (an additive difference from 1 that is upper-bounded by an inverse polynomial), shown in Proposition 4.4.

Coordinate-encoding agents (c-e agents). Given an instance of New variant high-D Tucker in nn dimensions, the corresponding instance of Consensus-halving has nn coordinate-encoding agents denoted {a1,,an}\{a_{1},\ldots,a_{n}\}.

The nn c-e agents have associated nn coordinate-encoding cuts (Definition 11). It will be seen that the c-e cuts typically occur in the c-e region. The c-e agents do not have any value for the coordinate-encoding region; their value functions are only ever positive elsewhere. In particular, they have blocks of value whose labels A+/AA_{+}/A_{-} are affected by the output gates of the circuitry that is encoded to the right of the c-e region.

Coordinate-encoding cuts (c-e cuts). We identify nn cuts as the coordinate-encoding cuts. In the instances of Consensus-halving that we construct, in any (sufficiently good approximate) solution to the Consensus-halving instance, all other cuts will be constrained to lie outside the c-e region (and it will be straightforward to see that the value functions of their associated agents impose this constraint). A c-e cut is not straightforwardly constrained to lie in the c-e region, but it will ultimately be proved that in any approximate solution, the c-e cuts do in fact lie in the c-e region.

Recall that phuge=n/δtinyp^{\rm huge}=n/\delta^{\rm tiny} from Section 2, which implies that the c-e region can be divided into phugep^{\rm huge} intervals of length δtiny\delta^{\rm tiny} (see also Figure 4).

σ\sigma-shifted version. Given a value function ff (or measure) on the c-e region [0,n][0,n], we say that another function ff^{\prime} on the c-e region is a σ\sigma-shifted version of ff, when we have that f((xσ)modn)=f(x)f^{\prime}((x-\sigma)\mod n)=f(x).

Recall that the circuit-encoding region (details in Section 5) contains pCp^{C} circuit-encoders, mentioned in the following definitions.

Sensor agents. Each circuit-encoder CiC_{i}, i=1,,pCi=1,\ldots,p^{C}, has a set Si{\cal S}_{i} of sensor agents. Si={si,1,,si,phuge}{\cal S}_{i}=\{s_{i,1},\ldots,s_{i,p^{\rm huge}}\} where the si,js_{i,j} are defined as follows. When i=1i=1, s1,js_{1,j} has value 110\frac{1}{10} uniformly distributed over the interval

For i>1i>1, si,js_{i,j} is a 1pC(i1)δtiny\frac{1}{p^{C}}(i-1)\delta^{\rm tiny}-shifted version of s1,js_{1,j}.

Each sensor agent si,js_{i,j} also has valuation outside the c-e region, in non-overlapping intervals of the circuit-encoding region RiR_{i} (see Section 5.1). This valuation consists of two valuation blocks of value 920\frac{9}{20} each, with no other valuation block in between. These are exactly as described in , see also Appendix A and Figure 16 for an illustration.

This value gadget for si,js_{i,j} causes the jj-th input gate in the circuit-encoder CiC_{i} to be set according to the label received by sis_{i}’s block of value in the c-e region, i.e. jump to the left or to the right in order to indicate that the corresponding value-block of sis_{i} in the c-e region is labelled A+A_{+} or AA_{-}.

According to the definitions above, C1C_{1} has a sequence of (a large polynomial number of) sensor agents that have blocks of value in a sequence of small intervals going from left to right of the c-e region (see Figure 4). For 1<ipC1<i\leq p^{C}, CiC_{i} has a similar sequence, shifted slightly to the right on the c-e region (by δtiny(i1)/pC\delta^{\rm tiny}(i-1)/p^{C}). For j[phuge]j\in[p^{\rm huge}], the intervals defined by the value-blocks of the sensor agents s1,j,,spC,js_{1,j},\ldots,s_{p^{C},j} (for C1,,CpCC_{1},\ldots,C_{p^{C}}) partition the interval [(j1)δtiny,jδtiny][(j-1)\delta^{\rm tiny},j\delta^{\rm tiny}].

Remark: Note that a c-e cut may divide one of the above value-blocks held by a sensor agent in the c-e region, and in that case the input being supplied (using the gadget of ) to its circuit-encoder is unreliable. However, only nn sensor agents may be affected in that way, and their circuit-encoders will get “out-voted” by the ones that receive valid boolean inputs. This is part of the reason why we use pCp^{C} circuit-encoders in total. More details on this averaging argument are provided in Section 5.

Blanket-sensor agents. Each circuit-encoder CiC_{i} shall have n1n-1 blanket-sensor agents bi,2,,bi,nb_{i,2},\ldots,b_{i,n}.

In C1C_{1}, for each j=2,,nj=2,\ldots,n, blanket-sensor agent b1,jb_{1,j} has value 1/101/10 distributed over [j2,j][j-2,j] (see Figure 4). This value consists of a sequence

of 2/δtiny=2phuge/n2/\delta^{\rm tiny}=2p^{\rm huge}/n value-blocks, each of length δtiny/pC\delta^{\rm tiny}/p^{C} and of value 110(δtiny/2)\frac{1}{10}\cdot(\delta^{\rm tiny}/2).

In each CiC_{i}, 1<ipC1<i\leq p^{C}, and for each j=2,,nj=2,\ldots,n, the value function of bi,jb_{i,j} that lies in the c-e region is an (i1)δtinypC(i-1)\frac{\delta^{\rm tiny}}{p^{C}}-shifted version of b1,jb_{1,j}.

The remaining value 9/109/10 of each bi,jb_{i,j} consists of 3 value-blocks of width δtiny\delta^{\rm tiny} lying in a subinterval Ii,jI_{i,j} of the circuit-encoding region RiR_{i} (see Section 5.1), such that:

respectively, where \kappa=\Bigl{(}\frac{1}{10}\frac{\delta^{\rm tiny}}{2}\Bigr{)}p^{\rm large}.

Ii,jI_{i,j} contains also value-blocks of agents for each gate that takes the value of bi,jb_{i,j} as input (the feedback gadgetry, see Section 5.1.2).

The value of the blanket-sensor agents in Ii,jI_{i,j} is very similar to the gadget used in , see Appendix A (of the present paper) and Figure 17. The structure of the blanket-sensor agents in the c-e region is shown in Figure 4.

Each blanket-sensor agent bi,jb_{i,j} has an associated cut c(bi,j)c(b_{i,j}) that lies in the subinterval Ii,jI_{i,j}. Agent bi,jb_{i,j} “monitors” an interval of length 2, namely the interval [j2,j][j-2,j] within which the sequence of 2/δtiny2/\delta^{\rm tiny} value-blocks lie. If, in this interval, the number of these value-blocks labelled A+A_{+} exceeds the number labelled AA_{-} by at least plargep^{\rm large} (recall that plargep^{\rm large} is a large polynomial which is however polynomially smaller than phugep^{\rm huge}) then (in any ε\varepsilon-approximate solution to ICHI_{CH}, where, recall, ε=δtiny/10\varepsilon=\delta^{\rm tiny}/10), the cut c(bi,j)c(b_{i,j}) in Ii,jI_{i,j} lies in either the right-hand or the left-hand value-block, otherwise it lies in the central value-block. Note that these three possible positions may be converted to boolean values that influence circuit-encoder CiC_{i}; this was referred to as a “bit-detection gadget” in , see Appendix A for more details.

We say that blanket-sensor bi,jb_{i,j} is active if bi,jb_{i,j} in fact observes a sufficiently large label discrepancy in the c-e region, that c(bi,j)c(b_{i,j}) lies in one of the two outer positions, left or right, and not in the central position. We say that bi,jb_{i,j} is active towards A+A_{+} if A+A_{+} is the overrepresented label, with similar terminology for AA_{-}.

When blanket-sensor agent bi,jb_{i,j} is active, it provides input to CiC_{i} that causes CiC_{i} to label the value held by aja_{j} and controlled by CiC_{i}, to be either A+A_{+} or AA_{-}; the choice depends on the over-represented label in [j2,j][j-2,j] and the parity of the index of the blanket-sensor agent. The precise feedback mechanism to the c-e agent aja_{j} by the blanket-sensor bi,jb_{i,j} is described in Section 5.1.3.

When no blanket-sensors are active, the sequence of c-e cuts encodes a point in the Significant Region (Definition 18).

In , we worked just in two dimensions and there was just one blanket-sensor agent for the entire c-e region, for each circuit-encoder. Note also that there, the blanket-sensor agent had a single value-block of length 2; here we split it into a polynomial sequence of small value-blocks. The advantage of using a polynomial sequence of value-blocks (which could not have been done in due to the exponential precision requirement) is that we can argue that in all but at most nn circuit-encoders, the blanket-sensor agents have value-blocks that are not cut by the c-e cuts, so we can be precise about how big a disparity between blocks labelled A+A_{+} and AA_{-} cause a blanket-sensor to be active, and for at most nn circuit-encoders, we regard them as having unreliable inputs (see Definition 16 and Observation 4.2).

2 Features of solutions

The main result of this section is Proposition 4.4, that in a solution to approximate Consensus-halving as constructed here, the sequence of cuts in the c-e region are “evenly spaced” in the sense that the gap between consecutive cuts differs from 1 by at most an inverse-polynomial.

Given an instance ICHI_{CH} derived by our reduction from an instance of New variant high-D Tucker in nn dimensions, any inverse-polynomial approximate solution of ICHI_{CH} has the property that at most nn cuts lie in the coordinate-encoding region. This is because all other cuts are associated with agents who have at least 9/109/10 of their value strictly to the right of the c-e region, thus in a solution, those cuts cannot lie in the c-e region.

We will say that a circuit-encoder receives reliable input, if no coordinate-encoding cut passes through value-blocks of its sensor agents.

At most nn circuit-encoders fail to receive reliable input (by Observation 4.1 and the fact that sensors of distinct circuit-encoders have value in distinct intervals).

When a circuit-encoder receives reliable input, it is straightforward to interpret the labels allocated to its sensors, as boolean values, and simulate a circuit computation on those values, ultimately passing feedback to the c-e agents via value-blocks that get labelled according to the output gates of the circuit being simulated. This is done in a conceptually similar way to that described in (e.g. see Sections 4.4.2 and 4.6 in ), see also Appendix A of the present paper.

The Möbius-simplex. The Möbius-simplex in nn dimensions consists of points x{\bf x} in \mboxIRn+1\mbox{{IR}}^{n+1} whose coordinates are non-negative and sum to 1. We identify every point (x1,,xn,0)(x_{1},\ldots,x_{n},0) with the point (0,x1,,xn)(0,x_{1},\ldots,x_{n}), for all non-negative x1,,xnx_{1},\ldots,x_{n} summing to 1. We use the following metric d(,)d(\cdot,\cdot) on the Möbius-simplex, letting L1L_{1} be the standard L1L_{1} distance on vectors:

where (0,x1,,xn)(x1,,xn,0)(0,x_{1},\ldots,x_{n})\equiv(x_{1},\ldots,x_{n},0).

Let ICHI_{CH} be an instance of Consensus-halving, obtained by reduction from New variant high-D Tucker in nn dimensions, hence having c-e region [0,n][0,n]. Note that, by Observation 4.1, at most nn cuts may lie in the c-e region. A set of knk\leq n cuts of the coordinate-encoding region splits it into k+1k+1 pieces. We associate such a split with a point x{\bf x} in \mboxIRn+1\mbox{{IR}}^{n+1} as follows. The first coordinate is the distance from the LHS of the consensus-halving domain to the first cut, divided by nn, the length of the c-e region. For 2ik+12\leq i\leq k+1, the ii-th coordinate of x{\bf x} is the distance between the i1i-1-st and ii-th cuts, divided by nn. Remaining coordinates are 0.

If there are n1n-1 cuts in the c-e region, suppose we add a cut at either the LHS or the RHS. These two alternative choices correspond to a pair of points that have been identified as the same point, as described in Definition 17. (Observation 5.3 makes a similar point regarding transformed coordinates.)

Each circuit-encoder reads in “input” representing a point in the Möbius-simplex. Any circuit-encoder CiC_{i} (i[pC]i\in[p^{C}]) behaves like C1C_{1} on a point xi{\bf x}_{i}, for which (for all i,j[pC]i,j\in[p^{C}]) d(xi,xj)δtinyd({\bf x}_{i},{\bf x}_{j})\leq\delta^{\rm tiny} (recall dd is defined in (1)). Consequently their collective output (the split between A+A_{+} and AA_{-} of the value held by the c-e agents) is the output of a single circuit-encoder averaged over a collection of pCp^{C} points in the Möbius-simplex, all within δtiny\delta^{\rm tiny} of each other.

This follows by inspection of the way the pCp^{C} circuit-encoders differ from each other: their sensor-agents are shifted but their internal circuitry is the same.

The Significant Region of the Möbius-simplex DD. The Significant Region of DD consists of all points in DD where no blanket-sensors are active (where “blanket-sensors” and “active” are defined in Definition 15).

There is an inverse-polynomial value δw\delta^{w} such that all points x=(x1,,xn+1){\bf x}=(x_{1},\ldots,x_{n+1}) in the Significant Region have coordinates xix_{i} that for 2in2\leq i\leq n differ from 1/n1/n by at most δw\delta^{w}, if x{\bf x} is encoded by the c-e cuts of an ε\varepsilon-approximate solution to one of our instances of Consensus-halving. (Recall that ε=δtiny/10\varepsilon=\delta^{\rm tiny}/10).

Thus, if an instance ICHI_{CH} of Consensus-halving (obtained using our reduction) has a solution SCHS_{CH}, then all the c-e cuts in SCHS_{CH} have the property that the distance between two consecutive c-e cuts differs from 1 by at most some inverse-polynomial amount.

Before we proceed with the proof of the proposition, we will state a few simple lemmas that will be used throughout the proof. We start with the following definition.

Intuitively, cuts that are δ\delta-close to integer points lie close (within distance at most δ\delta) to either the endpoints or the midpoint of some monitored interval [j2,j][j-2,j].

An interval II is a monochromatic interval if it is not intersected by any cuts (which means that it receives a single label). Additionally, if for Aj{A+,A}A_{j}\in\{A_{+},A_{-}\}, II is labelled with AjA_{j}, then we will say that II is a monochromatic interval of label AjA_{j}.

It should be clear that if any monitored interval [j2,j][j-2,j] has a large enough (larger than 1+δ1+\delta) monochromatic subinterval, then the blanket-sensor agent b1,jb_{1,j} is active.

In II, there are at least k1k-1 intervals monitored by blanket-sensor agents and we only have at most k2k-2 cuts at our disposal. With k2k-2 cuts, we can partition an interval of length kk in at most k1k-1 intervals, the largest of which, call it ImaxI_{\text{max}}, will have length at least 1+1/k1+1/k. Since δ<1/2n\delta<1/2n, the length of ImaxI_{max} is actually larger than 1+δ1+\delta. The lemma follows then from the fact that, since the monitored intervals partition the interval II, ImaxI_{max} will contain a monochromatic interval of length at least 1+δ1+\delta, which will be entirely contained in some monitored interval, and the corresponding blanket-sensor agent will be active.

each of the k1k-1 cuts in II will be δ\delta-close to a different integer point and these integer points will be the midpoints of the monitored subintervals contained entirely in II or

at least one of the blanket-sensors monitoring the subintervals in II will be active.

In II, there are at least k1k-1 intervals monitored by blanket-sensor agents and we have k1k-1 cuts at our disposal. Assume that there exists some integer point j1j-1 which is a midpoint of some monitored interval [j2,j]I[j-2,j]\subseteq I such that there exists no cut that is δ\delta-close to j1j-1. This implies that either [j2,j][j-2,j] is intersected by at least 22 cuts, or the blanket-sensor agent corresponding to [j2,j][j-2,j] will be active. To see this, note that if there were not any cuts in [j2,j][j-2,j] then obviously the whole interval [j2,j][j-2,j] would be monochromatic and the corresponding blanket-sensor would be active. If [j2,j][j-2,j] was intersected by a single cut, since the cut lies at distance at least δ\delta from the midpoint of the interval, there would exist a monochromatic subinterval of [j2,j][j-2,j] of length at least 1+δ1+\delta, activating the corresponding blanket-sensor agent b1,jb_{1,j}.

However, for the blanket-sensor agent b1,jb_{1,j} to not be active, it would have to be the case that some other cut in the interval [j2,j][j-2,j] is also not δ\delta-close to the midpoint of one of the adjacent monitored intervals, therefore generating an imbalance in labels that has to be compensated with at least one additional cut in that interval. Given that there are only k1k-1 cuts in the interval II, it follows that in some monitored subinterval [j,j2][j^{\prime},j^{\prime}-2] there will be a large enough imbalance, i.e a monochromatic subinterval of length at least 1+δ1+\delta, and the corresponding blanket-sensor agent b1,jb_{1,j^{\prime}} will be active. See the right-hand side of Figure 5 for an illustration.

We are now ready to proceed with the proof of Proposition 4.4.

First, recall that by Observation 4.1, at most nn cuts can lie in the c-e region. Also recall that from Definition 14, for the circuit encoder C1C_{1}, the blanket-sensor agent b1,jb_{1,j}, j{2,,n}j\in\{2,\ldots,n\} has valuation only in the interval [j2,j][j-2,j] of the c-e region, i.e. it “monitors” the interval [j2,j][j-2,j]. The blanket-sensor agent bi,jb_{i,j} for i{2,,pC}i\in\{2,\ldots,p^{C}\} is a (i1)δtinypC(i-1)\frac{\delta^{\rm tiny}}{p^{C}}-shifted version of b1,jb_{1,j}. We will make the argument for the blanket-sensor agents of the circuit-encoder C1C_{1}; the argument for any CiC_{i}, with i1i\neq 1 is very similar.

It suffices to prove that if consecutive cuts are too far apart or too close together, some blanket-sensor agent will be active.

Case 1: The cuts are too far apart. First consider the case when two consecutive cuts are too far apart (by more than 11 plus some inverse-polynomial amount 2δ2\delta). More formally, assume that there are two cuts c1c_{1} and c2c_{2} such that c2>c1c_{2}>c_{1} and c2c1>1+2δc_{2}-c_{1}>1+2\delta. Then, as we explain below, there is some j{2,,n}j\in\{2,\ldots,n\} such that some subinterval Ij=[j1,j2][j2,j]I_{j}=[j_{1},j_{2}]\subseteq[j-2,j] with j2j1>1+δj_{2}-j_{1}>1+\delta will receive a single label, either A+A_{+} or AA_{-}. In particular, we have the following cases:

There is a jj such that [c1,c2][j2,j][c_{1},c_{2}]\subseteq[j-2,j]. In that case, [c1,c2][c_{1},c_{2}] is such a monochromatic subinterval.

There is a jj such that [j2,j][c1,c2][j-2,j]\subseteq[c_{1},c_{2}]. In that case, the whole monitored subinterval [j2,j][j-2,j] is such a monochromatic subinterval.

Case 2: The cuts are too close together. Now consider the case when two consecutive cuts are too close together, closer than 12nδ1-2n\delta. More formally, assume that there are two consecutive cuts c1c_{1} and c2c_{2} in the c-e region such that c2>c1c_{2}>c_{1} and c2c1<12nδc_{2}-c_{1}<1-2n\delta. Since the cuts are close together, there exists a monitored interval that is intersected by both c1c_{1} and c2c_{2} and let [j2,j][j-2,j] be such an interval. Notice that if there exists no other cut that intersects [j2,j][j-2,j], then [j2,c1][c2,j][j-2,c_{1}]\cup[c_{2},j] is a union of subintervals of length at least 1+δ1+\delta that receive the same label, and we are done. Therefore there must exist at least 33 cuts that lie in [j2,j][j-2,j]. We consider three cases.

There are 5 or more cuts in [j2,j][j-2,j]. This is an easy case to argue, as if that happens, there will be some interval, either [0,j2][0,j-2] or [j,n][j,n] of length kk that is only intersected by at most k2k-2 cuts. By Lemma 4.5, some blanket-sensor agent will be active and we are done.

There are 4 cuts in [j2,j][j-2,j]. Consider the intervals [0,j2][0,j-2] and [j,n][j,n]. If either [0,j2][0,j-2] is intersected by at most (j2)2(j-2)-2 cuts or [j,n][j,n] is intersected by at most nj2n-j-2 cuts, then by Lemma 4.5, some blanket-sensor will be active and we are done. Note also for completeness that, if j=2j=2 (respectively j=nj=n), it is necessarily the case that [j2,n][j-2,n] (respectively [0,j2][0,j-2]) is intersected by n4n-4 cuts and Lemma 4.5 again applies. Therefore, we can assume that j{3,,n1}j\in\{3,\ldots,n-1\}, and that there are exactly (j2)1(j-2)-1 cuts in [0,j2][0,j-2] and nj1n-j-1 cuts in [j,n][j,n].

Consider the interval [j,n][j,n] without loss of generality, as the argument for [0,j2][0,j-2] is symmetric. By Lemma 4.6, we know that the cuts in [j,n][j,n] are δ\delta-close to integer points and particularly, they are δ\delta-close to the midpoints of the monitored intervals [j,j+2],,[n2,n][j,j+2],\ldots,[n-2,n]. This implies that in the monitored subinterval [j,j+2][j,j+2], the subinterval [j,j+1δ][j,j+1-\delta] will be a monochromatic interval of label AjA_{j} for some Aj{A+,A}A_{j}\in\{A_{+},A_{-}\},

In turn, this implies that [j1,j][j-1,j] has a monochromatic subinterval of length at least 1δ1-\delta that receives the label AjA_{-j}, where Aj{A+,A}A_{-j}\in\{A_{+},A_{-}\} is the complementary label to AjA_{j}, for the blanket-sensor agent to not be activated, which is only possible if one of the 44 cuts in [j2,j][j-2,j] is δ\delta-close to the integer point jj. Propagating the effect of this cut sequence/labelling into the monitored interval [j2,j][j-2,j] in question, we obtain that [j2,j1][j-2,j-1] also contains a monochromatic interval of length at least 1δ1-\delta and label AjA_{j}, as otherwise blanket-sensor agent b1,jb_{1,j} would be active. From this discussion, it follows that:

all the cuts in [j2,j][j-2,j] are δ\delta-close to integer coordinates within the interval and

there is at least one cut in [j2,j][j-2,j] that is δ\delta-close to the midpoint j1j-1 of the monitored interval, one cut that is δ\delta-close to the right endpoint jj of the monitored interval and at least one cut that is δ\delta-close to the left endpoint j2j-2 of the monitored interval,

where the very last statement follows from the symmetric argument to the one developed above, for the interval [0,j2][0,j-2]. See Figure 6 for an illustration.

There are 33 cuts in [j2,j][j-2,j]. Again, considering the intervals [0,j2][0,j-2] and [j,n][j,n] as we did in the case of 44 cuts in [j2,j][j-2,j], we can now observe that one of the intervals will be intersected by at most k1k-1 cuts, where k{j2,nj}k\in\{j-2,n-j\} is its length. Furthermore, if it is intersected by fewer than k1k-1 cuts, by Lemma 4.5 some blanket-sensor agent will be active and we are done. Therefore, we will consider the case when one of the intervals is intersected by exactly k1k-1 cuts and let [j,n][j,n] be that interval, without loss of generality, as the argument for [0,j2][0,j-2] is symmetric.

Following exactly the same arguments as in the second and third paragraph of the case of 44 cuts above, we can establish a very similar statement, namely that:

all the cuts in [j2,j][j-2,j] are δ\delta-close to integer coordinates within the interval and

there is at least one cut in [j2,j][j-2,j] that is δ\delta-close to the midpoint j1j-1 of the monitored interval, and one cut that is δ\delta-close to the right endpoint jj of the monitored interval.

Remarks: Looking ahead, certain points in the Significant Region encode New variant high-D Tucker (namely, the ones in the “twisted tunnel”, Definition 23). The Significant Region contains the twisted tunnel, being a somewhat wider 1-dimensional “tunnel” of inverse-polynomial width at most 1/pw(n)1/p_{w}(n), whose central axis is the set of points (α,1/n,,1/n,1/nα)(\alpha,1/n,\ldots,1/n,1/n-\alpha), where the endpoints are identified together (noting Definition 17). Topologically, the Significant Region is a high-dimensional Möbius strip.

Reducing from New variant high-D Tucker to Consensus-halving

In Sections 5.1 we give an overview of aspects of how we construct an instance ICHI_{CH} of ε\varepsilon-Consensus-halving (in poly-time) from an instance of New variant high-D Tucker, for inverse polynomial ε\varepsilon. Section 5.2 describes the new coordinate system for the Möbius-simplex DD and establishes key properties. Section 5.3 presents a colouring function of DD in terms of the coordinate system of Section 5.2. Section 5.4 describes how to construct a purported solution to nn-dimensional New variant high-D Tucker from a solution to ε\varepsilon-Consensus-halving. In Section 6 we prove that a solution to New variant high-D Tucker that is obtained by reducing to ε\varepsilon-Consensus-halving, solving it, and converting that solution to a solution to nn-dimensional New variant high-D Tucker, really is a valid solution.

We define the reduction from New variant high-D Tucker (Definition 8) to ε\varepsilon-Consensus-halving.

Let IVTI_{VT} be an instance of New variant high-D Tucker in nn dimensions; let CVTC_{VT} be the boolean circuit that represents it. ICHI_{CH} will be the corresponding instance of Consensus-halving. We list ingredients of ICHI_{CH} and give notation to represent them, as follows. AA is the consensus-halving domain, an interval of the form [0,poly(n)][0,poly(n)]. Any agent aa has a measure μa:A\mboxIR\mu_{a}:A\longrightarrow\mbox{{IR}} represented as a step function (thus having a polynomial number of steps).

ICHI_{CH} has nn coordinate-encoding agents a1,,ana_{1},\ldots,a_{n} (Definition 10). See Figure 9.

The consensus-halving domain AA of ICHI_{CH} has a coordinate-encoding region (c-e region) (Definition 9) consisting of the interval [0,n][0,n].

ICHI_{CH} has pCp^{C} circuit-encoders (Sections 5.1.1, 5.1.4), C1,,CpCC_{1},\ldots,C_{p^{C}}.

Each CiC_{i} has a set Ai{\cal A}_{i} of agents (see Figure 9) which includes CiC_{i}’s sensor agents, also circuit-encoding agents (below).

Each CiC_{i} has an associated circuit-encoding region RiR_{i} of AA; each RiR_{i} is an interval of polynomial length, and the RiR_{i} do not intersect with each other or with the coordinate-encoding region.

Ai{\cal A}_{i} contains a polynomial number of circuit-encoding agents (one for each gate of CVTC_{VT}), having value in RiR_{i}.

Each CiC_{i} has phugep^{\rm huge} sensor agents as defined in Definition 13 each of which has a block of value 1/101/10 in a small subinterval of the c-e region as specified in Definition 13, and further value in region RiR_{i}.

Each CiC_{i} has n1n-1 blanket-sensor agents as in Definition 14.

Remarks: We associate one cut with each agent; let c(a)c(a) be the cut associated with agent aa. The cuts c(ai)c(a_{i}) for coordinate-encoding agents, are called the coordinate-encoding cuts (or c-e cuts). A straightforward consequence of Proposition 4.4 is that in any solution, either all nn, or n1n-1, of the coordinate-encoding cuts must lie in the coordinate-encoding region. All other cuts must lie in the regions RiR_{i}, indeed, every cut, other than the c-e cuts, is constrained by the value of its associated agent, to lie in a small interval that does not overlap any other such intervals. In the event that a c-e cut lies outside the c-e region, we refer to it as a “stray cut”, and while such a cut may initially appear to interfere with the functioning of the circuitry, similarly to we have that the duplication of the circuit using pCp^{C} circuit-encoders, allows the circuitry to be robust to this problem. See Appendix A for more details.

Recall CVTC_{VT} is the boolean circuit in the instance IVTI_{VT} of New variant high-D Tucker.

We assume that CVTC_{VT} has 2n2n output gates g1,,gng_{1},\ldots,g_{n} and g1,,gng_{-1},\ldots,g_{-n} having the property that exactly one of them will take value true (this may be enforced syntactically). gig_{i} getting value 1 (true) means that the point at coordinates represented by the input gets coloured ii.

CVTC_{VT} has npolylog(n)n\cdot{\rm polylog}(n) input gates, representing the coordinates of a point in B=nB=^{n}, each represented with inverse-polynomial precision.

We describe how circuit-encoder C1C_{1} is derived from CVTC_{VT}. The subsequent circuit-encoders can then be specified in terms of C1C_{1}. Each gate gg of CVTC_{VT} is simulated using a “gate agent” a(g)a(g), as constructed in (see Appendix A for a more detailed exposition). a(g)a(g)’s cut c(a(g))c(a(g)) occupies a right position, or a left position, representing true or false, as a function of the cut(s) that represent boolean inputs to gg.

The circuit-encoding agents A1{\cal A}_{1} of C1C_{1} thus include 2n2n gate agents whose corresponding cuts simulate the values of the output gates of CVTC_{VT}, provided that the input represented by the c-e cuts lies in the “Significant Region” (Definition 18). The positions of these cuts affect the labels of blocks of value held by the nn coordinate-encoding agents, as detailed in Section 5.1.2.

Reference sensor-agent. Noting from Definition 13 that the sensor agents for CiC_{i} are denoted Si={si,1,,si,phuge}{\cal S}_{i}=\{s_{i,1},\ldots,s_{i,p^{\rm huge}}\}, we let s1,1s_{1,1} be the reference sensor-agent: Outputs produced by the circuit CiC_{i} are taken with reference to the valueWe are using s1,1s_{1,1} to denote the boolean value taken by s1,1s_{1,1} as well as the sensor itself. s1,1s_{1,1}, in the sense that after simulating CVTC_{VT} we take the exclusive-or with s1,1s_{1,1}.

We used this crucial technique of Definition 21 in : it performs the task of disorienting the domain while at the same time ensuring continuity when we move a cut from the left-hand side of the c-e region to the right-hand side.

For each CiC_{i}, we take all phugep^{\rm huge} input bits, which appear in up to n+1n+1 blocks of consecutive 1’s and 0’s, and convert them into the coordinates of a point in the Möbius-simplex (Definition 17). As noted earlier (Observation 4.2) at most nn circuit-encoders may receive ill-defined inputs caused by c-e cuts cutting through value-blocks in the c-e region that belong to their sensor agents; we simply assume that the output of those agents is unreliable, indeed adversarially chosen.

We then perform a coordinate transformation described in Section 5.2. A subset of points in the Möbius-simplex maps to a copy of the domain BB of the instance IVTI_{VT} (recall Definition 8). These points get their coordinates passed directly to a copy of CVTC_{VT}, and the outputs of CCVC_{CV} are used to provide feedback to the c-e agents as described in Section 5.1.2 (and discussed in Observations 4.2 and 4.3). Other points get coloured in a manner that avoids allowing bogus solutions to ICHI_{CH} (i.e. ones that do not encode solutions to IVTI_{VT}).

The following generalises . CVTC_{VT} has output gates gjg_{j}, j±[n]j\in\pm[n], with the property that when inputs are well-defined, exactly one output gate evaluates to true. A circuit-encoder simulates CVTC_{VT} using the gate gadgets introduced in (see Appendix A). Let xREF{\mbox\sctrue,\mbox\scfalse}x_{REF}\in\{\mbox{{\sc true}},\mbox{{\sc false}}\} be the negation of the value of the reference sensor (Definition 21). We use additional gates gjg^{\prime}_{j}, j±[n]j\in\pm[n] where

if gj=gj=\mbox\scfalseg_{j}=g_{-j}=\mbox{{\sc false}} then gj=\mbox\sctrueg^{\prime}_{|j|}=\mbox{{\sc true}} and gj=\mbox\scfalseg^{\prime}_{-|j|}=\mbox{{\sc false}};

if j>0j>0 and gj=\mbox\sctrueg_{j}=\mbox{{\sc true}} (so gj=\mbox\scfalseg_{-j}=\mbox{{\sc false}}) then gj=gj=\mbox\sctruexREFg^{\prime}_{j}=g^{\prime}_{-j}=\mbox{{\sc true}}\oplus x_{REF};

if j<0j<0 and gj=\mbox\sctrueg_{j}=\mbox{{\sc true}} (so gj=\mbox\scfalseg_{-j}=\mbox{{\sc false}}) then gj=gj=\mbox\scfalsexREFg^{\prime}_{j}=g^{\prime}_{-j}=\mbox{{\sc false}}\oplus x_{REF};

Each of the c-e agents a1,,ana_{1},\ldots,a_{n} has 22 value-blocks of value 1/(2pC)1/(2p^{C}) in region R1R_{1}, and each gate gjg^{\prime}_{j} of C1C_{1} is able to select the label of one of these value-blocks (recall that the boolean value at a gate is represented by two positions that may be taken by the corresponding cut, so that a block of value lies between these two positions.) Figure 20 in Appendix A shows an example of how this feedback works.

Let Aj{A+,A}A_{j}\in\{A_{+},A_{-}\} and let Aj{A+,A}A_{-j}\in\{A_{+},A_{-}\}, AjAjA_{-j}\neq A_{j} be the complementary label. The blanket-sensor agents b1,2,,b1,nb_{1,2},\ldots,b_{1,n} affect the output of the circuit-encoders as follows:

If none are active, the 2n2n outputs of C1C_{1} are computed as described in Section 5.1.2.

If jj is odd and b1,jb_{1,j} is active in direction AjA_{j} then the output gates gj,gjg^{\prime}_{j},g^{\prime}_{-j} are both set to the value that causes c-e agent aja_{j} to observe more AjA_{j}.

If jj is even and b1,jb_{1,j} is active in direction AjA_{j} then the output gates gj,gjg^{\prime}_{j},g^{\prime}_{-j} are both set to the value that causes c-e agent aja_{j} to observe more AjA_{-j}.

Rules 2 and 3 override Rule 1, which allocates values that directly encode values output by CVTC_{VT}. Note that the gadgetry of the circuit can ensure that either an excess of A+A_{+} or an excess of AA_{-} can be shown to the corresponding c-e agent as feedback, as the circuit can convert the input value encoded by the value gadget of the blanket-sensor agent in R1R_{1} to either a “right” or “left” output position, depending on the parity of the index. Also, if more than one blanket-sensor agent is active, they all affect their corresponding gates. The reason for requiring blanket-sensors of different parities to feedback different labels to the c-e agents is to be consistent with the definition of “consistent colours”, see Definition 24.

Note that we do not define the behaviour of the blanket-sensor agents in terms of the reference sensor. They essentially look for an imbalance between A+A_{+} and AA_{-} within some interval of length 2, and when they find a sufficiently large imbalance, they force the circuit C1C_{1} to show their associated c-e agent more of the over-represented or under-represented label, depending on their parity, which can be done using gadgetry of .

Consider the operation of moving a cut from near the left-hand side of the c-e region to the right-hand side, which corresponds to two points in the Möbius-simplex that are close to each other via a path through the facets that have been identified according to Definition 17. Suppose also that within the c-e region, we do not change the label of any point. Then the blanket-sensor agents behave the same way: if some blanket-sensor agent sees an excess of A+A_{+} in its interval then it will continue to see an excess of A+A_{+}. Regarding the (non-blanket) sensor agents, our reduction will make them “want” to produce opposite outputs, but due to the flipping of xREFx_{REF}, the reference sensor value, the final output values produced by gjg^{\prime}_{j}, j±[n]j\in\pm[n] are the same, and we will have continuity across this facet.

We next describe how the pCp^{C} circuit-encoders differ from each other. Each CiC_{i} has a set of circuit-encoding agents Ai{\cal A}_{i}, which contains CiC_{i}’s sensor agents Si{\cal S}_{i}. For i[n]i\in[n] let Ai{\cal A}_{i} be the agents ai,1,,ai,pa_{i,1},\ldots,a_{i,p} for some polynomial pp.

For all i,ji,j, μai,j(x)=μa1,j(y)\mu_{a_{i,j}}(x)=\mu_{a_{1,j}}(y) where xx and yy are corresponding points in RiR_{i} and RjR_{j}. By “corresponding points” here we mean points that lie in the same distance from the left-endpoint of the respective intervals RiR_{i} and RjR_{j}; see , Section 4.4.3 for the precise definition.

For all i,ji,j, all xx in the c-e region, μai,j(x)\mu_{a_{i,j}}(x), is specified in Definition 14.

The second of these items says that in the c-e region, the valuation function of the agents that make up CiC_{i} differ from those of C1C_{1} by having been shifted to the right by δtiny(i1)\delta^{\rm tiny}(i-1), where this shift wraps around in the event that we shift beyond nn (the right-hand point of the c-e region). In other respects, CiC_{i} is an exact copy of C1C_{1}, save that CiC_{i}’s internal circuitry lies in RiR_{i} rather than R1R_{1}.

For each CiC_{i}, the c-e agents have a further 2n2n value-blocks of value 1/(2pC)1/(2p^{C}) in region RiR_{i}, whose labels are governed by the outputs produced by CiC_{i} in the same way as for C1C_{1}. Consequently we have the following observation.

2 An alternative coordinate system for the Möbius-simplex

Recall that the Möbius-simplex DD is the nn-simplex consisting of points (x1,,xn+1)(x_{1},\ldots,x_{n+1}) whose components are non-negative and sum to 1. Furthermore, a typical point in DD is directly encoded via the positions of nn cuts in the c-e region.

Here we specify a transformed coordinate system that is needed in order to encode instances of New variant high-D Tucker. We will embed the hypercube-shaped domain of an instance of New variant high-D Tucker in a hypercube in the transformed coordinates, and then use properties of the transformed coordinate system to extend the labelling function to the rest of the domain in a way that does not introduce bogus solutions (i.e. fixpoints of the extended function that lie outside the hypercube and do not encode solutions of New variant high-D Tucker).

Let F0F_{0} be the set of points in DD of the form (0,x2,,xn,0)(0,x_{2},\ldots,x_{n},0); thus F0F_{0} is a (n2)(n-2)-face of DD. See Figure 10. For τ\tau\in, let xτ{\bf x}_{\tau} be the point

(So, x0{\bf x}_{0} and x1{\bf x}_{1} are the endpoints of the 1-dimensional edge of DD that is not contained in F0F_{0}.) Let DτD_{\tau} be the (n1)(n-1)-simplex consisting of convex combinations of F0F_{0}, and xτ{\bf x}_{\tau}. Thus D0D_{0} and D1D_{1} are the two facets of DD that have been identified together as in Definition 17.

DτD_{\tau} contains the point 0τ=(τ/n,1/n,,1/n,(1τ)/n){\bf 0}_{\tau}=(\tau/n,1/n,\ldots,1/n,(1-\tau)/n), which we regard as the origin of DτD_{\tau}. The set of points {0τ:0τ1}\{{\bf 0}_{\tau}:0\leq\tau\leq 1\} will be referred to as the axis; it will transpire that all solutions must lie within an inverse polynomial distance from the axis (in particular will be in the Significant Region).

We then refer to points in DτD_{\tau} by means of the coordinates in a coordinate system that itself is a linear function of τ\tau. With respect to any fixed τ\tau\in we define n1n-1 vectors (d2τ,,dnτ)(d^{\tau}_{2},\ldots,d^{\tau}_{n}) as follows. A key feature is that (d2τ,,dnτ)(d^{\tau}_{2},\ldots,d^{\tau}_{n}) form a basis of DτD_{\tau} (so that with respect to the origin 0τ{\bf 0}_{\tau}, any point in DτD_{\tau} has unique coordinates). The other key feature (Observation 5.3) is that at τ=0\tau=0 the coordinate/directions are “equal and opposite” to the coordinates at τ=1\tau=1. Also, the coordinate system varies suitably smoothly.

As a warm-up we start by considering d2τd^{\tau}_{2}:

d2τd^{\tau}_{2} consists of increasing the second coordinate at the expense of its neighbours. For small τ\tau we increase mainly at the expense of the third coordinate and as τ\tau increases, we increase the second coordinate more at the expense of the first. Notice in particular that at τ=12\tau=\frac{1}{2} we have d2τ=(12,1,12,0,,0)d^{\tau}_{2}=(-\frac{1}{2},1,-\frac{1}{2},0,\ldots,0).

Thus, again this consists of the ii-th coordinate increasing at the expense of its neighbours, and we have in particular

Observation 5.2 makes the important point that by linearity, the vectors diτd^{\tau}_{i}, i=2,,ni=2,\ldots,n, can be used as a coordinate system to refer to points in DτD_{\tau}.

Any point x{\bf x} in DτD_{\tau} can be uniquely expressed as a sum

To see this, note first that D0D_{0} is points in DD of the form (0,x2,,xn+1)(0,x_{2},\ldots,x_{n+1}), and D1D_{1} is points of the form (x1,,xn,0)(x_{1},\ldots,x_{n},0). Note that the observation certainly works for τ=0\tau=0 or τ=1\tau=1. To see that it works for intermediate τ\tau, note that the vectors diτd^{\tau}_{i} are linearly independent, and the reason why they span DτD_{\tau} is that any vector diτd^{\tau}_{i} is equal to (1τ)(1-\tau) multiplied by a vector in D0D_{0}, added to τ\tau multiplied by an equal-length vector in D1D_{1}. So these vectors do indeed lie in DτD_{\tau}.

where (0;α2,,αn)(1;α2,,αn)(0;\alpha_{2},\ldots,\alpha_{n})\equiv(1;-\alpha_{2},\ldots,-\alpha_{n}).

With regard to Definition 22, consider two points x=(0;α2,,αn){\bf x}=(0;\alpha_{2},\ldots,\alpha_{n}), x=(1;α2,,αn){\bf x}^{\prime}=(1;-\alpha_{2},\ldots,-\alpha_{n}), that have been equated with each other. Assume these points are near the axis, specifically αj<1/10n|\alpha_{j}|<1/10n for all jj. Notice that

the cuts in the c-e region for x{\bf x} and x{\bf x}^{\prime} partition the c-e region in the same way.

(with reference to Figure 11) when we move from a point in D1εD_{1-\varepsilon} to a nearby point in DεD_{\varepsilon}, for any j{2,,n}j\in\{2,\ldots,n\}, the direction of increasing αj\alpha_{j} segues smoothly to the direction of decreasing αj\alpha_{j}.

Our assumption that αj<1/10n|\alpha_{j}|<1/10n ensures that cuts are fairly evenly-spaced, and movement in any of the directions djτd^{\tau}_{j} does not cause the cuts to cross each other.

In Subsection 5.2.1, we show that for points near the axis (i.e. within Euclidean distance 1/10n21/10n^{2} of the axis), the computation of the coordinate transformation —and its inverse— have the property that small perturbations of the input values lead to inverse-polynomial upper bounds on the resulting perturbations of the output values. Since we have this for both the transformation and its inverse, it follows that there are also inverse-polynomial lower bounds on the resulting perturbations of the output values.

Note that Proposition 5.4 does not hold for all points in DD; the restriction to a neighbourhood of the axis is needed. For points on F0F_{0}, all values of τ\tau are equivalent, and for points close to F0F_{0}, perturbed versions of them could result in large perturbations of τ\tau.

We show in the next section that the coordinate transformation, and its inverse, can be computed in polynomial time, for points in DD that are within some inverse polynomial distance from the axis.

We verify here that (for points in the vicinity of the axis), our transformation may be performed efficiently, and that small perturbations of inputs lead to small perturbations of the outputs (in either direction). The easy direction is the computation of (x1,,xn+1)(x_{1},\ldots,x_{n+1}) from (τ;α2,,αn)(\tau;\alpha_{2},\ldots,\alpha_{n}). Recall that a point x{\bf x} on the original domain can be expressed in terms of the transformed coordinates (τ;α2,,αn)(\tau;\alpha_{2},\ldots,\alpha_{n}) and the origin 0τ=(τn,1n,,1n,1τn){\bf 0}_{\tau}=(\frac{\tau}{n},\frac{1}{n},\ldots,\frac{1}{n},\frac{1-\tau}{n}) as x=0τ+i=2nαidiτ{\bf x}={\bf 0}_{\tau}+\sum_{i=2}^{n}\alpha_{i}d^{\tau}_{i}. Therefore we have:

In the other direction, given (x1,,xn+1)(x_{1},\ldots,x_{n+1}) we first compute the value of τ\tau for the transformed coordinate system: Note that (x1,,xn+1)(x_{1},\ldots,x_{n+1}) must be a convex combination of xτx_{\tau} and F0F_{0} (where recall that xτ=(τ,0,,0,1τ)x_{\tau}=(\tau,0,\ldots,0,1-\tau) and F0F_{0} is points of the form (0,x2,,xn,0)(0,x_{2},\ldots,x_{n},0)), therefore τ\tau can be computed as the solution to the equation

Note that the dependence of τ\tau on x1x_{1} and xn+1x_{n+1} is not excessively sensitive near the axis, since x1+xn+1x_{1}+x_{n+1} is close to 1/n1/n. Having computed τ\tau\in, we could simply solve the equations above (the ones used to compute x1,,xn+1x_{1},\ldots,x_{n+1}) for α2\alpha_{2}, α3\alpha_{3} and so on successively, using the derived formulas for αi\alpha_{i} for the computation of αi+1\alpha_{i+1} and express each αi\alpha_{i} as a function of only τ\tau and the values x1,x2,xn+1x_{1},x_{2},\ldots x_{n+1}. However, in the extremal cases of τ=0\tau=0 and τ=1\tau=1, some of the αi\alpha_{i} values might “disappear”; for example, for τ=0\tau=0, expressing α3\alpha_{3} in terms of only τ\tau, x1x_{1} and x2x_{2} is not possible, since for τ=0\tau=0 we do not obtain a formula for α2\alpha_{2} to substitute into the equation for x2x_{2}. To remedy this, we consider two cases:

Case 1: τ12\tau\geq\frac{1}{2}. We compute α2,,αn\alpha_{2},\ldots,\alpha_{n} as follows:

and so on for α4,,αn\alpha_{4},\ldots,\alpha_{n}.

Case 2: τ12\tau\leq\frac{1}{2}. We compute α2,,αn\alpha_{2},\ldots,\alpha_{n} starting at the opposite end:

and so on for αn2,,α2\alpha_{n-2},\ldots,\alpha_{2}.

Note that inverse polynomial-size perturbations of the xix_{i} lead to inverse polynomial-size perturbations of the transformed coordinates. As a sanity check, note that at the boundary (points with τ=0\tau=0 are the same as points with τ=1\tau=1), if we move a cut at the LHS to the RHS (so (0,x2,,xn+1)(0,x_{2},\ldots,x_{n+1}) becomes (x2,,xn+1,0)(x_{2},\ldots,x_{n+1},0)), it can be checked that the αi\alpha_{i} get negated.

Note that these computations should be done with a precision (or rounding error) polynomially smaller than δtiny\delta^{\rm tiny}.

This section defines a partial function f:D{1,0,1}nf:D\rightarrow\{-1,0,1\}^{n} (DD being the nn-dimensional Möbius-simplex (Definition 17)). ff is constructed in polynomial time based on an instance IVTI_{VT} of New variant high-D Tucker in nn dimensions, defined using circuit CVTC_{VT}. ff is defined in the Significant Region (Definition 18) which is the set of points where no blanket-sensor agents (Definition 14) are active, thus it includes the twisted tunnel TT (Definition 23). ff is computable by a circuit CC, that is used to define the operations of the circuit-encoders in a derived instance of Consensus-halving. Within the Significant Region, ff determines the outputs of the circuit-encoders.

The function ff maps a point x{\bf x} in the Significant Region to a vector of length nn, ef(x)e_{f}({\bf x}), where ef(x)j=1e_{f}({\bf x})_{j}=1 means that the point receives colour jj, ef(x)j=1e_{f}({\bf x})_{j}=-1 means that the point receives colour j-j and ef(x)j=0e_{f}({\bf x})_{j}=0 means that the point does not receives colour jj or j-j. In general, ef(x)e_{f}({\bf x}) may have multiple non-zero entries. We will use the term the colour of x{\bf x} for points that only receive a single colour (and therefore their outputs are vectors with only one non-zero entry).

The function ff will have a corresponding vector-valued function ff^{\prime} (Section 6.1) that more closely represents choices of labels A+A_{+}/AA_{-} that the circuit shows to the c-e agents. We will do this in such a way that no “bogus” solutions result from the transition to parts of DD where blanket-sensor agents are active. By construction, there are no solutions where blanket-sensor agents are active, so all solutions occur where ff is defined.

In Section 6.1 we then define a vector-valued “Borsuk-Ulam style” function FF in terms of ff. Letting IVTI_{VT} be an instance of New variant high-D Tucker, F(x)F({\bf x}) will be approximately zero iff given x{\bf x}, we can derive an approximate consensus-halving solution to IVTI_{VT}. It will be shown that approximate zeroes of FF provide solutions to IVTI_{VT}.

Recall that DD is the set of points (x1,,xn+1)(x_{1},\ldots,x_{n+1}) whose components are non-negative and sum to 1. And, the “Significant Region” (Definition 18) of DD consists of points (x1,,xn+1)(x_{1},\ldots,x_{n+1}) for which coordinates xix_{i} (2in12\leq i\leq n-1) differ from 1/n1/n by at most an inverse polynomial δw=1/pw(n)\delta^{w}=1/p^{w}(n) (Proposition 4.4). (δw\delta^{w} represents an upper bound on the thickness of the Significant Region.)

Let BB be the nn-dimensional “box” associated with IVTI_{VT} (recall IVTI_{VT} is represented by circuit CVTC_{VT} that maps points in BB to ±[n]\pm[n]). We embed a copy of BB in DD as follows. Recall the way facets of BB are coloured in Definition 8. Let (x1,,xn)(x_{1},\ldots,x_{n}) denote a typical point in BB, and assume that the facets of BB with maximum and minimum x1x_{1} (i.e. x1=1x_{1}=1 and x1=1x_{1}=-1 respectively) are the panchromatic facets of BB (as in Definition 8), and for i2i\geq 2 the facet of BB with maximum xix_{i} (xi=1x_{i}=1) consists of points that do not have colour ii, and the the facet of BB with minimum xix_{i} (xi=1x_{i}=-1) consists of points that do not have colour i-i.

The twisted tunnel TT is defined as follows. The axis of TT is the set of all points 0τ{\bf 0}_{\tau} as defined in Section 5.2. The twisted tunnel is the set of all points with transformed coordinates (τ;α2,,αn)(\tau;\alpha_{2},\ldots,\alpha_{n}) such that for all ii, αi<δT|\alpha_{i}|<\delta^{T}. Note that δT\delta^{T} is an inverse polynomial quantity sufficiently small that TT is a subset of the Significant Region; this is achieved since by definition, δT\delta^{T} is polynomially smaller than δw\delta^{w} of Proposition 4.4. Thus, TT has (with respect to the transformed coordinates) a (n1)(n-1)-cube-shaped intersection with any DτD_{\tau}.

We define the behaviour of ff over the Significant Region (Definition 18) in 3 stages, as follows.

A point x=(x1,,xn){\bf x}=(x_{1},\ldots,x_{n}) in BB is mapped to a point g(x)g({\bf x}) in DD as follows. g(x)g({\bf x}) lies in DτD_{\tau}, where we choose τ=12+δTx1\tau=\frac{1}{2}+\delta^{T}\cdot x_{1}. Then (noting (4)) we set g(x)g({\bf x}) equal to 0τ+i=2nδTxidiτ{\bf 0}_{\tau}+\sum_{i=2}^{n}\delta^{T}\cdot x_{i}d^{\tau}_{i} (i.e. g(x)g({\bf x}) has transformed coordinates (12+δTx1;δTx2,,δTxn)(\frac{1}{2}+\delta^{T}x_{1};\delta^{T}x_{2},\ldots,\delta^{T}x_{n})). g(x)g({\bf x}) will receive a single colour; the colour of g(x)g({\bf x}) — i.e. the non-zero entry of f(g(x))f(g({\bf x})) — is set equal to the colour allocated to x{\bf x} in BB by IVTI_{VT}. (Notice that the centre of BB is mapped to (1/2n,1/n,,1/n,1/2n)(1/2n,1/n,\ldots,1/n,1/2n), which is the origin of D12D_{\frac{1}{2}}, and the centre of the Significant Region. This point has (recalling Definition 22) transformed coordinates (12;0,,0)(\frac{1}{2};0,\ldots,0) where the first entry is the value of τ\tau.)

We also colour other points in TT as follows — these will also receive single colours. Suppose y{\bf y} belongs to DτD_{\tau}, where τ<12δT\tau<\frac{1}{2}-\delta^{T} or τ>12+δT\tau>\frac{1}{2}+\delta^{T}. According to (4), y=0τ+i=2nαidiτ{\bf y}={\bf 0}_{\tau}+\sum_{i=2}^{n}\alpha_{i}d^{\tau}_{i}, and y{\bf y} has transformed coordinates (τ;α2,,αn)(\tau;\alpha_{2},\ldots,\alpha_{n}). Suppose all the αi\alpha_{i} lie in the range [δT,δT][-\delta^{T},\delta^{T}]. Then if τ<12δT\tau<\frac{1}{2}-\delta^{T}, we set the colour of y{\bf y} to the colour of a point y=(12δT;α2,,αn){\bf y}^{\prime}=(\frac{1}{2}-\delta^{T};\alpha_{2},\ldots,\alpha_{n}). Thus yD12δT{\bf y}^{\prime}\in D_{\frac{1}{2}-\delta^{T}}, and the other transformed coordinates (Definition 22) are the same for y{\bf y} and for y{\bf y}^{\prime}. We do a similar thing for points in DτD_{\tau} for τ>12+δT\tau>\frac{1}{2}+\delta^{T}. That is, if y{\bf y} has transformed coordinates (τ;α2,,αn)(\tau;\alpha_{2},\ldots,\alpha_{n}) where τ>12+δT\tau>\frac{1}{2}+\delta^{T} and the αi\alpha_{i} are all at most δT\delta^{T} in absolute value, then y{\bf y} gets the same colour as a point y{\bf y}^{\prime} whose transformed coordinates are (12+δT;α2,,αn)(\frac{1}{2}+\delta^{T};\alpha_{2},\ldots,\alpha_{n}).

The Significant Region (Definition 18) is points in DD where no blanket-sensor agents are active, a subset of points that are close to the axis in the sense of Proposition 4.4. Consider xDT{\bf x}\in D\setminus T with transformed coordinates (τ;α2,,αn)(\tau;\alpha_{2},\ldots,\alpha_{n}).

For each j{2,,n}j\in\{2,\ldots,n\} if αj>δT\alpha_{j}>\delta^{T} then x{\bf x} gets colour j-j;

For each j{2,,n}j\in\{2,\ldots,n\} if αj<δT\alpha_{j}<-\delta^{T} then x{\bf x} gets colour jj;

these are not mutually exclusive, x{\bf x} gets at least one colour, possibly more.

Notice that (within the subspace DτD_{\tau}) the side(s) of the twisted tunnel TT closest to x{\bf x} is guaranteed not to be opposite to any colour of x{\bf x}.

For a subset SS of colours, let R(S)R(S) be the region with colours in SS. We call these the “outer regions”.

Proposition 6.7 notes that when colour-regions meet each other at opposite ends of TT (which have been identified with each other according to the definition of the Significant Region), they will have equal and opposite colours.

4 How to compute a solution to New variant high-D Tucker from a solution to Consensus-halving

Suppose we have a solution SCHS_{CH} to an instance ICHI_{CH} of Consensus-halving, derived by our reduction from an instance IVTI_{VT} of New variant high-D Tucker.

Let x{\bf x} be the point in the Möbius-simplex represented by the c-e cuts of SCHS_{CH}. Proposition 4.4 already tells us that x{\bf x} must lie within some inverse-polynomial distance of the axis, since if not, some blanket-sensor agent will be active. We prove in Section 6 that x{\bf x} must lie within, or very close to, the twisted tunnel.

From this we identify two colour-regions that have equal and opposite colours, as follows. Let x{\bf x} have transformed coordinates (τ;α2,,αn)(\tau;\alpha_{2},\ldots,\alpha_{n}), which can be computed with inverse-polynomial precision from the c-e cuts.

If x{\bf x} occurs within the embedded copy of BB (Section 5.3) then identify two circuit-encoders CiC_{i} and CiC_{i^{\prime}} that both receive reliable inputs and have equal and opposite outputs. The proof of Proposition 6.4 tells us that this is always possible.

If x{\bf x} lies in TT but not in the embedded copy of BB, then we similarly find two distinct colour-regions that are adjacent and with opposite colours. Since these colour-regions are just extensions of the cubelets that lie on the panchromatic facets of the embedded instance of New variant high-D Tucker, we are done. Formally, if SCHS_{CH} represents a point with transformed coordinates (τ;α2,,αn)(\tau;\alpha_{2},\ldots,\alpha_{n}) where τ>12+δT\tau>\frac{1}{2}+\delta^{T}, we take the point (12+δT;α2,,αn)(\frac{1}{2}+\delta^{T};\alpha_{2},\ldots,\alpha_{n}), and similarly, if SCHS_{CH} represents a point with transformed coordinates (τ;α2,,αn)(\tau;\alpha_{2},\ldots,\alpha_{n}) where τ<12δT\tau<\frac{1}{2}-\delta^{T}, we take the point (12δT;α2,,αn)(\frac{1}{2}-\delta^{T};\alpha_{2},\ldots,\alpha_{n}).

Completing the Proof of Theorem 2.1

Let IVTI_{VT} be an instance of New variant high-D Tucker in nn dimensions, given in terms of circuit CVTC_{VT}; suppose Consensus-halving instance ICHI_{CH} is derived from it by our reduction. Recall that ε=δtiny/10\varepsilon=\delta^{\rm tiny}/10. We show that any ε\varepsilon-approximate solution SCHS_{CH} to ICHI_{CH} allows a solution to IVTI_{VT} to be recovered using Section 5.4.

In SCHS_{CH}, only the c-e cuts may lie in the c-e region (Observation 4.1), and every other cut c(a)c(a) for aa not a c-e agent, must lie in some interval outside the c-e region (in order for aa’s value to be evenly split). It follows from Proposition 4.4 that at least n1n-1 c-e cuts must lie in the c-e region, and they are evenly-spaced (the gaps between them differ from 1 by an inverse polynomial). The remaining cut may occur elsewhere, in which case it becomes what we called a “stray cut” in , and in that case, the “double negative lemma” of may be applied to prove that it has little effect on the quality of a solution: it causes a single circuit-encoder to have unreliable output.

Most or all of the points are in the twisted tunnel. In that case it will be proved that the procedure of Section 5.4 identifies a solution to New variant high-D Tucker; see Sections 6.1, 6.2.

Most or all are in the outer regions. In that case we are not at a solution since the colours cannot cancel each other.

Some of the points are outside the Significant Region. In that case we are far from the twisted tunnel, and Section 6.3 argues that no solution is possible here.

All points are outside the Significant Region. Then, we are certainly far from a solution since various blanket-sensor agents will be active, and the points are so close together (within δtiny\delta^{\rm tiny}) that the directions in which they are active, cannot cancel.

Recall that DD denotes the nn-dimensional Möbius-simplex (Definition 17). We start by defining a function f:Dnf^{\prime}:D\rightarrow^{n} based on ff defined as in Section 5.3. ff^{\prime} simulates the effect of the blanket-sensor agents as described in Section 5.1.3. Let xD{\bf x}\in D.

When no blanket-sensor agents are active at x{\bf x}. Here we are in the Significant Region.

In TT, ff^{\prime} behaves like ff in the sense that if ff assigns the colour ii to x{\bf x} (recall that it assigns a single colour to points in TT), then set f(x):=eif^{\prime}({\bf x}):={\bf e}_{i} if i>0i>0, f(x):=eif^{\prime}({\bf x}):=-{\bf e}_{|i|} for i<0i<0.

Outside TT, in outer region R(S)R(S), f(x):=iS,i>0ei+iS,i<0eif^{\prime}({\bf x}):=\sum_{i\in S,i>0}{\bf e}_{i}+\sum_{i\in S,i<0}-{\bf e}_{|i|}.

When one or more blanket-sensor agents are active. If the jj-th blanket-sensor agent of C1C_{1}, b1,jb_{1,j} is active towards A+A_{+} (respectively AA_{-}) then the jj-th entry of f(x)f^{\prime}({\bf x}) is set to 11 if jj is odd and to 1-1 if jj is even (respectively, to 1-1 if jj is odd and 11 is jj is even). This is done for all active blanket-sensor agents, thus f(x)f^{\prime}({\bf x}) can contain multiple 1’s and 1-1’s.

The following points are similar to Observation 4.3:

Suppose that circuit-encoder CiC_{i} (some i[pC]i\in[p^{C}]) of ICHI_{CH} receives reliable inputs. (Observation 4.2 tells us that at most nn of them fail to receive reliable inputs.) Then CiC_{i} computes ff^{\prime} at a point within distance δtiny\delta^{\rm tiny} from the xD{\bf x}\in D encoded by the c-e cuts, in the sense that the value observed by each c-e agent aja_{j} that is labelled by A+A_{+}, minus the amount labelled AA_{-}, restricted to that part of aja_{j}’s value that lies in RiR_{i} and so is governed by the output of CiC_{i}, is the jj-th component of ff^{\prime}.

This follows from the construction of Section 5.1.2 and the association of boolean values \mbox\sctrue,\mbox\scfalse\mbox{{\sc true}},\mbox{{\sc false}} with the labels A+A_{+} and AA_{-}. For xD{\bf x}\in D, F(x)F({\bf x}) is the average of the outputs of the CiC_{i}; Proposition 6.2 provides the details.

ICHI_{CH} computes a function FF in the following sense. Let x{\bf x} be the point encoded by the c-e agents. Suppose all agents other than the c-e agents have error (i.e. discrepancy between A+A_{+} and AA_{-} that they observe) at most ε\varepsilon. Then the error of the c-e agents is within additive distance 1/n1/n from the average value of ff^{\prime}, averaged over a set of points all within δtiny\delta^{\rm tiny} of x{\bf x}.

We put together various observations about the way ICHI_{CH} is constructed. Observation 4.3 told us that the values observed by the c-e agents are the average of a set of points all within distance δtiny\delta^{\rm tiny} of each other. The additive distance 1/n1/n results from the existence of up to nn circuit-encoders that either fail to receive good inputs (Observation 4.2), or are affected by the stray cut, taken in conjunction with the fact that we average over pCp^{C} points, where pCp^{C} can be taken to be at least 2n22n^{2}.

δtiny\delta^{\rm tiny} can be chosen to be sufficiently small (but still inverse-polynomial) that given a set of pCp^{C} points x1,,xpCD{\bf x}_{1},\ldots,{\bf x}_{p^{C}}\in D within distance δtiny\delta^{\rm tiny} of each other, when we compute their transformed coordinates y1,,ypC{\bf y}_{1},\ldots,{\bf y}_{p^{C}}, we have:

Every pair of points y,y{y1,,ypC}{\bf y},{\bf y}^{\prime}\in\{{\bf y}_{1},\ldots,{\bf y}_{p^{C}}\} has the property that they either lie in the same colour-region, or adjacent colour-regions (where a “colour-region” is one of monochromatic regions of Section 5.3), or one of the outer regions.

In identifying which colour-region a point with transformed coordinates y{\bf y} belongs to, for the colour-regions in the twisted tunnel TT, we compare coordinates with certain threshold values. These threshold values differ from each other by inverse-polynomial amounts, and the smallest difference between any pair of them is inverse-polynomial. Applying Proposition 5.4, we can keep the yi{\bf y}_{i} closer to each other than this. (Colour-regions lie in the Significant Region, so these points lie within 1/10n21/10n^{2} of the axis, so Proposition 5.4 is applicable.)

This applies also to the outer regions R(S)R(S) for sets of colours SS. Identifying which R(S)R(S) a point y{\bf y} belongs to, uses the same information on comparisons of its coordinates with inverse-polynomials.

We call FF a Borsuk-Ulam-style function — The suffix “style” is to note that we define a kind of function that has desirable properties similar to those of a Borsuk-Ulam function, but for example the domain of the function is DD as opposed to a sphere. Also, the function FF is “approximately Lipschitz” rather then truly continuous, which is good enough for our purposes.

F(x)ε|F({\bf x})|\leq\varepsilon (here, F|F| denotes the LL_{\infty} or “maximum” norm of FF) iff x{\bf x} encodes an approximate Consensus-halving solution. Regarding this point, FF is not simulating a Borsuk-Ulam function, but rather simulating a function consisting of the difference between the values taken by a Borsuk-Ulam function, at two antipodal points.

2 Encoding the output of F𝐹F with a Consensus-halving solution

Let SCHS_{CH} be an ε\varepsilon-approximate solution to ICHI_{CH}. Suppose that the c-e cuts of SCHS_{CH} represent a point x{\bf x} that lies in the twisted tunnel. Then we can reconstruct a solution to IVTI_{VT} in polynomial time.

Recall that SCHS_{CH}, ICHI_{CH} and IVTI_{VT} and ε\varepsilon are as introduced at the start of Section 6.

Observation 6.1 tells us that if a circuit-encoder CiC_{i} receives reliable inputs, it outputs the colour of a point in the Möbius-simplex that lies within δtiny\delta^{\rm tiny} of x{\bf x}.

We note next that the feedback received by the c-e agents in ICHI_{CH} corresponds to the average (over i[pC]i\in[p^{C}]) of the feedback received by the individual circuit-encoders CiC_{i}. In detail, the ii-th coordinate of a typical point in B=nB=^{n} is obtained by taking c-e agent aia_{i}, and (given any attempt at a consensus-halving solution SS) subtracting aia_{i}’s value for the parts of the consensus-halving domain labelled AA_{-} according to SS, from those labelled A+A_{+}. The resulting point is at the centre (or origin) of BB iff the c-e agents have balanced allocations of A+A_{+} and AA_{-} (as required for a consensus-halving solution), and more generally, a point in n^{n} is close to the centre of BB iff the c-e agents have approximately balanced allocations of A+A_{+} and AA_{-}.

Observation 4.2 tells us that at most nn circuit-encoders fail to receive reliable input. If all circuit-encoders received reliable input, then the total error at a solution would be at most ε\varepsilon, i.e. the precision parameter of the ICHI_{CH} instance. However, since at most nn of them receive unreliable input, we might have an added discrepancy of at most n/pCn/p^{C} when taking the average and therefore we need to get within distance ε+2npC\varepsilon+\frac{2n}{p^{C}} of the centre of BB. For this to be possible, we need some of the cancelling to take place amongst the outputs of the circuit-encoders that received reliable inputs, so we really can find a pair of correctly oppositely-coloured points.

Proposition 6.3 tell us that if xD{\bf x}\in D is the point in DD represented by some Consensus-halving solution, then provided x{\bf x} lies in the twisted tunnel, the corresponding cluster of pCp^{C} points must be mapped by the circuit-encoders CiC_{i} that mostly cancel each other out, so we find pairs of points that belong to oppositely-labelled colour-regions, from which Section 5.4 tells us how to recover two oppositely-coloured cubelets of IVTI_{VT}.

It remains to rule out the possibility of x{\bf x} occurring outside the twisted tunnel.

3 No bogus approximate-zeroes of F𝐹F at boundary of Significant Region

Proposition 6.4 tells us that ε\varepsilon-approximate zeroes of FF (inputs for which FF has value in [ε,ε]n[-\varepsilon,\varepsilon]^{n}) within the twisted tunnel TT must encode solutions. Around TT, there are outer regions R(S)R(S); note that if iSi\in S then i∉S-i\not\in S and moreover there is an inverse-polynomial lower bound on the distance between any pair of points belonging to outer regions containing opposite colours. But we have to rule out points with colour jSj\in S being averaged with nearby points that are “coloured” j-j due to a blanket-sensor agent. In more detail, if xR(S){\bf x}\in R(S) and x{\bf x} is within δtiny\delta^{\rm tiny} of x{\bf x}^{\prime} for which the jj-th blanket-sensor agent is active and provides feedback corresponding to j-j, then we will prove that SS contains some other colour kjk\not=j and no point in a δtiny\delta^{\rm tiny}-neighbourhood of x{\bf x} activates the kk-th blanket-sensor b1,kb_{1,k} to provide feedback corresponding to k-k.

In the following, we will refer to cuts in the following manner: “cut ii” refers to the ii-th cut (from left to right) in the c-e region. Also, recall that the width δT\delta^{T} of the twisted tunnel is smaller than any other inverse-polynomial quantities of interest, apart from δtiny\delta^{\rm tiny}, which itself is smaller than all other inverse-polynomials of interest, including δT\delta^{T}. We provide the following definition of a consistent colour.

For x=(τ;α2,,αn){\bf x}=(\tau;\alpha_{2},\ldots,\alpha_{n}) in the Significant Region, colour j{±2,,±n}j\in\{\pm 2,\ldots,\pm n\}, let Aj{A+,A}A_{j}\in\{A_{+},A_{-}\} be the label that tends to increase in interval [j2,j][j-2,j] when the j|j|-th coordinate αj\alpha_{j} of x{\bf x} is increased if j>0j>0, or decreased if j<0j<0. (AjA_{j} depends on the sign and parity of jj.) We say that x{\bf x} has consistent colour jj if

if j>0j>0 then αj>2δT\alpha_{j}>2\delta^{T}; if j<0j<0 then αj<2δT\alpha_{|j|}<-2\delta^{T};

at least 12plarge2phuge\frac{1}{2}-\frac{p^{\rm large}}{2p^{\rm huge}} of the interval [j2,j][j-2,j] gets the label AjA_{j}.

Condition 1 says that at xR(S){\bf x}\in R(S), colour jj is a member of SS and the corresponding transformed coordinate is sufficiently far from the twisted tunnel. Condition 2 says that we are at least some (small but significant) distance from triggering the jj-th blanket-sensor in a direction that corresponds to excessive colour j-j. In other words, for the jj-th blanket-sensor to become active in direction j-j, we would have to increase AjA_{-j} by an inverse-polynomial amount.

The following proposition establishes that for points in the outer regions R(S)R(S), consistent colours exist.

Suppose x=(τ;α2,,αn){\bf x}=(\tau;\alpha_{2},\ldots,\alpha_{n}) belongs to outer region R(S)R(S) and that x{\bf x} is within distance δtiny\delta^{\rm tiny} of the boundary of the Significant Region. Then x{\bf x} has a consistent colour in {±2,,±n}\{\pm 2,\ldots,\pm n\}.

Case 1: j>0j>0 (i.e., αj>0\alpha_{j}>0). In this case, moving in direction djτd^{\tau}_{j} causes cuts j1j-1 and jj to move away from each other; this is illustrated in Figure 13.

We claim that jj is a consistent colour for x{\bf x}. Note first that αj>2δT\alpha_{j}>2\delta^{T} and therefore Condition 1 is satisfied, since j>0j>0 in this case. αj>2δT\alpha_{j}>2\delta^{T} follows from the fact that jargmaxi{2,,n}αij\in\arg\max_{i\in\{2,\ldots,n\}}|\alpha_{i}|, we are close to the boundary of the Significant Region, and the width of Significant Region is polynomially larger than that of the twisted tunnel. In order to be close to the boundary of the Significant Region, we must have moved more than 2δT2\delta^{T} in some direction from 0τ\bf{0}_{\tau} and by the choice of jj, it holds that αj>2δT\alpha_{j}>2\delta^{T}.

For Condition 2, recall first that x{\bf x} has transformed coordinates (τ;α2,,αn)(\tau;\alpha_{2},\ldots,\alpha_{n}), and that the origin of DτD_{\tau} has transformed coordinates 0τ=(τ;0,,0){\bf 0}_{\tau}=(\tau;0,\ldots,0). The (j1)(j-1)-st and jj-th cuts corresponding to the point 0τ{\bf 0}_{\tau} are located at positions j2+τj-2+\tau and j1+τj-1+\tau respectively, and are shown in red in Figure 13. Near the axis, where the cuts are evenly-spaced (see Proposition 4.4), movement in direction djτd^{\tau}_{j} corresponds to moving the (j1)(j-1)-st and jj-th cuts (in the c-e region) away from each other. We will consider moving from 0τ{\bf 0}_{\tau} to x{\bf x} via a point xj{\bf x}_{j} in which we will only have increased the transformed coordinate αj\alpha_{j}.

First, consider moving from 0τ{\bf 0}_{\tau} to point xj=(τ;0,,0,αj,0,,0){\bf x}_{j}=(\tau;0,\ldots,0,\alpha_{j},0,\ldots,0) for αj>0\alpha_{j}>0. In this process, we move the (j1)(j-1)-st cut to the left by αjτ\alpha_{j}\cdot\tau and the jj-th cut to the right by αj(1τ)\alpha_{j}\cdot(1-\tau); all this takes place within the interval [j2,j][j-2,j], see Figure 13. Now consider moving from xj{\bf x}_{j} to x{\bf x}. In this process, the (j1)(j-1)-st cut moves to the right by αj1(1τ)\alpha_{j-1}\cdot(1-\tau) and the (j+1)(j+1)-st cut moves to the left by αj+1τ\alpha_{j+1}\cdot\tau. From the choice of jj to be jargmaxi{2,,n}αij\in\arg\max_{i\in\{2,\ldots,n\}}|\alpha_{i}|, it follows that αj1αj\alpha_{j-1}\leq\alpha_{j} and αj+1αj\alpha_{j+1}\leq\alpha_{j}. Then, there is a sub-interval of [j2,j][j-2,j] that contains the unit-length interval I=[j2+τ+αj(12τ),j1+τ+αj(12τ)]I=[j-2+\tau+\alpha_{j}\cdot(1-2\tau),j-1+\tau+\alpha_{j}\cdot(1-2\tau)] which ends up coloured entirely AjA_{j}, implying Condition 2. Overall, we obtain that jj is a consistent colour.

Case 2: j<0j<0 (i.e., αj<0\alpha_{j}<0). In this case, moving in direction djτd^{\tau}_{j} causes cuts j1|j|-1 and j|j| to move towards each other; this is illustrated in Figure 14.

Case 2a: τ[1/2n,1(1/2n)]\tau\in[1/2n,1-(1/2n)]. In this case, all movements of the cuts, in and around the Significant Region, are in distances upper-bounded by δw\delta^{w}, which by Proposition 4.4 is smaller than 1/2n1/2n by an inverse-polynomial amount. This means that if we start at 0τ{\bf 0}_{\tau} and re-set individual transformed coordinates to those of x{\bf x}, in any order (i.e. going through any intermediate point xj{\bf x}_{j}, similarly to above), the movement of the cuts will never force them to cross integer-valued thresholds. In other words, in moving from 0τ{\bf 0}_{\tau} to x{\bf x}, only the relevant cuts j1j-1 and jj will lie in the interval [j2,j][j-2,j]. This case can be seen in the illustration of Figure 13, if one reverses the direction of the arrows, switches the labels AjA_{j} to AjA_{-j} and vice-versa, and substitutes jj by j|j| in the labelling of cuts. The argument establishing the existence of a consistent colour is exactly symmetric to that of Case 1 above.

Case 2b: τ[0,1/2n][1(1/2n),1]\tau\in[0,1/2n]\cup[1-(1/2n),1]. Here, we consider the case where τ[0,1/2n]\tau\in[0,1/2n]; the other case is similar by symmetry. This case is illustrated in Figure 14; note that the sequence of labels Aj/AjA_{j}/A_{-j} is switched to make AjA_{j} the label that increases when we move in direction djτd^{\tau}_{j}.

Moving in direction djτd^{\tau}_{j} causes an increase of the label AjA_{j} in the interval [j2,j][|j|-2,|j|]. For jj not to be a consistent colour, we should observe an excess of the label AjA_{-j} in this interval. In generating cut locations from coordinates of x{\bf x}, the amount of AjA_{-j} in [j2,j][|j|-2,|j|] can be raised in the following ways (see Figure 14):

By increasing the transformed coordinate αj1\alpha_{|j|-1} in the negative direction, moving in direction d(j1)τd^{\tau}_{-(|j|-1)}. This causes cuts j2|j|-2 and j1|j|-1 to move towards each other and therefore importantly for us here, cut j1|j|-1 to move to the left (towards integer point j2|j|-2).

By increasing the transformed coordinate αj+1\alpha_{j+1} in the negative direction, moving in direction d(j+1)τd^{\tau}_{-(|j|+1)}. This causes cuts j|j| and j+1|j|+1 to move towards each other.

Note that what may happen in this last case, is that cut j+1|j|+1 which used to lie to the right of the integer point j+2|j|+2 before moving in direction d(j+1)τd^{\tau}_{-(|j|+1)}, now lies to the left of the integer point j+2|j|+2 after the movement, therefore increasing the label AjA_{-j} at the right-hand-side of [j2,j][|j|-2,|j|]. We consider two more cases, depending on whether or not this is the case.

Case 2b(i): At x{\bf x}, cut j+1|j|+1 is to the right of location j|j| or at location j|j|.

There are two ways to restore the deficit of AjA_{-j} that resulted from moving in direction djτd^{\tau}_{j} from 0τ{\bf 0}_{\tau} to xj{\bf x}_{j}. Moving in direction d(j1)τd^{\tau}_{-(|j|-1)} moves cut j1|j|-1 to the left, and moving in direction d(j+1)τd^{\tau}_{-(|j|+1)} moves cut j|j| to the right. (Note that the movement of cut j+1|j|+1 to the left has not changed the balance of AjA_{j} and AjA_{-j} in the interval [j2,j][|j|-2,|j|] any further, by the assumption of the case). Since jj was chosen to be in argmaxi{2,,n}αi\arg\max_{i\in\{2,\ldots,n\}}|\alpha_{i}|, it is easy to verify that

Cut j1|j|-1 has moved to the left as a result of moving in direction d(j1)τd^{\tau}_{-(|j|-1)} at most as much as cut j|j| has moved to the left as a result of moving in direction djτd^{\tau}_{j} (from 0τ{\bf 0}_{\tau} to xj{\bf x}_{j}).

Cut j|j| has moved to the right as a result of moving in direction d(j+1)τd^{\tau}_{-(|j|+1)} at most as much as cut j1|j|-1 has moved to the right as a result of moving in direction djτd^{\tau}_{j} (from 0τ{\bf 0}_{\tau} to xj{\bf x}_{j}).

Therefore a large enough subinterval of [j2,j][|j|-2,|j|] has been coloured with AjA_{j}, which means jj is a consistent colour.

Case 2b(ii): At x{\bf x}, cut j+1|j|+1 is to the left of location j|j|.

In this case, we have αj+1<0\alpha_{|j|+1}<0 and movement in direction d(j+1)τd^{\tau}_{-(|j|+1)} causes cuts j|j| and j+1|j|+1 to move towards each other. Note also that besides the effect of the movement in direction d(j+1)τd^{\tau}_{-(|j|+1)}, cut j+1|j|+1 may move to the left due to movement in direction dj+2τd^{\tau}_{|j|+2}, since such a movement would cause cuts j+1|j|+1 and j+2|j|+2 to move away from each other and therefore, cut j+1|j|+1 to move to the left. However, the distance moved in direction dj+2τd^{\tau}_{|j|+2} is small; it is at most ταj\tau\cdot|\alpha_{j}|, which is at most τδ+\tau\cdot\delta^{+}. Therefore, we need movement at least τ(1δ+)\tau(1-\delta^{+}) in direction d(j+1)τd^{\tau}_{-(|j|+1)} in order to cover the distance moved in direction djτd^{\tau}_{j}.

First, we verify that (j+1)-(|j|+1) satisfies Condition 1 of Definition 24, i.e. that αj+1<2δT\alpha_{|j|+1}<-2\delta^{T} (at this point we know that αj+1\alpha_{j+1} is a negative quantity). We consider two cases, depending on whether τ\tau is “small” or “large” (relatively to the small interval [0,1/2n][0,1/2n]).

In the case when τ<14αj\tau<\frac{1}{4}|\alpha_{j}|, the largest part of the deficit of AjA_{-j} introduced by moving from 0τ{\bf 0}_{\tau} to xj{\bf x}_{j} results from moving cut j|j| to the left. However, letting c(j1)c(|j|-1) denote the position of cut j1|j|-1 after this movement, the interval [j2,c(j1)][|j|-2,c(|j|-1)] is too small for the movement of cut j1|j|-1 in direction d(j1)τd^{\tau}_{-(|j|-1)} to compensate. In other words, even if movement in direction d(j1)τd^{\tau}_{-(|j|-1)} moves cut j1|j|-1 to the left endpoint of the interval [j2,j][|j|-2,|j|], this is not enough to make up for the deficit of AjA_{-j} introduced from the movement in direction djτd^{\tau}_{j}. This means that cut j+1|j|+1 needs to move to the left as well and in particular, it needs to move by more than τ/4\tau/4 to the left of location jj. This is only possible if αj+1<2δT\alpha_{|j|+1}<-2\delta^{T}.

In the case when τ14αj\tau\geq\frac{1}{4}|\alpha_{j}|, since τ\tau is large enough, cut j+1|j|+1 needs to move a substantial distance to the left, in order to end up positioned to the left of integer position j|j|. In particular, it needs to move at least 14αjτδ+\frac{1}{4}|\alpha_{j}|-\tau\cdot\delta^{+} to the left. This implies that Condition 1 is satisfied for colour (j+1)-(|j|+1).

Now consider what needs to happen in order for the second condition to fail. Consider the interval [j1,j+1][|j|-1,|j|+1] (which is monitored by the (j+1)(j+1)-st blanket-sensor). Since cut j+1|j|+1 is located to the left of location j|j| (the midpoint of this interval), there exists a subinterval of length at most 11 labelled AjA_{j}, within [j1,j+1][|j|-1,|j|+1]. This means that either

the colour (j+1)-(|j|+1) is a consistent colour and we are done, or

there is an additional amount of label AjA_{j} within interval [j1,j+1][|j|-1,|j|+1] and the total number of value-blocks labelled AjA_{j} outnumbers that of those labelled AjA_{-j} by at least plargep^{\rm large}. The only way this can happen is if cut j+2|j|+2 lies to the left of the integer location j+1|j|+1, and in fact, it has to lie an inverse-polynomial distance, at least plarge2phuge\frac{p^{\rm large}}{2p^{\rm huge}}, to the left of j+1|j|+1.

In case that happens, we move on to consider interval [j,j+2][j,j+2] and we apply the same argument. Again, αj+2\alpha_{j+2} is negative, and since cut j+2|j|+2 is to the left of location j+1j+1 by a margin plarge2phuge<2δT\frac{p^{\rm large}}{2p^{\rm huge}}<2\delta^{T}, (j+2)-(|j|+2) satisfies Condition 1 to be a consistent colour. It will also satisfy Condition 2, unless cut j+3|j|+3 lies to the left of location j+2|j|+2 by an inverse-polynomial distance, at least plarge2phuge\frac{p^{\rm large}}{2p^{\rm huge}}, similarly to before.

Continuing like this, we will either find a consistent colour in some interval [j2,j][j-2,j] with j<nj<n, or we will reach interval [n2,n][n-2,n]. When we reach interval [n2,n][n-2,n], cut nn has had to move to the left of integer location n1n-1 in order to prevent (n1)-(n-1) from being a consistent colour (as otherwise we would have identified a consistent colour in some already examined interval). But then n-n is a consistent colour, since we have moved an inverse-polynomial distance (at least plarge2phuge<2δT\frac{p^{\rm large}}{2p^{\rm huge}}<2\delta^{T}) in direction dnτd^{\tau}_{-n} (Condition 1), and at least 1/21/2 of the interval [n2,n][n-2,n] is coloured in a way that agrees with this (Condition 2).

A solution SCHS_{CH} to ICHI_{CH} cannot encode a point x{\bf x} in the Significant Region, within distance δtiny\delta^{\rm tiny} of the boundary (where blanket-sensor agent(s) become active).

Observation 6.1 tells us that the kk-th component of ff^{\prime} is the difference between A+A_{+} and AA_{-} observed by c-e agent aka_{k}, and Proposition 6.2 tells us that all these components, averaged over a set of points within δtiny\delta^{\rm tiny} of x{\bf x} need to be close to zero, at a solution.

Proposition 6.5 tells us that x{\bf x} has some consistent colour kk. All points within δtiny\delta^{\rm tiny} of x{\bf x} cause two outputs (gates gkg^{\prime}_{k}, gkg^{\prime}_{-k} as defined in Section 5.1.2) of the circuit-encoders to represent colour kk. This includes points where blanket-sensor agents are active, since by the properties of consistent colours, we are at least an inverse-polynomial distance from any point where any bi,kb_{i,k} can be active in the wrong direction. In Section 5.1.3 the blanket-sensor agents are designed to agree with the definition of consistent colour, Definition 24. So c-e agent aka_{k} observes a large imbalance between A+A_{+} and AA_{-}.

4 No bogus approximate-zeroes of F𝐹F due to the connecting facet

Let x{\bf x}, x{\bf x}^{\prime} be points in the Significant Region having transformed coordinates (τ;α2,,αn)(\tau;\alpha_{2},\ldots,\alpha_{n}) and (1τ;α2,,αn)(1-\tau;-\alpha_{2},\ldots,-\alpha_{n}) respectively, for τ<12δT\tau<\frac{1}{2}-\delta^{T}. Then f(x)=f(x)f({\bf x})=-f({\bf x}^{\prime}).

The proposition extends Observation 5.3. The points x{\bf x} and x{\bf x}^{\prime} have been coloured according to Item 2 of Section 5.3, and they belong to two long thin colour-regions that extend the cubelets that lie on the panchromatic facets of the cube embedded at the centre of TT, all the way to the ends of TT. From the boundary conditions on the colouring of box BB in New variant high-D Tucker, and the way ff is constructed above, their colours are equal and opposite.

Remark: The x{\bf x}, x{\bf x}^{\prime} in Proposition 6.7 will “approach each other” as τ0\tau\rightarrow 0. That is, they correspond to sequences of Consensus-halving cuts where the left-hand cut in the c-e region “wraps around” to the right-hand side of the c-e region. Proposition 6.7 may thus seem to create Borsuk-Ulam directions that are in conflict with each other as we cross from facet D0D_{0} to D1D_{1}, but in fact the flip of labels in Consensus-halving that occurs when we move from D0D_{0} to D1D_{1} will mean that they are in agreement with each other.

We consider the case where the set of pCp^{C} points in DD represented by the solution SCHS_{CH} to ICHI_{CH}, contains points on opposite sides of the facets of DD that have been identified with each other. Proposition 6.7 tells us that colour-regions are adjacent to colour-regions having the opposite colour. We need to verify that for a pair x,x{\bf x},{\bf x}^{\prime} of points that are close together but have opposite colours (due to lying in such a pair of colour-regions) the same (and not opposite) feedback is provided to the c-e agents. (So, in contrast with a pair of opposite-colour points that represent a solution, whose feedback to the c-e agents cancel each other out.)

In reasoning about these elements x,xD{\bf x},{\bf x}^{\prime}\in D, it is helpful to depart from our convention that the label-sequence begins with A+A_{+}, and suppose that for x{\bf x}^{\prime}, the label-sequence begins with AA_{-}. Suppose x,x{\bf x},{\bf x}^{\prime} have corresponding circuit-encoders Ci,CiC_{i},C_{i^{\prime}} and assume that CiC_{i} and CiC_{i^{\prime}} receive reliable inputs, recalling that only nn circuit-encoders may fail to receive reliable inputs. Notice that if x{\bf x} causes a blanket-sensor agent bi,jb_{i,j} to be active in direction A+A_{+}, then x{\bf x}^{\prime} typically causes bi,jb_{i^{\prime},j} to be active in direction A+A_{+} also (the over-represented label is fed back to c-e agent aja_{j}).

In the case that no blanket-sensor agents are active, if x,x{\bf x},{\bf x}^{\prime} receive opposite colours from Ci,CiC_{i},C_{i^{\prime}}, then, reverting to our convention that the shared label-sequence begins with A+A_{+}, we note that their reference-sensor agents get opposite labels, which causes CiC_{i} and CiC_{i^{\prime}} to agree with each other.

Remark. For intuition, it is possibly helpful to think about the move from x{\bf x} to x{\bf x}^{\prime} in terms of operations on the coordinate-encoding cuts. At x{\bf x}, there is a cut on the right-hand side of the c-e region, and in moving to x{\bf x}^{\prime} we move that cut to the left-hand side. If we move the cut while leaving the labels of the c-e region unchanged (apart from at the ends) we expect the circuit to behave as before, but since we have switched the roles of labels A+A_{+} and AA_{-}, the feedback to agents a1,,ana_{1},\ldots,a_{n} gets inverted. We re-invert this feedback by reversing the colour, and hence the output of ff^{\prime}. This is very similar (in fact, an nn-dimensional analogue) to the handling of the “wrap-around points” in the sequence via the interpretation in terms of the virtual cuts in .

Further work

What is the computational complexity of kk-thief Necklace-splitting, for kk not a power of 2? As discussed in , the proof that it is a total search problem, does not seem to boil down to the PPA principle. Right now, we do not even even know if it belongs to PTFNP .

Interestingly, Papadimitriou in (implicitly) also defined a number of computational complexity classes related to PPA, namely PPA-pp, for a parameter p2p\geq 2. PPA-pp is defined with respect to an input bipartite graph and a given vertex with degree which is not a multiple of pp, and the goal is to find another vertex with degree which is not a multiple of pp (it follows that PPA=PPA-22). This was done in the context of classifying the computational problem related to Chévalley’s Theorem from number theory, and it was proven that for prime pp, Chevally mod pp is in PPA-pp . Given the discussion above, it could possibly be the case that the the principle associated with Necklace-splitting for kk-thieves is the PPA-kk principle instead.

What about the computational hardness of the problem? Is 3-thief Necklace-splitting hard for PPA? At first glance, it seems like a more complicated problem, but there this is not obvious; for example, there is no way to cause the third thief to be a dummy agent and therefore a straightforward reduction is unlikely. However, it is worth mentioning here that the computational equivalence between ε\varepsilon-Consensus-halving and Necklace-splitting that was proven in is actually established between the Necklace Splitting problem for any kk and the corresponding approximate 1/k1/k-Division problem, a generalization of ε\varepsilon-Consensus-halving (see ); a PPA-hardness or PPA-membership result for k>2k>2 for the latter problem would imply a corresponding result for Necklace-splitting with k>2k>2. En route to either of these results, a possible approach that was suggested in , would be to define an appropriate generalisation of Tucker’s Lemma and prove it constructively (see Section 8 in ).

We have left open the questions of whether ε\varepsilon-Consensus-halving remains PPA-complete for constant ε\varepsilon, and whether Discrete Ham Sandwich remains PPA-complete when coordinates of points are given in unary. Recall that for the former problem, a PPAD-hardness result is known from ; it would be quite interesting to settle this, to verify whether it is possible for the precision parameter to play such an important role in the problem classification.

In classifying a problem as polynomial-time solvable versus NP-complete, this is usually seen as a statement about its computational (in)tractability. The distinction between PPAD-completeness and PPA-completeness is one of expressive power: we believe that PPAD-complete problems are hard, meanwhile PPA-complete problems are “at least as hard”, but of course are still in NP. The expressive power of totality principles that underpin TFNP problems is a topic of enduring interest ; note also the related work on Bounded Arithmetic discussed in . Our results highlight the distinction between computational (in)tractability and expressive power. In analysing the relationships between these complexity classes, it may be fruitful to focus on expressive power.

Finally, initiates an interesting experimental study of path-following algorithms for 2-thief Necklace-splitting, obtaining positive results when the number of bead colours is not too large. However, path-following seems to be inapplicable for, say, 3 thieves. The Necklace-splitting problem may constitute an interesting class of challenge-instances for SAT-solvers, now that it is known to be a very hard total search problem.

We thank Alex Hollender for detailed and insightful proof-reading of earlier versions of this paper.

References

APPENDIX

Appendix A Details from [29].

In this section, we include several details from that are being used (or extended) in the present paper as well.

First, the ability to detect the position of the cuts in the c-e region and feed this information to the circuit lies in the presence of gadgets developed in and used in , referred to as “bit detection gadgets” in . A bit detection gadget consists typically of two thin and dense valuation blocks of relatively large height and relatively small length, situated next to each other (e.g., see the rightmost set of value-blocks in Figure 16 or Figure 20, top). These value-blocks constitute most of the agent’s valuation over the related interval. The point of these gadgets is that if the discrepancy between A+A_{+} and AA_{-} is (significantly) in the favour of one against the other, there will be a cut intersecting one of the two valuation blocks; which block is intersected will correspond to a 0/10/1 value, i.e. a bit that indicates the “direction” of the discrepancy in the two labels.

These gadgets are used in several parts of the reduction, e.g.,

at the value-blocks of the sensor agents (Definition 13) in the region RiR_{i} (see Figure 16) that assumes the right or left position depending on whether their small value-blocks in the c-e region are labelled A+A_{+}or AA_{-},

in the encoding of the circuit CVTC_{VT} of New variant high-D Tucker using the circuit-encoding agents CiC_{i} (also see Section 5.1),

at the value-blocks of the blanket-sensor agents (Definition 14) in the region RiR_{i}, with the difference that in that case, a small value-block (of value 9κ/109\kappa/10) lies between the two thin and dense valuation blocks of the bit detection gadget (see Figure 17).

Sensor and blanket sensor agents

The formal definitions of the sensor agents and the blanket sensor agents were given in the main text, see Definition 13 and Definition 14 respectively. Here, we explain in more detail how these agents make use of the bit-detection gadgets (which lie in the region RiR_{i}) to detect the positions of the cuts (for the sensor agents) or to detect large discrepancies on the volumes of the two labels in the c-e region (for the blanket sensor agents).

Starting from the sensor agents, recall that each such agent of SiCiS_{i}\subset C_{i} has a small value-block (of value 1/101/10) in the c-e region and its remaining value (9/109/10) lies in the circuit encoding region and particularly, in the sub-region RiR_{i} (recall that the circuit-encoding region RR is partitioned into sub-regions RiR_{i}, one for each circuit encoder CiC_{i}, where most of the gadgetry of the encoder lies). In particular, the sensor agent has two thin blocks of value 9/209/20 in the c-e region and this is precisely the bit detection gadget of the agent, as described in the previous subsection - see Figure 20, top, for an illustration. If the value-block on the left-hand side (in the c-e region) is labelled AA_{-}, then the cut on the right-hand side intersects the rightmost value-block (i.e., “jumps” to the right) and if it is A+A_{+}, then it “jumps” to the left. This information is then passed on to the next level of circuit encoding agents, those that implement the pre-processing unit of the circuit (see Section 5.1.1) and the subsection following this one. In , we referred to this information as “raw data” (although the cut extraction mechanism there had to be more elaborate, due to the inversely-exponential precision). The pre-processing unit is responsible for converting the raw data into appropriate inputs for the circuit CVTC_{VT}, which encode coordinate of points on the Möbius-simplex. These inputs are then “propagated” through the encoding of the circuit CVTC_{VT}, to produce the appropriate labels at the output gates gjg_{j}, j±[n]j\in\pm[n], as described in th following subsection.

The blanket sensor agents use very similar bit detection gadgets in their outputs (i.e., in their value-blocks in region RiR_{i}), but between their thin and dense value-blocks, they have an additional small value-blocks (the block of value 9κ/109\kappa/10 in Definition 14). This is because the blanket sensor agents needs to be able to assume three states: “excess of A+A_{+}”, “excess of AA_{-}” and “(approximately) balanced labels”. The latter option corresponds to the cut associated with the blanket sensor agent intersecting the middle value-block (therefore not “jumping” to either side), whereas the other two options correspond to the cut “jumping” to either the right or the left side, where the choice depends on the over-represented label and the parity of the index of the blanket sensor agent. It is straightforward (as before) to interpret these positions as boolean values. The main idea in is that if the blanket sensor agents are active, then this information will “over-ride” the circuit CVTC_{VT} and generate an imbalance of labels in the feedback provided to the c-e agents directly, essentially bypassing the output gates of CVTC_{VT}. We use the same principle here, but since we now have many blanket sensor agents, extra care must be taken to make sure that no artificial solutions are introduced when colouring the domain. The details on how the input from the blanket-sensors affects the feedback to the c-e agents are presented in Section 5.1.3.

Before we explain how the circuit of New variant high-D Tucker is encoded using the circuit encoders CiC_{i}, we present the main building blocks for simulating boolean circuits based on bit-detection gadget, first presented in and later adapted and used in .

Consider a boolean gate that is an AND, an OR, or a NOT gate, denoted gg_{\land}, gg_{\lor} and g¬g_{\neg} respectively. Let in1in_{1}, in2in_{2} and outout be intervals such that in1=in2=out=1|in_{1}|=|in_{2}|=|out|=1. We will encode these gates using gate gadgets, shown in Figure 18 (from ).

Note that the gadget corresponding to the NOT gate only has one input, whereas the gadgets for the AND and OR gates have two inputs. In the interval outout, each gadget has two bit detection gadgets - in the case of the NOT gate these are even, but in the case of the AND and OR gates, they are uneven. Also note that for the inputs, as well as the output of the NOT gate, the label on the left-hand side of the cut is A+A_{+} and the label on the right-hand side will be AA_{-}, whereas for the outputs to the OR and AND gate, the label on the left-hand side of the cut is AA_{-} and the label on the right hand side is A+A_{+} . This can be achieved with the appropriate use of parity gadgets, i.e., simple valuation blocks that force cuts to lie in specific positions, only to change the parity of the cut sequence (see for more details).

As we mentioned earlier, the bit-detection gagdets allow us to extract boolean values corresponding to the positions of the cuts in the c-e region - this is achieved via the use of the sensor agents. In the next step, these values (referred earlier as the “raw data”) are supplied into a gadget that is referred to as the pre-processing circuit in . The role of this circuit is to convert this information to coordinates of points on the Möbius-simplex, which then will go through a coordinate transformation (see Section 5.2) and will be used as inputs to the encoding of CVTC_{VT}. The pre-processing circuit can be implemented using the boolean gate gadgets described above. The same is true for the actual circuit CVTC_{VT} as well, which can be simulated using the same set of gadgets, using the principle described in Figure 19 (from ). The outputs of the pre-preprocessing circuit are inputed to the input gates of CVTC_{VT} and their outputs, in turn, are inputted to the gates on the next level and this process carries on until the output gates of CVTC_{VT}.

Unreliable circuits and the stray cut

In the main text, we mentioned that we are guaranteed that at most nn cuts will lie in the c-e region (Observation 4.1) and at least n1n-1 cuts will lie in the c-e region, as otherwise some blanket sensor agent would be active (Proposition 4.4). If n1n-1 cuts lie in the c-e region, this means that one of the nn c-e cuts has moved into the circuit encoding region; following , we will refer to that cut as a stray cut. As we highlighted in , the presence of a stray cut may have the following two consequences.

It intersects the circuit-encoding region RiR_{i} of some circuit encoder CiC_{i}, for i{1,,pC}i\in\{1,\ldots,p^{C}\}.

It flips the parity of the circuit encoders CiC_{i}, with Ri<cR_{i}<c, where cc is the position of the stray cut in Ri1R_{i-1}. In other words, if the first cut in RiR_{i} was expecting to see A+A_{+} on its left-hand side, it now sees AA_{-} and vice-versa.

The first effect is not much of a problem, we will simply assume that the corresponding circuit is unreliable. As explained in the main text, an unreliable circuit can have an arbitrary (or even adversarial) effect on the volume of A+A_{+}and AA_{-}supplied as feedback to the c-e agents, but since there will be at most 11 unreliable circuit, its effect will be “outvoted” by the remaining reliable circuits (circuits that do not receive reliable inputs will also be outvoted by those that do, since there are at most nn of them, see Observation 4.2).

The parity flip that the stray cut induces on other circuit-encoders however seems potentially more worrisome., since it could flip the output of the sensor agents that detect the positions of the cuts. This is taken care of by the presence of the reference sensor agent (Definition 21) which achieves the disorientation of the domain. As explained in Section 5.1.1, the outputs of circuit CiC_{i} are taken with reference to the value s1,1s_{1,1} of the reference senso agent, in the sense that after simulating CVTC_{VT} we take the exclusive-or with s1,1s_{1,1}; this can be implemented in the circuit using the boolean gate gadgets explained above, in a manner similar to (via the use of the XOR sub-circuit, see Section 4.4.2 in ). This allows us to use a similar “double-negative lemma” as the one we used in (see Lemma 5.4).

Appendix B Membership in PPA

We show that Discrete Ham Sandwich belongs to PPA, which appears to be folklore that has not, to out knowledge, been written down. (In fact, in the problem was claimed to belong to PPAD, which is now seen to be incorrect subject to \mboxPPAD\mboxPPA\mbox{{PPAD}}\not=\mbox{{PPA}}, by the result in ). Since Theorem 2.3 reduces from 2-thief Necklace-splitting to Discrete Ham Sandwich, it follows immediately that 2-thief Necklace-splitting belongs to PPA. In Section B.2 we go a bit further for Necklace-splitting: we show that Necklace-splitting belongs to PPA whenever the number of thieves is a power of 2.

We use Freund and Todd’s construction of an undirected graph with known degree-1 vertex, based on a “special triangulation” of a high-dimensional L1L_{1} ball (i.e. a high-dimensional octahedron, also known as the crosspolytope ). Given an instance II of Discrete Ham Sandwich, we show how to construct a suitable triangulation. II contains nn sets of points {S1,,Sn}\{S_{1},\ldots,S_{n}\} in nn-dimensional space; we assume coordinates are represented as fractions whose numerators and denominators are give via standard binary expansions. (Recall that we leave it as an open problem whether Discrete Ham Sandwich remains PPA-hard for points presented in unary. Of course, the “in PPA” result follows immediately for that restricted version.)

Based on II we identify an exponentially-large collection of “candidate hyperplanes” as follows, which contains a solution to II. A candidate hyperplane HH is represented by a gradient vector gHg_{H} whose coordinates are assumed to be normalised to 1 (L1L_{1} norm; their absolute values sum to 1). Given gHg_{H}, HH is obtained by using binary search to efficiently find a hyperplane with gradient gHg_{H} that bisects the union of the sets SiS_{i}; it is then easy to check whether HH is a solution. Note that there exists an integer NN whose binary expansion has length polynomial in I|I| such that entries of some solution gHg_{H} can be assumed to be multiples of 1/N1/N.

Fix a point p\mboxIRnp\in\mbox{{IR}}^{n} such that pp is not contained in any candidate solution to II, e.g. p=(1/2N,1,,1)p=(1/2N,1,\ldots,1). Given any HH, the “positive” side of HH is the side that contains pp. Assume we chose NN large enough so that if two hyperplanes HH and HH^{\prime} have gradients gHg_{H} and gHg_{H^{\prime}} whose coordinates differ by at most 1/N1/N, then for any point xx in II where they disagree, xx lies in HH or HH^{\prime} (and does not lie strictly on the positive side of one and the negative side of the other).

For any HH, label it as follows. Find the set SiS_{i} that is most unevenly split by HH breaking ties lexicographically. Label HH with ii if most of SiS_{i} lies on the positive side, otherwise label HH with i-i. Each hyperplane HH has a gHg_{H} that is a point on the L1L_{1}-norm sphere Sn1S^{n-1}. By construction, antipodal points receive opposite labels. We can use these as the vertices of a special triangulation of the octahedral ball BnB^{n}, in which the origin is used as an additional point and is connected to all the points on Sn1S^{n-1} that correspond to candidate solutions. The graph defined in is a degree 2 undirected graph with a single known degree-1 vertex (the origin), and for which any other degree-1 vertex represents a pair of hyperplanes that bisect all the SiS_{i}.

B.2 Necklace-splitting is in PPA, if the number of thieves is a power of 2

The version of the problem with two thieves, 2-thief Necklace-splitting, belongs to PPA since we reduced it to Discrete Ham Sandwich which is shown in Section B.1 to belong to PPA. We can extend the membership to PPA for kk-Necklace-splitting, when kk is a power of 22, by using the argument of Proposition 3.2 of Alon (here, we take both kk and ll to be 22). In particular, we will reduce Necklace-splitting to 44-Necklace-splitting.

We start from an instance of 44-Necklace-splitting (with 44 thieves) and we regard it as an instance of Necklace-splitting (with 22 thieves), which we solve using an algorithm for the latter problem. The solution is a sequence of intervals defined by the endpoints of the necklace and nn cuts, each belonging to one of the two collections (corresponding to the two thieves), such that each collection contains exactly half of the beads of each colour. Then, we set the beads that lie in intervals belonging to each collection aside and form two new instances of Necklace-splitting (essentially by “gluing” the different sub-intervals of the same collection together); note that each new instance will have an even number of beads of each colour, since we initial number of beads from each colour was a multiple of 44. Then we run the algorithm again on the resulting instances of Necklace-splitting to obtain a partition into 44 collections (22 for each individual instance), which consitutes a partition of the 44-Necklace-splitting into 44 collections according to the definition of the problem. If nn is the number of colours, the total number of cuts is (at most) 3n3n, and therefore this partition is a solution to 44-Necklace-splitting.

The above is a Turing reduction, which can be extended straightforwardly to the case of kk a power of 2. We can convert such a reduction into a many-one reduction by applying Theorem 6.1 of Buss and Johnson , which shows that PPA and some related complexity classes are closed under Turing reductions.