On the Geometry of Differential Privacy

Moritz Hardt, Kunal Talwar

Introduction

The problem of Privacy-preserving data analysis has attracted a lot of attention in recent years. Several databases, e.g. those held by the Census Bureau, contain private data provided by individuals, and protecting the privacy of those individuals is an important concern. Differential Privacy is a rigorous notion of privacy that allows statistical analysis of sensitive data while providing strong privacy guarantees even in the presence of an adversary armed with arbitrary auxiliary information. We refer the reader to the survey of Dwork [Dwo08] and the references therein for further motivation and background information.

We consider the following general setting: A database is represented by a vector xnx\in\Re^{n}. The queries that the analyst may ask are linear combinations of the entries of xx. More precisely, a multidimensional query is a map F ⁣:ndF\colon\Re^{n}\rightarrow\Re^{d}, and we will restrict ourselves to linear maps FF with coefficients in the interval $.Thus. ThusFisais ad\times nmatrixwithentriesinmatrix with entries in.Inthiswork,weassumethroughoutthat. In this work, we assume throughout thatd\leqslant n.Amechanismisarandomizedalgorithmwhichholdsadatabase. A mechanism is a randomized algorithm which holds a databasex\in\Re^{n},receivesaquery, receives a queryF\colon\Re^{n}\to\Re^{d}andanswerswithsomeand answers with somea\in\Re^{d}.Informally,wesayamechanismsatisfiesdifferentialprivacyinthissettingifthedensitiesoftheoutputdistributionsoninputsInformally, we say a mechanism satisfies differential privacy in this setting if the densities of the output distributions on inputsx,x^{\prime}\in\Re^{n}withwith\|x-x^{\prime}\|_{1}\leqslant 1arepointwisewithinanare point wise within an\exp(\varepsilon)multiplicativefactorofeachother.Hereandinthefollowing,multiplicative factor of each other. Here and in the following,\varepsilon>0isaparameterthatmeasuresthestrengthoftheprivacyguarantee(smalleris a parameter that measures the strength of the privacy guarantee (smaller\varepsilonbeingastrongerguarantee).TheerrorofamechanismistheexpectedEuclideandistancebetweenthecorrectanswerbeing a stronger guarantee). The error of a mechanism is the expected Euclidean distance between the correct answerFxandtheactualanswerand the actual answera.$

In this work, we use methods from convex geometry to determine a nearly optimal trade-off between privacy and error. We will see a lower bound on how much error any differentially private mechanism must add. And we present a mechanism whose error nearly matches this lower bound.

As mentioned, the above setup is fairly general. To illustrate it and facilitate comparison with previous work, we will describe some specific instantiations below.

Suppose we have a database y[n]Ny\in[n]^{N}, containing private information about NN individuals. We can think of each individual as belonging to one of nn types. The database yy can then naturally be translated to a histogram xnx\in\Re^{n}, i.e., xix_{i} counts the number of individuals of type ii. Note that in the definition of differential privacy, we require the mechanism to be defined for all xnx\in\Re^{n} and demand that the output distributions be close whenever xx11\|x-x^{\prime}\|_{1}\leqslant 1. This is a stronger requirement than asserting this property only for integer vectors xx and xx^{\prime}. It only makes our upper bounds stronger. For the lower bounds, this strengthening allows us to ignore the discretization issues that would arise in the usual definition. However, our lower bounds can be extended for the usual definition for small enough ε\varepsilon and large enough NN (see Appendix B). Now, our upper bound holds for any linear query on the histogram. This includes some well-studied and natural classes of queries. For instance, contingency tables (see, e.g., [BCD+07]) are linear queries on the histogram.

Private bits.

In the setting looked at by Dinur and Nissim [DN03], the database y{0,1}Ny\in\{0,1\}^{N} consists of one private bit for each individual and each query ask for the number of 11’s amongst a (random) subset on [N][N]. Given dd such queries, one can define n2dn\leqslant 2^{d} types of individuals, depending on the subset of the queries that ask about an individual. The vector yy then maps to a histogram xx in the natural way with xix_{i} denoting the number of individuals of type ii with their private bit set to 11. Our results then imply a lower bound of Ω(d/ε)\Omega(d/\varepsilon) per answer for any ε\varepsilon-differentially private mechanism. This improves on the Ω(d)\Omega(\sqrt{d}) bound for d=Nd=N from [DN03] for a weaker privacy definition (blatant non-privacy). A closely related rephrasing is to imagine each individual having dd private {0,1}\{0,1\} attributes so that n=2dn=2^{d}. The dd queries that ask for the 11-way marginals of the input naturally map to a matrix FF and Theorem 1.1 implies a lower bound of Ω(d/ε)\Omega(d/\varepsilon) noise per marginal for such queries.

One can also look at xx itself as a database where each individuals private data is in $;inthissettingthedimensionofthedata; in this setting the dimension of the datanequalsthenumberofindividualsequals the number of individualsN$. Our results lead to better upper bounds for this setting.

1 Our results

Recall, the term error refers to the expected Euclidean distance between the output of the mechanism and the correct answer to the query FF.

As it turns out, when FF is a random Bernoulli ±1\pm 1 matrix our upper bound matches the lower bound up to constant factors. In this case, KK is a random polytope and its volume and average Euclidean norm have been determined rather recently. Specifically, we apply a volume lower bound of Litvak et al. [LPRN05], and an upper bound on the average Euclidean norm due to Klartag and Kozma [KK09]. Quantitatively, we obtain the following theorem.

Let ε>0\varepsilon>0 and dn/2.d\leqslant n/2. Then, for almost all matrices F{1,1}d×nF\in\{-1,1\}^{d\times n},

any ε\varepsilon-differentially private mechanism MM has error Ω(d/ε)min{d,log(n/d)}\Omega(d/\varepsilon)\cdot\min\{\sqrt{d},\sqrt{\log(n/d)}\}.

the KK-norm mechanism is ε\varepsilon-differentially private with error O(d/ε)min{d,log(n/d)}.O(d/\varepsilon)\cdot\min\{\sqrt{d},\sqrt{\log(n/d)}\}.

We remark that Litvak et al. also give an explicit construction of a mapping FF realizing the lower bound.

Notice that the bound in the previous theorem differs from the lower bound by a factor of LK.L_{K}. A central conjecture in convex geometry, sometimes referred to as the “Hyperplane Conjecture” or “Slicing Conjecture” (see [KK09] for further information) states that LK=O(1).L_{K}=O(1).

Unfortunately, in general the polytope KK could be very far from isotropic. In this case, both our volume-based lower bound and the KK-norm mechanism can be quite far from optimal. We give a recursive variant of our mechanism and a natural generalization of our volume-based lower bound which are nearly optimal even if KK is non-isotropic.

While we restricted our theorems to Fd×nF\in^{d\times n}, they apply more generally to any linear mapping F.F.

Our mechanism is an instantiation of the exponential mechanism and involves sampling random points from rather general high-dimensional convex bodies. This is why our mechanism is not efficient as it is. However, we can use rapidly mixing geometric random walks for the sampling step. These random walks turn out to approach the uniform distribution in a metric that is strong enough for our purposes. It will follow that both of our mechanisms can be implemented in polynomial time.

The mechanisms given in Theorem 1.2 and Theorem 1.5 can be implemented in time polynomial in n,1/εn,1/\varepsilon such that the stated error bound remains the same up to constant factors, and the mechanism achieves ε\varepsilon-differential privacy.

2 Previous Work

Queries of the kind described above have (total) sensitivity dd, and hence the work of Dwork et al. [DMNS06] shows that adding Laplace noise with parameter d/εd/\varepsilon to each entry of FxFx ensures ε\varepsilon-differential privacy. Moreover, adding Laplace noise to the histogram xx itself leads to another private mechanism. Thus such questions can be answered with noise min(d/ε,n/ε,N)\min(d/\varepsilon,\sqrt{n}/\varepsilon,N) per entry of FxFx. Some specific classes of queries can be answered with smaller error. Nissim, Raskhodnikova and Smith [NRS07] show that one can add noise proportional to a smoothed version of the local sensitivity of the query, which can be much smaller than the global sensitivity for some non-linear queries. Blum, Ligett and Roth [BLR08] show that it is possible to release approximate counts for all concepts in a concept class CC on {0,1}m\{0,1\}^{m} with error O((N2mVCDim(C)/ε)13)O((N^{2}m{\rm VCDim}(C)/\varepsilon)^{\frac{1}{3}}), where VCDim(C){\rm VCDim}(C) is the VC dimension of the concept class. Their bounds are incomparable to ours, and in particular their improvements over the Laplacian mechanism kick in when the number of queries is larger than the size of the database (a range of parameters we do not consider). Feldman et al. [FFKN09] construct private core sets for the kk-median problem, enabling approximate computation of the kk-median cost of any set of kk facilities in d\Re^{d}. Private mechanisms with small error, for other classes of queries have also been studied in several other works, see e.g. [BDMN05, BCD+07, MT07, CM08, GLM+10].

3 Overview and organization of the paper

In this section we will give a broad overview of our proof and outline the remainder of the paper.

Section 2 contains some preliminary facts and definitions. Specifically, we describe a linear program that defines the optimal mechanism for any set of queries. This linear program (also studied in [GRS09] for the one-dimensional case) is exponential in size, but in principle, given any query and error function, can be used to compute the best mechanism for the given set of queries. Moreover, dual solutions to this linear program can be used to prove lower bounds on the error. However, the asymptotic behavior of the optimum value of these programs for multi-dimensional queries was not understood prior to this work. Our lower bounds can be reinterpreted as dual solutions to the linear program. The upper bounds give near optimal primal solutions. Also, our results lead to a polynomial-time approximation algorithm for the optimum when FF is linear.

Indeed, using several results from convex geometry, we observe that our lower and upper bounds match up to constant factors when FF is drawn at random from {1,1}d×n\{-1,1\}^{d\times n}. As it turns out the polytope KK can be interpreted as the symmetric convex hull of the row vectors of F.F. When FF is a random matrix, KK is a well-studied random polytope. Some recent results on random polytopes give us suitable lower bounds on the volume and upper bounds on the average Euclidean norm. More generally, our bounds are tight whenever KK is in isotropic position (as pointed out in Section 6). This condition intuitively gives a relation between volume and average distance from the origin. Our bounds are actually only tight up to a factor of LK,L_{K}, the isotropic constant of K.K. A well-known conjecture from convex geometry, known as the Hyperplane Conjecture or Slicing Conjecture, implies that LK=O(1).L_{K}=O(1).

The problem is that when FF is not drawn at random, KK could be very far from isotropic. In this case, the KK-norm mechanism by itself might actually perform poorly. We thus give a recursive variant of the KK-norm mechanism in Section 7 which can handle non-isotropic bodies. Our approach is based on analyzing the covariance matrix of KK in order to partition KK into parts on which our earlier mechanism performs well. Assuming the Hyperplane conjecture, we derive bounds on the error of our mechanism that are optimal to within polylogarithmic factors.

Some complications arise, since we need to repeat the privacy and optimality analysis of our mechanisms in the presence of approximation errors (such as an approximate covariance matrix and an approximate separation oracle for KK). The details can be found in Section 8.

We would like to thank Frank McSherry, Aaron Roth, Katrina Ligett, Indraneel Mukherjee, Nikhil Srivastava for several useful discussions, and Adam Smith for discussions and comments on a previous version of the paper.

Preliminaries

1 Differential Privacy

A mechanism MM is a family of probability measures M={μx ⁣:xn}M=\{\mu_{x}\colon x\in\Re^{n}\} where each measure μx\mu_{x} is defined on d\Re^{d}. A mechanism is called ε\varepsilon-differentially private, if for all x,ynx,y\in\Re^{n} such that xy11\|x-y\|_{1}\leqslant 1, we have supSdμx(S)μy(S)exp(ε),\sup_{S\subseteq\Re^{d}}\frac{\mu_{x}(S)}{\mu_{y}(S)}\leqslant\exp(\varepsilon), where the supremum runs over all measurable subsets SdS\subseteq\Re^{d}.

A common weakening of ε\varepsilon-differential privacy is the following notion of approximate privacy.

A mechanism is called δ\delta-approximate ε\varepsilon-differentially private, if for all x,ynx,y\in\Re^{n} such that μx(S)exp(ε)μy(S)+δ\mu_{x}(S)\leqslant\exp(\varepsilon)\mu_{y}(S)+\delta for all measurable subsets SnS\subseteq\Re^{n} wheneverxy11\|x-y\|_{1}\leqslant 1,

The definition of privacy is transitive in the following sense.

If MM is an ε\varepsilon-differentially private mechanism and x,ynx,y\in\Re^{n} satisfy xy1k\|x-y\|_{1}\leqslant k, then for measurable SdS\subseteq\Re^{d} we have μx(S)μy(S)exp(εk).\frac{\mu_{x}(S)}{\mu_{y}(S)}\leqslant\exp(\varepsilon k).

We will consider mappings FF which possess the Lipschitz property, supxB1nFx1d.\sup_{x\in B_{1}^{n}}\|Fx\|_{1}\leqslant d. In this case, we will say that FF has sensitivity dd.

Our goal is to show trade-offs between privacy and error. The following standard upper bound, usually called the Laplacian mechanism, is known.

When it comes to approximate privacy, the so-called Gaussian mechanism provides the following guarantee.

2 Isotropic Position

We say a convex body KdK\subseteq\Re^{d} is in isotropic position with isotropic constant LKL_{K} if for every unit vector vdv\in\Re^{d},

For every convex body KdK\subseteq\Re^{d}, there is a volume-preserving linear transformation TT such that TKTK is in isotropic position.

For an arbitrary convex body KK, its isotropic constant LKL_{K} can then be defined to be LTKL_{TK} where TT brings LL to isotropic position. It is known (e.g. [MP89]) that TT is unique up to an orthogonal transformation and thus this is well-defined.

We refer the reader to the paper of Milman and Pajor [MP89], as well as the extensive survey of Giannopoulos [Gia03] for a proof of this fact and other facts regarding the isotropic constant.

3 Gamma Distribution

4 Linear Programming Characterization

A mechanism is specified by a distribution μx\mu_{x} on R\mathcal{R} for every xDx\in\mathcal{D}. Assume for simplicity that D\mathcal{D} and R\mathcal{R} are both finite. Thus a mechanism is fully defined by real numbers μ(x,a)\mu(x,a), where μ(x,a)\mu(x,a) is the probability that the mechanism outputs answer aRa\in\mathcal{R} on databases xDx\in\mathcal{D}. The constraints on μ\mu for an ε\varepsilon-differentially private mechanism are given by

The expected error (under any given prior over databases) is then a linear function of the variables μ(x,a)\mu(x,a) and can be optimized. Similarly, the worse case (over databases) expected error can be minimized, and we will concentrate on this measure for the rest of the paper. However these linear programs can be prohibitive in size. Moreover, it is not a priori clear how one can use this formulation to understand the asymptotic behavior of the error of the optimum mechanism.

Our work leads to a constant approximation to the optimum of this linear program when FF is a random in {1,+1}d×n\{-1,+1\}^{d\times n} and an O(log3/2d)O(\log^{3/2}d)-approximation otherwise.

Lower bounds via volume estimates

In this section we show that lower bounds on the volume of the convex body FB1ndFB_{1}^{n}\subseteq\Re^{d} give rise to lower bounds on the error that any private mechanism must have with respect to FF.

A set of points YdY\subseteq\Re^{d} is called a rr-packing if yy2r\|y-y^{\prime}\|_{2}\geqslant r for any y,yY,yyy,y^{\prime}\in Y,y\neq y^{\prime}.

We will now assume that M={μx ⁣:xn}M=\{\mu_{x}\colon x\in\Re^{n}\} is an ε\varepsilon-differentially private mechanism with error cddR/εcd\sqrt{d}R/\varepsilon and lead this to a contradiction for small enough c>0c>0. For this we set λ=d/2ε\lambda=d/2\varepsilon. By the assumption on the error, Markov’s inequality implies that for all xXx\in X, we have μx(Bx)12,\mu_{x}(B_{x})\geqslant\tfrac{1}{2}, where BxB_{x} is a ball of radius 2cddR/ε=4cλRd2cd\sqrt{d}R/\varepsilon=4c\lambda R\sqrt{d} centered at FxFx. Since Y=FXY=FX is an Ω(λRd)\Omega(\lambda R\sqrt{d})-packing, the balls {Bx:xX}\{B_{x}:x\in X\} are disjoint for small enough constant c>0c>0.

Since x1λ\|x\|_{1}\leqslant\lambda, it follows from ε\varepsilon-differential privacy with Fact 2.3 that

Since the balls BxB_{x} are pairwise disjoint,

for d2d\geqslant 2. We have thus obtained a contradiction. ∎

Let ε>0\varepsilon>0 and suppose F ⁣:ndF\colon\Re^{n}\to\Re^{d} is a linear map and let K=FB1nK=FB_{1}^{n}. Furthermore, let PP denote the orthogonal projection operator of a kk-dimensional subspace of d\Re^{d} for some 1kd.1\leqslant k\leqslant d. Then, every ε\varepsilon-differentially private mechanism MM must have

Note that a differentially private answer aa to FF can be projected down to a (differentially private) answer PaPa to PFPF and PP is norm 11 operator. ∎

where the supremum is taken over all kk and all kk-dimensional orthogonal projections PP.

Our lower bound used the fact that the mechanism is defined on all vectors xdx\in\Re^{d}. In Appendix B, we show how the lower bound can be extended when restricting the domain of the mechanism to integer vectors x[N]n,x\in[N]^{n}, where distance is measured in the Hamming metric.

1 Lower bounds for small number of queries

As shown previously, the task of proving lower bounds on the error of private mechanisms reduces to analyzing the volume of FB1n.FB_{1}^{n}. When dlognd\leqslant\log n this is a straightforward task.

This lower bound shows that the standard upper bound from Theorem 2.6 is, in fact, optimal when dlog(n).d\leqslant\log(n).

The K𝐾K-norm mechanism

In this section we describe a new differentially private mechanism, which we call the KK-norm mechanism.

Given a linear map F ⁣:ndF\colon\Re^{n}\to\Re^{d} and ε>0\varepsilon>0, we let K=FB1nK=FB_{1}^{n} and define the mechanism KM(F,d,ε)={μx ⁣:xn}{\bf KM}(F,d,\varepsilon)=\{\mu_{x}\colon x\in\Re^{n}\} so that each measure μx\mu_{x} is given by the probability density function

defined over d.\Re^{d}. Here ZZ denotes the normalization constant

A more concrete view of the mechanism is provided by Figure 2 and justified in the next remark.

We can sample from the distribution μx\mu_{x} as follows:

Indeed, if aFxK=R\|a-Fx\|_{K}=R, then the distribution of aa as above follows the probability density function

which is in agreement with (5). That is, g(a)=f(a).g(a)=f(a).

The next theorem shows that the KK-norm mechanism is indeed differentially private. Moreover, we can express its error in terms of the expected distance from the origin of a random point in K.K.

When p=1p=1, Γ(d+1+p)Γ(d+1)=d+1.\frac{\Gamma(d+1+p)}{\Gamma(d+1)}=d+1.

Privacy follows from the fact that the mechanism is a special case of the exponential mechanism [MT07]. For completeness, we repeat the argument.

Suppose that x11\|x\|_{1}\leqslant 1. It suffices to show that for all ada\in\Re^{d}, the densities of μ0\mu_{0} and μx\mu_{x} are within multiplicative exp(ε)\exp(\varepsilon), i.e.,

where in the first inequality we used the triangle inequality for K\|\cdot\|_{K}. In the second step we used that xB1nx\in B_{1}^{n} and hence FxFB1n=KFx\in FB_{1}^{n}=K which means FxK1.\|Fx\|_{K}\leqslant 1.

Hence, the mechanism satisfies ε\varepsilon-differential privacy. ∎

Matching bounds for random queries

In this section, we will show that our upper bound matches our lower bound when FF is a random query. A key observation is that FB1nFB_{1}^{n} is the symmetric convex hull of nn (random) points {v1,,vn}d\{v_{1},\dots,v_{n}\}\subseteq\Re^{d}, i.e., the convex hull of {±v1,,±vn}\{\pm v_{1},\dots,\pm v_{n}\}, where vidv_{i}\in\Re^{d} is the iith column of FF. The symmetric convex hull of random points has been studied extensively in the theory of random polytopes. A recent result of Litvak, Pajor, Rudelson and Tomczak-Jaegermann [LPRN05] gives the following lower bound on the volume of the convex hull.

Let 2dn2d2d\leqslant n\leqslant 2^{d} and let FF denote a random d×nd\times n Bernoulli matrix. Then,

with probability 1exp(Ω(dβn1β))1-\exp(-\Omega(d^{\beta}n^{1-\beta})) for any β(0,12).\beta\in(0,\frac{1}{2}). Furthermore, there is an explicit construction of nn points in {1,1}d\{-1,1\}^{d} whose convex hull achieves the same volume.

We are mostly interested in the range where ndlogdn\gg d\log d in which case the theorem was already proved by Giannopoulos and Hartzoulaki [GH02] (up to a weaker bound in the probability and without the explicit construction).

The bound in (7) is tight up to constant factors. A well known result [BF88] shows that the volume of the convex hull of any nn points on the sphere in d\Re^{d} of radius d\sqrt{d} is bounded by

Notice, that in our case K=FB1nBddB2dK=FB_{1}^{n}\subseteq B_{\infty}^{d}\subseteq\sqrt{d}B_{2}^{d} and in fact the vertices of KK are points on the (d1)(d-1)-dimensional sphere of radius d\sqrt{d}. However, equation (7) states that the normalized volume of the random polytope KK will be proportional to the volume of the Euclidean ball of radius log(n/d)\sqrt{\log(n/d)} rather than d.\sqrt{d}. When dlognd\gg\log n, this means that the volume of KK will be tiny compared to the volume of the infinity ball BdB_{\infty}^{d}. By combining the volume lower bound with Theorem 3.3, we get the following lower bound on the error of private mechanisms.

Let ε>0\varepsilon>0 and 0<dn/20<d\leqslant n/2. Then, for almost all matrices F{1,1}d×nF\in\{-1,1\}^{d\times n}, every ε\varepsilon-differentially private mechanism MM must have

We use this paragraph to point out that our lower bound immediately implies a separation between approximate and exact differential privacy.

Theorem 2.7 gives a mechanism providing δ\delta-approximate ε\varepsilon-differential privacy with error o(ε1log(n/d))o(\varepsilon^{-1}\sqrt{\log(n/d)}) as long as δ1/no(1).\delta\geqslant 1/n^{o(1)}. Our lower bound in Theorem 5.2 on the other hand states that the error of any ε\varepsilon-differentially private mechanism must be Ω(ε1log(n/d))\Omega(\varepsilon^{-1}\sqrt{\log(n/d)}) (assuming dlog(n)d\gg\log(n)). We get the strongest separation when dlog(n)d\leqslant\log(n) and δ\delta is constant. In this case, our lower bound is a factor d\sqrt{d} larger than the upper bound for approximate differential privacy.

2 Upper bound on average Euclidean norm

Let FF be a random d×nd\times n Bernoulli matrix and put K=FB1nK=FB_{1}^{n}. Then, there is a constant C>0C>0 so that with probability greater than 1CeO(n)1-Ce^{-O(n)},

An application of Jensen’s inequality thus gives us the following corollary.

Let ε>0\varepsilon>0 and 0<dn/20<d\leqslant n/2. Then, for almost all matrices F{1,1}d×nF\in\{-1,1\}^{d\times n}, the mechanism KM(F,d,ε){\rm KM}(F,d,\varepsilon) is ε\varepsilon-differentially private with error at most

Approximately isotropic bodies

The following definition is a relaxation of nearly isotropic position used in literature (e.g., [KLS97])

We say a convex body KdK\subseteq\Re^{d} is in cc-approximately isotropic position if for every unit vector vdv\in\Re^{d},

The results of Klartag and Kozma [KK09] referred to in the previous section show that the symmetric convex hull nn random points from the dd-dimensional hypercube are in O(1)O(1)-approximately isotropic position and have LK=O(1)L_{K}=O(1). More generally, the KK-norm mechanism can be shown to be approximately optimal whenever KK is nearly isotropic.

We can see that the previous upper bound is tight up to a factor of cLKcL_{K}. Estimating LKL_{K} for general convex bodies is a well-known open problem in convex geometry. The best known upper bound for a general convex body KdK\subseteq\Re^{d} is LKO(d1/4)L_{K}\leqslant O(d^{1/4}) due to Klartag [Kla06], improving over the estimate LKO(d1/4logd)L_{K}\leqslant O(d^{1/4}\log d) of Bourgain from ’91. The conjecture is that LK=O(1)L_{K}=O(1).

There exists C>0C>0 such that for every dd and every convex set KdK\subseteq\Re^{d}, LK<CL_{K}<C.

Assuming this conjecture we get matching bounds for approximately isotropic convex bodies.

Let ε>0.\varepsilon>0. Assuming the hyperplane conjecture, for every Fd×nF\in^{d\times n} such that K=FB1nK=FB_{1}^{n} is cc-approximately isotropic, the KK-norm mechanism KM(F,d,ε){\rm KM}(F,d,\varepsilon) is ε\varepsilon-differentially private with error at most

Non-isotropic bodies

In this section, we will design a recursive mechanism that can handle such non-isotropic convex bodies. To this end, we will need to introduce a few more notions from convex geometry.

Having defined the covariance matrix, we can describe a recursive mechanism for the case when KK is not in isotropic position. The idea of the mechanism is to act differently on different eigenspaces of the covariance matrix. Specifically, the mechanism will use a lower-dimensional version of KM(F,d,ε){\bf KM}(F,d^{\prime},\varepsilon) on subspaces corresponding to few large eigenvalues.

The image of PUFP_{U}F above is a dd^{\prime}-dimensional subspace of d.\Re^{d}. We assume that in the recursive call NIM(PUF,d,ε){\rm NIM}(P_{U}F,d^{\prime},\varepsilon), the KK-norm mechanism is applied to a basis of this subspace. However, formally the output is a dd-dimensional vector.

To analyze our mechanism, first observe that the recursive calls terminate after at most logd\log d steps. For each recursive step m{0,,logd}m\in\{0,\dots,\log d\}, let ama_{m} denote the distribution over the output of the KmK_{m}-norm mechanism in step 3. Here, KmK_{m} denotes the dmd_{m}-dimensional body given in step m.m.

The mechanism NIM(F,d,ε){\rm NIM}(F,d,\varepsilon) satisfies (εlogd)(\varepsilon\log d)-differential privacy.

We claim that for every step m{0,,logd}m\in\{0,\dots,\log d\}, the distribution over ama_{m} is ε\varepsilon-differentially private. Notice that this claim implies the lemma, since the joint distribution of a0,a1,,ama_{0},a_{1},\dots,a_{m} is εlog(d)\varepsilon\log(d)-differentially private. In particular, this is true for the final output of the mechanism as it is a function of a0,,am.a_{0},\dots,a_{m}.

The error analysis of our mechanism requires more work. In particular, we need to understand how the volume of PUKP_{U}K compares to the norm of PVa.P_{V}a. As a first step we will analyze the volume of PUK.P_{U}K.

2 Volume in eigenspaces of the covariance matrix

Our goal in this section is to express the volume of KK in eigenspaces of the covariance matrix in terms of the eigenvalues of the covariance matrix. This will be needed in the analysis of our mechanism for non-isotropic bodies.

We start with a formula for the volume of central sections of isotropic bodies. This result can be found in [MP89].

Let KdK\subseteq\Re^{d} be an isotropic body of unit volume. Let EE denote a kk-dimensional subspace for 1kd1\leqslant k\leqslant d. Then,

Here, BKB_{K} is an explicitly defined isotropic convex body.

Observe that the PKPK contains EKE\cap K since PP is the identity on E.E.

We cannot immediately use these results since they only apply to isotropic bodies and we are specifically dealing with non-isotropic bodies. The trick is to apply the previous results after transforming KK into an isotropic body while keeping track how much this transformation changed the volume.

As a first step, the following lemma relates the volume of projections of an arbitrary convex body KK to the volume of projections of TKTK for some linear mapping TT.

Let KdK\subseteq\Re^{d} be a symmetric convex body. Let TT be a linear map which has eigenvectors u1,,udu_{1},\ldots,u_{d} with eigenvalues λ1,,λd\lambda_{1},\ldots,\lambda_{d}. Let 1kd1\leqslant k\leqslant d and suppose E=span{u1,u2,,uk},E={\rm span}\{u_{1},u_{2},\dots,u_{k}\}, Denote by PP be the projection operator onto the subspace E.E. Then,

For simplicity, we assume that the eigenvectors of TT are the standard basis vectors e1,,ede_{1},\ldots,e_{d}; this is easily achieved by applying a rotation to KK. Now, it is easy to verify that P=PT1T=SPTP=PT^{-1}T=SPT where S=diag(λ11,λ21,,λk1,0,,0)S=\rm{diag}(\lambda_{1}^{-1},\lambda_{2}^{-1},\ldots,\lambda_{k}^{-1},0,\dots,0). Thus we can write

Before we can finish our discussion, we will need the fact that the isotropic constant of KK can be expressed in terms of the determinant of MK.M_{K}.

Let KdK\subseteq\Re^{d} be a convex body of unit volume. Then,

We conclude with the following Proposition 7.7.

Let KdK\subseteq\Re^{d} be a symmetric convex body. Let MkM_{k} have eigenvectors u1,,udu_{1},\ldots,u_{d} with eigenvalues σ1,,σd\sigma_{1},\ldots,\sigma_{d}. Let 1kd21\leqslant k\leqslant\lceil\frac{d}{2}\rceil with and suppose E=span{u1,u2,,uk},E={\rm span}\{u_{1},u_{2},\dots,u_{k}\}, Denote by PP be the projection operator onto the subspace E.E. Then,

where αK\alpha_{K} is Ω(1/d14)\Omega(1/d^{\frac{1}{4}}). Moreover, assuming the Hyperplane conjecture, αKΩ(1)\alpha_{K}\geqslant\Omega(1).

Since λTK\lambda TK is in isotropic position and has unit volume, Corollary 7.4 implies that

Thus the required inequality holds with an additional λkdk\lambda^{-\frac{k}{d-k}} term. By assumption on kk, kdk\frac{k}{d-k} is at most 22. Moreover, λ=LK1/dd1/d2\lambda=L_{K}^{1/d}\leqslant d^{1/d}\leqslant 2, so that this additional term is a constant. As discussed above, αK\alpha_{K} is Ω(d14)\Omega(d^{-\frac{1}{4}}) by [Kla06], and Ω(1)\Omega(1) assuming the Hyperplane Conjecture 6.3. Hence the claim. ∎

3 Arguing near optimality of our mechanism

Our next lemma shows that the expected squared Euclidean error added by our algorithm in each step is bounded by the square of the optimum. We will first need the following fact.

Let KdK\subseteq\Re^{d} be a centered convex body. Let σ1σ2σd\sigma_{1}\geqslant\sigma_{2}\geqslant\dots\geqslant\sigma_{d} denote the eigenvalues of MKM_{K} with a corresponding orthonormal eigenbasis u1,,ud.u_{1},\dots,u_{d}. Then, for all 1id1\leqslant i\leqslant d,

Let aa denote the random variable returned by the KK-norm mechanism in step (3) in the above description of NIM(F,d,ε){\rm NIM}(F,d,\varepsilon). Then,

For simplicity, we will assume that dd is even and hence dd=d.d-d^{\prime}=d^{\prime}. The analysis of the KK-norm mechanism (Theorem 4.3 with p=2p=2) shows that the random variable aa returned by the KK-norm mechanism in step (3) satisfies

Since σdσd+1\sigma_{d^{\prime}}\geqslant\sigma_{d^{\prime}+1}, it follows that

The case of odd dd is similar except that we define KK^{\prime} to be the projection onto the first d+1d^{\prime}+1 eigenvectors. ∎

We have to sum up the error over all recursive calls of the mechanism. To this end, let PVmamP_{V_{m}}a_{m} denote the output of the KK-norm mechanism ama_{m} in step mm projected to the corresponding subspace VmV_{m}. Also, let ada\in\Re^{d} denote the final output of our mechanism. We then have,

Let ε>0\varepsilon>0. Suppose F ⁣:ndF\colon\Re^{n}\to\Re^{d} is a linear map. Further, assume the hyperplane conjecture. Then, there is an ε\varepsilon-differentially private mechanism MM with error

Efficient implementation of our mechanism

Membership in KK can be decided efficiently.

KK is bounded, in the sense that B2dKdB2dB_{2}^{d}\subseteq K\subseteq dB_{2}^{d}.

Both conditions are naturally satisfied in our case where K=FB1nK=FB_{1}^{n} for some Fd×nF\in^{d\times n}. Indeed, KBddB2dK\subseteq B_{\infty}^{d}\subseteq\sqrt{d}B_{2}^{d} and we may always assume that B2dKB_{2}^{d}\subseteq K, for instance, by considering K=K+B2dK^{\prime}=K+B_{2}^{d} rather than KK. This will only increase the noise level by 11 in Euclidean distance. Notice that KK^{\prime} is convex. In order to implement the membership oracle for KK, we need to be able to decide for a given ada\in\Re^{d}, whether there exists an xB1nx\in B_{1}^{n} such that Fx=aFx=a. These constraints can be encoded using a linear program. In the case of KK^{\prime} this can be done using convex programming [GLS94].

The mixing time of the Grid walk is usually quantified in terms of the total variation (or L1L_{1}) distance between the random walk and its stationary distribution. The stationary distribution of the grid Walk is the uniform distribution over LK{\cal L}\cap K. Standard arguments show that an L1L_{1}-bound gives us δ\delta-approximate ε\varepsilon-differential privacy where δ\delta can be made exponentially small in polynomial time. In order to get exact privacy (δ=0\delta=0) we instead need a multiplicative guarantee on the density of the random walk at each point in K.K.

In Appendix A, we show that the Grid Walk actually satisfies mixing bounds in the relative LL_{\infty}-metric which gives us the following theorem. We also need to take care of the fact that the stationary distribution is a priori not uniform over K.K. A solution to this problem is shown in the appendix as well.

MM^{\prime} is O(ε)O(\varepsilon)-differentially private,

We conclude that the Grid walk gives us an efficient implementation of our mechanism which achieves the same error bound (up to constants) and ε\varepsilon-differential privacy.

The runtime stated in Theorem 8.1 depends only upon dd and ε1.\varepsilon^{-1}. The polynomial dependence on nn only comes in when implementing the separation oracle for KK as described earlier. Since we think of dd as small compared to nn, the exact runtime of our algorithm heavily depends upon how efficiently we can implement the separation oracle.

In the non-isotropic case we additionally need to compute the subspaces UU and VV to project onto (Step 2 of the algorithm). Note that these subspaces themselves depend only on the query FF and not on the database xx. Thus these can be published and the mechanism maintains its privacy for an arbitrary choice of subspaces UU and VV. The choice of U,VU,V in Section 7 depended on the covariance matrix MM, which we do not know how to compute exactly. We next describe a method to choose UU and VV that is efficient such that the resulting mechanism has essentially the same error. The sampling from KK can then be replaced by approximate sampling as in the previous subsection, resulting in a polynomial-time differentially private mechanism with small error.

First note that for any id+1i\geqslant d^{\prime}+1

Generalizations of our mechanism

Moreover, in cases when one does not have a good handle on KK itself, one can use any convex body KK^{\prime} containing KK.

Local Sensitivity.

Nissim, Raskhodnikova and Smith [NRS07] define smooth sensitivity and show that one can design approximately differentially private mechanism that add noise proportional to the smooth sensitivity of the query. This can be significant improvement when the local sensitivity is much smaller than the global sensitivity. Notice that such queries are necessarily non-linear. We point out that one can define a local sensitivity analogue of the KK-norm mechanism by considering the polytopes Kx=conv{F(x)F(x)dist(x,x) ⁣:xD}K_{x}={\rm conv}\left\{\frac{F(x^{\prime})-F(x)}{\mathit{dist}(x,x^{\prime})}\colon x^{\prime}\in\mathcal{D}\right\} and adapting the techniques of [NRS07] accordingly.

References

In this section, we sketch the proof of Theorem 8.1. We will be interested in the mixing properties of Markov chains over some measured state space Ω.\Omega. We will need to compare probability measures μ,ν\mu,\nu over the space Ω.\Omega.

The relative LL_{\infty}-distance is defined as

For a Markov chain PP, we will be interested in the mixing time in the \infty-metric. That is the smallest number tt such that Pt/π1ε.\|P_{t}/\pi-1\|_{\infty}\leqslant\varepsilon. Here, PtP_{t} is the distribution of PP at step tt and π\pi denotes the stationary distribution of P.P. The relevance of the \infty-norm for our purposes is given by the following fact.

Suppose M={μx}xnM=\{\mu_{x}\}_{x\in\Re^{n}} is an ε\varepsilon-differentially private mechanism MM and suppose M={μx}xnM^{\prime}=\{\mu_{x}^{\prime}\}_{x\in\Re^{n}} satisfies max{μx/μx1,μx/μx1}ε\max\{\|\mu_{x}/\mu_{x}^{\prime}-1\|_{\infty},\|\mu_{x}^{\prime}/\mu_{x}-1\|_{\infty}\}\leqslant\varepsilon for some 0ε10\leqslant\varepsilon\leqslant 1 and all xnx\in\Re^{n}. Then, MM^{\prime} is 3ε3\varepsilon-differentially private.

where we used that 1+εeε1+\varepsilon\leqslant e^{\varepsilon} for 0ε1.0\leqslant\varepsilon\leqslant 1.

Now, let x,xx,x^{\prime} satisfy xx11\|x-x^{\prime}\|_{1}\leqslant 1. By the previous inequality, we have

In the last inequality, we used the assumption that MM is ε\varepsilon-differentially private. Hence, we have shown that MM^{\prime} is 3ε3\varepsilon-differentially private. ∎

Finally, the bound on the moments of the Gamma distribution from Fact 2.10 implies that the expected running time of this algorithm is polynomial in d,ε1d,\varepsilon^{-1}.

Appendix B Lower bounds for Differential Privacy with respect to Hamming Distance

We can then repeat the proof of theorem 3.3, with minor modifications to handle the non-negative integer constraint.

Let ε>0\varepsilon>0 and suppose F{1,1}d×nF\in\{-1,1\}^{d\times n} is a linear map and let K=FB1nK=FB_{1}^{n}. Then, every ε\varepsilon-differentially private mechanism MM for computing G(w)=Fx(w)G(w)=Fx(w) must have

Now we come up with a similar set XZ+nX^{\prime}\in Z_{+}^{n}. For each xXx\in X, we round each xix_{i} randomly up or down, i.e. x^i=xi\hat{x}_{i}=\lceil x_{i}\rceil, with probability (xixi)(x_{i}-\lfloor x_{i}\rfloor), and xi\lfloor x_{i}\rfloor otherwise. It is easy to check that E[x^]=xE[\hat{x}]=x. so that with probability 2/32/3, x^13x1|\hat{x}|_{1}\leqslant 3|x|_{1}. Moreover, E[Fx^]=FxE[F\hat{x}]=Fx and each random choice can change Fx^\|F\hat{x}\| by at most d\sqrt{d}. Thus martingale concentration results imply that with probability 2/32/3, Fx^Fx2dn\|F\hat{x}-Fx\|\leqslant 2\sqrt{dn}. Thus there exists a choice of x^\hat{x} so that both these events happen. Let vv denote the vector (d/2ε,d/2ε,,d/2ε)(\lceil d/2\varepsilon\rceil,\lceil d/2\varepsilon\rceil,\ldots,\lceil d/2\varepsilon\rceil) and set x=x^+vx^{\prime}=\hat{x}+v. This defines our set XX^{\prime} which is easily seen to be in Z+nZ_{+}^{n}. In fact, Xv+(d/2ε)B1nX^{\prime}\subseteq v+(\lceil d/2\varepsilon\rceil)B_{1}^{n}. Moreover, for ε<CRd/32n\varepsilon<CRd/32\sqrt{n}, FXFX^{\prime} is a CRdd/8εCRd\sqrt{d}/8\varepsilon-packing.

Now assume that M={μx ⁣:xn}M=\{\mu_{x}\colon x\in\Re^{n}\} is an ε\varepsilon-differentially private mechanism with error CRdd/32εCRd\sqrt{d}/32\varepsilon and lead this to a contradiction. By the assumption on the error, Markov’s inequality implies that for all xXx\in X^{\prime}, μx(Bx)12,\mu_{x}(B_{x})\geqslant\tfrac{1}{2}, where BxB_{x} is a ball of radius CRdd/16εCRd\sqrt{d}/16\varepsilon centered at FxFx. By the packing property above, the balls {Bx:xX}\{B_{x}:x\in X\} are disjoint.

Since xv1(d/2ε)\|x^{\prime}-v\|_{1}\leqslant(d/2\varepsilon), it follows from ε\varepsilon-differential privacy with Fact 2.3 that

Since the balls BxB_{x} are pairwise disjoint,

for d2d\geqslant 2. We have thus obtained a contradiction. ∎

Translating the lower bound from Theorem 5.2 to this setting, we get

Let ε(0,(c(d/n))min{d,log(n/d)})\varepsilon\in(0,(c\sqrt{(d/n)})\cdot\min\{\sqrt{d},\sqrt{\log(n/d)}\}) for a universal constant cc and let dlognd\leqslant\log n. Then there exists a linear map F{1,1}d×nF\in\{-1,1\}^{d\times n} such that every ε\varepsilon-differentially private mechanism MM for computing G(w)=Fx(w)G(w)=Fx(w) must have

We remark that this lower bound holds for N=Ω(nd/ε)N=\Omega(nd/\varepsilon).

Appendix C Dilated Ball containment

Let AA be a convex body in d\Re^{d} such that B2dA+rB2dB_{2}^{d}\subseteq A+rB_{2}^{d} for some r<1r<1. Then a dilation (1r)B2d(1-r)B_{2}^{d} is contained in AA.

Let zdz\in\Re^{d} be a unit vector. Suppose that z=(1r)z∉Az^{\prime}=(1-r)z\not\in A. Then by the Separating Hyperplane theorem (see, e.g., [BV04]), there is a hyperplane HH separating zz^{\prime} from AA. Thus there is a unit vector ww and a scalar bb such that z,wb=0\langle z^{\prime},w\rangle-b=0 and u,wb0\langle u,w\rangle-b\leqslant 0 for all uAu\in A. Let v=z+rwv=z^{\prime}+rw. Then by triangle inequality, v1\|v\|\leqslant 1. Moreover,

This however contradicts the assumption that that vB2dA+rB2dv\in B_{2}^{d}\subseteq A+rB_{2}^{d}. Since zz was arbitrary, the lemma is proved. ∎