Dimensionality and the stability of the Brunn-Minkowski inequality

Ronen Eldan, Bo`az Klartag

Introduction

The Brunn-Minkowski inequality states, in one of its normalizations, that

The literature contains various stability estimates for the Brunn-Minkowski inequality, which imply that when there is almost-equality in (1), then KK and TT are almost-translates of each other. Such estimates appear in Diskant , in Groemer , and in Figalli, Maggi and Pratelli . We recommend Osserman for a general survey on the stability of geometric inequalities.

The present stability estimates do not seem to imply much about the proximity of KK to a translate of TT under the assumption (2). Only if the constant “55” in (2) is replaced by something like 1+1/n1+1/n or so, then the results of Figalli, Maggi and Pratelli can yield meaningful information. The goal of this note is to raise the possibility that the stability of the Brunn-Minkowski inequality actually improves as the dimension increases. In particular, we would like to deduce from (2) that

for a family of non-negative functions pp, when the dimension nn is high. Here, bKb_{K} and bTb_{T} denote the barycenters of KK and TT respectively. Furthermore, in some non-trivial cases we may conclude (3) even when the constant “55” in (2) is replaced by an expression that grows with the dimension, such as logn\log n or nαn^{\alpha} for a small universal constant α>0\alpha>0.

In this note we take the first steps towards a dimension-sensitive stability theory of the Brunn-Minkowski inequality. First, let us focus on the simplest case in which p(x)p(x) in (3) is a quadratic polynomial. In fact, we are interested mainly in expressions related to the quadratic form

Let p(x)=pK(x)p(x)=p_{K}(x) be the inertia form of KK defined in (4) and (5). Then,

Here C,α1,α2>0C,\alpha_{1},\alpha_{2}>0 are universal constants, and bK,bTb_{K},b_{T} are the barycenters of K,TK,T respectively.

See Theorem 4.5 below for explicit bounds on the universal constants α1,α2\alpha_{1},\alpha_{2} from Theorem 1.1. Our interest in the inertia form pKp_{K} stems from the central limit theorem for convex sets, see for background reading. As we shall explain in Proposition 6.4 below, Theorem 1.1 implies the bound

where σn\sigma_{n} is the thin shell parameter from , C>0C>0 is a universal constant and α1>0\alpha_{1}>0 is the constant from Theorem 1.1. In fact, Theorem 4.5 and (51) below show that the inequality (7) is essentially an equivalence. Consequently, the universal constant α1\alpha_{1} from Theorem 1.1 is intimately connected with the thin shell parameter σn\sigma_{n}. The question of whether σn\sigma_{n} is bounded by a universal constant is currently one of the central problems in high-dimensional convex geometry.

In fact, the assumption that φ\varphi is 11-Lipschitz may typically be weakened. For instance, when φ\varphi is convex or concave, it is well-known that

(in order to use (8) we also need a crude estimate for Tx2dμT(x)\int_{T}|x|^{2}d\mu_{T}(x), hence we applied Corollary 2.4 to obtain such an estimate). In view of (11) and Proposition 6.4 below, we match (up to logarithmic factors) the best bounds for the width of the thin spherical shell for unconditional convex bodies proven in .

The structure of the remainder of this note is as follows: In the next section we establish some well-known facts about one-dimensional log-concave measures. In Section 3 we prove Theorem 1.1 and in Section 4 we prove Theorem 1.2. Section 5 is dedicated to attaining some inequalities related to one-dimensional transportation of measure. In Section 6, using these inequalities, we prove Theorem 1.3.

Background about log-concave densities on the line

A nice characterization of log-concavity that we learned from Bobkov is that μ\mu is log-concave if and only if the function

is a concave function. This characterization lies at the heart of the proof of the following Poincaré-type inequality which appears as Corollary 4.3 in Bobkov :

Let μ\mu be a log-concave probability measure on the real line, and set

for the variance of μ\mu. Then for any smooth function ff with fdμ=0\int fd\mu=0,

Further information about log-concave densities on the line is provided by the following standard lemma.

f(t)Cσexp(ctb/σ)\displaystyle f(t)\leq\frac{C}{\sigma}\exp(-c|t-b|/\sigma); and

If tbcσ|t-b|\leq c\sigma, then f(t)cσ\displaystyle f(t)\geq\frac{c}{\sigma}.

Proof: Part (a) is the content of Lemma 3.2 in Bobkov . In order to prove (b), we show that for some t0b+c0σt_{0}\geq b+c_{0}\sigma,

with c0=1/(10C)c_{0}=1/(10C), C1=c1log(10C/c)C_{1}=c^{-1}\log(10C/c) where here c,Cc,C are the constants from part (a). Indeed, if there is no such t0t_{0}, then from (a),

in contradiction to Grünbaum’s inequality (see, e.g., [4, Lemma 3.3]). By symmetry, there exists some t1bc0σt_{1}\leq b-c_{0}\sigma with

From log-concavity, f(t)1/(10C1σ)f(t)\geq 1/(10C_{1}\sigma) for t[t1,t0]t\in[t_{1},t_{0}], and (b) is proven since [t1,t0][bc0σ,b+c0σ][t_{1},t_{0}]\supseteq[b-c_{0}\sigma,b+c_{0}\sigma].

The following lemma is essentially a one-dimensional, functional version of Theorem 1.1. The Lemma states, roughly, that if the supremum-convolution of two log-concave probability densities has a bounded integral, then their respective variances cannot be too far from each other.

Let X,YX,Y be random variables with corresponding densities fX,fYf_{X},f_{Y} and variances σX2,σY2\sigma_{X}^{2},\sigma_{Y}^{2}. Assume that fXf_{X} and fYf_{Y} are log-concave. Define

a supremum-convolution of fXf_{X} and fYf_{Y}. Then,

Proof: The function hh is clearly measurable (it is even log-concave). It follows from Lemma 2.2(b) that there exist intervals IX,IYI_{X},I_{Y} such that

Combining this with (13), we learn that there exists an interval IZI_{Z} with Length(IZ)c(σX+σY)/2Length(I_{Z})\geq c(\sigma_{X}+\sigma_{Y})/2 such that,

In order to prove (14), it suffices to show that

Denote the respective densities of X,YX,Y by fX,fYf_{X},f_{Y}. The Prékopa-Leindler theorem (see, e.g., the first pages of Pisier ) implies that fXf_{X} and fYf_{Y} are log-concave. Furthermore, using the Prékopa-Leindler theorem again we derive,

Plugging this into lemma 2.3 we deduce (15).

Next, for a measure μ\mu and measurable sets A,BA,B with 0<μ(A)<0<\mu(A)<\infty define

Thus the probability measure μA\mu|_{A} is the conditioning of μ\mu to the set AA. Clearly, for a log-concave measure μ\mu and an interval II, the measure μI\mu|_{I} remains log-concave.

Proof: It is enough to prove the lemma for J1,J2J_{1},J_{2} being rays. Denote by II the interior of the support of μ\mu, and by ρ\rho the density of μ\mu. Abbreviate Φ(t)=μ((,t]), μt=μ(,t]\Phi(t)=\mu\left((-\infty,t]\right),\ \mu_{t}=\mu|_{(-\infty,t]} and set

To prove the lemma, it suffices to show that v(t)0v^{\prime}(t)\geq 0 for any tt, or equivalently, that

Deriving a stability estimate from the central limit theorem for convex sets

A second ingredient will be a calculation which shows that the integral of the supremum-convolution of two Gaussian densities whose covariance matrix is a multiple of the identity, becomes very large when their respective covariances are not close to one another. This will imply that when Voln((K+T)/2)Vol_{n}((K+T)/2) is not large, the covariance matrices of both marginals are roughly the same multiple of the identity. Therefore the inertia forms of KK and TT must have had roughly the same trace (the trace of the matrix will determine the multiple of the identity).

for all xEx\in E with xnc4|x|\leq n^{c_{4}}. Here, C,c1,c2,c3,c4>0C,c_{1},c_{2},c_{3},c_{4}>0 are universal constants.

It can be seen directly from the proof in that the constants in Theorem 3.1 may be selected to be c1,c2,c3=130,c4=160,C=500c_{1},c_{2},c_{3}=\frac{1}{30},c_{4}=\frac{1}{60},C=500. Other constants would imply different universal constants in Theorem 1.1. We shall need the following elementary lemma:

for α=1/a\alpha=\sqrt{1/a} and also for α=a\alpha=a, where c>0c>0 is a universal constant.

Proof: First we prove the lemma for α=a\alpha=a. Note that for 0<a40<a\leq 4,

The case where α=1/a\alpha=\sqrt{1/a} follows as min{(1/a1)2,1}10min{(a1)2,1}\min\{(\sqrt{1/a}-1)^{2},1\}\leq 10\min\{(a-1)^{2},1\}.

The following lemma is the second ingredient in our proof of Theorem 1.1 described above. The essence of the lemma is that the integral of the supremum-convolution of two spherically-symmetric Gaussian densities must be quite large when the covariances are not close to each other.

whenever x10αk|x|\leq 10\alpha\sqrt{k}. Assume that hh is measurable. Then,

We would like to find ss which maximizes the right-hand side in (20). We select s=t(a1)/(a+1)s=t(a-1)/(a+1) and verify that when t<5(1+a)k/a|t|<5\sqrt{(1+a)k/a} we have s+t10k|s+t|\leq 10\sqrt{k} and st10αk|s-t|\leq 10\alpha\sqrt{k}. We conclude that for any t<5(1+a)k/a|t|<5\sqrt{(1+a)k/a},

for some universal constants C,C1>1C,C_{1}>1.

Proof: Clearly, we may assume that the sequence {λi}\{\lambda_{i}\} is non-decreasing. Translating gg, we may assume that the barycenter of gg is at the origin. Let XX and YY be random vectors that are distributed according to the laws f,gf,g, respectively. Fix 0<δ<10<\delta<1. Consider the subspace EE spanned by {ei;λi1δ}\{e_{i};\lambda_{i}-1\geq\delta\}, where {ei}\{e_{i}\} is an orthonormal basis of eigenvectors corresponding the the eigenvalues {λi}\{\lambda_{i}\}. Denote d=dimEd=\dim E and assume that d2d\geq 2. Since the λi\lambda_{i}’s are in increasing order, the subspace EE has the form,

for some 1i0n1\leq i_{0}\leq n. Write j0=ni02j_{0}=\left\lfloor\frac{n-i_{0}}{2}\right\rfloor and V2=λi0+j0V^{2}=\lambda_{i_{0}+j_{0}}. Now, fix 1jj01\leq j\leq j_{0}. Define,

Inspect the function f(θ)=Cov(g)vj(θ),vj(θ)f(\theta)=\langle Cov(g)v_{j}(\theta),v_{j}(\theta)\rangle. We have f(0)=λi0+j0jV2f(0)=\lambda_{i_{0}+j_{0}-j}\leq V^{2} and f(1)=λi0+j0+jV2f(1)=\lambda_{i_{0}+j_{0}+j}\geq V^{2}. By continuity, there exists a certain 0θj10\leq\theta_{j}\leq 1 for which

Equation (21) and the fact that e1,,ene_{1},\ldots,e_{n} are orthonormal eigenvectors imply that for every vFv\in F, one has Cov(g)v,v=V2\langle Cov(g)v,v\rangle=V^{2}. Moreover, dimF=j012d1\dim F=j_{0}\geq\frac{1}{2}d-1. We now apply Theorem 3.1 which claims that if dCd\geq C, then there exists a subspace GFG\subset F with dimG=d1/40\dim G=\lfloor d^{1/40}\rfloor such that

On the other hand, we may use the Prekopá-Leindler inequality as in (16) above, and deduce that

Consequently, under the assumption that dCd\geq C,

Since V1+δ1+δ/3V\geq\sqrt{1+\delta}\geq 1+\delta/3, we conclude

By repeating the argument, with the subspace {ei;λi1δ}\{e_{i};\lambda_{i}-1\leq-\delta\} replacing the subspace EE, we conclude the proof.

Proof of Theorem 1.1: By applying affine transformations to both KK and TT, we can assume that both bodies have the origin as their barycenter, and that pK(x)=x2p_{K}(x)=|x|^{2} while pT(x)=ixi2/λip_{T}(x)=\sum_{i}x_{i}^{2}/\lambda_{i}. By Lemma 3.4,

for any 0<δ<10<\delta<1. Since λiCR4\lambda_{i}\leq CR^{4} for all ii, as follows from Corollary 2.4, then

where C,α1,α2>0C,\alpha_{1},\alpha_{2}>0 are universal constants. To obtain (6), note that

Remark: When KK in Theorem 1.1 is isotropic, we actually prove in (24) that

where AHS2=Trace(AtA)\|A\|_{HS}^{2}=Trace(A^{t}A) is the square of the Hilbert-Schmidt norm of the matrix AA.

Obtaining stability estimates using a transportation argument

the covariance matrix. Finally, we normalize this density by defining

A theorem of Brenier asserts that a convex solution to the above equation on the domain Supp(f)={x;f(x)>0}Supp(f)=\{x;f(x)>0\} exists. The regularity theory developed by Caffarelli implies that the convex function φ\varphi is smooth. For precise definitions and properties, see . The map F=φF=\nabla\varphi pushes forward the measure whose density is ff to the measure whose density is gg, and is referred to as the Brenier map between the two measures. The matrix F(x)\nabla F(x) is positive-definite since it has a positive determinant and it is the Hessian matrix of a convex function.

Remark. The Knothe map, used in Section 6, is in some sense a limiting case of the Brenier map. See .

The following lemma contains the central idea of this section.

where D=D(f,g)D=D(f,g) and Λ\Lambda is a random variable distributed uniformly in $$.

Proof: By a standard approximation argument we may assume that ff and gg are sufficiently smooth. Denote D=D(f,g)D=D(f,g) and L(λ,x)=L(f,g)(λ,x)L(\lambda,x)=L(f,g)(\lambda,x). Furthermore, define,

Using the fact that LL is log-concave, we obtain

A simple calculation shows that the Jacobian of M(λ,x)M(\lambda,x) is

By changing variables using M1M^{-1} and applying (30) and (31), we calculate

Applying the change of variables xD1/2(xb(f,g))x\to D^{-1/2}(x-b(f,g)) completes the proof.

Combining this with the above lemma yields

In view of (33), we would like to have a lower bound for v(x,y)v(x,y) in terms of x2y2|x|^{2}-|y|^{2} and in terms of xy|x-y|. The following lemma serves this purpose.

and h(λ)=f(λ)g(λ)h(\lambda)=f(\lambda)-g(\lambda). Then h(1λ)=h(λ)h(1-\lambda)=h(\lambda) hence COV(g(Λ),h(Λ))=0COV(g(\Lambda),h(\Lambda))=0. Consequently,

Combining (35), (36) and (37) completes the proof.

Proof of Theorem 1.2: Write b=b(f,g)b=b(f,g) and D=D(f,g)D=D(f,g). Substituting the result of Lemma 4.2 into (33) yields

Let X,YX,Y be the random vectors whose densities are f,gf,g respectively. By the definition of the transportation distance,

where the transportation distance between random vectors is defined to be the distance between the corresponding distribution measures. The fact that ff and gg have barycenters at the origin implies

The Cauchy-Schwartz inequality together with (38), (39) and (40) yield,

where DOP=sup0xD(x)/x\|D\|_{OP}=\sup_{0\neq x}|D(x)|/|x| is the operator norm of DD. From the remark to Corollary 2.4 we conclude that

The function λKλ(f,g)\lambda\mapsto K_{\lambda}(f,g) is log-concave and it is bounded from below by one, according to the Prékopa-Leindler inequality. Therefore,

The rest of this section aims at a better understanding of the exponents in Theorem 1.1. The next lemma exploits the second summand in our basic estimate (41).

Proof: We use the notation of the proof of Theorem 1.2. In order to establish (42), we fix α>0\alpha>0, and assume that

Consequently, in order to establish (42), it suffices to show that for some universal constant C>0C>0,

In view of (41), the last inequality will be concluded if we only manage to show,

The above fact follows from an application of Lemma 3.4 with δ=1/2\delta=1/2 and from the assumption that K1/2(f,g)exp(nc1)K_{1/2}(f,g)\leq\exp(n^{c_{1}}). Equation (42) is established, and the proof of (43) is analogous. The proof of the lemma is thus complete.

so that σnτnnκ\sigma_{n}\leq\tau_{n}n^{\kappa}. Note that the thin-shell conjecture implies that κ=0\kappa=0 and τn<C\tau_{n}<C. We apply the estimate from the previous lemma for various marginals of our nn-dimensional measures, and obtain:

where C,C1>0C,C_{1}>0 are some universal constants.

Proof: The bound (46) follows directly from the remark to Corollary 2.4. In order to establish (47), denote by {ei}\{e_{i}\} the orthonormal basis of eigenvectors corresponding to the eigenvalues {λi}\{\lambda_{i}\}. Define

Let EE be the subspace with the larger dimension among these two subspaces. Then k=dimEi/2k=\dim E\geq i/2. Denote by i0i_{0} the maximal jj for which ejEe_{j}\in E. Then ki0ik\leq i_{0}\leq i. According to our assumption, dim(E)(log(2R))C1/2\dim(E)\geq(\log(2R))^{C_{1}}/2, and hence we may apply Lemma 4.3 in the subspace EE. Denote by fEf_{E} and gEg_{E} the marginals of ff and gg to the subspace EE. Using (42) and (43) for fEf_{E} and gEg_{E} we obtain

where we used the fact that K(f,g)K1/2(f,g)2=R2K(f,g)\leq K_{1/2}(f,g)^{2}=R^{2} as well as the Prékopa-Leindler inequality which implies that Kλ(fE,gE)Kλ(f,g)K_{\lambda}(f_{E},g_{E})\leq K_{\lambda}(f,g) for any λ(0,1)\lambda\in(0,1).

The next theorem demonstrates that the exponent α1\alpha_{1} in Theorem 1.1 may be made arbitrarily close to 1/2κ1/2-\kappa, thus complementing the inequality (7) which goes in the opposite direction. This provides yet another piece of evidence for the close relationship between the thin shell problem and the stability of the Brunn-Minkowski inequality in high dimensions.

where IdId is the identity matrix. Consequently,

Proof: We may clearly assume that Cov(μT)Cov(\mu_{T}) is a diagonal matrix whose diagonal is λ1,,λn\lambda_{1},\ldots,\lambda_{n}, where the sequence {λi1}\{|\lambda_{i}-1|\} is non-increasing. Since our measures are log-concave, then we may use Lemma 4.4 and calculate

The bound (50) follows. In order to deduce (51) from (50), argue as in (25) above. The proof is complete.

Transportation in one dimension

For j=1,2j=1,2, the map Φj1\Phi_{j}^{-1} pushes forward the uniform measure on $toto\mu_{j}.Themonotonetransportationmapbetween. The monotone transportation map between\mu_{1}andand\mu_{2}$ is the continuous, non-decreasing function

defined for tSupp(μ1)t\in Supp(\mu_{1}). Observe that

Furthermore, FF is differentiable in Supp(μ1)Supp(\mu_{1}) and

Additionally, it is well-known (see, e.g., Villani’s book ) that

where FF is the monotone transportation map between μ1\mu_{1} and μ2\mu_{2} and C>0C>0 is a universal constant.

We begin the proof of Proposition 5.1 with the following crude lemma.

Let μ1\mu_{1} and μ2\mu_{2} be probability measures on the real line.

If μ1\mu_{1} and μ2\mu_{2} are even, then

If μ1,μ2\mu_{1},\mu_{2} are supported on [A,)[A,\infty) and [B,)[B,\infty) respectively, and have non-increasing densities, then

Proof: Denote by δ0\delta_{0} the Dirac measure at the origin. Assume that μ0\mu_{0} and μ1\mu_{1} are even. By the triangle inequality for the transportation metric,

Let δA,δB,δe\delta_{A},\delta_{B},\delta_{e} be the Dirac measures supported on A,B,eA,B,e respectively. By the triangle inequality,

Therefore, by using W2(μ1,μ2)W2(μ1,δA)+W2(δA,δB)+W2(δB,μ2)W_{2}(\mu_{1},\mu_{2})\leq W_{2}(\mu_{1},\delta_{A})+W_{2}(\delta_{A},\delta_{B})+W_{2}(\delta_{B},\mu_{2}),

Proof of Proposition 5.1: Use (52), the definition of FF, and the fact that Φ11\Phi_{1}^{-1} pushes forward the uniform measure on $toto\mu_{1}$, in order to obtain

Recall that when μj\mu_{j} is a log-concave measure, the function ρj(Φj1(t))\rho_{j}(\Phi_{j}^{-1}(t)) is concave on $.Denote. DenoteI_{j}(t)=\rho_{j}(\Phi_{j}^{-1}(t))forforj=1,2.Then. ThenI_{1}andandI_{2}areconcave,nonnegativefunctionsonare concave, non-negative functions on,withthepropertythat, with the property thatI_{j}(t)=I_{j}(1-t)foranyfor anyt\in.Thesetwofunctionsarethereforecontinuouson. These two functions are therefore continuous on(0,1),increasingon, increasing on[0,1/2],anddecreasingon, and decreasing on[1/2,1].Let. Let\varepsilon>0$ be such that

Suppose first that ε>1/10\varepsilon>1/10. In this case, from part (i) of lemma 5.2,

So whenever ε>1/10\varepsilon>1/10, the inequality (54) holds trivially for a sufficiently large universal constant C>0C>0.

From now on, we restrict attention to the case where ε1/10\varepsilon\leq 1/10. We divide the rest of the proof into several steps.

Step 1: Let us prove that there exists a universal constant C>0C>0 such that

Once we prove (57), the desired bound (56) follows from (55). We thus focus on the proof of (57). Suppose that t1(0,1/2]t_{1}\in(0,1/2] satisfies I1(t1)>4I2(t1)I_{1}(t_{1})>4I_{2}(t_{1}). We will show that in this case

If I1(t)>2I2(t)I_{1}(t)>2I_{2}(t) for all t(0,t1)t\in(0,t_{1}), then t1ε2t_{1}\leq\varepsilon^{2} according to (55). Thus (58) holds true in this case. Otherwise, there exists 0<t<t10<t<t_{1} with I1(t)2I2(t)I_{1}(t)\leq 2I_{2}(t). Let t0t_{0} be the supremum over all such tt. Since I1I_{1} and I2I_{2} are continuous and non-decreasing on (0,t1](0,t_{1}], then

Since I1I_{1} is concave, non-decreasing and non-negative on [0,t1][0,t_{1}], then necessarily t0<t1/2t_{0}<t_{1}/2. We conclude that I1(t)>2I2(t)I_{1}(t)>2I_{2}(t) for any t[t1/2,t1]t\in[t_{1}/2,t_{1}]. From (55) it follows that t12ε2t_{1}\leq 2\varepsilon^{2}. Therefore (58) is proven in all cases. By symmetry, we conclude (57), and the proof of (56) is complete.

Step 2: For any 0TΦ11(12ε2)0\leq T\leq\Phi_{1}^{-1}(1-2\varepsilon^{2}) we have

where the last inequality is the content of Step 1. Denote ν=μ1[T,T]\nu=\mu_{1}|_{[-T,T]}, an even log-concave probability measure. According to Lemma 2.5, we have Var(ν)Var(μ1)σVar(\nu)\leq Var(\mu_{1})\leq\sigma. Note that the function F(t)tF(t)-t is odd, hence its ν\nu-average its zero. Using the Poincaré-type inequality in Lemma 2.1, we see that for any 0TΦ11(12ε2)0\leq T\leq\Phi_{1}^{-1}(1-2\varepsilon^{2}),

Step 3: Let T1=Φ11(13ε2)T_{1}=\Phi_{1}^{-1}(1-3\varepsilon^{2}) and let T2=Φ11(12ε2)T_{2}=\Phi_{1}^{-1}(1-2\varepsilon^{2}). We use (59) and conclude that there exists T1TT2T_{1}\leq T\leq T_{2} with

Denote ν1=μ1[T,)\nu_{1}=\mu_{1}|_{[T,\infty)} and ν2=μ2[F(T),)\nu_{2}=\mu_{2}|_{[F(T),\infty)}. These are log-concave probability densities with Var(ν1)+Var(ν2)σ2Var(\nu_{1})+Var(\nu_{2})\leq\sigma^{2}. Note that we have, owing to (59),

In order to prove the lemma it remains to show that W2(ν1,ν2)2Cσ2.W_{2}(\nu_{1},\nu_{2})^{2}\leq C\sigma^{2}. But in view of (60), the latter is a direct consequence of part (ii) in lemma 5.2: Since T,F(T)>0T,F(T)>0, then the log-concave densities of ν1\nu_{1} and ν2\nu_{2} are non-increasing. This completes the proof.

We thus view the function hh as a refined variant of the supremum-convolution of ff and gg. The following proposition is a stability estimate for the Prékopa-Leindler inequality in one dimension. It may be viewed as the transportation-metric version of the L1L^{1}-stability estimates from Ball and Böröczky .

where the function hh is defined via (61) and C>0C>0 is a universal constant.

Proof: Multiplying the functions ff and gg by positive constants, if necessary, we may assume that f=g=1\int f=\int g=1. Indeed, neither the left-hand side nor the right-hand side of (54) is changed under such normalization. Let FF be the monotone transportation map between μf\mu_{f} and μg\mu_{g} and as before, S(x)=(F(x)+x)/2S(x)=(F(x)+x)/2 for xSupp(μf)x\in Supp(\mu_{f}). Applying the change of variables y=S(x)y=S(x) we see that

According to (52), we have F(x)g(F(x))=f(x)F^{\prime}(x)g(F(x))=f(x) for any xx in the support of μf\mu_{f}. Since gg is log-concave, it does not vanish in Supp(μg)Supp(\mu_{g}), and hence F(x)0F^{\prime}(x)\neq 0 for any xSupp(μf)x\in Supp(\mu_{f}). Therefore,

where we used Lemma 3.2(ii) in the last passage. Since f=1\int f=1, then

We may thus apply Proposition 5.1 and deduce that

Unconditional Convex Bodies

where C>0C>0 is a universal constant and μf,μg\mu_{f},\mu_{g} are the probability measures with densities f,gf,g respectively.

The main tool in the proof of Theorem 6.1 is the Knothe map from , which we define next. Let M,f,gM,f,g be as in Theorem 6.1. Then the support of μg\mu_{g} is a convex set, and gg does not vanish in Supp(μg)Supp(\mu_{g}). The Knothe map between μf\mu_{f} and μg\mu_{g} is the continuous function F=(F1,,Fn):Supp(μf)Supp(μg)F=(F_{1},\ldots,F_{n}):Supp(\mu_{f})\rightarrow Supp(\mu_{g}) for which

For any jj, the function Fj(x1,,xn)F_{j}(x_{1},\ldots,x_{n}) actually depends only on the variables x1,,xjx_{1},\ldots,x_{j}. We may thus speak of Fj(x1,,xj)F_{j}(x_{1},\ldots,x_{j}).

For any jj and for any fixed x1,,xj1x_{1},\ldots,x_{j-1}, the function Fj(x1,,xj)F_{j}(x_{1},\ldots,x_{j}) is non-decreasing in xjx_{j}.

It may be proven by induction on nn (see ) that the Knothe map between μf\mu_{f} and μg\mu_{g} exists, and that in fact, the three requirements above determine the function FF completely. Denoting λj(x)=Fj(x)/xj0\lambda_{j}(x)=\left.\partial F_{j}(x)\right/\partial x_{j}\geq 0, it follows from property (b) that

for any xSupp(μ1)x\in Supp(\mu_{1}), where JF(x)J_{F}(x) is the Jacobian of the map FF. Below we will also use the fact that the map xx+F(x)x\mapsto x+F(x), defined for xSupp(μf)x\in Supp(\mu_{f}), is one-to-one, as follows from properties (b) and (c). Set

and let fn1,gn1f_{n-1},g_{n-1} be the densities of the probability measures π(μf),π(μg)\pi_{*}(\mu_{f}),\pi_{*}(\mu_{g}), respectively. Then fn1f_{n-1} and gn1g_{n-1} are unconditional and log-concave. Write Tn=F=(F1,,Fn)T_{n}=F=(F_{1},\ldots,F_{n}) for the Knothe map between μf\mu_{f} and μg\mu_{g}, and set

Then Tn1T_{n-1} is the Knothe map between π(μf)\pi_{*}(\mu_{f}) and π(μg)\pi_{*}(\mu_{g}). Observe that for fixed (x1,,xn1)π(Supp(μf))(x_{1},\ldots,x_{n-1})\in\pi(Supp(\mu_{f})), the map

is the monotone transportation map between the probability densities proportional to

for (z1,,zn1)=Tn1(x1,,xn1)(z_{1},\ldots,z_{n-1})=T_{n-1}(x_{1},\ldots,x_{n-1}). For i=n1,ni=n-1,n we set

which is a one-to-one, continuous function, defined for xSupp(μf)x\in Supp(\mu_{f}) when i=ni=n and for xπ(Supp(μf))x\in\pi\left(Supp(\mu_{f})\right) when i=n1i=n-1. According to (65) and to property (b), the Jacobian JSi(x)J_{S_{i}}(x) of the map SiS_{i} satisfies

Since SiS_{i} is one-to-one, then V(fi,gi)V(f_{i},g_{i}) is a well-defined function on a subset of QiQ^{i}. We extend V(fi,gi)V(f_{i},g_{i}) to the entire QiQ^{i} by setting it to be zero outside its original domain of definition.

Let φ:Qn1[0,)\varphi:Q^{n-1}\rightarrow[0,\infty) be a measurable function. Then,

Proof: We use (65) for the Knothe map Tn1T_{n-1} to conclude that

where we used (66) and (67) in the last passage. The map Sn1S_{n-1} is one-to-one in the support of fn1f_{n-1}. Changing variables z=Sn1(y)z=S_{n-1}(y) we obtain

The following lemma will serve as the induction step in the proof of Theorem 6.1.

where C>0C>0 is a universal constant (in fact, it is the same constant as in Proposition 5.3).

In order to prove the lemma, it therefore suffices to show that

Recall that tFn(y,t)t\mapsto F_{n}(y,t) is the monotone transportation map between the even, log-concave probability measures supported on [M,M][-M,M], whose densities are proportional to tf(y,t)t\mapsto f(y,t) and sg(Tn1(y),s)s\mapsto g(T_{n-1}(y),s). The variance of an even measure supported on [M,M][-M,M] cannot exceed M2M^{2}. We may therefore use Proposition 5.3, together with (53), to conclude that for any yπ(Supp(μf))y\in\pi(Supp(\mu_{f})),

In particular, the right-hand side of (70) is non-negative. We use the definition (67) and integrate with respect to yy. This yields:

where the last passage is legal according to Lemma 6.2. The desired estimate (69) follows, and the proof is complete.

Proof of Theorem 6.1: We will prove by induction on the dimension nn that

where CC is the constant from Lemma 6.3. The case n=1n=1 follows from Proposition 5.3 and from the fact that the variance of an even measure supported on [M,M][-M,M] cannot exceed M2M^{2}. We assume that (71) is proven for dimension n1n-1 and proceed with the proof for dimension nn. Apply the induction hypothesis for the unconditional, log-concave probability densities fn1,gn1f_{n-1},g_{n-1} and conclude that

and (71) is proven for dimension nn, hence for all dimensions. Using (71) and the fact that V(f,g)H(f,g)V(f,g)\leq H(f,g), the theorem follows by the definition of transportation distance.

The uniform measure on a convex body is a prime example for a log-concave measure. Consequently, we may deduce Theorem 1.3 from Theorem 6.1 by using a crude “cut with a big cube” argument. The logarithmic factor of Theorem 1.3 may be an artifact of this clumsy procedure.

Proof of Theorem 1.3: Let 0γ1/20\leq\gamma\leq 1/2 be a parameter to be specified later on. For α,β>0\alpha,\beta>0 we denote

According to Corollary 2.4, we have Cov(μT)CR4Cov(\mu_{T})\leq CR^{4}. Using Lemma 2.2 and a union bound, we deduce that

We now select α\alpha and β\beta so that

Denote by μK1\mu_{K}^{1} the uniform probability measure on KαK_{\alpha} and similarly for TT. By elementary properties of the transportation metric W2W_{2}, it follows that

where Diam(K)=supx,yKxyDiam(K)=\sup_{x,y\in K}|x-y| is the diameter of KK. It is well-known (see ) that Diam(K)CnCov(μK)OPDiam(K)\leq Cn\sqrt{\|Cov(\mu_{K})\|_{OP}} and therefore,

Note that μK1\mu_{K}^{1} and μT1\mu_{T}^{1} satisfy the requirements of Theorem 6.1 with M=max{α,β}lognM=\max\{\alpha,\beta\}\cdot\log n. Denote f(x)=1Kα(x)/Voln(Kα),g(x)=1Tβ(x)/Voln(Tβ)f(x)=1_{K_{\alpha}}(x)/Vol_{n}(K_{\alpha}),g(x)=1_{T_{\beta}}(x)/Vol_{n}(T_{\beta}). Then,

From Theorem 6.1 and (75) we conclude that

All that remains is to select γ\gamma. In the case where Rn2R\leq n^{2}, we choose

and deduce the desired bound (10) from (76). In the case where Rn2R\geq n^{2}, we select γ=1/2\gamma=1/2 and still deduce (10). The theorem is thus proven for all cases.

for any s>0s>0 with Voln(Ks)/Voln(K)[1/8,7/8]Vol_{n}(K_{s})/Vol_{n}(K)\in[1/8,7/8]. Then,

Proof: Standard bounds on the distribution of polynomials on high-dimensional convex sets (see Bourgain or Nazarov, Sodin and Volberg ) reduce the desired inequality (78) to the estimate

In order to prove (79), select a>0a>0 such that Voln(Ka)=Voln(K)/4Vol_{n}(K_{a})=Vol_{n}(K)/4. From (77),

For the upper bound, let s<ts<t be such that Voln(Ks)=3Voln(K)/4Vol_{n}(K_{s})=3Vol_{n}(K)/4 and Voln(Kt)=7Voln(K)/8Vol_{n}(K_{t})=7Vol_{n}(K)/8. Then, from (77),

Hence, maxxKsx2n1+13A\max_{x\in K_{s}}\frac{|x|^{2}}{n}\leq 1+13A, or equivalently,

It is now clear that (79) follows from (80) and (81).

References