On the infimum convolution inequality

Rafał Latała, Jakub Onufry Wojtaszczyk

Introduction and Notation

In the seminal paper , B. Maurey introduced the so called property (τ)(\tau) for a probability measure μ{\mu} with a cost function φ\varphi (see Definition 2.1 below) and established a very elegant and simple proof of Talagrand’s two level concentration for the product exponential distribution νn\nu^{n} using (τ)(\tau) for this distribution and an appropriate cost function ww.

It is natural to ask what other pairs (μ,φ)(\mu,\varphi) have property (τ)(\tau)? As any μ\mu satisfies (τ)(\tau) with φ0\varphi\equiv 0, one will rather ask how big a cost function can one take. In this paper we study the probability measures μ\mu that have property (τ)(\tau) with respect to the largest (up to a multiplicative factor) possible convex cost function Λμ\Lambda^{\star}_{\mu}. This bound comes from checking property (τ)(\tau) for linear functions. We say a measure satisfies the infimum convolution inequality (ICIC for short) if the pair (μ,Λμ)(\mu,\Lambda^{\star}_{\mu}) satisfies τ\tau.

It turns out that such an optimal infimum convolution inequality has very strong consequences. It gives the best possible concentration behaviour, governed by the so–called LpL_{p}-centroid bodies (Corollary 3.10). This, in turn, implies in particular a weak–strong moment comparison (Proposition 3.12), the Central Limit Theorem of Klartag and the tail estimates estimates of Paouris (Proposition 3.15). We believe that ICIC holds for any log–concave probability measure, which is the main motivation for this paper.

Organization of the paper. This section, apart from the above introduction, defines the notation used throughout the paper. The second section is devoted to studying the general properties of the inequality ICIC. In subsection 2.1 we recall the definition of property (τ)(\tau) and its ties to concentration from . In subsection 2.2 we study the opposite implication — what additional assumptions one needs to infer (τ)(\tau) from concentration inequalities. In subsection 2.3 we show that Λμ\Lambda^{\star}_{\mu} is indeed the largest possible cost function and define the inequality ICIC. In subsection 2.4 we show that product log–concave measures satisfy ICIC.

In the third section we give more attention to the concentration inequalities tied to ICIC. In subsection 3.1 we show the connection to Zp\mathcal{Z}_{p} bodies. In subsection 3.2 we continue in this vein with the additional assumption our measure is α\alpha–regular. In subsection 3.3 we show how ICIC implies a comparison of weak and strong moments and the results of and .

In the fourth section we give a modification of the two–level concentration for the exponential measure, in which for sets lying far away from the origin only an enlargement by tB1ntB_{1}^{n} is used. This will be used in the fifth section, which focuses on the uniform measure on the BpnB_{p}^{n} ball. In subsection 5.1 we define and study two rather standard transports of measure used further on. In subsection 5.2 we use these transports along with the concentration from section 4 and a Cheeger inequality from to give a proof of ICIC for p2p\leq 2. In section 5.3 we show a proof of ICIC for p2p\geq 2 and a proof of the log–Sobolev inequality for p2p\geq 2.

We conclude with a few possible extensions of the results of the paper in the sixth section.

The letters c,Cc,C denote absolute numerical constants, which may change from line to line. By c(p),C(p)c(p),C(p) we mean constants dependent on pp (or, formally, a family of absolute constants indexed by pp), these also may change from line to line. Other letters, in particular greek letters, denote constants fixed for a given proof or section. For any sets of positive real numbers aia_{i} and bib_{i}, iIi\in I, by aibia_{i}\sim b_{i} we mean there exist absolute numerical constants c,C>0c,C>0 such that cai<bi<Caica_{i}<b_{i}<Ca_{i} for any iIi\in I. Similarly for collections of sets AiA_{i} and BiB_{i} by AiBiA_{i}\sim B_{i} we mean cAiBiCAicA_{i}\subset B_{i}\subset CA_{i} for any iIi\in I, where again c,C>0c,C>0 are absolute numerical constants. By p\sim_{p} we mean the constants above can depend on pp.

Infimum convolution inequality

denotes the infimum convolution of ff and gg.

The following two easy observations are almost immediate (c.f. ):

If pairs (μi,φi)(\mu_{i},\varphi_{i}), i=1,,ki=1,\ldots,k have property (τ)(\tau) and φ(x1,,xk)=φ1(x1)++φk(xk)\varphi(x_{1},\ldots,x_{k})=\varphi_{1}(x_{1})+\ldots+\varphi_{k}(x_{k}), then the couple (i=1kμi,φ)(\otimes_{i=1}^{k}\mu_{i},\varphi) also has property (τ)(\tau).

Then the pair (μT1,ψ)(\mu\circ T^{-1},\psi) has property (τ)(\tau).

Maurey noticed that property (τ)(\tau) implies μ(A+Bφ(t))1μ(A)1et,\mu(A+B_{\varphi}(t))\geq 1-\mu(A)^{-1}e^{-t}, where

We will need a slight modification of this estimate.

Property (τ)(\tau) for (φ,μ)(\varphi,\mu) implies for any Borel set AA and t0t\geq 0,

Thus from property (τ)(\tau) for ff we have

from which, extracting the condition upon μ(A+Bφ(t)){\mu}(A+B_{\varphi}(t)) by direct calculation, we get (2).

Let ft(p):=etp/((et1)p+1)f_{t}(p):=e^{t}p/((e^{t}-1)p+1), notice that ftf_{t} is increasing in pp and for pet/2/2p\leq e^{-t/2}/2,

hence ft(p)>min(et/2p,1/2)f_{t}(p)>\min(e^{t/2}p,1/2) and (3) follows. Moreover for p1/2p\geq 1/2

Let F(x)=ν(,x]F(x)=\nu(-\infty,x] and gt(p)=F(F1(p)+t)g_{t}(p)=F(F^{-1}(p)+t). Previous calculations show that for t,p>0t,p>0, ft(p)gt/2(p)f_{t}(p)\geq g_{t/2}(p) if F1(p)+t/20F^{-1}(p)+t/2\leq 0 or F1(p)0F^{-1}(p)\geq 0. Since gt+s=gtgsg_{t+s}=g_{t}\circ g_{s} and ft+s=ftfsf_{t+s}=f_{t}\circ f_{s}, we get that ft(p)gt/2(p)f_{t}(p)\geq g_{t/2}(p) for all t,p>0t,p>0, hence (2) implies (5). ∎

The main theorem of states that ν\nu satisfies (τ\tau) with a sufficiently chosen cost function.

Let w(x)=136x2w(x)=\frac{1}{36}x^{2} for x4|x|\leq 4 and w(x)=29(x2)w(x)=\frac{2}{9}(|x|-2) otherwise. Then the pair (νn,i=1nw(xi))(\nu^{n},\sum_{i=1}^{n}w(x_{i})) has property (τ)(\tau).

Theorem 2.5 together with Proposition 2.4 immediately gives the following two-level concentration:

that was first established (with different universal, rather large constants) by Talagrand .

2 From concentration to property (τ)𝜏(\tau)

Proposition 2.4 shows that property (τ)(\tau) implies concentration, the next result presents the first approach to the converse implication.

Then the pair (μ,136φ(β))(\mu,\frac{1}{36}\varphi(\frac{\cdot}{\beta})) has property (τ)(\tau). In particular if φ\varphi is convex, symmetric and φ(0)=0\varphi(0)=0 then (7) implies property (τ)(\tau) for (μ,φ(36β))(\mu,\varphi(\frac{\cdot}{36\beta})).

To finish the proof of the first assertion, by Theorem 2.5 it is enough to show that

Since the set A(gw,u)A(g\Box w,u) is a halfline, it is enough to prove that

Let us fix x1x_{1} and x2x_{2} with g(x1)+w(x2)<ug(x_{1})+w(x_{2})<u and take s1>g(x1)s_{1}>g(x_{1}) s2=w(x2)s_{2}=w(x_{2}) with s1+s2<us_{1}+s_{2}<u. Put A:=A(f,s1)A:=A(f,s_{1}), then μ(A)=ν(A(g,s1))ν(,x1]\mu(A)=\nu(A(g,s_{1}))\geq\nu(-\infty,x_{1}]. By the definition of ww it easily follows that x2max{6s2,9s2}x_{2}\leq\max\{6\sqrt{s_{2}},9s_{2}\}, hence by (7), μ(A+βBφ(36s2))ν(,x1+x2].\mu(A+\beta B_{\varphi}(36s_{2}))\geq\nu(-\infty,x_{1}+x_{2}]. Since

The last part of the statement immediately follows since any symmetric convex function φ\varphi is radius-wise nondecreasing and if additionally φ(0)=0\varphi(0)=0, then φ(x/36)φ(x)/36\varphi(x/36)\leq\varphi(x)/36 for any xx. ∎

The next proposition shows that inequalities (3) and (4) are strongly related.

The following two conditions are equivalent for any Borel set KK and γ>1\gamma>1,

and we get the contradiction with (10). ∎

Let us fix the set AA with μ(A)=ν(,x]\mu(A)=\nu(-\infty,x]. Notice that A+2K=A+K+KA+KA+2K=A+K+K\supset A+K. If x+t0x+t\leq 0, then μ(A+K)>etμ(A)=ν(,x+t]\mu(A+K)>e^{t}\mu(A)=\nu(-\infty,x+t]. If x0x\geq 0, Proposition 2.7 gives

Finally, if x0x+tx\leq 0\leq x+t, we get μ(A+K)1/2=ν(,0]\mu(A+K)\geq 1/2=\nu(-\infty,0], hence by the previous case,

Corollary 2.8 shows that if the cost function φ\varphi is symmetric and convex, condition (7) (with 2β2\beta instead of β\beta) for t1t\geq 1 is implied by the following:

To treat the case t1t\leq 1 we will need Cheeger’s version of the Poincaré inequality.

It is not hard to check that Cheeger’s inequality (cf. [6, Theorem 2.1]) implies

Finally, we may summarize this section with the following statement.

Suppose that the cost function φ\varphi is convex, symmetric with φ(0)=0\varphi(0)=0 and 1φ(x)(αx)21\wedge\varphi(x)\leq(\alpha|x|)^{2} for all xx. If the measure μ\mu satisfies Cheeger’s inequality with the constant β=1/δ\beta=1/\delta and the condition (11) is satisfied for all t1t\geq 1 and C=γC=\gamma then (μ,φ(/C))(\mu,\varphi(\cdot/C)) has property (τ)(\tau) with the constant C=36min{2γ,αδ}C=36\min\{2\gamma,\alpha\delta\}.

Notice that αBφ(t)tB2n\alpha B_{\varphi}(t)\supset\sqrt{t}B_{2}^{n} for all t<1t<1, hence Cheeger’s inequality implies that condition (7) holds for t<1t<1 with C=αδC=\alpha\delta. Therefore (7) holds for all t0t\geq 0 with C=min{2γ,αδ}C=\min\{2\gamma,\alpha\delta\} and the assertion follows by Corollary 2.6. ∎

3 Optimal cost functions

A natural question arises: what other pairs (μ,φ)(\mu,\varphi) have property (τ\tau)? First we have to choose the right cost function. To do this let us recall the following definitions.

The Legendre transform of any function is a convex function. If ff is convex and lower semi-continuous, then LLf=f\mathcal{L}\mathcal{L}f=f, and otherwise LLff\mathcal{L}\mathcal{L}f\leq f. In general, if fgf\geq g, then LfLg\mathcal{L}f\leq\mathcal{L}g. The Legendre transform satisfies L(Cf)(x)=CLf(x/C)\mathcal{L}(Cf)(x)=C\mathcal{L}f(x/C) and if g(x)=f(x/C)g(x)=f(x/C), then Lg(x)=Lf(Cx)\mathcal{L}g(x)=\mathcal{L}f(Cx). For this and other properties of L\mathcal{L}, cf. . The Legendre transform has been previously used in the context of convex geometry, see for instance and .

The function Λμ\Lambda^{\star}_{\mu} plays a crucial role in the theory of large deviations cf. .

Take f(x)=x,vf(x)=\left\langle x,v\right\rangle. Then

where the last equality uses the fact that μ{\mu} is symmetric. Thus by taking the logarithm we get Lφ(v)2Λμ(v)\mathcal{L}\varphi(v)\geq 2\Lambda_{\mu}(v), and by applying the Legendre transform we obtain φ(v)=LLφ(v)2Λμ(v/2).\varphi(v)=\mathcal{L}\mathcal{L}\varphi(v)\leq 2\Lambda^{\star}_{\mu}(v/2). The inequality 2Λμ(v/2)Λμ(v)2\Lambda^{\star}_{\mu}(v/2)\leq\Lambda^{\star}_{\mu}(v) follows by the convexity of Λμ\Lambda_{\mu}^{\star}. ∎

The above remark motivates the following definition.

which substituted into (13) gives the thesis. ∎

Direct calculation shows that Λν(x)=ln(1x2)\Lambda_{\nu}(x)=-\ln(1-x^{2}) for x<1|x|<1 and

Since a/2aln(1+a/2)aa/2\leq a-\ln(1+a/2)\leq a for a0a\geq 0, we get 12(1+x21)Λν(x)1+x21\frac{1}{2}(\sqrt{1+x^{2}}-1)\leq\Lambda_{\nu}^{\star}(x)\leq\sqrt{1+x^{2}}-1. Finally

The last statement follows by Theorem 2.5, since min((x/9)2,x/9)w(x)\min((x/9)^{2},|x|/9)\leq w(x). ∎

4 Logaritmically concave product measures

It is easy to check that for any measure μ{\mu} with a full–dimensional support there exists a linear map LL such that μL1{\mu}\circ L^{-1} is isotropic.

The next theorem (with a different universal, but rather large constant) may be deduced from the results of Gozlan . We give the following, relatively short proof for the sake of completeness.

then g(x)ex/ag(0)g(x)\leq e^{-x/a}g(0) for x>ax>a and g(x)ex/ag(0)g(x)\geq e^{-x/a}g(0) for x[0,a)x\in[0,a). Therefore

Notice that T(0)=1/(2g(0))4T^{\prime}(0)=1/(2g(0))\leq 4, thus by concavity of TT, Tx4xTx\leq 4x for x0x\geq 0. Moreover for x0x\geq 0, h(Tx)=x+ln2h(Tx)=x+\ln 2.

where w(x)w(x) is as in Theorem 2.5. We have two cases. i) Tx16Tx\leq 16, then

We expect that in fact a more general fact holds.

Concentration inequalities.

Sets Zp(μK){\cal Z}_{p}(\mu_{K}) for p1p\geq 1, when μK\mu_{K} is the uniform disribution on the convex body KK are called LpL_{p}-centroid bodies of KK, their properties were investigated in .

Let us take vZp(μ)v\in{\cal Z}_{p}(\mu), we need to show that Λμ(v/(21/pe))p\Lambda_{\mu}^{\star}(v/(2^{1/p}e))\leq p, that is

i) β21/pep\beta\leq 2^{1/p}ep. Then, since Λμ(u)u,xdμ(x)=0\Lambda_{\mu}(u)\geq\int\langle u,x\rangle d\mu(x)=0,

Hence Λμ(21/pepu/β)p\Lambda_{\mu}(2^{1/p}epu/\beta)\geq p and Λμ(u)β21/pepΛμ(21/pepu/β)β21/pe\Lambda_{\mu}(u)\geq\frac{\beta}{2^{1/p}ep}\Lambda_{\mu}(2^{1/p}epu/\beta)\geq\frac{\beta}{2^{1/p}e}. Therefore

Using the symmetry and isotropicity of μ\mu, we get

where to get the last inequality we used ln(1+x)x\ln(1+x)\leq x for x>1x>-1. ∎

2 α𝛼\alpha-regular measures.

To establish inlusions opposite to those in the previous subsection, we introduce the following property:

If μ\mu is α\alpha-regular for some α1\alpha\geq 1, then for any p2p\geq 2,

Take any v4eαZp(μ)v\notin 4e\alpha{\cal Z}_{p}(\mu), then we may find uMp(μ)u\in{\cal M}_{p}(\mu) such that v,u>4eα\left\langle v,u\right\rangle>4e\alpha and obtain

If μ\mu is symmetric, isotropic α\alpha-regular for some α1\alpha\geq 1, then

We have by the symmetry, isotropicity and regularity of μ\mu,

so Λμ(u)α2e2u2/2\Lambda_{\mu}(u)\leq\alpha^{2}e^{2}|u|^{2}/2 for αeu1\alpha e|u|\leq 1. Thus Λμ(u)min{u2αe,u22α2e2}\Lambda_{\mu}^{*}(u)\geq\min\{\frac{|u|}{2\alpha e},\frac{|u|^{2}}{2\alpha^{2}e^{2}}\} for all uu. ∎

We always have for pqp\geq q, Mp(μ)Mq(μ){\cal M}_{p}(\mu)\subset{\cal M}_{q}(\mu) and Zq(μ)Zp(μ){\cal Z}_{q}(\mu)\subset{\cal Z}_{p}(\mu). If the measure μ\mu is α\alpha-regular, then Mq(μ)αpqMp(μ){\cal M}_{q}(\mu)\subset\frac{\alpha p}{q}{\cal M}_{p}(\mu) and Zp(μ)αpqZq(μ){\cal Z}_{p}(\mu)\subset\frac{\alpha p}{q}{\cal Z}_{q}(\mu) for pq2p\geq q\geq 2. Moreover for any symmetric measure μ\mu, Λμ(0)=0\Lambda_{\mu}^{*}(0)=0, hence by the convexity of Λμ\Lambda_{\mu}^{*}, Bq(μ)Bp(μ)pqBq(μ)B_{q}(\mu)\subset B_{p}(\mu)\subset\frac{p}{q}B_{q}(\mu) for all pq>0p\geq q>0.

Symmetric log–concave measures are 1-regular.

so it is enough to show that the function f(x):=1x(Γ(x+1))1/xf(x):=\frac{1}{x}(\Gamma(x+1))^{1/x} is nonincreasing on [2,)[2,\infty). Binet’s form of the Stirling formula (cf. [1, Theorem 1.6.3]) gives

where μ(x)=0arctg(t/x)(e2πt1)1dt\mu(x)=\int_{0}^{\infty}{\rm arctg}(t/x)(e^{2\pi t}-1)^{-1}dt is decreasing function. Thus

is indeed nonincreasing on [2,)[2,\infty). ∎

This definition is motivated by the following Corollary:

By Remark 3.7, Proposition 2.4 and the definition of Bp(μ)B_{p}(\mu) we have

so the first part of the statement immediately follows by Proposition 3.5.

By Proposition 2.7 this implies property (11). Additionally Λμ\Lambda^{\star}_{{\mu}} is convex, symmetric and Λμ(0)=0\Lambda^{\star}_{\mu}(0)=0. Finally, from Proposition 3.3 we have min{1,Λμ(u)}u2\min\{1,\Lambda^{\star}_{\mu}(u)\}\leq|u|^{2}. Thus, from Proposition 2.9 we get the second part of the statement.

By Proposition 2.7 in the definition 3.9 we could use the equivalent condition μ(A+βZp(μ))min{epμ(A),1/2}\mu(A+\beta{\cal Z}_{p}(\mu))\geq\min\{e^{p}\mu(A),1/2\}. The next proposition shows that for log-concave measures these conditions are satisfied for large pp and for small sets.

Using a standard volumetric estimate for any r>0r>0 we may choose SMr(μ)S\subset{\cal M}_{r}(\mu) with #S5n\#S\leq 5^{n} such that Mr(μ)uS(u+12Mr(μ)){\cal M}_{r}(\mu)\subset\bigcup_{u\in S}(u+\frac{1}{2}{\cal M}_{r}(\mu)). Then for t>0t>0,

Let μ(A)=eq\mu(A)=e^{-q}, we will consider two cases.

i) pmax{q,cn}p\geq\max\{q,cn\}. Then by Remark 3.7,

so A30c1Zp(μ)A\cap 30c^{-1}{\cal Z}_{p}(\mu)\neq\emptyset, hence 0A+30c1Zp(μ)0\in A+30c^{-1}{\cal Z}_{p}(\mu) and

The previous facts motivate the following.

Proposition 3.10 shows that Conjecture 1 implies Conjecture 2. Both hypotheses would be equivalent provided that the following conjecture of Kannan, Lovász and Simonovits holds.

There exists an absolute constant CC such that any symmetric isotropic log–concave probability measure satisfies Cheeger’s inequality with constant 1/C1/C.

3 Comparison of weak and strong moments

where \|\cdot\|_{*} denotes the norm dual to \|\cdot\|.

hence xM+tmp\|x\|\leq M+tm_{p} for xA+tZp(μ)x\in A+t{\cal Z}_{p}(\mu). Thus for tpt\geq p,

Under the assumptions of Proposition 3.12 by the triangle inequality we get for γ=4αβ\gamma=4\alpha\beta,

Notice that x22dμ=n\int\|x\|_{2}^{2}d\mu=n and u2=u2\|u\|_{2}^{*}=\|u\|_{2}. Hence i) follows directly from (19) with p=q=2p=q=2. Moreover (19) with q=2q=2 implies

by the α\alpha-regularity and isotropicity of μ\mu. ∎

Property i) plays the crucial role in the Klartag proof of the central limit theorem for convex bodies . Paouris showed that moments of the Euclidean norm for symmetric isotropic log-concave measures are bounded by C(p+n)C(p+\sqrt{n}). Thus Conjecture 4 would imply both Klartag CLT (with the optimal speed of convergence) and Paouris concentration.

We conclude this section with the estimate that shows comparison of weak and strong moments for any probability measure and p>n/Cp>n/C.

As in the proof of Proposition 3.11 we can find u1,,uNu_{1},\ldots,u_{N} with ui1\|u_{i}\|_{*}\leq 1, N5nN\leq 5^{n} such that x2maxiNui,x\|x\|\leq 2\max_{i\leq N}\left\langle u_{i},x\right\rangle for all xx. Then

Modified Talagrand concentration for exponential measure

In this section we show that for a set lying far from the origin Talagrand’s two level concentration for the exponential measure may be somewhat improved, namely (for sufficiently large tt) it is enough to enlarge the set by tB1ntB_{1}^{n} instead of tB1n+tB2ntB_{1}^{n}+\sqrt{t}B_{2}^{n}.

If ut>0u\geq t>0 then for any i{1,,n}i\in\{1,\ldots,n\} we have

Obviously we may assume that i=1i=1 and unu\leq n. Let A1:=AnB1n{x ⁣:x1u}A_{1}:=A\cap nB_{1}^{n}\cap\{x\colon x_{1}\geq u\} and B:={xB1n:x1i2xi}B:=\{x\in B_{1}^{n}:x_{1}\geq\sum_{i\geq 2}|x_{i}|\}. From the definition of BB and A1A_{1} we have A1tBnB1nA_{1}-tB\subset nB_{1}^{n}. On the other hand B={x:x11/2+i2xi1/2}B=\{x:|x_{1}-1/2|+\sum_{i\geq 2}|x_{i}|\leq 1/2\}, so B=2nB1n=(2r1,n)n|B|=2^{-n}|B_{1}^{n}|=(2r_{1,n})^{-n}. Thus

Then we easily check that tB/(1s)=A1/s|tB/(1-s)|=|A_{1}/s|. Since A1{xnB1n ⁣:x1t}A_{1}\subset\{x\in nB_{1}^{n}\colon x_{1}\geq t\} we get A11/n(nt)/r1,n|A_{1}|^{1/n}\leq(n-t)/r_{1,n} and s2(nt)/(2nt)s\leq 2(n-t)/(2n-t). Now we can use the Brunn-Minkowski inequality to get

Notice that A1+tB1n{x ⁣:x1ut}A_{1}+tB_{1}^{n}\subset\{x\colon x_{1}\geq u-t\}, so we obtain

A similar result (although with a constant multiplicative factor) can be obtained using the same technique and more calculations for n1/pBpnn^{1/p}B_{p}^{n} instead of nB1nnB_{1}^{n} for pp\in.

If ut>0u\geq t>0 then for any i{1,,n}i\in\{1,\ldots,n\} we have

and also P1(A)+B1n+kP1(A+B1n)P^{-1}(A)+B_{1}^{n+k}\subset P^{-1}(A+B_{1}^{n}). From Lemma 4.1 we have

Let At=A+tB1nA_{t}=A+tB_{1}^{n}. By Lemma 4.3 we get for any s0s\geq 0 and any ii:

To get the assertion it is enough to take the sum over all ii and notice that the function f(y):=(yt)+2f(y):=(\sqrt{y}-t)_{+}^{2} is convex on [0,)[0,\infty), hence

Then Ak+tB1n{x ⁣:5tn+t(2k3)x<5tn+t(2k+1)}A_{k}+tB_{1}^{n}\subset\{x\colon 5t\sqrt{n}+t(2k-3)\leq|x|<5t\sqrt{n}+t(2k+1)\}, hence

From Proposition 4.4 applied for AkA_{k} we have

In particular (\refl1enl)(\ref{l1_enl}) holds if A(50nB2n+tB1n)=A\cap(50\sqrt{n}B_{2}^{n}+tB_{1}^{n})=\emptyset.

Let AkA_{k} denote A+10kB1nA+10kB_{1}^{n} for k=0,1,k=0,1,\ldots. If for any 0kt/100\leq k\leq t/10 we have νn(Ak50nB2n)νn(A)/2\nu^{n}(A_{k}\cap 50\sqrt{n}B_{2}^{n})\geq\nu^{n}(A)/2, the thesis is proved. Thus we assume otherwise. Let Ak:=Ak50nB2nA_{k}^{\prime}:=A_{k}\setminus 50\sqrt{n}B_{2}^{n}. From Lemma 4.5 we have

By a simple induction we get νn(Ak)e2kνn(A)\nu^{n}(A_{k})\geq e^{2k}\nu^{n}(A) for any kt/10k\leq t/10. Thus we get

where the last part follows from Stirling’s formula. Thus rp,nn1/p{r_{p,n}}\sim n^{1/p}.

where fp(t)=t2f_{p}(t)=t^{2} for t<1t<1 and fp(t)=tpf_{p}(t)=t^{p} for t1t\geq 1.

We shall use the facts proved in Section 3 to approximate Bt(νp)B_{t}({\nu_{p}}). Note that νp{\nu_{p}} is log-concave (as its density is log-concave) and symmetric. It is 1–regular from Proposition 3.8. Also

Thus Zt(νp)[t1/p,t1/p]Z_{t}({\nu_{p}})\sim[-t^{1/p},t^{1/p}] for t1|t|\geq 1, so by Propositions 3.2 and 3.5, Bt(νp)[t1/p,t1/p]B_{t}({\nu_{p}})\sim[-t^{1/p},t^{1/p}]. Hence, for all t0t\geq 0 we have {x ⁣:fp(x)t}{x:Λνp(x)t}\{x\colon f_{p}(|x|)\leq t\}\sim\{x:\Lambda^{\star}_{\nu_{p}}(x)\leq t\}, so Λνp(t/C)fp(t)Λνp(Ct)\Lambda^{\star}_{\nu_{p}}(t/C)\leq f_{p}(t)\leq\Lambda^{\star}_{{\nu_{p}}}(Ct). As Λνp\Lambda^{\star}_{\nu_{p}} is symmetric, the proof is finished. ∎

For t<1t<1 we use Propositions 3.3 and 3.6. Both μp,n{\mu_{p,n}} and νpn{\nu_{p}^{n}} are symmetric, log–concave measures, and both can be rescaled as in Proposition 5.1 to be isotropic, thus Bt(μp,n)tB2nBt(νpn)B_{t}({\mu_{p,n}})\sim\sqrt{t}B_{2}^{n}\sim B_{t}({\nu_{p}^{n}}).

Lemma 6 from gives (after rescaling by rp,n{r_{p,n}}),

It is not hard to verify that Bt(μp,n)rp,nBpnB_{t}({\mu_{p,n}})\sim{r_{p,n}}B_{p}^{n} for tnt\geq n.

We are now going to investigate two transports of measure. They will combine to transport a measure with known concentration properties (νn\nu^{n} or ν2n\nu_{2}^{n}, that is the exponential or Gaussian measure) to the uniform measure μp,n{\mu}_{p,n}. We will investigate the contractive properties of these transports with respect to various norms. Our motivation is the following:

Let us prove the first statement, the second proof is almost identical. Suppose U(x)U(A)+δ1/pt1/pBpnU(x)\in U(A)+\delta^{1/p}t^{1/p}B_{p}^{n}. Then there exists yAy\in A such that U(x)U(y)ppδt.\|U(x)-U(y)\|_{p}^{p}\leq\delta t. From the assumption we have txyqqt\geq\|x-y\|_{q}^{q}, which means xA+t1/qBqnx\in A+t^{1/q}B_{q}^{n}, and U(x)U(A+t1/qBqn)U(x)\in U(A+t^{1/q}B_{q}^{n}). ∎

Let us first show the following simple estimate.

Then f(0)=0f(0)=0 and f(u)=euuq(12u/q+2/q)0f^{\prime}(u)=e^{-u}u^{q}(1-2u/q+2/q)\geq 0 for 0uq/20\leq u\leq q/2. ∎

Now we are ready to state the basic properties of Tp,nT_{p,n}.

i) The map Tp,nT_{p,n} transports the probability measure νpn\nu_{p}^{n} onto the measure μp,n{\mu}_{p,n}. ii) For all t>0t>0 we have etp/nt2γpfp,n(t)te^{-t^{p}/n}t\leq 2\gamma_{p}f_{p,n}(t)\leq t and fp,n(t)(2γp)11f_{p,n}^{\prime}(t)\leq(2\gamma_{p})^{-1}\leq 1. iii) For any t>0t>0, 0fp,n(t)/tfp,n(t)min{1,2ptp/n}0\leq f_{p,n}(t)/t-f_{p,n}^{\prime}(t)\leq\min\{1,2pt^{p}/n\}. iv) The function tfp,n(t)/tt\mapsto f_{p,n}(t)/t is decreasing on (0,)(0,\infty) and for any s,t>0s,t>0,

The definition of Tp,nT_{p,n} directly implies i). Differentiation of (22) gives

which, when the nn-th root is taken, give the first part of ii).

For the second part of ii) we use (23) and the estimate above to get

To show iii) first notice that by (23) and ii),

thus fp,n(t)/tfp,n(t)0f_{p,n}(t)/t-f_{p,n}^{\prime}(t)\geq 0. Moreover by ii), fp,n(t)/tfp,n(t)fp,n(t)/t1f_{p,n}(t)/t-f_{p,n}^{\prime}(t)\leq f_{p,n}(t)/t\leq 1, so we may assume that 2ptp/n12pt^{p}/n\leq 1. By (22) and Lemma 5.7 we obtain

Thus using again (23) and part ii) we get

By iii) we get (fp,n(t)/t)0(f_{p,n}(t)/t)^{\prime}\leq 0, which proves the first part of iv). For the second part suppose that s>t>0s>t>0, then

The next Proposition may be also deduced (with different constant) from the more general fact proved in .

Assume s:=xpt:=yps:=\|x\|_{p}\geq t:=\|y\|_{p}, we apply Proposition 5.8 and get

Let s=xps=\|x\|_{p} and t=ypt=\|y\|_{p}, we use Proposition 5.8 as in the proof of Proposition 5.9, and the Hölder inequality,

The second transport we will use is a simple product transport which transports the measure νpn\nu_{p}^{n} onto νqn\nu_{q}^{n}. We shall be particularly interested in the cases p=1p=1 and p=2p=2, but most of the results can be stated in the more general setting.

Note that wp,q1=wq,pw_{p,q}^{-1}=w_{q,p} and (Wp,qn)1=Wq,pn(W_{p,q}^{n})^{-1}=W_{q,p}^{n}. Differentiating equality (24) we get

We will prove that wp,qw_{p,q} behaves very much like xp/qx^{p/q} for large xx, and is more or less linear for small xx. We begin with the bound for q=1q=1.

For p1p\geq 1 we have i) vp(x)xp+ln(pγpxp1)v_{p}(x)\geq x^{p}+\ln(p\gamma_{p}x^{p-1}) and vp(x)pxp1v_{p}^{\prime}(x)\geq px^{p-1} for x0x\geq 0, ii) vp(x)e+xp+ln(pγpxp1)v_{p}(x)\leq e+x^{p}+\ln(p\gamma_{p}x^{p-1}) and vp(x)eepxp1v_{p}^{\prime}(x)\leq e^{e}px^{p-1} for x1x\geq 1, iii) vp(x)vp(y)21pxyp|v_{p}(x)-v_{p}(y)|\geq 2^{1-p}|x-y|^{p}.

Note that γ1=1\gamma_{1}=1. We have for x0x\geq 0,

and for x1x\geq 1, since (1+r/p)per1+er(1+r/p)^{p}\leq e^{r}\leq 1+er for rr\in, we get

Notice that by (25), vp(x)=exp+vp(x)/γpv_{p}^{\prime}(x)=e^{-x^{p}+v_{p}(x)}/\gamma_{p}, hence we may estimate vpv_{p}^{\prime} using the just derived bounds on vpv_{p}.

The lower bound on vpv_{p}^{\prime} yields vp(x)vp(y)xyp|v_{p}(x)-v_{p}(y)|\geq|x-y|^{p} for x,y0x,y\geq 0. The same estimate holds for x,y0x,y\leq 0, since vpv_{p} is odd. Finally for x0yx\geq 0\geq y we have

i) For pq1p\geq q\geq 1, wp,q(x)xp/q|w_{p,q}(x)|\geq|x|^{p/q} and wp,q(x)γqγp12w_{p,q}^{\prime}(x)\geq\frac{\gamma_{q}}{\gamma_{p}}\geq\frac{1}{2}. ii) For p2p\geq 2, wp,2(x)18pxp/21.w_{p,2}^{\prime}(x)\geq\frac{1}{8}\sqrt{p}|x|^{p/2-1}.

Since the function wp,qw_{p,q} is odd, we may and will assume that x0x\geq 0.

i) We have by the monotonicity of up/q1u^{p/q-1} on [0,)[0,\infty),

thus wp,q(x)q/pxw_{p,q}(x)^{q/p}\geq x and wp,q(x)xp/qw_{p,q}(x)\geq x^{p/q}. Formula (25) gives wp,q(x)γq/γp1/2w_{p,q}^{\prime}(x)\geq\gamma_{q}/\gamma_{p}\geq 1/2.

ii) We begin by the following Gaussian tail estimate for z>0z>0:

We have equality when zz{\rightarrow}\infty, and direct calculation shows the derivative of the left–hand–side is no larger than the derivative of the right–hand–side.

Let κ:=4π\kappa:=4\sqrt{\pi}, we will now show that for all x>0x>0 and p2p\geq 2,

Suppose on the contrary that wp,2(x)<up(x)w_{p,2}(x)<u_{p}(x) for some p2p\geq 2 and x>0x>0. Note that by i) we have wp,2γ2/γpγ2=π/2w_{p,2}^{\prime}\geq\gamma_{2}/\gamma_{p}\geq\gamma_{2}=\sqrt{\pi}/2. Thus up(x)u_{p}(x) is equal to the second part of the maximum. This in particular implies that x2/3x\geq 2/3, since for x<2/3x<2/3 we have

Therefore up(x)πx/21/3u_{p}(x)\geq\sqrt{\pi}x/2\geq 1/\sqrt{3}. Now by (26), (24) and (27),

After simplifying this gives up(x)>pxp/2u_{p}(x)>\sqrt{p}x^{p/2}. Hence

which is impossible. This condratiction shows that (28) holds.

Thus we have wp,2(x)up(x)w_{p,2}(x)\geq u_{p}(x) and by (25) we obtain

By taking up(x)=max{πx/2,(xp+ln(pxp/21/(κlnp)))+}u_{p}(x)=\max\{\sqrt{\pi}x/2,\sqrt{(x^{p}+\ln(px^{p/2-1}/(\kappa\ln p)))_{+}}\} for sufficiently large κ\kappa and estimating carefully one may arrive at the bound wp,2(x)C1pxp/21/lnpw_{p,2}^{\prime}(x)\geq C^{-1}px^{p/2-1}/\ln p. One cannot, however, receive a bound of the order of pxp/21px^{p/2-1}.

Property i) follows from the definition of wq,pw_{q,p} and Wq,pnW_{q,p}^{n}. Since wq,p=wp,q1w_{q,p}=w_{p,q}^{-1} we get ii) by Lemma 5.13 i). Property iii) is a direct consequence of ii).

The above inequality together with ii) gives iv) and iv) yields v). ∎

Now we define a transport from the exponential measure νn\nu^{n} to μp,n{\mu_{p,n}} for p2p\geq 2:

This transport satisfies the following bound:

Let s=W1,pn(x)ps=\|W_{1,p}^{n}(x)\|_{p}. By direct calculation we get

Since w1,p=wp,11w_{1,p}=w_{p,1}^{-1}, Proposition 5.13 i) implies w1,p(xj)2|w_{1,p}^{\prime}(x_{j})|\leq 2, while by Proposition 5.8 we have fp,n(s)/s1f_{p,n}(s)/s\leq 1. Thus the first summand can be bounded by 22.

For the second summand note that by Proposition 5.8 iii),

Moreover, W1,pn(x)2n1/21/ps\|W_{1,p}^{n}(x)\|_{2}\leq n^{1/2-1/p}s by the Hölder inequality and

Let ui(t)=(y1,y2,,yi1,t,zi+1,zi+2,,zn)u_{i}(t)=(y_{1},y_{2},\ldots,y_{i-1},t,z_{i+1},z_{i+2},\ldots,z_{n}) for i=1,,ni=1,\ldots,n. Note that ui(yi)=ui+1(zi+1)u_{i}(y_{i})=u_{i+1}(z_{i+1}), u1(z1)=zu_{1}(z_{1})=z and un(yn)=yu_{n}(y_{n})=y, hence

Let si(t):=w1,p(ui(t))ps_{i}(t):=\|w_{1,p}(u_{i}(t))\|_{p}. By vector–valued integration and (29) we get

As in the proof of Proposition 5.17 we show that

To deal with the sum of aia_{i}’s we notice that, since fp,n(s)/s1f_{p,n}(s)/s\leq 1 and w1,p(x)0w_{1,p}^{\prime}(x)\geq 0,

If xytB1n+t1/2B2nx-y\in tB_{1}^{n}+t^{1/2}B_{2}^{n} for some t>0t>0, then for all p2p\geq 2, Sp,nxSp,ny10(t1/2B2nt1/pBpn)S_{p,n}x-S_{p,n}y\in 10(t^{1/2}B_{2}^{n}\cap t^{1/p}B_{p}^{n}).

Let us fix x,yx,y with xytB1n+t1/2B2nx-y\in tB_{1}^{n}+t^{1/2}B_{2}^{n}. By Proposition 5.15 iv),

By Hölder’s inequality Sp,nxSp,ny2n1/21/pSp,nxSp,nyp8t1/2\|S_{p,n}x-S_{p,n}y\|_{2}\leq n^{1/2-1/p}\|S_{p,n}x-S_{p,n}y\|_{p}\leq 8t^{1/2} for tnt\geq n.

Assume now that tnt\leq n. Let zz be such that xzt1/2B2nx-z\in t^{1/2}B_{2}^{n} and zytB1nz-y\in tB_{1}^{n}. Then Sp,nxSp,nz4t1/2B2nS_{p,n}x-S_{p,n}z\in 4t^{1/2}B_{2}^{n} by Proposition 5.17 and W1,pnzW1,pny22t\|W_{1,p}^{n}z-W_{1,p}^{n}y\|_{2}\leq 2\sqrt{t} by Proposition 5.15 v). Thus by Proposition 5.18,

Hence Sp,nxSp,ny10t1/2B2nS_{p,n}x-S_{p,n}y\in 10t^{1/2}B_{2}^{n}. ∎

The last function we define transports the Gaussian measure ν2n\nu_{2}^{n} to μp,n{\mu_{p,n}} for p2p\geq 2.

The first summand is bounded by 2 as in the proof of Proposition 5.17. Since w2,p=wp,21w_{2,p}=w_{p,2}^{-1} we get by Lemma 5.13 ii)

We start with the version of Theorem 4.6 for νp\nu_{p}.

νpn(A+20t1/pBpn)etνpn(A){\nu_{p}^{n}}(A+20t^{1/p}B_{p}^{n})\geq e^{t}{\nu_{p}^{n}}(A) or

{\nu_{p}^{n}}\big{(}(A+20t^{1/p}B_{p}^{n})\cap 100\sqrt{n}B_{2}^{n}\big{)}\geq\frac{1}{2}{\nu_{p}^{n}}(A).

We will use the transport W1,pnW_{1,p}^{n} from νn{\nu^{n}} to νpn{\nu_{p}^{n}}. Proposition 5.15 v) gives W1,pn(x)W1,pn(y)pp2pxy1.\|W_{1,p}^{n}(x)-W_{1,p}^{n}(y)\|_{p}^{p}\leq 2^{p}\|x-y\|_{1}. By Remark 5.5 this means that A+2(10t)1/pBpnW1,pn(Wp,1n(A)+10tB1n)A+2(10t)^{1/p}B_{p}^{n}\supset W_{1,p}^{n}(W_{p,1}^{n}(A)+10tB_{1}^{n}). Let us fix t1t\geq 1 and apply Theorem 4.6 to Wp,1n(A)W_{p,1}^{n}(A) and 10t10t. If the second case occurs, we have

If the first case of Theorem 4.6 occurs, then due to Proposition 5.15 iii) we have W1,pn(x)22x2\|W_{1,p}^{n}(x)\|_{2}\leq 2\|x\|_{2}, so 2αB2nW1,pn(αB2n)2\alpha B_{2}^{n}\supset W_{1,p}^{n}(\alpha B_{2}^{n}) for any α>0\alpha>0. Thus

Corollary 5.2 gives Bs(νpn)C(s1/pBpn+s1/2B2n)B_{s}({\nu_{p}^{n}})\subset C(s^{1/p}B_{p}^{n}+s^{1/2}B_{2}^{n}) for s>0s>0. By Corollary 2.19, νpn{\nu_{p}^{n}} satisfies IC(48)IC(48), which, due to Proposition 2.4 implies νpn(A+48B2t(νpn))min{1/2,etνpn(A)}\nu_{p}^{n}(A+48B_{2t}({\nu_{p}^{n}}))\geq\min\{1/2,e^{t}{\nu_{p}^{n}}(A)\} for any Borel set AA. Thus we have

where in the last step we use the Stirling approximation and CC as always denotes a universal constant. Thus it is enough to take c(α)<(Cα)1c(\alpha)<(C\alpha)^{-1}. ∎

By Propositions 2.7, 3.11, 3.5 and 5.3 it is enough to show

for 1tn1\leq t\leq n and μp,n(A)en\mu_{p,n}(A)\geq e^{-n}.

Recall that Tp,nT_{p,n} denotes the map transporting νpn{\nu_{p}^{n}} to μp,n{\mu_{p,n}}. Apply Lemma 5.22 to Tp,n1(A)T_{p,n}^{-1}(A) and tt. If the first case occurs, we have

Proposition 5.9 gives Tp,nxTp,nyp2xyp\|T_{p,n}x-T_{p,n}y\|_{p}\leq 2\|x-y\|_{p}, thus by Remark 5.5,

Hence we may assume that the second case of Lemma 5.22 holds, that is

In particular νpn(A)en/2{\nu_{p}^{n}}(A^{\prime})\geq e^{-n}/2. Let

We apply Lemma 5.23 for AA^{\prime\prime} and 4t4t to get

Proposition 5.9, Remark 5.5 and the definitions of AA^{\prime} and AA^{\prime\prime} yield

Putting the four estimates together, we can write

which gives (33) in the second case and ends the proof. ∎

A recent result of S. Sodin ([19, Theorem 1]) states (after rescaling from BpnB_{p}^{n} to rp,nBpn{r_{p,n}}B_{p}^{n}) that

3 The easy case – p≥2𝑝2p\geq 2

This case will follow easily from the exponential case and the facts from subsection 5.1.

By Propositions 2.7, 3.11, 3.5 and 5.3, Theorem 5.27 yield the following.

For any p2p\geq 2 and n1n\geq 1 the measure μp,n{\mu_{p,n}} satisfies Cheeger’s inequality (12) with the constant 1/201/20.

Again we shall transport this result from the exponential measure. By Cheeger’s inequality holds for νn\nu^{n} with the constant κ=1/(26)\kappa=1/(2\sqrt{6}), thus by Proposition 5.17 μp,n{\mu_{p,n}} satisfies (12) with the constant κ/41/20\kappa/4\geq 1/20. ∎

As in the proof of Corollary 5.26 we show that Theorem 5.29 and Corollary 5.28 imply infimum convolution inequality for μp,n{\mu_{p,n}}, p2p\geq 2. Adding the two results together we get

We conclude this section with the proof of logaritmic Sobolev–type inequality for μp,n{\mu_{p,n}}.

In particular there exists a universal constant CC such that

Concluding Remarks

Following the proof of Proposition 3.12 we also get for all p2p\geq 2,

References