On the role of Convexity in Isoperimetry, Spectral-Gap and Concentration

Emanuel Milman

Introduction

The first way is by means of an isoperimetric inequality. Recall that Minkowski’s (exterior) boundary measure of a Borel set AΩA\subset\Omega, which we denote here by μ+(A)\mu^{+}(A), is defined as:

A very useful isoperimetric inequality was considered by Cheeger (and in a more general form, independently by V. G. Maz’ya ):

The space (Ω,d,μ)(\Omega,d,\mu) is said to satisfy Cheeger’s isoperimetric inequality if:

The best possible constant DD above is denoted by DChe=DChe(Ω,d,μ)D_{Che}=D_{Che}(\Omega,d,\mu).

A second way to measure the interplay between dd and μ\mu is given by functional inequalities. Let F=F(Ω,d)\mathcal{F}=\mathcal{F}(\Omega,d) denote the space of functions which are Lipschitz on every ball in (Ω,d)(\Omega,d) - we will call such functions “Lipschitz-on-balls” - and let fFf\in\mathcal{F}. We will consider functional inequalities which measure the relation between fLp(μ)\left\|f\right\|_{L_{p}(\mu)} and fLq(μ)\left\|\left|\nabla f\right|\right\|_{L_{q}(\mu)}, for 0<p,q0<p,q\leq\infty (more general Orlicz norms will be treated in ). Here, the effect of the metric dd is via the Riemannian metric gg which is used to measure f:=g(f,f)1/2\left|\nabla f\right|:=g(\nabla f,\nabla f)^{1/2}, although more general ways exist to define f\left|\nabla f\right| in the non manifold setting. Of course if ff is constant there is no sense to compare against fLq(μ)=0\left\|\left|\nabla f\right|\right\|_{L_{q}(\mu)}=0, so we will need to exclude these cases. To this end, we will require that either the expectation EμfE_{\mu}f or median MμfM_{\mu}f of ff are 0. Here Eμf=fdμE_{\mu}f=\int fd\mu and MμfM_{\mu}f is a value so that μ(fMμf)1/2\mu(f\geq M_{\mu}f)\geq 1/2 and μ(fMμf)1/2\mu(f\leq M_{\mu}f)\geq 1/2.

A well known example of a functional inequality was studied by Poincaré:

The space (Ω,d,μ)(\Omega,d,\mu) is said to satisfy Poincaré’s inequality if:

The best possible constant DD above is denoted by DPoin=DPoin(Ω,d,μ)D_{Poin}=D_{Poin}(\Omega,d,\mu).

It is well known (e.g. ) that under appropriate smoothness assumptions, Poincaré’s inequality is equivalent to the existence of a spectral gap of an appropriate Laplacian operator Δg,μ-\Delta_{g,\mu} on (M,g)(M,g) associated to the measure μ\mu with corresponding boundary conditions on its support. When μ\mu is uniform on a domain Ω(M,g)\Omega\subset(M,g), Δg,μ\Delta_{g,\mu} coincides with the usual Laplace-Beltrami operator Δg\Delta_{g} with Neumann boundary conditions on Ω\Omega. The first non-trivial eigenvalue of Δg,μ-\Delta_{g,\mu} (the “spectral gap”) is then precisely DPoin2(Ω,d,μ)D_{Poin}^{2}(\Omega,d,\mu).

A third way to measure the relation between dd and μ\mu is given by concentration inequalities. These measure how tightly 11-Lipschitz functions are concentrated about their mean, by providing a quantitative estimate on the tail decay μ(fEμft)\mu(|f-E_{\mu}f|\geq t). A typical situation is given by the following example:

The space (Ω,d,μ)(\Omega,d,\mu) is said to have exponential concentration if:

Fixing c=ec=e, the best possible constant DD above is denoted by DExp=DExp(Ω,d,μ)D_{Exp}=D_{Exp}(\Omega,d,\mu). The best constant for a specific ff is denoted by DExp(f)D_{Exp}(f).

It is known that the three examples mentioned above are arranged in a hierarchy. It was shown by Cheeger , and in a more general form, independently by Maz’ya (see also ), that Cheeger’s isoperimetric inequality always implies Poincaré’s inequality (or spectral gap):

DPoinDChe/2D_{Poin}\geq D_{Che}/2 (“Cheeger’s inequality”).

The fact that Poincaré’s inequality implies exponential concentration was first shown by M. Gromov and V. Milman in the Riemannian setting, and subsequently by other authors in other settings as well (e.g. , see and the references therein):

There exists a universal numeric constant c>0c>0 such that DExpcDPoinD_{Exp}\geq cD_{Poin}.

2 Reversing the Hierarchy

In both cases, we will say that “our convexity assumptions are fulfilled”. More generally, we present the following definition:

We will say that our smooth convexity assumptions are fulfilled if:

dd denotes the induced geodesic distance on (M,g)(M,g).

dμ=exp(ψ)dvolMd\mu=\exp(-\psi)dvol_{M}, ψC2(M)\psi\in C^{2}(M), and as tensor fields on MM:

We will say that our convexity assumptions are fulfilled if μ\mu can be approximated in total-variation by measures {μm}\left\{\mu_{m}\right\} so that (Ω,d,μm)(\Omega,d,\mu_{m}) satisfy our smooth convexity assumptions.

The condition (1.1) is the well-known Curvature-Dimension condition CD(0,)CD(0,\infty), introduced by Bakry and Émery in their influential paper (in the more abstract framework of diffusion generators). Here RicgRic_{g} denotes the Ricci curvature tensor and HessgHess_{g} denotes the second covariant derivative. When the Ricci tensor satisfies a slightly relaxed condition RicgKgRic_{g}\geq-Kg, K0K\geq 0, it was first shown by Buser that the implication in Theorem 1.1 can be reversed. We only quote the K=0K=0 case, which in our setting reads:

If μ\mu is uniform on a closed nn-dimensional manifold (M,g)(M,g) and Ricg0Ric_{g}\geq 0 then DChecDPoinD_{Che}\geq cD_{Poin}, where c>0c>0 is a universal numeric constant.

The fact that the constant cc above does not depend on the dimension nn is quite remarkable. Buser’s theorem was recently further generalized by M. Ledoux (following the method developed by Bakry–Ledoux ) to the Bakry-Émery abstract setting. Again, we only quote the CD(0,)CD(0,\infty) case:

Under our smooth convexity assumptions DChecDPoinD_{Che}\geq cD_{Poin}, where c>0c>0 is a universal numeric constant.

3 Main Theorem

How about reversing the implication in Theorem 1.2 under our convexity assumptions? This is one of the statements in our Main Theorem below. A second statement, which is much more surprising, concerns a very weak type of concentration inequality, which we introduce:

The space (Ω,d,μ)(\Omega,d,\mu) is said to satisfy First-Moment concentration if:

The best possible constant DD above is denoted by DFM=DFM(Ω,d,μ)D_{FM}=D_{FM}(\Omega,d,\mu).

Clearly, by the Markov-Chebyshev inequality, First-Moment concentration implies linear tail-decay:

and decay slightly faster than linear implies (integrating by parts) First-Moment concentration. The First-Moment concentration is clearly a-priori much weaker than exponential concentration. Our Main Theorem, first announced in , asserts that under our convexity assumptions, not only is First-Moment concentration equivalent to exponential concentration, but in fact also to the a-priori stronger inequalities of Poincaré and Cheeger:

Under our convexity assumptions, the following statements are equivalent:

Cheeger’s isoperimetric inequality (with DCheD_{Che}).

Exponential concentration inequality (with DExpD_{Exp}).

First Moment concentration inequality (with DFMD_{FM}).

The equivalence is in the sense that the constants above satisfy DCheDPoinDExpDFMD_{Che}\simeq D_{Poin}\simeq D_{Exp}\simeq D_{FM}.

Here and below, ABA\simeq B means that C1BAC2BC_{1}B\leq A\leq C_{2}B, with Ci>0C_{i}>0 some universal numerical constants, independent of any other parameter, and in particular the dimension nn. We will see in Section 4 that the use of the First-Moment is not essential in Statement (4); we may have required any arbitrarily slow uniform tail decay, instead of linear decay. In other words, if:

where α\alpha decays to 0 arbitrarily slow, we can deduce under our convexity assumptions that Lipschitz functions have in fact much faster exponential tail decay (with rate depending solely on α\alpha), and in addition the stronger inequalities of Poincaré and Cheeger, as above. In this sense, our result extends the well-known Kahane-Khinchine type inequalities in Convexity Theory (e.g. consequences of Borell’s Lemma , see for an overview) stating that linear functionals have comparable moments, ensuring exponential tail decay, to the same statement for the “worst” 11-Lipschitz function (see Remark 4.4).

The Main Theorem may also be interpreted as stating that under our convexity assumptions, there exists a single 11-Lipschitz function ff whose level sets on average attain the minimum (up to constants) in Cheeger’s isoperimetric inequality (see Section 4). In fact, one may choose this function to be of the form f(x)=d(x,A)f(x)=d(x,A), where AA is some set with μ(A)1/2\mu(A)\geq 1/2. This is expressed in the following reformulation of the Main Theorem:

Under our convexity assumptions on (Ω,d,μ)(\Omega,d,\mu):

Equivalently, this is tantamount to saying that under our convexity assumptions, it is only necessary to use test functions of the form f(x)=d(x,A)f(x)=d(x,A) when testing (up to a universal numeric constant) for the spectral gap DPoin2D_{Poin}^{2} in Poincaré’s inequality. Clearly, without any further assumptions, all of the above statements are in general false.

4 Applications to Spectral Gap of Convex Domains

where c>0c>0 is some universal numeric constant.

Here Vol denotes the Lebesgue measure. In particular, we see that:

Note that K,LK,L satisfying the above condition can be very different geometrically (consider for instance a Euclidean ball of radius 11 and its intersection with a centered slab of width 10/n10/\sqrt{n}), and yet share essentially the same spectral gap. Also note that our stability result holds with respect to all possible Euclidean structures |\cdot| simultaneously, since the assumption in the left-hand side above is independent of the Euclidean structure.

We also observe that the quantitative dependence on vK,vLv_{K},v_{L} in (1.4) is essentially best possible: the logarithmic dependence on 1/vL1/v_{L} is (up to numeric constants) optimal, and the quadratic dependence on vKv_{K} cannot be improved beyond linear (and is in fact optimal in some restricted range, see Example 5.6). In addition, Theorem 1.7 implies that when 1aLKLb\frac{1}{a}L\subset K\subset Lb with a,b1a,b\geq 1, ab1+cnab\leq 1+\frac{c}{n}, then DChe(K)DChe(L)D_{Che}(K)\simeq D_{Che}(L). In fact, when ab1+snab\leq 1+\frac{s}{n} with 1sn1\leq s\leq n, we obtain in Corollary 5.3 the best possible (up to numeric constants) quantitative bounds on DChe(K)/DChe(L)D_{Che}(K)/D_{Che}(L) as a function of ss (see Example 5.7). To the best of our knowledge, no quantitative bounds on the stability of DCheD_{Che} for convex domains under convex perturbations were previously known. Completely analogous stability results hold for log-concave probability measures as well (see Theorem 5.5). Another useful result which we deduce from our Main Theorem is that Cheeger’s constant is preserved under maps which are not necessarily Lipschitz, but rather Lipschitz on average (see Theorem 5.9).

5 Ingredients in Proof of Main Theorem

All of the four statements in our Main Theorem 1.5 can be equivalently (up to universal constants) rewritten using a single unified framework in terms of (p,q)(p,q) Poincaré inequalities:

The space (Ω,d,μ)(\Omega,d,\mu) is said to satisfy a (p,q)(p,q) Poincaré inequality if:

The best possible constant DD above is denoted by Dp,q=Dp,q(Ω,d,μ)D_{p,q}=D_{p,q}(\Omega,d,\mu).

We prefer to use the median MμM_{\mu} in our definition for reasons which will become apparent in Section 2. It is known and easy to establish that DPoinD2,2D_{Poin}\simeq D_{2,2}, DChe=D1,1D_{Che}=D_{1,1}, DFMD1,D_{FM}\simeq D_{1,\infty}, so our Main Theorem can be restated as the claim that all (p,q)(p,q) Poincaré inequalities in the range 1pq1\leq p\leq q\leq\infty are equivalent under our convexity assumptions (see Theorem 2.4).

The convexity assumptions are used in an essential way in the proof of the Main Theorem in several separate places. First, we employ the CD(0,)CD(0,\infty) condition via the semi-group gradient estimates used by Ledoux in his proof of Theorem 1.4. Contrary to previous approaches, which could only deduce isoperimetric information from functional inequalities with a fLq(μ)\left\|\left|\nabla f\right|\right\|_{L_{q}(\mu)} term with q=2q=2 (see [8, p. 3] and the references therein), we can handle arbitrary q1q\geq 1 (and although we do not pursue this direction here, more general Orlicz norms too). To demonstrate that our estimates are sharp, we remark that the isoperimetric inequalities we obtain are in fact equivalent (up to universal constants) to the (p,q)(p,q) Poincaré inequalities used to derive them. This is summarized in Theorem 2.9, which generalizes Theorems 1.1, 1.2, 1.3 and 1.4 above into a single unified framework. Using this, we deduce from the First-Moment inequality (p=1,q=p=1,q=\infty above) that:

To deduce Cheeger’s isoperimetric inequality from (1.5), we need to use our convexity assumptions for the second time. We employ the following series of results in Riemannian Geometry, due to numerous groups of authors , who proved them under increasingly general conditions. A detailed survey of these results may be found in the Appendix. We learned about these results from the PhD Thesis of V. Bayle , which was referenced to us by Sasha Sodin, to whom we are indebted. In the formulation below, we use a slightly more general notion of smooth convexity assumptions, which is defined in Section 6.

Under our generalized smooth convexity assumptions, the isoperimetric profile I=I(Ω,d,μ)I=I_{(\Omega,d,\mu)} is concave on (0,1)(0,1). Moreover, when μ\mu is in addition uniform on Ω(M,g)\Omega\subset(M,g), then In/(n1)I^{n/(n-1)} is concave on $,where, wherenisthedimensionofis the dimension ofM$.

It is not hard to show (see Section 6) that the isoperimetric profile II is continuous under very general assumptions. It then follows by a general argument (e.g. Corollary 6.5) that II must be symmetric about the point 1/21/2. Hence, the concavity of II implies that DChe=2I(1/2)D_{Che}=2I(1/2) under our convexity assumptions. It is then immediate to deduce Cheeger’s isoperimetric inequality from (1.5). In fact, a stronger statement can be deduced when μ\mu is uniform on Ω\Omega (see Remark 2.11).

A final ingredient in the proof is an approximation argument to handle non-smooth densities, which are typical in applications as well as essential for handling uniform measures on bounded domains (with possibly non-smooth boundaries). Contrary to many results in Convexity Theory, where approximation arguments are standard, easy and usually omitted, the isoperimetric profile and the Cheeger constant are delicate objects, which in general are not stable under approximation in the natural total-variation metric (see Section 6). We therefore employ our convexity assumptions one last time, and provide in Section 6 a careful argument for deducing the Main Theorem 1.5 without any smoothness assumptions, and a different approximation procedure for extending Theorem 1.8, which in particular applies to the entire class of log-concave measures in Euclidean space.

The rest of this work is organized as follows. In Section 2, we reformulate the Main Theorem in terms of an equivalence between (p,q)(p,q) Poincaré inequalities, and using Theorem 1.8, reduce it to the statement of Theorem 2.9. The semi-group argument for proving Theorem 2.9 is described in Section 3. Further interpretations and an extension of the Main Theorem are described in Section 4. Applications for the spectral gap under our convexity assumptions are described in Section 5. We conclude with an approximation argument for disposing of our smoothness assumptions in Section 6, and an Appendix describing in more detail the results summarized in the statement of Theorem 1.8.

Acknowledgements. I would like to thank Professor Gideon Schechtman and the Weizmann Institute of Science where this research project commenced during the last months of my PhD studies. I would also like to thank Professor Jean Bourgain and the Institute for Advanced Study for providing the perfect research environment. Most especially, I would like to thank Sasha Sodin for his invaluable help - acquainting me with capacities, suggesting to look at the PhD Thesis of Bayle and Ledoux’s semi-group argument, countless other references, many informative conversations and comments on this manuscript. I am also grateful to Franck Barthe for his kind hospitality, remarks and advice, Bo’az Klartag for several references, discussions and insightful remarks, Professors Sergey Bobkov and Michel Ledoux for their comments and advice, and Professor David Jerison for several interesting conversations and suggestions. My gratitude also extends to the anonymous referees for their extraordinary careful reading and helpful suggestions, which greatly improved the presentation of this work. Finally, I would also like to thank Professors David Jerison, Erwin Lutwak, Assaf Naor, Vladimir Pestov and Santosh Vempala for their invitations to give talks on this work in its early development.

(p,q)𝑝𝑞(p,q) Poincaré Inequalities

We start by rewriting some of the statements of the Main Theorem 1.5.

Let N(μ)N(\mu) denote an Orlicz norm associated to the Young function NN. Then:

Note that 1N(μ)=1/N1(1)\left\|1\right\|_{N(\mu)}=1/N^{-1}(1). First, by Jensen’s inequality (applied twice):

Next, we may assume that MμfEμfM_{\mu}f\geq E_{\mu}f (otherwise exchange ff by f-f). By the Markov-Chebyshev inequality:

We conclude by noting that N1(2)N1(1)2\frac{N^{-1}(2)}{N^{-1}(1)}\leq 2 since NN is convex. ∎

The last lemma implies that we can pass back and forth between using the median MμM_{\mu} and the expectation EμE_{\mu} when excluding constant functions in our functional inequalities, at the expense of losing a universal constant. We therefore see that Poincaré’s inequality is equivalent (up to constants) to the inequality:

(and in fact in this case one clearly has DPoinDPoinMD_{Poin}\geq D^{M}_{Poin}). The next lemma, due to Maz’ya and Federer and Fleming (see also for a careful derivation), rewrites Cheeger’s isoperimetric inequality in functional form:

Cheeger’s isoperimetric inequality (with DCheD_{Che}) holds iff:

It is easy to show that Cheeger’s isoperimetric inequality is recovered by applying (2.2) to Lipschitz functions which approximate χA\chi_{A}, the characteristic function of a Borel set AA, in an appropriate sense. Conversely, the co-area formula, which for general metric probability spaces becomes an inequality (see ), implies for fFf\in\mathcal{F} with Mμf=0M_{\mu}f=0:

Since for a 1-Lipschitz function ff, fL(μ)1\left\|\left|\nabla f\right|\right\|_{L_{\infty}(\mu)}\leq 1, our First-Moment inequality is clearly equivalent to:

in the sense that DFMDFMMD_{FM}\simeq D^{M}_{FM} where DFMMD^{M}_{FM} is the best constant above.

The above functional reformulations remain valid for general metric probability spaces (Ω,d,μ)(\Omega,d,\mu), in which case we interpret f\left|\nabla f\right| for any fFf\in\mathcal{F} as the following Borel function:

(and we define it as 0 if xx is an isolated point - see [19, pp. 184,189] for more details).

With the above reformulations (2.1), (2.2), (2.3) serving as motivation, the reasons behind our definition of (p,q)(p,q) Poincaré inequalities in the Introduction are now clear. Note that DChe=D1,1D_{Che}=D_{1,1}, DPoinM=D2,2D^{M}_{Poin}=D_{2,2} and DFMM=D1,D^{M}_{FM}=D_{1,\infty}. We can now restate our Main Theorem 1.5 as follows:

Under our convexity assumptions, all (p,q)(p,q) Poincaré inequalities are equivalent in the range 1pq1\leq p\leq q\leq\infty. More precisely, for any other 1pq1\leq p^{\prime}\leq q^{\prime}\leq\infty:

In fact, a more precise dependence on pp and pp^{\prime} may be obtained in some cases. For instance, clearly Dp,qDp,qD_{p^{\prime},q^{\prime}}\geq D_{p,q} if ppp^{\prime}\leq p and qqq^{\prime}\geq q without any further convexity assumptions (by Jensen’s inequality), so we see that the First-Moment inequality ((1,)(1,\infty) case) is the weakest among all (p,q)(p,q) Poincaré inequalities in the above range. Another immediate observation is given by:

Let 0<pp0<p\leq p^{\prime}\leq\infty and 0<qq0<q\leq q^{\prime}\leq\infty be such that:

Then without any further convexity assumptions, Dp,qppDp,qD_{p^{\prime},q^{\prime}}\geq\frac{p}{p^{\prime}}D_{p,q}.

Let gFg\in\mathcal{F} denote a function with Mμg=0M_{\mu}g=0. Define f=sign(g)gp/pf=\text{sign}(g)|g|^{p^{\prime}/p}, and apply the (p,q)(p,q) Poincaré inequality to ff. Clearly Mμf=0M_{\mu}f=0, so we obtain by Hölder’s inequality:

Maz’ya–Cheeger inequality: DPoinDChe/2D_{Poin}\geq D_{Che}/2.

Gromov–Milman inequality: DExpcDPoinD_{Exp}\geq cD_{Poin}.

Since DPoinD2,2D_{Poin}\simeq D_{2,2}, we conclude by Proposition 2.5 that Dp,pcDPoin/pD_{p,p}\geq cD_{Poin}/p for every 2p2\leq p\leq\infty. Let ff be a 1-Lipschitz function. It is elementary to show (e.g. ) that 1/DExp(f)1/D_{Exp}(f) is equivalent (to within universal constants) to fEμfΨ1(μ)\left\|f-E_{\mu}f\right\|_{\Psi_{1}(\mu)}, and that gΨ1(μ)\left\|g\right\|_{\Psi_{1}(\mu)} is in turn equivalent to supp1gLp(μ)/p\sup_{p\geq 1}\left\|g\right\|_{L_{p}(\mu)}/{p}. Employing Lemma 2.1 and using the (p,p)(p,p) Poincaré inequalities:

since ff was assumed 1-Lipschitz. Taking supremum on all such functions ff, we obtain the conclusion. ∎

The exact same proof shows that DExpcrDr,rD_{Exp}\geq c_{r}D_{r,r}, for arbitrary r1r\geq 1.

We have seen that passing from (p,q)(p,q) to (p,q)(p^{\prime},q^{\prime}) is manageable if qqq^{\prime}\geq q (perhaps under some additional assumptions on p,pp,p^{\prime}) without any convexity assumptions. Unfortunately, we are interested in the case q<qq^{\prime}<q, for which an analogous statement to Proposition 2.5 is simply false without any additional assumptions (counter examples are easy to construct, as in the Introduction). Our first ingredient in the proof of Theorem 2.4 states that our convexity assumptions already suffice to extend Proposition 2.5 to the case q<qq^{\prime}<q, p<pp^{\prime}<p:

Let 0<p0<p\leq\infty, 1q1\leq q\leq\infty, and set r=1+1p1qr=1+\frac{1}{p}-\frac{1}{q}. Assume that 12r2\frac{1}{2}\leq r\leq 2. Then under our smooth convexity assumptions, the following statements are equivalent:

where the best constants Dp,qD_{p,q} and DrD^{\prime}_{r} above satisfy:

for some universal constants c1,c2>0c_{1},c_{2}>0. In fact, the direction (2)(1)(2)\Rightarrow(1) holds for pqp\geq q without any convexity assumptions.

Note that when p=q=2p=q=2, the direction (2)(1)(2)\Rightarrow(1) reduces (up to constants) to Theorem 1.1 (Maz’ya–Cheeger inequality), and the direction (1)(2)(1)\Rightarrow(2) to the Buser–Ledoux Theorems 1.3,1.4. A generalization of Theorem 2.9 involving general Orlicz norms will be derived in .

Let 0<r10<r\leq 1. Without any convexity assumptions, the (1/r,1)(1/r,1) Poincaré inequality:

is equivalent to the following isoperimetric inequality:

The proof of (2)(1)(2)\Rightarrow(1) is thus complete.

Before proceeding to the proof of the direction (1)(2)(1)\Rightarrow(2) (this will be the focus of the next section), let us recall how Theorem 2.9 coupled with Theorem 1.8 conclude the proof of Theorem 2.4 and hence of our Main Theorem 1.5:

By an approximation argument we develop in Section 6, it is enough to prove the theorem under our smooth convexity assumptions.

By Jensen’s inequality, D1,Dp,qD_{1,\infty}\geq D_{p,q} in the range 1pq1\leq p\leq q\leq\infty. Employing our (smooth) convexity assumptions, the direction (1)(2)(1)\Rightarrow(2) of Theorem 2.9 implies:

Using our (smooth) convexity assumptions for the second time, Theorem 1.8 asserts that II is concave on (0,1)(0,1). Since II is also symmetric about 1/21/2 (see Corollary 6.5), we immediately deduce that:

which is exactly Cheeger’s isoperimetric inequality, and is identical to stating D1,1c2D1,D_{1,1}\geq\frac{c}{2}D_{1,\infty}. Using Proposition 2.5 and Jensen’s inequality if necessary, we can pass from this to an arbitrary (p,q)(p^{\prime},q^{\prime}) inequality in the range 1pq1\leq p^{\prime}\leq q^{\prime}\leq\infty. ∎

Note that when μ\mu is the uniform measure on Ω\Omega, Theorem 1.8 in fact ensures that Inn1I^{\frac{n}{n-1}} is concave, so we may deduce from (2.5) that in fact:

Proposition 2.10 implies that the latter isoperimetric inequality is equivalent to a (nn1,1)(\frac{n}{n-1},1) Poincaré inequality. Hence, it is clear that in this case, both our Main Theorem 1.5 and Theorem 2.4 can be strengthened.

The Semi-Group Argument

In this section, we prove the direction (1)(2)(1)\Rightarrow(2) of Theorem 2.9. Our proof closely follows Ledoux’s proof of Theorem 1.4.

Given a smooth complete oriented connected Riemannian manifold Ω=(M,g)\Omega=(M,g) equipped with a probability measure μ\mu with density dμ=exp(ψ)dvolMd\mu=\exp(-\psi)dvol_{M}, ψC2(M)\psi\in C^{2}(M), we define the associated Laplacian Δ(Ω,μ)\Delta_{(\Omega,\mu)} by:

where ΔΩ\Delta_{\Omega} is the usual Laplace-Beltrami operator on Ω\Omega. Δ(Ω,μ)\Delta_{(\Omega,\mu)} acts on B(Ω)\mathcal{B}(\Omega), the space of bounded smooth real-valued functions on Ω\Omega. Let (Pt)t0(P_{t})_{t\geq 0} denote the semi-group associated to the diffusion process with infinitesimal generator Δ(Ω,μ)\Delta_{(\Omega,\mu)} (cf. ), characterized by the following system of second order differential equations:

For each t0t\geq 0, Pt:B(Ω)B(Ω)P_{t}:\mathcal{B}(\Omega)\rightarrow\mathcal{B}(\Omega) is a bounded linear operator and its action naturally extends to the entire Lp(μ)L_{p}(\mu) spaces (p1p\geq 1). We collect several elementary properties of these operators:

Pt(f)pPt(fp)\left|P_{t}(f)\right|^{p}\leq P_{t}(\left|f\right|^{p}) for all p1p\geq 1.

The following crucial dimension-free reverse Poincaré inequality was shown by Bakry and Ledoux in [5, Lemma 4.2], extending Ledoux’s approach for proving Buser’s Theorem (see also [5, Lemma 2.4], [56, Lemma 5.1]). It may also be interpreted as a weak, dimension-free, form of the Li–Yau parabolic gradient inequality .

Assume that the following Bakry-Émery Curvature-Dimension condition holds on Ω\Omega:

Then for any t0t\geq 0 and fB(Ω)f\in\mathcal{B}(\Omega), we have:

In fact, the proof of this lemma is very general and extends to the abstract framework of diffusion generators, as developed by Bakry and Émery . We comment that in the Riemannian setting, it is known (see also ) that the gradient estimate of Lemma 3.1 is preserved when restricting to a locally convex domain (as defined in the Appendix) with smooth boundary; we refer to Sturm [83, Proposition 4.15] for a general statement about closedness of the Bakry-Émery Curvature-Dimension condition in an arbitrary metric probability space. The above lemma therefore holds under more general conditions, namely when μ\mu is supported on a locally convex domain Ω(M,g)\Omega\subset(M,g) with C2C^{2} boundary, and dμΩ=exp(ψ)dvolMΩd\mu|_{\Omega}=\exp(-\psi)dvol_{M}|_{\Omega}, ψC2(Ω)\psi\in C^{2}(\overline{\Omega}). In this case, ΔΩ\Delta_{\Omega} in (3.1) denotes the Neumann Laplacian on Ω\Omega, B(Ω)\mathcal{B}(\Omega) denotes the space of bounded smooth real-valued functions on Ω\Omega satisfying Neumann’s boundary condition on Ω\partial\Omega, and Lemma 3.1 remains valid.

Our convexity assumptions are that K=0K=0 in Lemma 3.1, and this is what we will henceforth assume. It is clear that our results in this section may be extended to the case of K>0K>0, but we do not pursue this direction in this work.

From Lemma 3.1, it is immediate that for any 2q2\leq q\leq\infty:

and using q=q=\infty, Ledoux easily deduces the following dual statement [56, (5.5)]:

First, our assumption on the range of rr implies that by applying Proposition 2.5 if necessary, we may assume that p1,q2p\geq 1,q\geq 2 at the expense of an additional universal constant appearing in (2.4). An additional universal constant will appear on account of Lemma 2.1, with which we pass to EμE_{\mu} instead of MμM_{\mu} in (1), so our assumption now reads:

Let AA denote an arbitrary Borel set in Ω\Omega, and let χA,ε(x):=(11εdg(x,A))0\chi_{A,\varepsilon}(x):=(1-\frac{1}{\varepsilon}d_{g}(x,A))\vee 0 denote a continuous approximation in Ω\Omega to the characteristic function χA\chi_{A} of AA. Clearly:

Applying Corollary 3.2 to functions in B(Ω)\mathcal{B}(\Omega) which approximate χA,ε\chi_{A,\varepsilon} (in say W1,1(Ω,μ)W^{1,1}(\Omega,\mu)) and passing to the limit inferior as ε0\varepsilon\rightarrow 0, it follows that:

We start by rewriting the right hand side above as:

Note that by Hölder’s inequality (recall that p1p\geq 1) and our assumption (3.5):

Using (3.3) (recall that q2q\geq 2) to estimate Pt(χA)Lq(μ)\left\|\left|\nabla P_{t}(\chi_{A})\right|\right\|_{L_{q}(\mu)}, we conclude that:

We may now optimize on tt. Using the rough estimate:

for s1s\geq 1, we evaluate (3.6) at time:

where r=21/q1/p=1+1/p1/qr=2-1/q-1/p^{*}=1+1/p-1/q. Since r2r\leq 2, this concludes the proof. ∎

As evident from the proof, for deducing the direction (1)(2)(1)\Rightarrow(2) of Theorem 2.9, the definition of smooth convexity assumptions given in the Introduction may be extended to encompass the more general case treated in this section. Moreover, it is possible to provide an approximation argument for deducing this direction without any smoothness assumptions. We provide the argument in and omit it here, since it is not required for the results of this work.

Interpretations and Extensions

In this section, we provide some further interpretations and extensions of our Main Theorem, which will also be needed for the applications of the next section. We assume throughout this section that our convexity assumptions on (Ω,d,μ)(\Omega,d,\mu) are satisfied.

Lemma 2.2 demonstrates that if AA is a set with μ(A)1/2\mu(A)\leq 1/2 on which the minimal ratio DChe=μ+(A)/μ(A)D_{Che}=\mu^{+}(A)/\mu(A) in Cheeger’s isoperimetric inequality is attained (or nearly attained), then the function f=χAf=\chi_{A} (or the sequence of Lipschitz functions which approximate it) attains the same (nearly) minimal ratio

among all functions fFf\in\mathcal{F} with Mμf=0M_{\mu}f=0. Clearly χA\chi_{A} (or its approximating sequence) is far from being 11-Lipschitz. If on the other hand we define:

which is a 11-Lipschitz function, it is not clear that it will have a small ratio in (4.1). Our Main Theorem 1.5 (together with Lemma 2.1) states that under our convexity assumptions, any 11-Lipschitz function f0f_{0} on (Ω,d)(\Omega,d) with Mμf0=0M_{\mu}f_{0}=0 which is (essentially) optimal in the First-Moment inequality (say f0dμ1/(3DFMM)\int|f_{0}|d\mu\geq 1/(3D^{M}_{FM})), also essentially minimizes the ratio in (4.1). Moreover, using the co-area formula as in Lemma 2.2 and applying our Main Theorem, we have:

from which we also see that the ratio μ+(At)/min(μ(At),1μ(At))\mu^{+}(A_{t})/\min(\mu(A_{t}),1-\mu(A_{t})) for the “average” level set AtA_{t} of f0f_{0} is essentially DCheD_{Che}, the smallest possible.

Theorem 1.6 from the Introduction states that f0f_{0} as above may in fact be chosen to be of the form (4.2).

Given a Borel set AΩA\subset\Omega with μ(A)1/2\mu(A)\geq 1/2, we denote gA(x)=d(x,A)g_{A}(x)=d(x,A). Clearly gAg_{A} is 11-Lipschitz and MμgA=0M_{\mu}g_{A}=0, so one direction follows immediately by Lemma 2.2:

For the other direction, we employ our Main Theorem (and Lemma 2.1):

where the infimum is over all 11-Lipschitz functions ff on (Ω,d)(\Omega,d) with Mμf=0M_{\mu}f=0. Denoting A1={f0},A2={f0}A_{1}=\left\{f\leq 0\right\},A_{2}=\left\{f\geq 0\right\}, we have μ(Ai)1/2\mu(A_{i})\geq 1/2, i=1,2i=1,2. By continuity of ff, fA10f|_{\partial A_{1}}\equiv 0, fA20f|_{\partial A_{2}}\equiv 0 (even though it is possible that A1A2\partial A_{1}\neq\partial A_{2}), and since it is 11-Lipschitz:

The next proposition will prove to be very useful for the applications of the next section. We start with some notations. Given a Borel function ff on a Borel probability space (Ω,μ)(\Omega,\mu) and δ\delta\in, let us denote by Qδ(f)=Qμ,δ(f)Q_{\delta}(f)=Q_{\mu,\delta}(f) the δ\delta-quantile of ff:

Let us also recall an inequality due to Paley and Zygmund (see also [46, Chapter 2]), which in its simplest form reads as follows:

Let ff denote a Borel function on Ω\Omega, and assume that:

Then for any θ(0,1)\theta\in(0,1), denoting ε(θ)=(1θ)2/D2\varepsilon(\theta)=(1-\theta)^{2}/D^{2}, one has Q1ε(θ)(f)θfL1(μ)Q_{1-\varepsilon(\theta)}(|f|)\geq\theta\left\|f\right\|_{L_{1}(\mu)}.

Let f0f_{0} denote a 11-Lipschitz function with either Mμf0=0M_{\mu}f_{0}=0 and f0L1(μ)1/(2DFMM)\left\|f_{0}\right\|_{L_{1}(\mu)}\geq 1/(2D^{M}_{FM}) or Eμf0=0E_{\mu}f_{0}=0 and f0L1(μ)1/(2DFM)\left\|f_{0}\right\|_{L_{1}(\mu)}\geq 1/(2D_{FM}). Then:

for some universal constants C0>0C_{0}>0 and 0<ε0<10<\varepsilon_{0}<1.

Proceeding as in Corollary 2.7, and using Lemma 2.1 and the Main Theorem:

for some universal constant D0>0D_{0}>0, and (4.4) follows by Lemma 4.1 (with θ=1/2\theta=1/2). Note that our convexity assumptions necessarily imply that f0L1(μ)<\left\|f_{0}\right\|_{L_{1}(\mu)}<\infty (see Lemma 6.13), so the appeal to Lemma 4.1 is indeed legitimate. ∎

An arbitrarily slow uniform tail decay condition (1.3) implies any of the statements of the Main Theorem 1.5, with DChe,DPoin,DExp,DFMD_{Che},D_{Poin},D_{Exp},D_{FM} depending solely on α\alpha. Moreover, EμfE_{\mu}f in (1.3) may be replaced by MμfM_{\mu}f.

Given a 11-Lipschitz function f0f_{0} satisfying either of the assumptions of Proposition 4.2, these and (4.4) imply that:

Consequently, the tail decay condition (1.3) (whether stated with EμfE_{\mu}f or MμfM_{\mu}f) ensures that max(DFM,DFMM)1/(4α1(ε0))>0\max(D_{FM},D^{M}_{FM})\geq 1/(4\alpha^{-1}(\varepsilon_{0}))>0, so by Lemma 2.1 the First-Moment concentration inequality is satisfied, from which the other statements of the Main Theorem follow. ∎

Applications to Spectral Gap of Convex Domains

First, we would like to obtain a stability result for DChe(Ω)D_{Che}(\Omega) (or equivalently DPoin(Ω)D_{Poin}(\Omega)) for perturbations of Ω\Omega. Clearly, without any further assumptions, there can be no such result (as seen by adding arbitrarily small “necks” to Ω\Omega as in the Introduction), so we restrict our attention to convex domains. In this case, our Main Theorem 1.5 asserts that this is equivalent to obtaining a stability result for DFM(Ω)D_{FM}(\Omega), which is much easier. To obtain the best quantitative bounds, we will also employ DExp(Ω)D_{Exp}(\Omega).

Let f0f_{0} denote a 11-Lipschitz function on LL with MλLf0=0M_{\lambda_{L}}f_{0}=0 so that f0dλL1/(2DFMM(L))\int|f_{0}|d\lambda_{L}\geq 1/(2D^{M}_{FM}(L)). Since LL is convex, we may clearly extend f0f_{0} to a 11-Lipschitz function on KK, say by defining f1=f0(ProjLx)f_{1}=f_{0}(\text{Proj}_{L}x). Here ProjLx\text{Proj}_{L}x denotes the unique (by convexity) yy in LL so that d(x,L)=d(x,y)d(x,L)=d(x,y). We may assume that EλKf10E_{\lambda_{K}}f_{1}\geq 0 (otherwise exchange f0f_{0} with f0-f_{0}). Note that we can estimate EλKf1E_{\lambda_{K}}f_{1} as follows:

By Proposition 4.2, there exists some universal ε0>0\varepsilon_{0}>0 so that f0L1(λL)QλL,1ε0(f0)\left\|f_{0}\right\|_{L_{1}(\lambda_{L})}\leq Q_{\lambda_{L},1-\varepsilon_{0}}(|f_{0}|). Using this, the ratio between the volumes of LL and KK, the triangle inequality, the Markov-Chebyshev inequality and the estimate on EλKf1E_{\lambda_{K}}f_{1} in (5.1), we evaluate:

where C0>0C_{0}>0 is some universal constant. Using Lemma 2.1 and (2.3), the assertion follows. ∎

Note that for any 1/2<p11/2<p\leq 1 and in fact even without assuming that LL is convex:

Indeed, since KK is convex, by Theorem 1.8 (more precisely, its extension to non-smooth domains or densities given by Theorem 6.10 and Corollaries 6.11,6.12) we know that DChe(K)=2I(K,,λK)(1/2)D_{Che}(K)=2I_{(K,\left|\cdot\right|,\lambda_{K})}(1/2). Given a Borel set AA with λK(A)=1/2\lambda_{K}(A)=1/2, we have:

By the assumption in (5.2), 112pλL(A)12p1-\frac{1}{2p}\leq\lambda_{L}(A)\leq\frac{1}{2p}, and from this we easily deduce the conclusion in (5.2). Iterating this using a sequence of intermediate convex bodies (here we already need to use that LL is convex) L=L0L1Lm=KL=L_{0}\subset L_{1}\subset\ldots\subset L_{m}=K so that Vol(Li)/Vol(Li+1)v1/m>1/2\textnormal{Vol}\left(L_{i}\right)/\textnormal{Vol}\left(L_{i+1}\right)\geq v^{1/m}>1/2 (for example, assuming 0L0\in L, choose Li=(1+ri)LKL_{i}=(1+r_{i})L\cap K for appropriate ri0r_{i}\geq 0), we obtain that:

Taking the limit as mm\rightarrow\infty yields the claimed power of 22 (even without any additional numerical constant!). ∎

then applying Lemma 5.2, the Main Theorem 1.5 and Lemma 5.1, we obtain:

for some universal constants ci>0c_{i}>0, concluding the proof of Theorem 1.7. Of course a similar upper bound on DChe(K)D_{Che}(K) is obtained by interchanging the roles of K,LK,L.

In Convexity Theory, many interesting ways are known to cut a convex body KK so that its volume is preserved up to a constant (e.g. by slabs, parallelepipeds, balls etc…). We see that all of these preserve (up to a constant) DChe(K)D_{Che}(K) (equivalently, the spectral gap DPoin2(K)D_{Poin}^{2}(K)). A useful way to measure the distance between two convex bodies is given by the following variant on the usual geometric distance:

so by passing from the outer to the inner body (in which case our estimates are logarithmic), we deduce:

for some 1sC1n1\leq s\leq C_{1}n, where C1>0C_{1}>0 is some universal constant, then:

where C2>0C_{2}>0 is another universal constant.

Denoting a,ba,b the best constants in (5.4) and applying Lemma 5.1:

and since bdG(K,L)C1+1b\leq d_{G}(K,L)\leq C_{1}+1, the assertion follows. ∎

appearing in the assumptions of both lemmas has a clear and intuitive geometric meaning.

Lemmas 5.1 and 5.2 remain valid for absolutely continuous log-concave probability measures μK,μL\mu_{K},\mu_{L} (replacing respectively K,LK,L), if the condition (5.5) in the assumption is replaced by the condition:

and DChe(Ω),DFM(Ω),DExp(Ω)D_{Che}(\Omega),D_{FM}(\Omega),D_{Exp}(\Omega) are replaced by DChe(μΩ),DFM(μΩ),DExp(μΩ)D_{Che}(\mu_{\Omega}),D_{FM}(\mu_{\Omega}),D_{Exp}(\mu_{\Omega}) (Ω=K,L\Omega=K,L) in the corresponding conclusion.

Identical to the proof of the original lemmas; the only minor point is the construction of intermediate measures μLi\mu_{L_{i}} in the proof of Lemma 5.2, which may be defined e.g. by μLi=ηLiηLi\mu_{L_{i}}=\frac{\eta_{L_{i}}}{|\eta_{L_{i}}|}, dηLidx(x)=min((1+ri)dμLdx(x1+ri),dμKdx(x))\frac{d\eta_{L_{i}}}{dx}(x)=\min((1+r_{i})\frac{d\mu_{L}}{dx}(\frac{x}{1+r_{i}}),\frac{d\mu_{K}}{dx}(x)), for appropriate ri>0r_{i}>0 (assuming the origin is in the interior of the support of μL\mu_{L}). ∎

The analogue of Theorem 1.7 may then be conveniently formulated using the total-variation metric:

with c(ε)=cε2/log(1+1/ε)c(\varepsilon)=c^{\prime}\varepsilon^{2}/\log(1+1/\varepsilon) and c>0c^{\prime}>0 a universal constant.

Let μ0\mu_{0} denote the measure whose density is min(dμ1dx,dμ2dx)\min(\frac{d\mu_{1}}{dx},\frac{d\mu_{2}}{dx}), and note that dTV(μ1,μ2)=1μ0d_{TV}(\mu_{1},\mu_{2})=1-|\mu_{0}|. Denoting by μ3\mu_{3} the (log-concave) probability measure μ0μ0\frac{\mu_{0}}{|\mu_{0}|}, since dμidxμ0dμ3dx\frac{d\mu_{i}}{dx}\geq|\mu_{0}|\frac{d\mu_{3}}{dx}, i=1,2i=1,2, we may apply Lemma 5.4 and the Main Theorem to pass from μ1\mu_{1} to μ3\mu_{3} to μ2\mu_{2} as in (5.3), concluding the proof. ∎

2 Optimality of Stability

To the best of our knowledge, no quantitative results on the stability of DCheD_{Che} or DPoinD_{Poin} for convex domains with respect to volume preserving perturbations or geometric distance were previously known. Moreover, we claim that the bounds obtained in Theorem 1.7 (or (5.3)) are optimal (up to numeric constants) with respect to vLv_{L} and close to optimal with respect to vKv_{K} (note that the dependence is logarithmic in the former yet quadratic in the latter; in other words, the deterioration in the Cheeger constant when passing from an outer convex body to an inner one is genuinely different than when passing from the inner one outward). This is witnessed by the following:

It is known (see Subsection 5.5) that DChe(Qm)DChe(B1m)1D_{Che}(Q^{m})\simeq D_{Che}(B_{1}^{m})\simeq 1, so by the 1-1-homogeneity of DCheD_{Che}, it follows that DChe(Kk)1D_{Che}(K_{k})\simeq 1 and DChe(Lk)1kD_{Che}(L_{k})\simeq\frac{1}{k}. Denoting vk=Vol(Lk)Vol(Kk)v_{k}=\frac{\textnormal{Vol}\left(L_{k}\right)}{\textnormal{Vol}\left(K_{k}\right)}, since log1/vkk\log 1/v_{k}\simeq k, we conclude that:

uniformly for all k=2,,n1k=2,\ldots,n-1. So one cannot expect better than logarithmic dependence on 1/v1/v (at least when vexp(n)v\geq\exp(-n)), which coincides with the estimate given by Lemma 5.1.

On the other hand (as is well-known), if we set L=QnL=Q^{n} and K=Qn1×tQ1K=Q^{n-1}\times tQ^{1} a circumscribing box with t>1t>1, since DChe(K)1/tD_{Che}(K)\simeq 1/t in that range, it is clear that the quadratic dependence on vv in Lemma 5.2 cannot be improved beyond linear. Although we do not know whether the optimal bound is, up to a constant, closer to the linear or quadratic asymptotic, we comment that for very small perturbations (i.e. vv very close to 1), it is possible to show that the exact quadratic bound in Lemma 5.2 is optimal (in this range of vv, we of course do not allow any additional numerical constants).

The next example (which is similar yet different from the previous one) shows that the bounds in Corollary 5.3 are optimal too (up to numeric constants), as a function of ss in the stated range.

Since clearly LsKsL_{s}\subset K_{s}, it remains to note that (1srn)KsLs(1-\frac{s}{r_{n}})K_{s}\subset L_{s}, so dG(Ks,Ls)1snd_{G}(K_{s},L_{s})-1\simeq\frac{s}{n}. By interchanging the roles of Ks,LsK_{s},L_{s} appropriately, we observe that the estimates on DChe(K)/DChe(L)D_{Che}(K)/D_{Che}(L) in Corollary 5.3 are sharp both from above and from below.

It is easy to adapt the proofs of Lemma 5.1 and consequently Corollary 5.3 to obtain even sharper quantitative bounds (up to universal constants) on the stability of DCheD_{Che} for specific convex bodies, such as the Euclidean ball B2nB_{2}^{n}. For instance, in the latter case, one obtains that if dG(K,B2n)1+snd_{G}(K,B_{2}^{n})\leq 1+\frac{s}{n} for 1sC1n1\leq s\leq C_{1}n, then:

This is an improvement over Corollary 5.3 and known to be sharp for s=ns=n (folklore).

It is well known and immediate to see that isoperimetric inequalities are preserved under 11-Lipschitz mappings. Given two metric probability spaces (X,dX,μ)(X,d_{X},\mu) and (Y,dY,ν)(Y,d_{Y},\nu), recall that a Borel map T:(X,dX)(Y,dY)T:(X,d_{X})\rightarrow(Y,d_{Y}) is said to push forward μ\mu onto ν\nu, if ν(A)=μ(T1(A))\nu(A)=\mu(T^{-1}(A)) for every Borel set AYA\subset Y. This is equivalent to requiring that for any Borel function gg on (Y,dY)(Y,d_{Y}):

This will be denoted by T(μ)=νT_{*}(\mu)=\nu. The following is then immediate from the definitions:

The following result states that when our convexity assumptions hold for the target space, as far as Cheeger’s isoperimetric inequality is concerned, one need not require that TT be Lipschitz on the entire space, but rather just on average. We would like to thank Bo’az Klartag for a fruitful discussion regarding this point.

Assume that (Y,dY,ν)(Y,d_{Y},\nu) verifies our convexity assumptions and that T(μ)=νT_{*}(\mu)=\nu for some Lipschitz-on-balls map TT. Then:

Here DTop(x)\left\|DT\right\|_{op}(x) denotes the local Lipschitz constant of TT at xx:

When TT is smooth and X,YX,Y are linear spaces, this coincides with the operator norm of the usual derivative matrix DTDT at xx.

First, rewrite Cheeger’s isoperimetric inequality on (X,dX,μ)(X,d_{X},\mu) in functional form (Lemma 2.2):

Using this, we estimate the First-Moment constant on (Y,dY,ν)(Y,d_{Y},\nu). Given a 11-Lipschitz function gg on (Y,dY)(Y,d_{Y}), clearly gTg\circ T is Lipschitz-on-balls on (X,dX)(X,d_{X}), hence in F(X,dX)\mathcal{F}(X,d_{X}). We then have by the definition of push-forward and our assumption (5.6):

We conclude by our Main Theorem (and Lemma 2.1), which imply that DChe(Y,dY,ν)cDFMM(Y,dY,ν)D_{Che}(Y,d_{Y},\nu)\geq cD^{M}_{FM}(Y,d_{Y},\nu) under our convexity assumptions on (Y,dY,ν)(Y,d_{Y},\nu). ∎

In this subsection, we easily recover some previously known estimates on the Cheeger constant of convex domains in a single framework and extend some results to the Riemannian setting. We begin with the following stimulating conjecture from :

Here σ1(μ)2\sigma_{1}(\mu)^{2} denotes the largest eigenvalue of the symmetric covariance matrix Σ(μ)\Sigma(\mu) of μ\mu:

We will write σ1(K)\sigma_{1}(K) for σ1(λK)\sigma_{1}(\lambda_{K}).

Although the KLS conjecture is far from being resolved, some general lower bounds on DCheD_{Che} are known, but these produce dimension-dependent results. We will see that our Main Theorem easily reproduces these bounds.

The following result in the Euclidean setting is due to Payne and Weinberger . This was generalized to the Riemannian setting by Li and Yau . We refer to the Appendix for missing definitions.

If K(M,g)K\subset(M,g) is a locally convex bounded domain with smooth boundary and Ricg0Ric_{g}\geq 0, then:

where diam denotes the diameter and dgd_{g} the induced geodesic distance. In fact, when (M,g)(M,g) is Euclidean space the constant 2 above may be omitted.

Ledoux’s Theorem 1.4 implies that the same lower bound (up to an additional constant) holds for DChe(K,dg,λK)D_{Che}(K,d_{g},\lambda_{K}). In the Euclidean case, this was strengthened in :

To obtain this result, Kannan, Lovász and Simonovits developed a geometric localization technique (which in fact can be traced back to the work of M. Gromov and V. Milman ). As pointed out to us by Sasha Sodin, it is interesting to note that this technique uses some geometric properties of Euclidean space and does not generalize to other Riemannian manifolds (except in special cases, like that of the Euclidean Sphere, as in the work of Gromov–Milman). Our method, on the other hand, does allow us to state the following generalization of Theorem 5.11 to the Riemannian setting, which also improves over Theorem 5.10:

Assume that (Ω,d,μ)(\Omega,d,\mu) satisfies our convexity assumptions. Then:

As usual, we just need to bound DFM(Ω,d,μ)D_{FM}(\Omega,d,\mu). Let ff denote a 11-Lipschitz function on (Ω,d)(\Omega,d). Then for any x0Ωx_{0}\in\Omega, applying the triangle inequality twice:

and the claim follows by our Main Theorem. ∎

An alternative approach to localization for proving isoperimetric inequalities was developed by Bobkov in the Euclidean setting. Bobkov’s approach was extended by Barthe and subsequently by Barthe and Kolesnikov . This approach is based on the Prékopa–Leindler inequality (e.g. ), or equivalently, on optimal transportation, which have both been recently generalized to the Riemannian-with-density-setting by Cordero-Erausquin, McCann and Schmuckenschläger . Using these tools we expect that it should be possible to provide an alternative proof of Theorem 5.12 following Bobkov’s approach, but as pointed out to us by one of the referees, this has yet to be accomplished. We would like to thank the referee for his comments regarding our original simpleminded remark in this direction.

We would like to mention another bound on DCheD_{Che} obtained in using the localization method.

where θB(x)\theta_{B}(x) denotes the longest symmetric interval contained in BB and centered at xx, and c>0c>0 is a universal constant.

We have recently managed to derive this result using our Main Theorem, but this will be described elsewhere. Instead, we would like to show how this bound may be used to recover a result of Bobkov ; in fact, the bound we deduce is formally stronger than Bobkov’s. Bobkov employs the localization method as well, but then relies on some nice trick involving moment inequalities for polynomials in the log-concave setting. Our argument, on the other hand, is more geometric. Independently of our proof, we heard about a similar idea for bounding the boundary measure of large sets from Santosh Vempala (using localization as well).

where VarμVar_{\mu} denotes the variance with respect to μ\mu.

Without loss of generality, we may assume that x0=0x_{0}=0; for general x0x_{0} the claimed bound follows by translating μ\mu. Let E:=EμxE:=E_{\mu}|x|, S:=(Varμx)1/2S:=(Var_{\mu}|x|)^{1/2}, and denote:

By Chebyshev’s inequality, μ(B)3/4\mu(B)\geq 3/4, so if we define μ0:=μB/μ(B)\mu_{0}:=\mu|_{B}/\mu(B), it follows that dTV(μ,μ0)1/4d_{TV}(\mu,\mu_{0})\leq 1/4. Hence DChe(μ)DChe(μ0)D_{Che}(\mu)\simeq D_{Che}(\mu_{0}) by Theorem 5.5. Assume that E2SE\geq 2S, otherwise the support of μ0\mu_{0} has diameter bounded by 8S8S, and one can conclude as in Theorem 5.12. We now employ Theorem 5.14 to bound DChe(μ0)D_{Che}(\mu_{0}):

The crucial geometric observation is that for the Euclidean ball BB:

It remains to plug this into (5.8) and evaluate the resulting expression using integration by parts and Chebyshev’s inequality. We leave it as an exercise to conclude that:

for some universal constant c>0c^{\prime}>0. This bound is in fact formally better than Bobkov’s bound (by several applications of Hölder’s inequality), but using some standard results in Convexity Theory, it is in fact equivalent in the interesting situations. ∎

This is immediate from the results of Schechtman and Zinn , who showed that DExpD_{Exp} of these bodies is bounded from below by a universal constant. The result then follows from our Main Theorem (in fact, we only need a bound on DFMD_{FM}). ∎

Another family of convex bodies for which the KLS conjecture is almost confirmed, is that of unconditional convex bodies KK, i.e. convex bodies for which (x1,,xn)K(x_{1},\ldots,x_{n})\in K iff (±x1,,±xn)K(\pm x_{1},\ldots,\pm x_{n})\in K. It was recently shown by Bo’az Klartag that if KK is an unconditional body with σ1(K)=1\sigma_{1}(K)=1 then DChe(K)c/lognD_{Che}(K)\geq c/\log n, for some universal constant c>0c>0. To obtain this result, Klartag employed Theorem 1.7 to pass to an unconditional body contained inside the cube (Clogn)n(C\log n)^{n}, and then used some symmetry properties of the Laplacian’s eigenfunctions to conclude his result. In fact, one can just use Theorem 1.8 on the concavity of the isoperimetric profile (in the form of Lemma 5.2) for this application.

We conclude this section by stating the known dimension dependent bounds on DChe(K)D_{Che}(K) for non-degenerate convex bodies KK (in the sense that σ1(K)=1\sigma_{1}(K)=1).

It is known in this case that diam(K)cn\textnormal{diam}(K)\leq cn (by a simple volume estimate). Theorem 5.10 (together with Theorem 1.4) then gives DChe(K)c/nD_{Che}(K)\geq c/n. The first KLS bound (Theorem 5.11) improves this to DChe(K)c/nD_{Che}(K)\geq c/\sqrt{n}, since:

Bobkov’s bound (Theorem 5.15) is always at least as good as the first KLS bound (up to a constant), since (using the bound derived in the proof together with a standard application of Borell’s lemma ):

for some universal constant C>0C>0. We see that whenever some non-trivial information on Varμ(xx0)Var_{\mu}(\left|x-x_{0}\right|) is known, Bobkov’s bound is strictly better. Such a remarkable result was proved by Bo’az Klartag , allowing him to deduce a Central-Limit type result for the class of convex bodies (and more generally, log-concave measures). Klartag’s improved estimate in reads:

Approximation Argument

In this section, we develop an approximation argument for extending the following theorems to non-necessarily smooth densities (or boundaries) in our convexity assumptions:

Theorem 1.8 on the concavity of the isoperimetric profile.

We will develop different procedures for extending each of these theorems.

We begin by extending our definition of smooth convexity assumptions (we refer to the Appendix for the definition of locally convex).

We will say that our generalized smooth convexity assumptions are fulfilled if:

ΩM\Omega\subset M is a locally convex domain with C2C^{2} boundary.

dd denotes the induced geodesic distance on (M,g)(M,g).

dμ=exp(ψ)dvolMΩd\mu=\exp(-\psi)dvol_{M}|_{\Omega}, ψC2(Ω)\psi\in C^{2}(\overline{\Omega}), and as tensor fields on Ω\Omega:

This definition was already used in the statement of Theorem 1.8 on the concavity of the isoperimetric profile. The smoothness assumptions in the above definition are used in an essential way in the proof of this theorem to deduce the existence and regularity of the isoperimetric minimizers, which are otherwise false. This permits the use of variational methods from Riemannian Geometry, consequently obtaining a second-order differential inequality which the isoperimetric profile must satisfy (see the Appendix for more details). Nevertheless, the restriction to smooth densities and domains still seems like a technical artifact of the proofs. Some authors have suggested various methods to remove these smoothness assumptions (see e.g. Morgan and Bayle [12, Chapter 4]), but unfortunately these are not well suited for our purposes. We therefore attempt to use a different approximation argument for extending Theorem 1.8 to a more general setting.

At first glance, it is tempting to believe that the isoperimetric profile of (Ω,d,μ)(\Omega,d,\mu) should be stable under approximating the measure μ\mu by measures μm\mu_{m} in, say, total-variation distance. However, the profile is in fact not even pointwise continuous under arbitrary approximation in total-variation. To see this, consider the measures μm\mu_{m} which are uniform on the set [1/21/m,1/2+1/m]\setminus[1/2-1/m,1/2+1/m], and converge to μ\mu, the uniform measure on $.Clearly. ClearlyI_{\mu_{m}}(1/2)=0foreveryfor everym\geq 3,eventhough, even thoughI_{\mu}(1/2)=1$. So one must take care when specifying the approximation.

We say that a sequence of Borel probability measures {μm}\left\{\mu_{m}\right\} tends to μ\mu from above if {μm}\left\{\mu_{m}\right\} converges to μ\mu in total-variation and in addition there exists a sequence {cm}\left\{c_{m}\right\} which tends to 1, so that μm(A)μ(A)/cm\mu_{m}(A)\geq\mu(A)/c_{m} for any Borel set AA.

Let (Ω,d)(\Omega,d) be a metric space and let {μm}\left\{\mu_{m}\right\} be a sequence of Borel probability measures on (Ω,d)(\Omega,d) which tends to μ\mu from above. Then for any t(0,1)t\in(0,1):

Denote I=I(Ω,d,μ)I=I_{(\Omega,d,\mu)} and Im=I(Ω,d,μm)I_{m}=I_{(\Omega,d,\mu_{m})} for short. Let ε>0\varepsilon>0. Then there exists m0m_{0} such that for all mm0m\geq m_{0}, μ(B)μm(B)<ε\left|\mu(B)-\mu_{m}(B)\right|<\varepsilon for any Borel set BB. Let δ>0\delta>0, then for every mm0m\geq m_{0} there exist a Borel set BmB_{m} such that:

Taking the limit as mm\rightarrow\infty and subsequently ε,δ0\varepsilon,\delta\rightarrow 0, we obtain the assertion. ∎

We say that a sequence of Borel probability measures {μm}\left\{\mu_{m}\right\} tends to μ\mu from within if μm=μAm/μ(Am)\mu_{m}=\mu|_{A_{m}}/\mu(A_{m}) for some sequence of Borel sets AmA_{m} such that μ(Am)1\mu(A_{m})\rightarrow 1, and in addition μ+(Am)0\mu^{+}(A_{m})\rightarrow 0.

Let (Ω,d)(\Omega,d) be a metric space and let {μm}\left\{\mu_{m}\right\} be a sequence of Borel probability measures on (Ω,d)(\Omega,d) which tends to μ\mu from within. Then for any t(0,1)t\in(0,1):

We continue with the same assumptions and notations as in the proof of the previous lemma and definition. In our case, we may assume that BmAmB_{m}\subset A_{m}. Then:

Taking the limit as mm\rightarrow\infty and subsequently ε,δ0\varepsilon,\delta\rightarrow 0, we obtain the assertion. ∎

Next, we recall the definition of qq-capacity (we will only require the case q=1q=1). Capacities were introduced in the 1960’s by Maz’ya , Federer and Fleming , and were used by Bobkov and Houdré in . We follow a variation on the definition given in (for general qq), which was extended by Barthe, Cattiaux and Roberto (with q=2q=2) in (after being introduced in ). We conform to the definition implicitly used by Sodin in and Sodin and the author in .

Given a metric probability space (Ω,d,μ)(\Omega,d,\mu), 0<q<0<q<\infty and 0ab10\leq a\leq b\leq 1, we denote:

where the infimum is on all Φ:Ω\Phi:\Omega\rightarrow which are Lipschitz-on-balls (recall the definition of Φ\left|\nabla\Phi\right| given in Remark 2.3).

The following proposition encapsulates the connection between 11-capacity and the isoperimetric profile I=I(Ω,d,μ)I=I_{(\Omega,d,\mu)}. The proof is very much along the lines of the proof of Lemma 2.2, so we will omit it here; the reader is referred to Sodin [81, Proposition A] for an elementary derivation (note the slight difference in our formulation). We only remark that it suffices to use Lipschitz functions Φ\Phi in the definition of capacity above for the purpose of this proposition.

Since obviously Cap1(a,b)=Cap1(1b,1a)\textnormal{Cap}_{1}(a,b)=\textnormal{Cap}_{1}(1-b,1-a), it follows that:

Letting bb converge to aa, and replacing a,ba,b with 1b,1a1-b,1-a, we obtain:

If II is lower semi-continuous at tt and 1t1-t, t(0,1)t\in(0,1), then I(t)=I(1t)I(t)=I(1-t).

Let (Ω,d)(\Omega,d) be a metric space and let {μm}\left\{\mu_{m}\right\} be a sequence of Borel probability measures on (Ω,d)(\Omega,d) which converges in the total-variation norm to μ\mu. Assume in addition that I(Ω,d,μm)I_{(\Omega,d,\mu_{m})} are concave on (0,1)(0,1). Then for any t(0,1)t\in(0,1):

As usual, denote I=I(Ω,d,μ)I=I_{(\Omega,d,\mu)} and Im=I(Ω,d,μm)I_{m}=I_{(\Omega,d,\mu_{m})} for short. Let t(0,1)t\in(0,1) and small ε>0\varepsilon>0 be given, and let Φ:(Ω,d)\Phi:(\Omega,d)\rightarrow denote a Lipschitz function so that:

For any small δ>0\delta>0, there exists an m0m_{0} so that for any mm0m\geq m_{0}:

We conclude by Proposition 6.4 and the concavity of ImI_{m} that:

Since Φ\Phi is Lipschitz (hence Φ|\nabla\Phi| is bounded), and {μm}\left\{\mu_{m}\right\} converge to μ\mu in total-variation, we can pass to the limit as mm\rightarrow\infty:

Taking infimum on all such Φ\Phi as above and using Proposition 6.4 again, we obtain:

Taking the limit of ε,δ\varepsilon,\delta to 0, we obtain the desired conclusion. ∎

It is clear from the proof that the concavity condition may be seriously relaxed (e.g. to equicontinuity), and the regularity condition on ImI_{m} obtained in Lemma 6.9 below may also be used.

Combining the last three lemmas we immediately obtain:

Let (Ω,d)(\Omega,d) be a metric space, let {μm}\left\{\mu_{m}\right\} be a sequence of Borel probability measures on (Ω,d)(\Omega,d) which converges in the total-variation norm to μ\mu, and assume that I(Ω,d,μm)I_{(\Omega,d,\mu_{m})} are all concave on (0,1)(0,1). If in addition {μm}\left\{\mu_{m}\right\} tend to μ\mu from above or from within, then for any t(0,1)t\in(0,1):

In particular, if I(Ω,d,μ)I_{(\Omega,d,\mu)} is in addition lower semi-continuous, we have (pointwise):

The following lemma, which extends the argument given by Gallot in [34, Lemma 6.2] for compact manifolds with uniform density, provides a sufficient condition for the isoperimetric profile to be continuous.

Let Ω=(M,g)\Omega=(M,g) denote an nn-dimensional (n2n\geq 2) smooth complete oriented connected Riemannian manifold and let dd denote the induced geodesic distance. Let μ\mu denote an absolutely continuous measure with respect to volMvol_{M}, such that its density is bounded from above on every ball (but not necessarily from below, nor do we assume it is continuous). Then I=I(Ω,d,μ)I=I_{(\Omega,d,\mu)} is absolutely continuous on $,andinfactislocallyofHo¨lderexponent, and in fact is locally of Hölder exponent\frac{n-1}{n}$.

By Lebesgue’s Theorem, we know for almost every xMx\in M (with respect to volMvol_{M}),

where BM(x,R)B_{M}(x,R) denotes the ball in MM of radius RR around xx, VolM\textnormal{Vol}_{M} denotes the Riemannian volume on MM (and by abuse of notation the induced volume on any submanifold as well), and μ(C)\mu_{\infty}(\mathcal{C}) denotes the upper bound on the density of μ\mu on a compact set CM\mathcal{C}\subset M. By Rauch’s Comparison Theorem, for any such compact set C\mathcal{C} (and in particular a singleton), there exists a εC<1/2\varepsilon_{\mathcal{C}}<1/2 so that for any xCx\in\mathcal{C} and ε<εC\varepsilon<\varepsilon_{\mathcal{C}}:

where BnB^{n} and Sn1S^{n-1} denote the Euclidean unit ball and sphere, respectively, and Vol denotes Euclidean volume. Therefore as t0t\rightarrow 0:

where Cn,μC_{n,\mu} depends on nn and μ\mu only. Since clearly I(0)=I(1)=0I(0)=I(1)=0, this takes care of the continuity at and 11.

Given 0<θ<10<\theta<1, set Rθ=g(θ/2)+1R_{\theta}=g(\theta/2)+1, εθ=εBM(x0,Rθ+1)\varepsilon_{\theta}=\varepsilon_{B_{M}(x_{0},R_{\theta}+1)}, and μ(θ)=μ(BM(x0,Rθ+1))\mu_{\infty}(\theta)=\mu_{\infty}(\overline{B_{M}(x_{0},R_{\theta}+1)}). Let KθK_{\theta} denote the (possibly negative) lower bound on the sectional curvature of KK on BM(x0,Rθ)B_{M}(x_{0},R_{\theta}). Rauch’s Theorem also implies that:

where MKM_{K} denotes the simply connected model space with constant curvature KK, VolMK\textnormal{Vol}_{M_{K}} denotes the volume on MKM_{K} and BMK(R)B_{M_{K}}(R) is any ball in MKM_{K} of radius RR.

Given a set AMA\subset M with θ=μ(A)>0\theta=\mu(A)>0, note that by Fubini’s Theorem, (6.3) and the definition of gg, for any ε<εθ<1\varepsilon<\varepsilon_{\theta}<1:

We conclude from (\refeq:control2)(\ref{eq:control2}) and (\refeq:control1)(\ref{eq:control1}) that given any AMA\subset M with 0<θ=μ(A)<10<\theta=\mu(A)<1 and ε<εθ\varepsilon<\varepsilon_{\theta}, there exists an xBM(x0,Rθ)x\in B_{M}(x_{0},R_{\theta}) such that:

Now let 0<s<t<10<s<t<1 be close enough such that there exists an ε1<εt\varepsilon_{1}<\varepsilon_{t} such that:

By definition, for any η>0\eta>0, there exists a set AA such that μ(A)=t\mu(A)=t and μ+(A)I(t)+η\mu^{+}(A)\leq I(t)+\eta. By (6.7) there exists an xBM(x0,Rt)x\in B_{M}(x_{0},R_{t}) such that μ(ABM(x,ε1))s\mu(A\setminus B_{M}(x,\varepsilon_{1}))\leq s, and since μ\mu is absolutely continuous, it follows that there exists an ε2ε1\varepsilon_{2}\leq\varepsilon_{1} such that μ(ABM(x,ε2))=s\mu(A\setminus B_{M}(x,\varepsilon_{2}))=s. Therefore:

where we have used (6.2) and (6.4) in the last inequality. Sending η\eta to 0 and plugging in (6.8), we conclude that for some constant CnC_{n} which depends on nn:

To get the inequality in the other direction, we require that 0<s<t<10<s<t<1 are close enough so that ε1<ε1s\varepsilon_{1}<\varepsilon_{1-s} in addition satisfies:

Now let AMA\subset M be such that μ(A)=s\mu(A)=s and μ+(A)I(s)+η\mu^{+}(A)\leq I(s)+\eta. Applying (6.7) for the set MAM\setminus A, we find an xBM(x0,R1s)x\in B_{M}(x_{0},R_{1-s}) and ε2ε1\varepsilon_{2}\leq\varepsilon_{1} such that μ(ABM(x,ε2))=t\mu(A\cup B_{M}(x,\varepsilon_{2}))=t. Repeating the above argument then gives:

Since ff is monotone, this concludes the proof. ∎

Our approximation argument is now clear. Given a measure μ\mu in the setting of Lemma 6.9, we know that its isoperimetric profile II is continuous. Assume that μ\mu can be approximated from above or from within by measures {μm}\left\{\mu_{m}\right\} satisfying our generalized smooth convexity assumptions. By Theorem 1.8, the corresponding profiles {Im}\left\{I_{m}\right\} (and when the densities are uniform, also the renormalized profiles {Imn/(n1)}\{I_{m}^{n/(n-1)}\}) are concave, and so applying Proposition 6.8, we deduce the pointwise convergence of ImI_{m} to II, which clearly preserves concavity. We therefore deduce:

Let Ω=(M,g)\Omega=(M,g) denote an nn-dimensional (n2n\geq 2) smooth complete oriented connected Riemannian manifold and let dd denote the induced geodesic distance. For each m1m\geq 1, let {μm}\left\{\mu_{m}\right\} denote a sequence of Borel probability measures on ΩmΩ\Omega_{m}\subset\Omega so that (Ωm,d,μm)(\Omega_{m},d,\mu_{m}) satisfies our generalized smooth convexity assumptions. Assume that {μm}\left\{\mu_{m}\right\} tends to an absolutely continuous Borel probability measure μ\mu from above or from within, and denote Im=I(Ωm,d,μm)I_{m}=I_{(\Omega_{m},d,\mu_{m})} and I=I(Ω,d,μ)I=I_{(\Omega,d,\mu)}. Then ImII_{m}\rightarrow I pointwise and consequently II is concave on $.Moreover,ifeach. Moreover, if each\mu_{m}isuniformoveris uniform over\Omega_{m},then, thenI^{n/(n-1)}isalsoconcaveonis also concave on$.

The argument has already been sketched. We only remark that it is not hard to verify the validity of the assumptions of Lemma 6.9 on μ\mu, as the limit of {μm}\left\{\mu_{m}\right\} as above (see e.g. [64, Remark 6.2]). ∎

Approximate Ω\Omega from outside by smooth convex domains using standard methods (see e.g. ). Note that Ωε\Omega_{\varepsilon} will only guarantee C1C^{1} smoothness. ∎

The case n=1n=1 follows from Theorem A.4 in the Appendix. For the case n2n\geq 2, we will need to approximate μ\mu from above and within by a sequence of smooth log-concave probability measures. Since we did not find a standard reference for this, we outline the argument.

First, assume that the support BB of μ\mu is compact. Approximate μ\mu by smooth log-concave probability measures {νε}\left\{\nu_{\varepsilon}\right\} in total-variation distance using standard methods (e.g. convolution with a Gaussian mollifier). Now define ηε,δ\eta_{\varepsilon,\delta} to be the dilatation of νε\nu_{\varepsilon} given by ηε,δ(A)=νε(x0+(1+δ)(Ax0))\eta_{\varepsilon,\delta}(A)=\nu_{\varepsilon}(x_{0}+(1+\delta)(A-x_{0})) for all Borel sets AA, where x0x_{0} is a point in the interior of BB (another possibility would be to use sup-convolution with a small Gaussian). It is then not hard to check that for a suitable subsequence, ηε,δ(ε)\eta_{\varepsilon,\delta(\varepsilon)} tends to μ\mu from above, from which the assertion follows by Theorem 6.10.

In case the support of μ\mu is not compact, we repeat the above argument for the truncated measures μr=μrB2n/μ(rB2n)\mu_{r}=\mu|_{rB^{n}_{2}}/\mu(rB^{n}_{2}), where B2nB^{n}_{2} denotes the Euclidean unit-ball. Note that μ+(rB2n)0\mu^{+}(rB^{n}_{2})\rightarrow 0 as rr\rightarrow\infty by the co-area formula:

Hence {μr}\left\{\mu_{r}\right\} tends to μ\mu from within, and so by Theorem 6.10 the claim now follows for arbitrary log-concave measures. ∎

2 Stability of First-Moment Concentration

Up to now, we have only concluded the Main Theorem 1.5 under our smooth convexity assumptions. We now describe how to extend these assumptions to our general convexity assumptions.

Indeed, assume that μ\mu can be approximated in total-variation by measures {μm}\left\{\mu_{m}\right\} with density exp(ψm)\exp(-\psi_{m}) such that ψmC2(M)\psi_{m}\in C^{2}(M) and Ricg+Hessgψm0Ric_{g}+Hess_{g}\psi_{m}\geq 0 on Ω=(M,g)\Omega=(M,g). We would like to show that our Main Theorem, stating that DChe(Ω,d,μ)cDFM(Ω,d,μ)D_{Che}(\Omega,d,\mu)\geq cD_{FM}(\Omega,d,\mu) for some universal constant c>0c>0, still holds. It is immediate to deduce from Lemma 6.6 that:

and using our Main Theorem for the smooth measures μm\mu_{m} (and Lemma 2.1), we deduce that:

for some universal constant c>0c>0. The First Moment constant is particularly easy to handle, since there is no fLq\left\|\left|\nabla f\right|\right\|_{L_{q}} term which needs to be controlled. The following lemma, which is an adaptation of a classical lemma of C. Borell from the Euclidean case to the Riemannian-manifold-with-density setting, enables us to reduce to the case that {μm}\left\{\mu_{m}\right\} are all supported on some compact set:

Let x0Mx_{0}\in M and R>0R>0 be such that θ=μm(B(x0,R))>1/2\theta=\mu_{m}(B(x_{0},R))>1/2. Then:

Given this lemma, it is easy to proceed as follows. Fix x0Ωx_{0}\in\Omega and R>0R>0 so that μ(B(x0,R))3/4\mu(B(x_{0},R))\geq 3/4. Then for some m0m_{0} and all mm0m\geq m_{0}, we have μm(B(x0,R))2/3\mu_{m}(B(x_{0},R))\geq 2/3, and hence by the lemma we conclude that:

Let fmf_{m} denote the 11-Lipschitz functions on Ω\Omega so that Mμmfm=0M_{\mu_{m}}f_{m}=0 and 1/DFMM(Ω,d,μm)=fmdμm1/D^{M}_{FM}(\Omega,d,\mu_{m})=\int|f_{m}|d\mu_{m} (we assume without loss of generality that the supremum is achieved). Since fmf_{m} are continuous, Mμmfm=0M_{\mu_{m}}f_{m}=0 and μm(B(x0,R))>1/2\mu_{m}(B(x_{0},R))>1/2, there must exist a xmB(x0,R)x_{m}\in B(x_{0},R) so that fm(xm)=0f_{m}(x_{m})=0. Since fmf_{m} are 11-Lipschitz, it follows that for any t1t\geq 1:

Hence, given ε>0\varepsilon>0, there exists a t1t\geq 1 so that:

But since our Lipschitz functions fmf_{m} are uniformly bounded on B(x0,tR)B(x_{0},tR) by (t+1)R(t+1)R (by passing through xmx_{m} as before), the convergence of {μm}\left\{\mu_{m}\right\} to μ\mu in total-variation implies:

Finally, we note that for mm large enough, by the Markov-Chebyshev inequality (we assume here without loss of generality that Mμfm0M_{\mu}f_{m}\geq 0):

so Mμfm3/DFMM(Ω,d,μ)\left|M_{\mu}f_{m}\right|\leq 3/D^{M}_{FM}(\Omega,d,\mu). Combining everything together, we deduce that for mm large enough:

Since ε>0\varepsilon>0 was arbitrary, we conclude that:

This concludes the proof, since as usual, we may pass from DFMMD^{M}_{FM} to DFMD_{FM} using Lemma 2.1.

For completeness, we provide a proof of Lemma 6.13, using the following remarkable generalization of the Prékopa-Leindler inequality (e.g. ) due to Cordero-Erausquin, McCann and Schmuckenschläger (generalizing their own result from ). Given x,yMx,y\in M and ss\in, define:

Indeed, if this is not so, there would exist a zMz\in M so that:

which would imply that d(y,x0)<tRd(y,x_{0})<tR, a contradiction. Hence, (6.9) implies that the functions f=χB(x0,R)f=\chi_{B(x_{0},R)}, g=χMB(x0,tR)g=\chi_{M\setminus B(x_{0},tR)} and h=χMB(x0,R)h=\chi_{M\setminus B(x_{0},R)} satisfy the assumption of Theorem 6.14 with s=2t+1s=\frac{2}{t+1}. Theorem 6.14 then implies that:

and the conclusion of the lemma follows. ∎

Appendix

In the Appendix, we provide more details regarding the statement and ideas underlying the proof of Theorem 1.8 from the Introduction, as it plays an essential role in our argument. In the statement of this theorem, we have summarized a series of results in Riemannian Geometry concerning the concavity of the isoperimetric profile, which were proved under increasingly general convexity assumptions. An essential ingredient in the proofs of these results is provided by Geometric Measure Theory, which guarantees the existence and regularity of the isoperimetric minimizers, and permits the use of a variational argument to deduce the concavity of the profile.

First, we survey the case where the metric space (Ω,d)(\Omega,d) is given by a bounded domain (connected open set) with C2C^{2} boundary in a smooth complete oriented connected nn-dimensional (n2n\geq 2) Riemannian manifold (M,g)(M,g) along with the induced geodesic distance dd in MM, and the probability measure μ\mu is given by the restriction to Ω\Omega of the Riemannian volume form volMvol_{M} on MM, normalized so that μ(Ω)=1\mu(\Omega)=1. We summarize for completeness some remarkable results provided by Geometric Measure Theory about the existence and regularity of isoperimetric minimizers in the case we are considering, and refer to the books of Federer , Morgan , Giusti and Burago and Zalgaller for further information.

For any t(0,1)t\in(0,1), there exists an open isoperimetric minimizer AA of measure tt for the isoperimetric problem on (Ω,d,μ)(\Omega,d,\mu) as above. The boundary Σ=AΩ\Sigma=\overline{\partial A\cap\Omega} can be written as a disjoint union of a regular part Σr\Sigma_{r} and a set of singularities Σs\Sigma_{s}, with the following properties:

ΣrΩ\Sigma_{r}\cap\Omega is a smooth, embedded hypersurface of constant mean curvature.

Σr\Sigma_{r} meets Ω\partial\Omega orthogonally.

Σs\Sigma_{s} is a closed set of Hausdorff co-dimension not smaller than 8. This result is sharp.

For all the results to be described, it is essential that the Hausdorff co-dimension of the singular part of the boundary is large (although typically knowing that it is greater than 3 is sufficient). This approach was used by M. Gromov in his influential generalization of P. Lévy’s isoperimetric inequality ,[39, Appendix C]. The negligible singular part permits to consider a normal variation of the regular part, and from there on one may continue by using the readily available tools from Riemannian Geometry to calculate the first and second variations of volume and area. Before proceeding, we remark that most results we will mention deduce that the isoperimetric profile satisfies a second order differential inequality under more general convexity assumptions than stated (e.g. a negative lower bound on the Ricci curvature), and provide a characterization of the equality case as well.

The first convexity assumption which we add is that the Ricci curvature tensor RicgRic_{g} of (M,g)(M,g) be non-negative. When MM is a closed manifold and Ω=M\Omega=M, and under the additional assumption that all isoperimetric minimizers are smooth submanifolds (this is always the case when n7n\leq 7), it was shown by Bavard and Pansu that II is concave on $.Infact,theseauthorsattributethesamestatementwithouttheassumptiononthesmoothnessoftheisoperimetricminimizerstoBeˊrard,BessonandGallot.ThiswasalsoformallyverifiedbyMorganandJohnson[73,Section2.1andProposition3.3].Gallotin[34,Corollary6.6]showedthatinfacttherenormalizedprofile. In fact, these authors attribute the same statement without the assumption on the smoothness of the isoperimetric minimizers to Bérard, Besson and Gallot. This was also formally verified by Morgan and Johnson [73, Section 2.1 and Proposition 3.3]. Gallot in [34, Corollary 6.6] showed that in fact the renormalized profileI^{n/(n-1)}$ is concave in this case. This result captures the right dependence of the dimension in the exponent.

A domain Ω(M,g)\Omega\subset(M,g) is said to be locally convex, if all geodesics in MM tangent to Ω\partial\Omega are locally outside of Ω\Omega. By a result of Bishop , in case that Ω\Omega has C2C^{2} boundary, this is equivalent to requiring that the second fundamental form of Ω\partial\Omega with respect to the normal pointing into Ω\Omega be positive semi-definite on all of Ω\partial\Omega.

We summarize the above results in the following:

Let (M,g)(M,g) be a smooth complete oriented connected Riemannian manifold of dimension n2n\geq 2 with non-negative Ricci curvature, and let Ω\Omega denote a locally convex bounded domain in (M,g)(M,g). Let dd denote the induced geodesic distance in (M,g)(M,g) and μ\mu the restriction to Ω\Omega of the canonical volume form volMvol_{M} on MM, normalized so that μ(Ω)=1\mu(\Omega)=1. Assume in addition that Ω\Omega has C2C^{2} smooth boundary. Then the isoperimetric profile I=I(Ω,d,μ)I=I_{(\Omega,d,\mu)} is a concave function on $.Moreover,sois. Moreover, so isI^{n/(n-1)}$.

A.2 Manifolds with densities

As before, let (M,g)(M,g) denote an nn-dimensional (n2n\geq 2) smooth complete oriented connected Riemannian manifold with induced geodesic distance dd. In addition, let ψC2(M)\psi\in C^{2}(M) be such that dμ=exp(ψ)dvolMd\mu=\exp(-\psi)dvol_{M} is a probability measure on MM. Since the influential work of Bakry and Émery in the abstract framework of diffusion generators, it is known that a natural convexity condition on a manifold with density, which replaces the condition Ricg0Ric_{g}\geq 0 in the uniform density case, is to require the following CD(0,)CD(0,\infty) Curvature-Dimension condition:

Let Ω=(M,g)\Omega=(M,g) and d,μd,\mu as above. Assume that (A.1) holds on Ω\Omega. Then I=I(Ω,d,μ)I=I_{(\Omega,d,\mu)} is a concave function on $$.

This theorem was proved by Bayle in under the assumption that MM is a closed manifold. It was noted (without explanation) by Morgan [71, Corollary 9] that the same proof applies for a general complete manifold, as long as it has finite μ\mu-measure. Indeed, Bayle’s argument remains exactly the same; the only point one needs to check is the existence and regularity of isoperimetric minimizers in the manifold with density setting. The argument goes as follows: it was shown by Morgan in [70, Remark 3.10] that given a complete smooth Riemannian manifold with positive density ρCk(M)\rho\in C^{k}(M) (k0k\geq 0), if there exists an area minimizing current then its boundary is necessarily CkC^{k} regular outside a set of Hausdorff codimension at least 8. As explained e.g. in , the existence of an area minimizing current is guaranteed by the local compactness Theorem for currents (see ), as soon as the μ\mu-measure of MM is finite, which is always the case in our setting. Since the minimizing current is regular by the previous result, it follows that the usual notion of weighted area (i.e. Minkowski boundary measure) and the weighted area of a current coincide, and hence there exists a regular minimizer of Minkowski boundary measure.

The assumption that MM has finite mass is essential for the existence of minimizers, otherwise one may construct counterexamples (see or [12, p. 51]). It is also essential that the density be continuous, otherwise minimizers need not necessarily exist (consider the density 14χ×+χ[14,1]×\frac{1}{4}\chi_{\times}+\chi_{[\frac{1}{4},1]\times} on ×\times).

We remark that the same existence and regularity argument works for manifolds with a smooth boundary. Let Ω(M,g)\Omega\subset(M,g) be a domain (connected open set) with C2C^{2} boundary, let dd be the geodesic distance induced by (M,g)(M,g), and let dμ=exp(ψ)dvolMΩd\mu=\exp(-\psi)dvol_{M}|_{\Omega} with ψC2(Ω)\psi\in C^{2}(\overline{\Omega}) so that μ(Ω)=1\mu(\Omega)=1. One can easily check that the argument of Grüter on the constant curvature of the regular part of the boundary and the orthogonality still applies, with a minor change in the conclusion. We summarize this in the following:

For any t(0,1)t\in(0,1), there exists an open isoperimetric minimizer AA of measure tt for the isoperimetric problem on (Ω,d,μ)(\Omega,d,\mu) as above. The boundary Σ=AΩ\Sigma=\overline{\partial A\cap\Omega} can be written as a disjoint union of a regular part Σr\Sigma_{r} and a set of singularities Σs\Sigma_{s}, with the following properties:

ΣrΩ\Sigma_{r}\cap\Omega is a C2C^{2} smooth, embedded hypersurface of constant generalized mean curvature, defined as:

where HΣr(x)H_{\Sigma_{r}}(x) denotes the usual mean curvature of Σr\Sigma_{r} in the direction of the unit normal νΣr(x)\nu_{\Sigma_{r}}(x) pointing into AA (i.e. the trace of the second fundamental form divided by (n1)(n-1)), for xΣrΩx\in\Sigma_{r}\cap\Omega.

Σr\Sigma_{r} meets Ω\partial\Omega orthogonally (even in the presence of a density).

Σs\Sigma_{s} is a closed set of Hausdorff co-dimension not smaller than 8.

It is then a (tedious) exercise to follow the proof of Sternberg and Zumbrun and Bayle (see also ) and to deduce the following extension of Theorem A.2:

Let Ω(M,g)\Omega\subset(M,g) be a locally convex domain with C2C^{2} boundary, and let dd,μ\mu as above. Assume that (A.1) holds on Ω\Omega. Then I=I(Ω,d,μ)I=I_{(\Omega,d,\mu)} is a concave function on $$.

In the one-dimensional case n=1n=1, it was shown by S. Bobkov that all of the above theorems hold as well (here there is no point to consider a general manifold):

Bobkov showed that in this case, the minimizing sets are always given by half-lines, from which it is immediate that I(t)=min(FF1(t),FF1(1t))I(t)=\min(F^{\prime}\circ F^{-1}(t),F^{\prime}\circ F^{-1}(1-t)), where F(s)=μ(,s)F(s)=\mu(-\infty,s). Using that μ\mu is log-concave, direct differentiation reveals that II is concave. Note that the case n=1n=1 is special since II may be discontinuous at and 11, but this has absolutely no consequences to our applications.

References