On the role of Convexity in Isoperimetry, Spectral-Gap and Concentration
Emanuel Milman
Introduction
The first way is by means of an isoperimetric inequality. Recall that Minkowski’s (exterior) boundary measure of a Borel set , which we denote here by , is defined as:
A very useful isoperimetric inequality was considered by Cheeger (and in a more general form, independently by V. G. Maz’ya ):
The space is said to satisfy Cheeger’s isoperimetric inequality if:
The best possible constant above is denoted by .
A second way to measure the interplay between and is given by functional inequalities. Let denote the space of functions which are Lipschitz on every ball in - we will call such functions “Lipschitz-on-balls” - and let . We will consider functional inequalities which measure the relation between and , for (more general Orlicz norms will be treated in ). Here, the effect of the metric is via the Riemannian metric which is used to measure , although more general ways exist to define in the non manifold setting. Of course if is constant there is no sense to compare against , so we will need to exclude these cases. To this end, we will require that either the expectation or median of are 0. Here and is a value so that and .
A well known example of a functional inequality was studied by Poincaré:
The space is said to satisfy Poincaré’s inequality if:
The best possible constant above is denoted by .
It is well known (e.g. ) that under appropriate smoothness assumptions, Poincaré’s inequality is equivalent to the existence of a spectral gap of an appropriate Laplacian operator on associated to the measure with corresponding boundary conditions on its support. When is uniform on a domain , coincides with the usual Laplace-Beltrami operator with Neumann boundary conditions on . The first non-trivial eigenvalue of (the “spectral gap”) is then precisely .
A third way to measure the relation between and is given by concentration inequalities. These measure how tightly -Lipschitz functions are concentrated about their mean, by providing a quantitative estimate on the tail decay . A typical situation is given by the following example:
The space is said to have exponential concentration if:
Fixing , the best possible constant above is denoted by . The best constant for a specific is denoted by .
It is known that the three examples mentioned above are arranged in a hierarchy. It was shown by Cheeger , and in a more general form, independently by Maz’ya (see also ), that Cheeger’s isoperimetric inequality always implies Poincaré’s inequality (or spectral gap):
(“Cheeger’s inequality”).
The fact that Poincaré’s inequality implies exponential concentration was first shown by M. Gromov and V. Milman in the Riemannian setting, and subsequently by other authors in other settings as well (e.g. , see and the references therein):
There exists a universal numeric constant such that .
2 Reversing the Hierarchy
In both cases, we will say that “our convexity assumptions are fulfilled”. More generally, we present the following definition:
We will say that our smooth convexity assumptions are fulfilled if:
denotes the induced geodesic distance on .
, , and as tensor fields on :
We will say that our convexity assumptions are fulfilled if can be approximated in total-variation by measures so that satisfy our smooth convexity assumptions.
The condition (1.1) is the well-known Curvature-Dimension condition , introduced by Bakry and Émery in their influential paper (in the more abstract framework of diffusion generators). Here denotes the Ricci curvature tensor and denotes the second covariant derivative. When the Ricci tensor satisfies a slightly relaxed condition , , it was first shown by Buser that the implication in Theorem 1.1 can be reversed. We only quote the case, which in our setting reads:
If is uniform on a closed -dimensional manifold and then , where is a universal numeric constant.
The fact that the constant above does not depend on the dimension is quite remarkable. Buser’s theorem was recently further generalized by M. Ledoux (following the method developed by Bakry–Ledoux ) to the Bakry-Émery abstract setting. Again, we only quote the case:
Under our smooth convexity assumptions , where is a universal numeric constant.
3 Main Theorem
How about reversing the implication in Theorem 1.2 under our convexity assumptions? This is one of the statements in our Main Theorem below. A second statement, which is much more surprising, concerns a very weak type of concentration inequality, which we introduce:
The space is said to satisfy First-Moment concentration if:
The best possible constant above is denoted by .
Clearly, by the Markov-Chebyshev inequality, First-Moment concentration implies linear tail-decay:
and decay slightly faster than linear implies (integrating by parts) First-Moment concentration. The First-Moment concentration is clearly a-priori much weaker than exponential concentration. Our Main Theorem, first announced in , asserts that under our convexity assumptions, not only is First-Moment concentration equivalent to exponential concentration, but in fact also to the a-priori stronger inequalities of Poincaré and Cheeger:
Under our convexity assumptions, the following statements are equivalent:
Cheeger’s isoperimetric inequality (with ).
Exponential concentration inequality (with ).
First Moment concentration inequality (with ).
The equivalence is in the sense that the constants above satisfy .
Here and below, means that , with some universal numerical constants, independent of any other parameter, and in particular the dimension . We will see in Section 4 that the use of the First-Moment is not essential in Statement (4); we may have required any arbitrarily slow uniform tail decay, instead of linear decay. In other words, if:
where decays to 0 arbitrarily slow, we can deduce under our convexity assumptions that Lipschitz functions have in fact much faster exponential tail decay (with rate depending solely on ), and in addition the stronger inequalities of Poincaré and Cheeger, as above. In this sense, our result extends the well-known Kahane-Khinchine type inequalities in Convexity Theory (e.g. consequences of Borell’s Lemma , see for an overview) stating that linear functionals have comparable moments, ensuring exponential tail decay, to the same statement for the “worst” -Lipschitz function (see Remark 4.4).
The Main Theorem may also be interpreted as stating that under our convexity assumptions, there exists a single -Lipschitz function whose level sets on average attain the minimum (up to constants) in Cheeger’s isoperimetric inequality (see Section 4). In fact, one may choose this function to be of the form , where is some set with . This is expressed in the following reformulation of the Main Theorem:
Under our convexity assumptions on :
Equivalently, this is tantamount to saying that under our convexity assumptions, it is only necessary to use test functions of the form when testing (up to a universal numeric constant) for the spectral gap in Poincaré’s inequality. Clearly, without any further assumptions, all of the above statements are in general false.
4 Applications to Spectral Gap of Convex Domains
where is some universal numeric constant.
Here Vol denotes the Lebesgue measure. In particular, we see that:
Note that satisfying the above condition can be very different geometrically (consider for instance a Euclidean ball of radius and its intersection with a centered slab of width ), and yet share essentially the same spectral gap. Also note that our stability result holds with respect to all possible Euclidean structures simultaneously, since the assumption in the left-hand side above is independent of the Euclidean structure.
We also observe that the quantitative dependence on in (1.4) is essentially best possible: the logarithmic dependence on is (up to numeric constants) optimal, and the quadratic dependence on cannot be improved beyond linear (and is in fact optimal in some restricted range, see Example 5.6). In addition, Theorem 1.7 implies that when with , , then . In fact, when with , we obtain in Corollary 5.3 the best possible (up to numeric constants) quantitative bounds on as a function of (see Example 5.7). To the best of our knowledge, no quantitative bounds on the stability of for convex domains under convex perturbations were previously known. Completely analogous stability results hold for log-concave probability measures as well (see Theorem 5.5). Another useful result which we deduce from our Main Theorem is that Cheeger’s constant is preserved under maps which are not necessarily Lipschitz, but rather Lipschitz on average (see Theorem 5.9).
5 Ingredients in Proof of Main Theorem
All of the four statements in our Main Theorem 1.5 can be equivalently (up to universal constants) rewritten using a single unified framework in terms of Poincaré inequalities:
The space is said to satisfy a Poincaré inequality if:
The best possible constant above is denoted by .
We prefer to use the median in our definition for reasons which will become apparent in Section 2. It is known and easy to establish that , , , so our Main Theorem can be restated as the claim that all Poincaré inequalities in the range are equivalent under our convexity assumptions (see Theorem 2.4).
The convexity assumptions are used in an essential way in the proof of the Main Theorem in several separate places. First, we employ the condition via the semi-group gradient estimates used by Ledoux in his proof of Theorem 1.4. Contrary to previous approaches, which could only deduce isoperimetric information from functional inequalities with a term with (see [8, p. 3] and the references therein), we can handle arbitrary (and although we do not pursue this direction here, more general Orlicz norms too). To demonstrate that our estimates are sharp, we remark that the isoperimetric inequalities we obtain are in fact equivalent (up to universal constants) to the Poincaré inequalities used to derive them. This is summarized in Theorem 2.9, which generalizes Theorems 1.1, 1.2, 1.3 and 1.4 above into a single unified framework. Using this, we deduce from the First-Moment inequality ( above) that:
To deduce Cheeger’s isoperimetric inequality from (1.5), we need to use our convexity assumptions for the second time. We employ the following series of results in Riemannian Geometry, due to numerous groups of authors , who proved them under increasingly general conditions. A detailed survey of these results may be found in the Appendix. We learned about these results from the PhD Thesis of V. Bayle , which was referenced to us by Sasha Sodin, to whom we are indebted. In the formulation below, we use a slightly more general notion of smooth convexity assumptions, which is defined in Section 6.
Under our generalized smooth convexity assumptions, the isoperimetric profile is concave on . Moreover, when is in addition uniform on , then is concave on $nM$.
It is not hard to show (see Section 6) that the isoperimetric profile is continuous under very general assumptions. It then follows by a general argument (e.g. Corollary 6.5) that must be symmetric about the point . Hence, the concavity of implies that under our convexity assumptions. It is then immediate to deduce Cheeger’s isoperimetric inequality from (1.5). In fact, a stronger statement can be deduced when is uniform on (see Remark 2.11).
A final ingredient in the proof is an approximation argument to handle non-smooth densities, which are typical in applications as well as essential for handling uniform measures on bounded domains (with possibly non-smooth boundaries). Contrary to many results in Convexity Theory, where approximation arguments are standard, easy and usually omitted, the isoperimetric profile and the Cheeger constant are delicate objects, which in general are not stable under approximation in the natural total-variation metric (see Section 6). We therefore employ our convexity assumptions one last time, and provide in Section 6 a careful argument for deducing the Main Theorem 1.5 without any smoothness assumptions, and a different approximation procedure for extending Theorem 1.8, which in particular applies to the entire class of log-concave measures in Euclidean space.
The rest of this work is organized as follows. In Section 2, we reformulate the Main Theorem in terms of an equivalence between Poincaré inequalities, and using Theorem 1.8, reduce it to the statement of Theorem 2.9. The semi-group argument for proving Theorem 2.9 is described in Section 3. Further interpretations and an extension of the Main Theorem are described in Section 4. Applications for the spectral gap under our convexity assumptions are described in Section 5. We conclude with an approximation argument for disposing of our smoothness assumptions in Section 6, and an Appendix describing in more detail the results summarized in the statement of Theorem 1.8.
Acknowledgements. I would like to thank Professor Gideon Schechtman and the Weizmann Institute of Science where this research project commenced during the last months of my PhD studies. I would also like to thank Professor Jean Bourgain and the Institute for Advanced Study for providing the perfect research environment. Most especially, I would like to thank Sasha Sodin for his invaluable help - acquainting me with capacities, suggesting to look at the PhD Thesis of Bayle and Ledoux’s semi-group argument, countless other references, many informative conversations and comments on this manuscript. I am also grateful to Franck Barthe for his kind hospitality, remarks and advice, Bo’az Klartag for several references, discussions and insightful remarks, Professors Sergey Bobkov and Michel Ledoux for their comments and advice, and Professor David Jerison for several interesting conversations and suggestions. My gratitude also extends to the anonymous referees for their extraordinary careful reading and helpful suggestions, which greatly improved the presentation of this work. Finally, I would also like to thank Professors David Jerison, Erwin Lutwak, Assaf Naor, Vladimir Pestov and Santosh Vempala for their invitations to give talks on this work in its early development.
(p,q)𝑝𝑞(p,q) Poincaré Inequalities
We start by rewriting some of the statements of the Main Theorem 1.5.
Let denote an Orlicz norm associated to the Young function . Then:
Note that . First, by Jensen’s inequality (applied twice):
Next, we may assume that (otherwise exchange by ). By the Markov-Chebyshev inequality:
We conclude by noting that since is convex. ∎
The last lemma implies that we can pass back and forth between using the median and the expectation when excluding constant functions in our functional inequalities, at the expense of losing a universal constant. We therefore see that Poincaré’s inequality is equivalent (up to constants) to the inequality:
(and in fact in this case one clearly has ). The next lemma, due to Maz’ya and Federer and Fleming (see also for a careful derivation), rewrites Cheeger’s isoperimetric inequality in functional form:
Cheeger’s isoperimetric inequality (with ) holds iff:
It is easy to show that Cheeger’s isoperimetric inequality is recovered by applying (2.2) to Lipschitz functions which approximate , the characteristic function of a Borel set , in an appropriate sense. Conversely, the co-area formula, which for general metric probability spaces becomes an inequality (see ), implies for with :
Since for a 1-Lipschitz function , , our First-Moment inequality is clearly equivalent to:
in the sense that where is the best constant above.
The above functional reformulations remain valid for general metric probability spaces , in which case we interpret for any as the following Borel function:
(and we define it as 0 if is an isolated point - see [19, pp. 184,189] for more details).
With the above reformulations (2.1), (2.2), (2.3) serving as motivation, the reasons behind our definition of Poincaré inequalities in the Introduction are now clear. Note that , and . We can now restate our Main Theorem 1.5 as follows:
Under our convexity assumptions, all Poincaré inequalities are equivalent in the range . More precisely, for any other :
In fact, a more precise dependence on and may be obtained in some cases. For instance, clearly if and without any further convexity assumptions (by Jensen’s inequality), so we see that the First-Moment inequality ( case) is the weakest among all Poincaré inequalities in the above range. Another immediate observation is given by:
Let and be such that:
Then without any further convexity assumptions, .
Let denote a function with . Define , and apply the Poincaré inequality to . Clearly , so we obtain by Hölder’s inequality:
Maz’ya–Cheeger inequality: .
Gromov–Milman inequality: .
Since , we conclude by Proposition 2.5 that for every . Let be a 1-Lipschitz function. It is elementary to show (e.g. ) that is equivalent (to within universal constants) to , and that is in turn equivalent to . Employing Lemma 2.1 and using the Poincaré inequalities:
since was assumed 1-Lipschitz. Taking supremum on all such functions , we obtain the conclusion. ∎
The exact same proof shows that , for arbitrary .
We have seen that passing from to is manageable if (perhaps under some additional assumptions on ) without any convexity assumptions. Unfortunately, we are interested in the case , for which an analogous statement to Proposition 2.5 is simply false without any additional assumptions (counter examples are easy to construct, as in the Introduction). Our first ingredient in the proof of Theorem 2.4 states that our convexity assumptions already suffice to extend Proposition 2.5 to the case , :
Let , , and set . Assume that . Then under our smooth convexity assumptions, the following statements are equivalent:
where the best constants and above satisfy:
for some universal constants . In fact, the direction holds for without any convexity assumptions.
Note that when , the direction reduces (up to constants) to Theorem 1.1 (Maz’ya–Cheeger inequality), and the direction to the Buser–Ledoux Theorems 1.3,1.4. A generalization of Theorem 2.9 involving general Orlicz norms will be derived in .
Let . Without any convexity assumptions, the Poincaré inequality:
is equivalent to the following isoperimetric inequality:
The proof of is thus complete.
Before proceeding to the proof of the direction (this will be the focus of the next section), let us recall how Theorem 2.9 coupled with Theorem 1.8 conclude the proof of Theorem 2.4 and hence of our Main Theorem 1.5:
By an approximation argument we develop in Section 6, it is enough to prove the theorem under our smooth convexity assumptions.
By Jensen’s inequality, in the range . Employing our (smooth) convexity assumptions, the direction of Theorem 2.9 implies:
Using our (smooth) convexity assumptions for the second time, Theorem 1.8 asserts that is concave on . Since is also symmetric about (see Corollary 6.5), we immediately deduce that:
which is exactly Cheeger’s isoperimetric inequality, and is identical to stating . Using Proposition 2.5 and Jensen’s inequality if necessary, we can pass from this to an arbitrary inequality in the range . ∎
Note that when is the uniform measure on , Theorem 1.8 in fact ensures that is concave, so we may deduce from (2.5) that in fact:
Proposition 2.10 implies that the latter isoperimetric inequality is equivalent to a Poincaré inequality. Hence, it is clear that in this case, both our Main Theorem 1.5 and Theorem 2.4 can be strengthened.
The Semi-Group Argument
In this section, we prove the direction of Theorem 2.9. Our proof closely follows Ledoux’s proof of Theorem 1.4.
Given a smooth complete oriented connected Riemannian manifold equipped with a probability measure with density , , we define the associated Laplacian by:
where is the usual Laplace-Beltrami operator on . acts on , the space of bounded smooth real-valued functions on . Let denote the semi-group associated to the diffusion process with infinitesimal generator (cf. ), characterized by the following system of second order differential equations:
For each , is a bounded linear operator and its action naturally extends to the entire spaces (). We collect several elementary properties of these operators:
for all .
The following crucial dimension-free reverse Poincaré inequality was shown by Bakry and Ledoux in [5, Lemma 4.2], extending Ledoux’s approach for proving Buser’s Theorem (see also [5, Lemma 2.4], [56, Lemma 5.1]). It may also be interpreted as a weak, dimension-free, form of the Li–Yau parabolic gradient inequality .
Assume that the following Bakry-Émery Curvature-Dimension condition holds on :
Then for any and , we have:
In fact, the proof of this lemma is very general and extends to the abstract framework of diffusion generators, as developed by Bakry and Émery . We comment that in the Riemannian setting, it is known (see also ) that the gradient estimate of Lemma 3.1 is preserved when restricting to a locally convex domain (as defined in the Appendix) with smooth boundary; we refer to Sturm [83, Proposition 4.15] for a general statement about closedness of the Bakry-Émery Curvature-Dimension condition in an arbitrary metric probability space. The above lemma therefore holds under more general conditions, namely when is supported on a locally convex domain with boundary, and , . In this case, in (3.1) denotes the Neumann Laplacian on , denotes the space of bounded smooth real-valued functions on satisfying Neumann’s boundary condition on , and Lemma 3.1 remains valid.
Our convexity assumptions are that in Lemma 3.1, and this is what we will henceforth assume. It is clear that our results in this section may be extended to the case of , but we do not pursue this direction in this work.
From Lemma 3.1, it is immediate that for any :
and using , Ledoux easily deduces the following dual statement [56, (5.5)]:
First, our assumption on the range of implies that by applying Proposition 2.5 if necessary, we may assume that at the expense of an additional universal constant appearing in (2.4). An additional universal constant will appear on account of Lemma 2.1, with which we pass to instead of in (1), so our assumption now reads:
Let denote an arbitrary Borel set in , and let denote a continuous approximation in to the characteristic function of . Clearly:
Applying Corollary 3.2 to functions in which approximate (in say ) and passing to the limit inferior as , it follows that:
We start by rewriting the right hand side above as:
Note that by Hölder’s inequality (recall that ) and our assumption (3.5):
Using (3.3) (recall that ) to estimate , we conclude that:
We may now optimize on . Using the rough estimate:
for , we evaluate (3.6) at time:
where . Since , this concludes the proof. ∎
As evident from the proof, for deducing the direction of Theorem 2.9, the definition of smooth convexity assumptions given in the Introduction may be extended to encompass the more general case treated in this section. Moreover, it is possible to provide an approximation argument for deducing this direction without any smoothness assumptions. We provide the argument in and omit it here, since it is not required for the results of this work.
Interpretations and Extensions
In this section, we provide some further interpretations and extensions of our Main Theorem, which will also be needed for the applications of the next section. We assume throughout this section that our convexity assumptions on are satisfied.
Lemma 2.2 demonstrates that if is a set with on which the minimal ratio in Cheeger’s isoperimetric inequality is attained (or nearly attained), then the function (or the sequence of Lipschitz functions which approximate it) attains the same (nearly) minimal ratio
among all functions with . Clearly (or its approximating sequence) is far from being -Lipschitz. If on the other hand we define:
which is a -Lipschitz function, it is not clear that it will have a small ratio in (4.1). Our Main Theorem 1.5 (together with Lemma 2.1) states that under our convexity assumptions, any -Lipschitz function on with which is (essentially) optimal in the First-Moment inequality (say ), also essentially minimizes the ratio in (4.1). Moreover, using the co-area formula as in Lemma 2.2 and applying our Main Theorem, we have:
from which we also see that the ratio for the “average” level set of is essentially , the smallest possible.
Theorem 1.6 from the Introduction states that as above may in fact be chosen to be of the form (4.2).
Given a Borel set with , we denote . Clearly is -Lipschitz and , so one direction follows immediately by Lemma 2.2:
For the other direction, we employ our Main Theorem (and Lemma 2.1):
where the infimum is over all -Lipschitz functions on with . Denoting , we have , . By continuity of , , (even though it is possible that ), and since it is -Lipschitz:
The next proposition will prove to be very useful for the applications of the next section. We start with some notations. Given a Borel function on a Borel probability space and , let us denote by the -quantile of :
Let us also recall an inequality due to Paley and Zygmund (see also [46, Chapter 2]), which in its simplest form reads as follows:
Let denote a Borel function on , and assume that:
Then for any , denoting , one has .
Let denote a -Lipschitz function with either and or and . Then:
for some universal constants and .
Proceeding as in Corollary 2.7, and using Lemma 2.1 and the Main Theorem:
for some universal constant , and (4.4) follows by Lemma 4.1 (with ). Note that our convexity assumptions necessarily imply that (see Lemma 6.13), so the appeal to Lemma 4.1 is indeed legitimate. ∎
An arbitrarily slow uniform tail decay condition (1.3) implies any of the statements of the Main Theorem 1.5, with depending solely on . Moreover, in (1.3) may be replaced by .
Given a -Lipschitz function satisfying either of the assumptions of Proposition 4.2, these and (4.4) imply that:
Consequently, the tail decay condition (1.3) (whether stated with or ) ensures that , so by Lemma 2.1 the First-Moment concentration inequality is satisfied, from which the other statements of the Main Theorem follow. ∎
Applications to Spectral Gap of Convex Domains
First, we would like to obtain a stability result for (or equivalently ) for perturbations of . Clearly, without any further assumptions, there can be no such result (as seen by adding arbitrarily small “necks” to as in the Introduction), so we restrict our attention to convex domains. In this case, our Main Theorem 1.5 asserts that this is equivalent to obtaining a stability result for , which is much easier. To obtain the best quantitative bounds, we will also employ .
Let denote a -Lipschitz function on with so that . Since is convex, we may clearly extend to a -Lipschitz function on , say by defining . Here denotes the unique (by convexity) in so that . We may assume that (otherwise exchange with ). Note that we can estimate as follows:
By Proposition 4.2, there exists some universal so that . Using this, the ratio between the volumes of and , the triangle inequality, the Markov-Chebyshev inequality and the estimate on in (5.1), we evaluate:
where is some universal constant. Using Lemma 2.1 and (2.3), the assertion follows. ∎
Note that for any and in fact even without assuming that is convex:
Indeed, since is convex, by Theorem 1.8 (more precisely, its extension to non-smooth domains or densities given by Theorem 6.10 and Corollaries 6.11,6.12) we know that . Given a Borel set with , we have:
By the assumption in (5.2), , and from this we easily deduce the conclusion in (5.2). Iterating this using a sequence of intermediate convex bodies (here we already need to use that is convex) so that (for example, assuming , choose for appropriate ), we obtain that:
Taking the limit as yields the claimed power of (even without any additional numerical constant!). ∎
then applying Lemma 5.2, the Main Theorem 1.5 and Lemma 5.1, we obtain:
for some universal constants , concluding the proof of Theorem 1.7. Of course a similar upper bound on is obtained by interchanging the roles of .
In Convexity Theory, many interesting ways are known to cut a convex body so that its volume is preserved up to a constant (e.g. by slabs, parallelepipeds, balls etc…). We see that all of these preserve (up to a constant) (equivalently, the spectral gap ). A useful way to measure the distance between two convex bodies is given by the following variant on the usual geometric distance:
so by passing from the outer to the inner body (in which case our estimates are logarithmic), we deduce:
for some , where is some universal constant, then:
where is another universal constant.
Denoting the best constants in (5.4) and applying Lemma 5.1:
and since , the assertion follows. ∎
appearing in the assumptions of both lemmas has a clear and intuitive geometric meaning.
Lemmas 5.1 and 5.2 remain valid for absolutely continuous log-concave probability measures (replacing respectively ), if the condition (5.5) in the assumption is replaced by the condition:
and are replaced by () in the corresponding conclusion.
Identical to the proof of the original lemmas; the only minor point is the construction of intermediate measures in the proof of Lemma 5.2, which may be defined e.g. by , , for appropriate (assuming the origin is in the interior of the support of ). ∎
The analogue of Theorem 1.7 may then be conveniently formulated using the total-variation metric:
with and a universal constant.
Let denote the measure whose density is , and note that . Denoting by the (log-concave) probability measure , since , , we may apply Lemma 5.4 and the Main Theorem to pass from to to as in (5.3), concluding the proof. ∎
2 Optimality of Stability
To the best of our knowledge, no quantitative results on the stability of or for convex domains with respect to volume preserving perturbations or geometric distance were previously known. Moreover, we claim that the bounds obtained in Theorem 1.7 (or (5.3)) are optimal (up to numeric constants) with respect to and close to optimal with respect to (note that the dependence is logarithmic in the former yet quadratic in the latter; in other words, the deterioration in the Cheeger constant when passing from an outer convex body to an inner one is genuinely different than when passing from the inner one outward). This is witnessed by the following:
It is known (see Subsection 5.5) that , so by the -homogeneity of , it follows that and . Denoting , since , we conclude that:
uniformly for all . So one cannot expect better than logarithmic dependence on (at least when ), which coincides with the estimate given by Lemma 5.1.
On the other hand (as is well-known), if we set and a circumscribing box with , since in that range, it is clear that the quadratic dependence on in Lemma 5.2 cannot be improved beyond linear. Although we do not know whether the optimal bound is, up to a constant, closer to the linear or quadratic asymptotic, we comment that for very small perturbations (i.e. very close to 1), it is possible to show that the exact quadratic bound in Lemma 5.2 is optimal (in this range of , we of course do not allow any additional numerical constants).
The next example (which is similar yet different from the previous one) shows that the bounds in Corollary 5.3 are optimal too (up to numeric constants), as a function of in the stated range.
Since clearly , it remains to note that , so . By interchanging the roles of appropriately, we observe that the estimates on in Corollary 5.3 are sharp both from above and from below.
It is easy to adapt the proofs of Lemma 5.1 and consequently Corollary 5.3 to obtain even sharper quantitative bounds (up to universal constants) on the stability of for specific convex bodies, such as the Euclidean ball . For instance, in the latter case, one obtains that if for , then:
This is an improvement over Corollary 5.3 and known to be sharp for (folklore).
It is well known and immediate to see that isoperimetric inequalities are preserved under -Lipschitz mappings. Given two metric probability spaces and , recall that a Borel map is said to push forward onto , if for every Borel set . This is equivalent to requiring that for any Borel function on :
This will be denoted by . The following is then immediate from the definitions:
The following result states that when our convexity assumptions hold for the target space, as far as Cheeger’s isoperimetric inequality is concerned, one need not require that be Lipschitz on the entire space, but rather just on average. We would like to thank Bo’az Klartag for a fruitful discussion regarding this point.
Assume that verifies our convexity assumptions and that for some Lipschitz-on-balls map . Then:
Here denotes the local Lipschitz constant of at :
When is smooth and are linear spaces, this coincides with the operator norm of the usual derivative matrix at .
First, rewrite Cheeger’s isoperimetric inequality on in functional form (Lemma 2.2):
Using this, we estimate the First-Moment constant on . Given a -Lipschitz function on , clearly is Lipschitz-on-balls on , hence in . We then have by the definition of push-forward and our assumption (5.6):
We conclude by our Main Theorem (and Lemma 2.1), which imply that under our convexity assumptions on . ∎
In this subsection, we easily recover some previously known estimates on the Cheeger constant of convex domains in a single framework and extend some results to the Riemannian setting. We begin with the following stimulating conjecture from :
Here denotes the largest eigenvalue of the symmetric covariance matrix of :
We will write for .
Although the KLS conjecture is far from being resolved, some general lower bounds on are known, but these produce dimension-dependent results. We will see that our Main Theorem easily reproduces these bounds.
The following result in the Euclidean setting is due to Payne and Weinberger . This was generalized to the Riemannian setting by Li and Yau . We refer to the Appendix for missing definitions.
If is a locally convex bounded domain with smooth boundary and , then:
where diam denotes the diameter and the induced geodesic distance. In fact, when is Euclidean space the constant 2 above may be omitted.
Ledoux’s Theorem 1.4 implies that the same lower bound (up to an additional constant) holds for . In the Euclidean case, this was strengthened in :
To obtain this result, Kannan, Lovász and Simonovits developed a geometric localization technique (which in fact can be traced back to the work of M. Gromov and V. Milman ). As pointed out to us by Sasha Sodin, it is interesting to note that this technique uses some geometric properties of Euclidean space and does not generalize to other Riemannian manifolds (except in special cases, like that of the Euclidean Sphere, as in the work of Gromov–Milman). Our method, on the other hand, does allow us to state the following generalization of Theorem 5.11 to the Riemannian setting, which also improves over Theorem 5.10:
Assume that satisfies our convexity assumptions. Then:
As usual, we just need to bound . Let denote a -Lipschitz function on . Then for any , applying the triangle inequality twice:
and the claim follows by our Main Theorem. ∎
An alternative approach to localization for proving isoperimetric inequalities was developed by Bobkov in the Euclidean setting. Bobkov’s approach was extended by Barthe and subsequently by Barthe and Kolesnikov . This approach is based on the Prékopa–Leindler inequality (e.g. ), or equivalently, on optimal transportation, which have both been recently generalized to the Riemannian-with-density-setting by Cordero-Erausquin, McCann and Schmuckenschläger . Using these tools we expect that it should be possible to provide an alternative proof of Theorem 5.12 following Bobkov’s approach, but as pointed out to us by one of the referees, this has yet to be accomplished. We would like to thank the referee for his comments regarding our original simpleminded remark in this direction.
We would like to mention another bound on obtained in using the localization method.
where denotes the longest symmetric interval contained in and centered at , and is a universal constant.
We have recently managed to derive this result using our Main Theorem, but this will be described elsewhere. Instead, we would like to show how this bound may be used to recover a result of Bobkov ; in fact, the bound we deduce is formally stronger than Bobkov’s. Bobkov employs the localization method as well, but then relies on some nice trick involving moment inequalities for polynomials in the log-concave setting. Our argument, on the other hand, is more geometric. Independently of our proof, we heard about a similar idea for bounding the boundary measure of large sets from Santosh Vempala (using localization as well).
where denotes the variance with respect to .
Without loss of generality, we may assume that ; for general the claimed bound follows by translating . Let , , and denote:
By Chebyshev’s inequality, , so if we define , it follows that . Hence by Theorem 5.5. Assume that , otherwise the support of has diameter bounded by , and one can conclude as in Theorem 5.12. We now employ Theorem 5.14 to bound :
The crucial geometric observation is that for the Euclidean ball :
It remains to plug this into (5.8) and evaluate the resulting expression using integration by parts and Chebyshev’s inequality. We leave it as an exercise to conclude that:
for some universal constant . This bound is in fact formally better than Bobkov’s bound (by several applications of Hölder’s inequality), but using some standard results in Convexity Theory, it is in fact equivalent in the interesting situations. ∎
This is immediate from the results of Schechtman and Zinn , who showed that of these bodies is bounded from below by a universal constant. The result then follows from our Main Theorem (in fact, we only need a bound on ). ∎
Another family of convex bodies for which the KLS conjecture is almost confirmed, is that of unconditional convex bodies , i.e. convex bodies for which iff . It was recently shown by Bo’az Klartag that if is an unconditional body with then , for some universal constant . To obtain this result, Klartag employed Theorem 1.7 to pass to an unconditional body contained inside the cube , and then used some symmetry properties of the Laplacian’s eigenfunctions to conclude his result. In fact, one can just use Theorem 1.8 on the concavity of the isoperimetric profile (in the form of Lemma 5.2) for this application.
We conclude this section by stating the known dimension dependent bounds on for non-degenerate convex bodies (in the sense that ).
It is known in this case that (by a simple volume estimate). Theorem 5.10 (together with Theorem 1.4) then gives . The first KLS bound (Theorem 5.11) improves this to , since:
Bobkov’s bound (Theorem 5.15) is always at least as good as the first KLS bound (up to a constant), since (using the bound derived in the proof together with a standard application of Borell’s lemma ):
for some universal constant . We see that whenever some non-trivial information on is known, Bobkov’s bound is strictly better. Such a remarkable result was proved by Bo’az Klartag , allowing him to deduce a Central-Limit type result for the class of convex bodies (and more generally, log-concave measures). Klartag’s improved estimate in reads:
Approximation Argument
In this section, we develop an approximation argument for extending the following theorems to non-necessarily smooth densities (or boundaries) in our convexity assumptions:
Theorem 1.8 on the concavity of the isoperimetric profile.
We will develop different procedures for extending each of these theorems.
We begin by extending our definition of smooth convexity assumptions (we refer to the Appendix for the definition of locally convex).
We will say that our generalized smooth convexity assumptions are fulfilled if:
is a locally convex domain with boundary.
denotes the induced geodesic distance on .
, , and as tensor fields on :
This definition was already used in the statement of Theorem 1.8 on the concavity of the isoperimetric profile. The smoothness assumptions in the above definition are used in an essential way in the proof of this theorem to deduce the existence and regularity of the isoperimetric minimizers, which are otherwise false. This permits the use of variational methods from Riemannian Geometry, consequently obtaining a second-order differential inequality which the isoperimetric profile must satisfy (see the Appendix for more details). Nevertheless, the restriction to smooth densities and domains still seems like a technical artifact of the proofs. Some authors have suggested various methods to remove these smoothness assumptions (see e.g. Morgan and Bayle [12, Chapter 4]), but unfortunately these are not well suited for our purposes. We therefore attempt to use a different approximation argument for extending Theorem 1.8 to a more general setting.
At first glance, it is tempting to believe that the isoperimetric profile of should be stable under approximating the measure by measures in, say, total-variation distance. However, the profile is in fact not even pointwise continuous under arbitrary approximation in total-variation. To see this, consider the measures which are uniform on the set , and converge to , the uniform measure on $I_{\mu_{m}}(1/2)=0m\geq 3I_{\mu}(1/2)=1$. So one must take care when specifying the approximation.
We say that a sequence of Borel probability measures tends to from above if converges to in total-variation and in addition there exists a sequence which tends to 1, so that for any Borel set .
Let be a metric space and let be a sequence of Borel probability measures on which tends to from above. Then for any :
Denote and for short. Let . Then there exists such that for all , for any Borel set . Let , then for every there exist a Borel set such that:
Taking the limit as and subsequently , we obtain the assertion. ∎
We say that a sequence of Borel probability measures tends to from within if for some sequence of Borel sets such that , and in addition .
Let be a metric space and let be a sequence of Borel probability measures on which tends to from within. Then for any :
We continue with the same assumptions and notations as in the proof of the previous lemma and definition. In our case, we may assume that . Then:
Taking the limit as and subsequently , we obtain the assertion. ∎
Next, we recall the definition of -capacity (we will only require the case ). Capacities were introduced in the 1960’s by Maz’ya , Federer and Fleming , and were used by Bobkov and Houdré in . We follow a variation on the definition given in (for general ), which was extended by Barthe, Cattiaux and Roberto (with ) in (after being introduced in ). We conform to the definition implicitly used by Sodin in and Sodin and the author in .
Given a metric probability space , and , we denote:
where the infimum is on all which are Lipschitz-on-balls (recall the definition of given in Remark 2.3).
The following proposition encapsulates the connection between -capacity and the isoperimetric profile . The proof is very much along the lines of the proof of Lemma 2.2, so we will omit it here; the reader is referred to Sodin [81, Proposition A] for an elementary derivation (note the slight difference in our formulation). We only remark that it suffices to use Lipschitz functions in the definition of capacity above for the purpose of this proposition.
Since obviously , it follows that:
Letting converge to , and replacing with , we obtain:
If is lower semi-continuous at and , , then .
Let be a metric space and let be a sequence of Borel probability measures on which converges in the total-variation norm to . Assume in addition that are concave on . Then for any :
As usual, denote and for short. Let and small be given, and let denote a Lipschitz function so that:
For any small , there exists an so that for any :
We conclude by Proposition 6.4 and the concavity of that:
Since is Lipschitz (hence is bounded), and converge to in total-variation, we can pass to the limit as :
Taking infimum on all such as above and using Proposition 6.4 again, we obtain:
Taking the limit of to 0, we obtain the desired conclusion. ∎
It is clear from the proof that the concavity condition may be seriously relaxed (e.g. to equicontinuity), and the regularity condition on obtained in Lemma 6.9 below may also be used.
Combining the last three lemmas we immediately obtain:
Let be a metric space, let be a sequence of Borel probability measures on which converges in the total-variation norm to , and assume that are all concave on . If in addition tend to from above or from within, then for any :
In particular, if is in addition lower semi-continuous, we have (pointwise):
The following lemma, which extends the argument given by Gallot in [34, Lemma 6.2] for compact manifolds with uniform density, provides a sufficient condition for the isoperimetric profile to be continuous.
Let denote an -dimensional () smooth complete oriented connected Riemannian manifold and let denote the induced geodesic distance. Let denote an absolutely continuous measure with respect to , such that its density is bounded from above on every ball (but not necessarily from below, nor do we assume it is continuous). Then is absolutely continuous on $\frac{n-1}{n}$.
By Lebesgue’s Theorem, we know for almost every (with respect to ),
where denotes the ball in of radius around , denotes the Riemannian volume on (and by abuse of notation the induced volume on any submanifold as well), and denotes the upper bound on the density of on a compact set . By Rauch’s Comparison Theorem, for any such compact set (and in particular a singleton), there exists a so that for any and :
where and denote the Euclidean unit ball and sphere, respectively, and Vol denotes Euclidean volume. Therefore as :
where depends on and only. Since clearly , this takes care of the continuity at and .
Given , set , , and . Let denote the (possibly negative) lower bound on the sectional curvature of on . Rauch’s Theorem also implies that:
where denotes the simply connected model space with constant curvature , denotes the volume on and is any ball in of radius .
Given a set with , note that by Fubini’s Theorem, (6.3) and the definition of , for any :
We conclude from and that given any with and , there exists an such that:
Now let be close enough such that there exists an such that:
By definition, for any , there exists a set such that and . By (6.7) there exists an such that , and since is absolutely continuous, it follows that there exists an such that . Therefore:
where we have used (6.2) and (6.4) in the last inequality. Sending to 0 and plugging in (6.8), we conclude that for some constant which depends on :
To get the inequality in the other direction, we require that are close enough so that in addition satisfies:
Now let be such that and . Applying (6.7) for the set , we find an and such that . Repeating the above argument then gives:
Since is monotone, this concludes the proof. ∎
Our approximation argument is now clear. Given a measure in the setting of Lemma 6.9, we know that its isoperimetric profile is continuous. Assume that can be approximated from above or from within by measures satisfying our generalized smooth convexity assumptions. By Theorem 1.8, the corresponding profiles (and when the densities are uniform, also the renormalized profiles ) are concave, and so applying Proposition 6.8, we deduce the pointwise convergence of to , which clearly preserves concavity. We therefore deduce:
Let denote an -dimensional () smooth complete oriented connected Riemannian manifold and let denote the induced geodesic distance. For each , let denote a sequence of Borel probability measures on so that satisfies our generalized smooth convexity assumptions. Assume that tends to an absolutely continuous Borel probability measure from above or from within, and denote and . Then pointwise and consequently is concave on $\mu_{m}\Omega_{m}I^{n/(n-1)}$.
The argument has already been sketched. We only remark that it is not hard to verify the validity of the assumptions of Lemma 6.9 on , as the limit of as above (see e.g. [64, Remark 6.2]). ∎
Approximate from outside by smooth convex domains using standard methods (see e.g. ). Note that will only guarantee smoothness. ∎
The case follows from Theorem A.4 in the Appendix. For the case , we will need to approximate from above and within by a sequence of smooth log-concave probability measures. Since we did not find a standard reference for this, we outline the argument.
First, assume that the support of is compact. Approximate by smooth log-concave probability measures in total-variation distance using standard methods (e.g. convolution with a Gaussian mollifier). Now define to be the dilatation of given by for all Borel sets , where is a point in the interior of (another possibility would be to use sup-convolution with a small Gaussian). It is then not hard to check that for a suitable subsequence, tends to from above, from which the assertion follows by Theorem 6.10.
In case the support of is not compact, we repeat the above argument for the truncated measures , where denotes the Euclidean unit-ball. Note that as by the co-area formula:
Hence tends to from within, and so by Theorem 6.10 the claim now follows for arbitrary log-concave measures. ∎
2 Stability of First-Moment Concentration
Up to now, we have only concluded the Main Theorem 1.5 under our smooth convexity assumptions. We now describe how to extend these assumptions to our general convexity assumptions.
Indeed, assume that can be approximated in total-variation by measures with density such that and on . We would like to show that our Main Theorem, stating that for some universal constant , still holds. It is immediate to deduce from Lemma 6.6 that:
and using our Main Theorem for the smooth measures (and Lemma 2.1), we deduce that:
for some universal constant . The First Moment constant is particularly easy to handle, since there is no term which needs to be controlled. The following lemma, which is an adaptation of a classical lemma of C. Borell from the Euclidean case to the Riemannian-manifold-with-density setting, enables us to reduce to the case that are all supported on some compact set:
Let and be such that . Then:
Given this lemma, it is easy to proceed as follows. Fix and so that . Then for some and all , we have , and hence by the lemma we conclude that:
Let denote the -Lipschitz functions on so that and (we assume without loss of generality that the supremum is achieved). Since are continuous, and , there must exist a so that . Since are -Lipschitz, it follows that for any :
Hence, given , there exists a so that:
But since our Lipschitz functions are uniformly bounded on by (by passing through as before), the convergence of to in total-variation implies:
Finally, we note that for large enough, by the Markov-Chebyshev inequality (we assume here without loss of generality that ):
so . Combining everything together, we deduce that for large enough:
Since was arbitrary, we conclude that:
This concludes the proof, since as usual, we may pass from to using Lemma 2.1.
For completeness, we provide a proof of Lemma 6.13, using the following remarkable generalization of the Prékopa-Leindler inequality (e.g. ) due to Cordero-Erausquin, McCann and Schmuckenschläger (generalizing their own result from ). Given and , define:
Indeed, if this is not so, there would exist a so that:
which would imply that , a contradiction. Hence, (6.9) implies that the functions , and satisfy the assumption of Theorem 6.14 with . Theorem 6.14 then implies that:
and the conclusion of the lemma follows. ∎
Appendix
In the Appendix, we provide more details regarding the statement and ideas underlying the proof of Theorem 1.8 from the Introduction, as it plays an essential role in our argument. In the statement of this theorem, we have summarized a series of results in Riemannian Geometry concerning the concavity of the isoperimetric profile, which were proved under increasingly general convexity assumptions. An essential ingredient in the proofs of these results is provided by Geometric Measure Theory, which guarantees the existence and regularity of the isoperimetric minimizers, and permits the use of a variational argument to deduce the concavity of the profile.
First, we survey the case where the metric space is given by a bounded domain (connected open set) with boundary in a smooth complete oriented connected -dimensional () Riemannian manifold along with the induced geodesic distance in , and the probability measure is given by the restriction to of the Riemannian volume form on , normalized so that . We summarize for completeness some remarkable results provided by Geometric Measure Theory about the existence and regularity of isoperimetric minimizers in the case we are considering, and refer to the books of Federer , Morgan , Giusti and Burago and Zalgaller for further information.
For any , there exists an open isoperimetric minimizer of measure for the isoperimetric problem on as above. The boundary can be written as a disjoint union of a regular part and a set of singularities , with the following properties:
is a smooth, embedded hypersurface of constant mean curvature.
meets orthogonally.
is a closed set of Hausdorff co-dimension not smaller than 8. This result is sharp.
For all the results to be described, it is essential that the Hausdorff co-dimension of the singular part of the boundary is large (although typically knowing that it is greater than 3 is sufficient). This approach was used by M. Gromov in his influential generalization of P. Lévy’s isoperimetric inequality ,[39, Appendix C]. The negligible singular part permits to consider a normal variation of the regular part, and from there on one may continue by using the readily available tools from Riemannian Geometry to calculate the first and second variations of volume and area. Before proceeding, we remark that most results we will mention deduce that the isoperimetric profile satisfies a second order differential inequality under more general convexity assumptions than stated (e.g. a negative lower bound on the Ricci curvature), and provide a characterization of the equality case as well.
The first convexity assumption which we add is that the Ricci curvature tensor of be non-negative. When is a closed manifold and , and under the additional assumption that all isoperimetric minimizers are smooth submanifolds (this is always the case when ), it was shown by Bavard and Pansu that is concave on $I^{n/(n-1)}$ is concave in this case. This result captures the right dependence of the dimension in the exponent.
A domain is said to be locally convex, if all geodesics in tangent to are locally outside of . By a result of Bishop , in case that has boundary, this is equivalent to requiring that the second fundamental form of with respect to the normal pointing into be positive semi-definite on all of .
We summarize the above results in the following:
Let be a smooth complete oriented connected Riemannian manifold of dimension with non-negative Ricci curvature, and let denote a locally convex bounded domain in . Let denote the induced geodesic distance in and the restriction to of the canonical volume form on , normalized so that . Assume in addition that has smooth boundary. Then the isoperimetric profile is a concave function on $I^{n/(n-1)}$.
A.2 Manifolds with densities
As before, let denote an -dimensional () smooth complete oriented connected Riemannian manifold with induced geodesic distance . In addition, let be such that is a probability measure on . Since the influential work of Bakry and Émery in the abstract framework of diffusion generators, it is known that a natural convexity condition on a manifold with density, which replaces the condition in the uniform density case, is to require the following Curvature-Dimension condition:
Let and as above. Assume that (A.1) holds on . Then is a concave function on $$.
This theorem was proved by Bayle in under the assumption that is a closed manifold. It was noted (without explanation) by Morgan [71, Corollary 9] that the same proof applies for a general complete manifold, as long as it has finite -measure. Indeed, Bayle’s argument remains exactly the same; the only point one needs to check is the existence and regularity of isoperimetric minimizers in the manifold with density setting. The argument goes as follows: it was shown by Morgan in [70, Remark 3.10] that given a complete smooth Riemannian manifold with positive density (), if there exists an area minimizing current then its boundary is necessarily regular outside a set of Hausdorff codimension at least 8. As explained e.g. in , the existence of an area minimizing current is guaranteed by the local compactness Theorem for currents (see ), as soon as the -measure of is finite, which is always the case in our setting. Since the minimizing current is regular by the previous result, it follows that the usual notion of weighted area (i.e. Minkowski boundary measure) and the weighted area of a current coincide, and hence there exists a regular minimizer of Minkowski boundary measure.
The assumption that has finite mass is essential for the existence of minimizers, otherwise one may construct counterexamples (see or [12, p. 51]). It is also essential that the density be continuous, otherwise minimizers need not necessarily exist (consider the density on ).
We remark that the same existence and regularity argument works for manifolds with a smooth boundary. Let be a domain (connected open set) with boundary, let be the geodesic distance induced by , and let with so that . One can easily check that the argument of Grüter on the constant curvature of the regular part of the boundary and the orthogonality still applies, with a minor change in the conclusion. We summarize this in the following:
For any , there exists an open isoperimetric minimizer of measure for the isoperimetric problem on as above. The boundary can be written as a disjoint union of a regular part and a set of singularities , with the following properties:
is a smooth, embedded hypersurface of constant generalized mean curvature, defined as:
where denotes the usual mean curvature of in the direction of the unit normal pointing into (i.e. the trace of the second fundamental form divided by ), for .
meets orthogonally (even in the presence of a density).
is a closed set of Hausdorff co-dimension not smaller than 8.
It is then a (tedious) exercise to follow the proof of Sternberg and Zumbrun and Bayle (see also ) and to deduce the following extension of Theorem A.2:
Let be a locally convex domain with boundary, and let , as above. Assume that (A.1) holds on . Then is a concave function on $$.
In the one-dimensional case , it was shown by S. Bobkov that all of the above theorems hold as well (here there is no point to consider a general manifold):
Bobkov showed that in this case, the minimizing sets are always given by half-lines, from which it is immediate that , where . Using that is log-concave, direct differentiation reveals that is concave. Note that the case is special since may be discontinuous at and , but this has absolutely no consequences to our applications.