Thin shell implies spectral gap up to polylog via a stochastic localization scheme
Ronen Eldan
Introduction
is the -extension of . The main point of this paper is to find an upper bound for the constant,
Where runs over all isotropic log-concave measures and runs over all smooth enough functions with and is some universal constant.
In this note we will show that, up to a small correction, the above is implied by a seemingly weaker hypothesis.
Next, we would like to formulate the thin-shell conjecture. Let satisfy
There exists a universal constant such that,
An application of (3) with the function shows that the thin-shell conjecture is weaker than the KLS conjecture.
The first nontrivial bound for was given by Klartag in [K1], who showed that . Several improvements have been introduced around the same method, see e.g [K2] and [Fl1]. The best known bound for at the time of this note is due to Guedon and E. Milman, in [Gu-M], extending previous works of Klartag, Fleury and Paouris, who show that . The thin-shell conjecture was shown to be true for several specific classes of convex bodies, such as bodies with a symmetry for coordinate reflections (Klartag, [K3]) and certain random bodies (Fleury, [Fl2]).
It was found by Sudakov, [Sud], that the parameter is highly related to almost-gaussian behaviour of certain marginals of a convex body, a fact now known as the central limit theorem for convex sets [K1]. This theorem asserts that most of the one-dimensional marginals of an isotropic, log-concave random vector are approximately gaussian in the sense that the Kolmogorov distance to the standard gaussian distribution of a typical marginal has roughly the order of magnitude of . Therefore the conjectured bound (5) actually concerns the quality of the gaussian approximation to the marginals of high-dimensional log-concave measures. The first theorem of this note reads,
There exists a constant such that for all ,
Plugging the results of this paper into the currently best known bound for (proven in [Gu-M]), , it follows that
This slightly improves the previous bound, , which is a corollary of [Gu-M] and [Bo].
In [EK1], B. Klartag and the author have found a connection between the thin-shell hypothesis and another well known conjecture related to convex bodies, known as the hyperplane conjecture. The methods of this paper share some common lines with the methods in [EK1]. In a very recent paper of K.Ball and V.H. Nguyen, [BN], a connection between the KLS conjecture and the hyperplane conjecture that applies for individual log-concave measures has also been established. They show that the isotropic constant of a log concave measure which attains a spectral gap is bounded by a constant which depends exponentially on the spectral gap.
Compare this result with the result in [Bo]. Bobkov’s theorem states that for any log-concave random vector and any smooth function , one has
Under the thin-shell hypothesis, Bobkov’s theorem gives .
The bound in theorem 1.1 will rely on the following intermediate constant which corresponds to a slightly stronger thin shell bound. Define,
There exists a constant such that for all ,
Theorem 1.1 will be a consequence of the above lemma along with,
There exists a constant such that for all ,
The constant satisfies the following bound:
We move on to the second result of this paper, a stability result for the Brunn-Minkowski Inequality. The Brunn-Minkowski inequality states, in one of its normalizations, that
When there is an almost-equality in (7), and are almost translates of each other in a certain sense (which varies between different estimates). Estimates of this form, often referred to as stability estimates, appear in Diskant [Dis], in Groemer [Groe], and in Figalli, Maggi and Pratelli [FMP1, FMP2], Segal [Seg].
The result [FMP2], which is essentially the strongest result in its category, and other existing stability estimates share a common thing: the bounds become worse as the dimension increases. In a recent paper, [EK2], Klartag and the author suggested that the correct bounds might actually become better as the dimension increases, as demonstrated by certain results. The estimates presented here may be viewed as a continuation of this line of research.
In order to formulate our result, we define the two constants
so that . Note that the thin-shell conjecture implies and . Our main estimate reads,
For every there exists a constant such that the following holds: Let be convex bodies whose volume is and whose barycenters lie at the origin. Suppose that the covariance matrix of the uniform measure on is equal to for a constant . Denote,
It follows from theorem 1.4 in [EK2] that the above estimate is true with
If , then the result we prove here weaker than the one in [EK2]. However, under the thin shell hypothesis, the result of this paper becomes stronger, and is in fact tight up to the term . This tightness is demonstrated, for instance, by taking and to be the unit cube and a unit cube truncated by a ball of radius and normalized to be isotropic.
Using the bound in [Gu-M], the theorem gives
Note that if the assumption (9) is dropped, even if the covariance matrices of and are assumed to be equal, the best corresponding bound would be as demonstrated, for example, by a cube and a ball.
The above bound complements, in some sense, the result proven in [FMP1], which reads,
for some choice of , where denotes the symmetric difference between the sets. Unlike the result presented in this paper, the result in [FMP1] gives much more information as the expression approaches zero. On the other hand the result presented here already gives some information when .
The structure of this paper is as follows: In section 2, we construct a stochastic localization scheme which will be the main ingredient our proofs. In section 3, we establish a bound for the covariance matrix of the measure throughout the localization process, which will be essential for its applications. In section 4, we prove theorem 1.1 and in section 5 we prove theorem 1.2 and its corollaries. In section 6 we tie some loose ends.
Acknowledgements I owe this work to countless useful discussions I have had with my supervisor, Bo’az Klartag, through which I learnt the vast part of what I know about the subject, as well as about related topics, and for which I am grateful. I would also like to thank Vitali and Emanuel Milman and Boris Tsirelson for inspiring discussions and for their useful remarks on a preliminary version of this note. Finally, I would like to thank the anonymous referee for doing a tremendous job reviewing a preliminary version of this paper, thanks to his/her ideas the proofs are significantly simpler, shorter and more comprehensible.
A stochastic localization scheme
The assumption (10) ensures that , and are smooth functions of . Let be a standard Wiener process and consider the following system of stochastic differential equations:
Taking into account the fact that the functions are smooth and that is positive definite for all , we can use a standard existence and uniqueness theorem (see e.g., [Ok], section 5.2) to ensure the existence and uniqueness of a solution in some interval , where is an almost-surely positive random variable. Next, we construct a 1-parameter family of functions by defining,
so that and are the barycenter and the covariance matrix of the function .
The following lemma may shed some light on this construction.
The function satisfies the following set of equations:
Equation (11) clearly implies that . Let denote the quadratic variation of the process . We have,
Applying Itô’s formula one last time yields,
In view of the above lemma it can be seen that, in some sense, the above is just the continuous version of the following iterative process: at every time step, multiply the function by a linear function equal to at the barycenter, whose gradient has a random direction distributed uniformly on the ellipsoid of inertia. This construction may also be thought of as a variant of the Brownian motion on the Riemannian manifold constructed in [EK1].
Rather than defining the process through equations (11) and (12), one may alternatively define it directly with the infinite system of stochastic differential equations in formula (13). In this case, the existence and uniqueness of the solution can be shown using [KX, Theorem 5.2.2, page 159] (however, some extra work is needed in order to show that the conditions of this theorem hold).
In the remainder of this note, most of the calculations involving the process will use the formula (13) rather than the formulas (11) and (12). The remaining part of this section is dedicated to analyzing some basic properties of . We begin with:
In order to prove (i), we will first need the following technical lemma:
For every dimension , there exists a constant such that,
The proof of this lemma is postponed section 6. Proof of lemma 2.4: To prove (i), we have to make sure that does not blow up. To this end, define . By continuity, . Equation (12) suggests that is log-concave for all . The fact that will be proven below. We start by showing that both (ii) and (iii) hold for any . We first calculate, using (13),
with probability 1. The last equality follows from the definition of as the barycenter of the measure . We conclude (ii). We continue with proving (iii). To do this, fix some and write,
which is clearly an isotropic probability density. Let us inspect . We have, using (13),
which proves (iii). We are left with showing that . To see this, write,
where is the constant from lemma 2.5. Note that, by continuity, is well-defined and almost-surely positive. When time comes, we may define as in (15), and continue running the process on the function as above. We repeat this every time hits the value , thus generating the hitting times . Lemma 2.5 suggests that,
which implies that, almost surely, for infinitely many values of . Thus, almost surely, and so . Part (iv) follows immediately from formula (13). The lemma is proven.
where the third equality follows from the defition of , which implies,
One of the crucial points, when using this localization scheme, will be to show that the barycenter of the measure does not move too much throughout the process. For this, we would like to attain upper bounds on the eigenvalues of the matrix . We start with a simple observation: Equation (12) shows that the measure is log-concave with respect to the measure . The following result, which is well-known to experts, shows that measures which possess this property attain certain concentration inequalities.
where is the -extension of , defined in the previous section. (ii) For all ,
Proof: Denote the density of by . Let be the complement of , where the constant will be chosen later on. Define,
Note that for and , we have . Thus, by the parallelogram law,
Since the function is assumed to be convex, we obtain
Now, using the Prekopa-Leindler theorem, we attain
Clearly, a large enough choice of the constant gives (i). To prove (ii), we define,
and take . An application on (i) on the set gives,
Part (ii) of the proposition is a direct consequence of the last two equations.
Plugging (12) into part (ii) of this theorem gives,
By our assumption (10) we deduce that is bounded by , which immediately gives
The bound (18) will be far from sufficient for our needs, and the next section is dedicated to attaining a better upper bound. However, it is good enough to show that the barycenter, , converges in distribution to the density . Indeed, (18) implies that
It is interesting to compare this construction with the construction by Lehec in [Leh]. In both cases, a certain Itô process converges to a given log-concave measure. In the result of Lehec, the convergence is ensured by applying a certain adapted drift, while here, it is ensured by adjusting the covariance matrix of the process.
We end this section with a simple calculation in which we analyze the process in the simple case that is the standard Gaussian measure. While the calculation will not be necessary for our proofs, it may provide the reader a better understanding of the process. Define,
According to formula (12), the function takes the form,
Recall that . It follows that,
In the previous section we saw that the covariance matrix of the , , satisfies (18). The goal of this section is to give a better bound, which holds also for small . Namely, we want to prove:
Before we move on to the proof, we will establish some simple properties of the matrix . Our first task is to find the differential of process . We have, using Itô’s formula with equation (13),
Let us try to understand each of this terms. The second term is,
Recall that by (16), , which gives,
Plugging equations (23), (24) and (25) together gives,
The vectors satisfy the following bounds: (i) For all , for some universal constant . (ii) For all , .
Proof: Since for all , we have
Using this with (30) establishes (i). Next, by the definition of , we have for all ,
We are now ready to prove the main proposition of the section. Proof of proposition 3.1: We fix a positive integer whose value will be chosen later, and define,
Since is a smooth function of the coefficients , which are Itô processes (assuming that the basis is fixed), itself is also an Itô process. Fix some . Our next goal will be to find . To that end, define to be the set of -tuples, , such that for all and such that . It is easy to verify that,
The second type of term will contain exactly two off-diagonal entries, and due to the symmetry of the matrix and the constraint , it has the form:
where and . Keeping in mind that , we calculate,
We may clearly assume , which implies that for and for all values of , one has
Inspect the equation (32). For every , the expansion on the right hand side contains exactly one term of the first type, and for every distinct with , it contains terms of the second type (or otherwise, for all choices such that , it contains terms of this type). Using (33) and (34), we conclude
where in the last inequality we used the part (ii) of lemma 3.2. A well-known property of Itô processes is existence and uniqueness of the decomposition , where is a local martingale and is an adapted process of locally bounded variation. In the last equation, we attained,
Next, we use the unique decomposition where is a local martingale, is an adapted process of locally bounded variation and . According to Itô’s formula and formula (36),
Choosing to be a large enough universal constant, , yields
(where we used the fact that ). Using (37), we attain
for some universal constant . We now use Itô’s formula again, this time with formula (35), to get
The last two equations and the legitimate assumption that give,
for some universal constant . Define the event as the complement of the event in the equation above,
Clearly, whenever the event holds, we have,
Our next task is to bound the norm for larger values of . To this end, recall the bound (17). Recalling that , and applying (17) gives,
By the definition of and by (38), it follows that whenever holds one has,
where . Equations (39) and (40) imply,
Proposition 2.6 gives an immediate corollary to part (iii) of proposition 3.1:
where is the -extension of , defined in the introduction.
Thin shell implies spectral gap
Our goal in this section is to show that,
for some universal constants , where and is the -extension of . The idea is quite simple. Define , the localization of constructed in section 2, and fix . By the martingale property of the localization, we have,
Corollary 3.3 suggests that if is large enough, the right term can be bounded from below if we only manage to bound the integral away from 0 and from 1. Define,
In view of the above, we would like to prove:
There exists a universal constant such that,
Define . By Itô’s formula,
Plugging the last two equations together gives,
The lemma follows from an application of Chebyshev’s inequality.
The last ingredient needed for our proof is a theorem of E. Milman, [Mil2, Theorem 2.1]. The following is a weaker formulation of this theorem which will be suitable for us:
Note that equation (48) is the exact type of inequality defining the constant in equation (2). We are now ready to prove the main proposition of this section. Proof of proposition 1.7: Let be the constant from lemma 4.1. Denote,
The result now follows directly from an application of theorem 4.2.
In the above proof, we used E. Milman’s result in order to reduce the theorem to the case where is exactly , as well as to attain an isoperimetric inequality from a certain concentration inequality for distance functions. Alternatively, we may have replaced propsition 2.6 with an essentially stronger result due to Bakry-Emery, proven in [BE] (see also Gross, [Gros1]). Their result, which relies on the hypercontractivity principle, asserts that a density of the form (12) actually possesses a respective Cheeger constant. Using this fact, we may have directly bounded from below the surface area of any set with respect to the measure whose density is .
The proof of lemma 1.6 is in section 6. Along with this lemma, we have established theorem 1.1.
Stability of the Brunn-Minkowski inequality
The following lemma is a variant of lemma 6.5 from [EK2].
and let be the eigenvalues of such that the order of is decreasing. Then,
where are universal constants.
Our main ideas in this section are contained in the following lemma:
Proof: As explained in the beginning of the section, we will couple between the measures and in means of coupling between the processes and . To that end, we define, as in (13),
is the covariance matrix of . As usual denote . Next, we define,
and denote . Finally, we ”interpolate” between the two processes by defining,
where is the same constant as in (20). Finally, denote . By proposition 3.1 and equation (54), for all . Define a stopping time by the equation,
Now, thanks to formula (19), we can take large enough (and deterministic) such that,
is the covariance matrix of . Therefore,
The first two terms are martingale. We use the unique decomposition
where is a local martingale and is an adapted process of locally bounded variation. We get,
where . Our next task is to use lemma 5.1 to bound under the assumption f. We start by denoting the eigenvalues of the matrix by , in decreasing order, and the eigenvalues of the matrix by , also in decreasing order. By theorem 1 in [T],
Fix some constant , whose value will be chosen later. For now, we assume that . Using Hölder’s inequality, we calculate,
where . Recall that , which gives,
Take such that . Equations (61) and (62) give,
Finally, using equations (56) and (58), we conclude,
In the above lemma, if we replace the assumption that is isotropic by the assumption that are log-concave with respect to the Gaussian measure, then following the same lines of proof while using proposition 2.6, one may improve the bound (52) and get,
We move on to the proof of theorem 1.2. Proof of theorem 1.2: Let be convex bodies of volume such that the covariance matrix of is . Fix . Define,
so both and are probability measures and is isotropic. We have,
where . Denote,
It follows from Markov’s inequality and from (64) and (65) that,
Finally, taking gives
Tying up loose ends
Let and let be a subspace of dimension . Denote and . By definition of ,
Using the last inequality and applying Cauchy-Schwartz gives,
Next, in order to provide the reader with a better understanding of the constant , we introduce two new constants. First, define
where the supremum runs over all isotropic log-concave random vectors, , and all quadratic forms . Next, define
where runs over all isotropic log-concave measures and runs over all ellipsoids with .
There exist universal constants such that
where runs over all symmetric matrices. Let be a symmetric matrix. Fix coordinates under which is diagonal, and write and . Define . We have,
We suspect that there exists a universal constant such that , but we are unable to prove that assertion.
We move on to the proof of lemma 2.5. Proof of lemma 2.5: Throughout the proof, all the constants may depend only on the dimension . Recall that is assumed to be isotropic and log-concave. It is well-known that there exist two constants , such that
(see for example [LV, Theorem 5.14]). Define . It is also well-known (see for example [LV, Lemma 5.7]) that there exist two constants such that
which implies that whenever and is positive semi-definite,
It follows that for all and (in the sense of positive matrices), one has
for some constant . Define the stopping times,
so the lemma would be concluded if we manage to show that
for some constant . Define the event . Whenever holds, we have the following: First, using (67),
Recall that . It follows that
By taking in the last equation, we learn that whenever holds, so
Therefore, it is enough to prove that for some . Furthermore, in the following we are able to assume that . To that end, consider the defining equation (11) and use Itô’s formula to attain
Define the process by the equations,
Using (68), we deduce that whenever , one has
Define . Note that, by (71),
Another application of (68), this time with the assumption (10) gives,
for all , and for a constant . By plugging (73) and (74) into (70), we learn that whenever holds, one has