Local Circular Law for Random Matrices

Paul Bourgade, Horng-Tzer Yau, Jun Yin

Introduction

A considerable literature about random matrices focuses on Hermitian or symmetric matrices with independent entries. These models are paradigms for local eigenvalues statistics of many random Hamiltonians, as envisioned by Wigner. The study of non-Hermitian random matrices goes back to Ginibre, then in Princeton and motivated by Wigner. Ginibre’s viewpoint on the problem was described as follows [Gin1965]:

Apart from the intrinsic interest of the problem, one may hope that the methods and results will provide further insight in the cases of physical interest or suggest as yet lacking applications.

This phenomenon is the non-Hermitian counterpart of the semicircular law for Wigner random Hermitian matrices, and the quarter circular limit for Marchenko-Pastur random covariance matrices.

In the case of real Gaussian entries, the join distribution of the eigenvalues is more complicated but still integrable, allowing Edelman [Ede1997] to prove the limiting circular law as well; for more precise asymptotic properties of the real Ginibre ensemble, see [ForNag2007, Sin2007, BorSin2009]. We note also that the (right) eigenvalues of the quaternionic Ginibre ensemble were recently shown to converge to a (non-uniform) measure on the unit ball of the quaternions field [BenCha2011].

From this formula, it is clear that the small eigenvalues of the Hermitian matrix (Xz)(Xz)(X^{*}-z^{*})(X-z) play a special role due to the logarithmic singularity at . The key question is to estimate the smallest eigenvalues of (Xz)(Xz)(X^{*}-z^{*})(X-z), or in other words, the smallest singular values of (Xz)(X-z). This problem was not treated in [Gir1984], but the gap was remedied in a series of papers. First Bai [Bai1997] was able to treat the logarithmic singularity assuming bounded density and bounded high moments for the entries of the matrix (see also [BaiSil2006]). Lower bounds on the smallest singular values were given in Rudelson, Vershynin [Rud2008, RudVer2008], and subsequently Tao, Vu [TaoVu2008], Pan, Zhou [PanZho2010] and Götze, Tikhomirov [GotTik2010] weakened the moments and smoothness assumptions for the circular law, till the optimal \mboxL2\mbox{L}^{2} assumption, under which the circular law was proved in [TaoVuKri2010].

The purpose of this paper is to prove a local version of the circular law, up to the optimal scale N1/2+εN^{-1/2+{\varepsilon}} (see Section 2 for a precise statement). Below this scale, detailed local statistics will be important and that is beyond the scope of the current paper. The main tool of this paper is a detailed analysis of the self-consistent equations of the Green functions

Finally, we remark that the local circular law demonstrates that the eigenvalue distribution in the unit disk is extremely “uniform”. If the eigenvalues are distributed in the unit disk by a uniform statistics or any other statistics with summable decay of correlations, then there will be big holes or some clusterings of eigenvalues in the disk. While the usual circular law does not rule out these phenomena, the local law established in this paper does. This implies that the eigenvalue statistics cannot be any probability laws with summable decay of correlations

The local circular law

We first introduce some notations. Let XX be an N×NN\times N matrix with independent centered entries of variance N1N^{-1}. The matrix elements can be either real or complex, but for the sake of simplicity we will consider real entries in this paper. Denote the eigenvalues of XX by μj\mu_{j}, j=1,,Nj=1,\ldots,N. We will use the following notion of stochastic domination which simplifies the presentation of the results and their proofs.

Let W=(WN)N1W=(W_{N})_{N\geqslant 1} be family a random variables and Ψ=(ΨN)N1\Psi=(\Psi_{N})_{N\geqslant 1} be deterministic parameters. We say that WW is stochastically dominated by Ψ\Psi if for any σ>0\sigma>0 and D>0D>0 we have

for sufficiently large NN. We denote this stochastic domination property by

In this paper, we will assume that the probability distributions for the matrix elements have the uniform subexponential decay property, i.e.,

for some constant ϑ>0\vartheta>0 independent of NN. This condition can of course be weakened to an hypothesis of boundedness on sufficiently high moments, but the error estimates in the following Theorem would be weakened as well. We now state our local circular law, which holds up to the optimal scale N1/2+εN^{-1/2+{\varepsilon}}.

Let XX be an N×NN\times N matrix with independent centered entries of variance N1N^{-1}. Suppose that the probability distributions of the matrix elements satisfy the uniformly subexponentially decay condition (2.1). We assume that for some fixed τ>0\tau>0, for any NN we have τz01τ1\tau\leqslant||z_{0}|-1|\leqslant\tau^{-1} (z0z_{0} can depend on NN). Let ff be a smooth non-negative function which may depend on NN, such that fC\|f\|_{\infty}\leqslant C, fNC\|f^{\prime}\|_{\infty}\leqslant N^{C} and f(z)=0f(z)=0 for zC|z|\geqslant C, for some constant CC independent of NN. Let fz0(z)=N2af(Na(zz0))f_{z_{0}}(z)=N^{2a}f(N^{a}(z-z_{0})) be the approximate delta function obtained from rescaling ff to the size order NaN^{-a} around z0z_{0}. We denote by DD the unit disk. Then for any a(0,1/2]a\in(0,1/2],

Hermitization and local Green function estimate

In the following, we will use the notation

where II is the identity operator. Let λj(z)\lambda_{j}(z) be the jj-th eigenvalue (in the increasing ordering) of YzYzY^{*}_{z}Y_{z}. We will generally omit the zz-dependence in these notations. Thanks to the Hermitization technique of Girko [Gir1984], the first step in proving the local circular law is to understand the local statistics of eigenvalues of YzYzY^{*}_{z}Y_{z}, for zz strictly inside the unit circle. In this section, we first recall some well-known facts about the Stieltjes transform of the empirical measure of eigenvalues of YzYzY^{*}_{z}Y_{z}. We then present the key estimate concerning the Green function of YzYzY^{*}_{z}Y_{z} in almost optimal spectral windows. This result will be used later on to prove a local version of the circular law.

. Define the Green function of YzYzY^{*}_{z}Y_{z} and its trace by

We will also need the following version of the Green function later on:

As we will see, with high probability m(w,z)m(w,z) converges to mc(w,z)m_{\rm c}(w,z) pointwise, as NN\to\infty where mc(w,z)m_{\rm c}(w,z) is the unique solution of

with positive imaginary part (see Section 3 in [GotTik2010] for the existence and uniqueness of such a solution). The limit mc(w,z)m_{\rm c}(w,z) is the Stieltjes transform of a density ρc(x,z)\rho_{\rm c}(x,z) and we have

whenever η>0\eta>0. The function ρc(x,z)\rho_{\rm c}(x,z) is the limiting eigenvalue density of the matrix YzYzY^{*}_{z}Y_{z} (cf. Lemmas 4.2 and 4.3 in [Bai1997]). Let

Note that λ\lambda_{-} has the same sign as z1|z|-1. The following two propositions summarize the properties of ρc\rho_{\rm c} and mcm_{\rm c} that we will need to understand the main results in this section. They will be proved in Appendix A. In the following, we use the notation ABA\sim B when cBAc1BcB\leqslant A\leqslant c^{-1}B, where c>0c>0 is independent of NN.

The limiting density ρc\rho_{\rm c} is compactly supported and the following properties regarding ρc\rho_{\rm c} hold.

The support of ρc(x,z)\rho_{\rm c}(x,z) is [max{0,λ},λ+][\max\{0,\lambda_{-}\},\lambda_{+}].

As xλ+x\to\lambda_{+} from below, the behavior of ρc(x,z)\rho_{\rm c}(x,z) is given by ρc(x,z)λ+x.\rho_{\rm c}(x,z)\sim\sqrt{\lambda_{+}-x}.

For any ε>0{\varepsilon}>0, if max{0,λ}+εxλ+ε\max\{0,\lambda_{-}\}+{\varepsilon}\leqslant x\leqslant\lambda_{+}-{\varepsilon}, then ρc(x,z)1\rho_{\rm c}(x,z)\sim 1.

Near max{0,λ}\max\{0,\lambda_{-}\}, the behavior of ρc(x,z)\rho_{\rm c}(x,z) can be classified as follows.

If z1+τ|z|\geqslant 1+\tau for some fixed τ>0\tau>0, then λ>ε(τ)>0\lambda_{-}>{\varepsilon}(\tau)>0 and ρc(x,z)\mathds1x>λxλ\rho_{\rm c}(x,z)\sim\mathds{1}_{x>\lambda_{-}}\sqrt{x-\lambda_{-}}.

If z1τ|z|\leqslant 1-\tau for some fixed τ>0\tau>0, then λ<ε(τ)<0\lambda_{-}<-{\varepsilon}(\tau)<0 and ρc(x,z)1/x\rho_{\rm c}(x,z)\sim 1/\sqrt{x}.

All of the estimates in this proposition are uniform in z<1τ|z|<1-\tau, or τ1z1+τ\tau^{-1}\geqslant|z|\geqslant 1+\tau for fixed τ>0\tau>0.

The preceding Proposition implies that, uniformly in ww in any compact set,

Moreover, the following estimates on mc(w,z)m_{\rm c}(w,z) hold.

If z1+τ|z|\geqslant 1+\tau for some fixed τ>0\tau>0, then mc1m_{\rm c}\sim 1 for ww in any compact set.

If z1τ|z|\leqslant 1-\tau for some fixed τ>0\tau>0, then mcw1/2m_{\rm c}\sim|w|^{-1/2} for ww in any compact set.

2 Concentration estimate of the Green function up to the optimal scale.

We now state precisely the estimate regarding the convergence of mm to mcm_{\rm c}. Since the matrix YzYzY^{*}_{z}Y_{z} is symmetric, we will follow the approach of [ErdYauYin2010Adv]. We will use extensively the following definition of high probability events.

Let ζ>0\zeta>0. We say that an NN-dependent event Ω\Omega holds with ζ\zeta-high probability if there is some constant CC such that

For α0\alpha\geqslant 0, define the zz-dependent set

Suppose τz1τ1\tau\leqslant||z|-1|\leqslant\tau^{-1} for some τ>0\tau>0 independent of NN. Then for any ζ>0\zeta>0, there exists Cζ>0C_{\zeta}>0 such that the following event holds with ζ\zeta-high probability:

Moreover, the individual matrix elements of the Green function satisfy, with ζ\zeta-high probability,

For z<1|z|<1, we have λ<0\lambda_{-}<0 (see Proposition 3.1), so in this case we define κ:=Eλ+\kappa:=|E-\lambda_{+}|.

There exists τ0>0\tau_{0}>0 such that for any ττ0\tau\leqslant\tau_{0} if z1τ|z|\leqslant 1-\tau and wτ1|w|\leqslant\tau^{-1} then the following properties concerning mcm_{\rm c} hold. All constants in the following estimates depend on τ\tau.

Eλ+E\geqslant\lambda_{+} and wλ+τ|w-\lambda_{+}|\geqslant\tau. We have

wλ+τ|w-\lambda_{+}|\leqslant\tau (Notice that there is no restriction on whether Eλ+E\leqslant\lambda_{+} or not ). We have

wτ|w|\geqslant\tau, wλ+τ|w-\lambda_{+}|\geqslant\tau and Eλ+E\leqslant\lambda_{+}. We have

Here Case 1 covers the regime where Eλ+E\geqslant\lambda_{+} and ww is far away from λ+\lambda_{+}. Case 2 concerns the regime that ww is near λ+\lambda_{+}, while Case 3 is for ww is near the origin. Finally Case 4 is for ww not covered by the first three cases.

There exists τ0>0\tau_{0}>0 such that for any ττ0\tau\leqslant\tau_{0}, if z1+τ|z|\geqslant 1+\tau and wτ1|w|\leqslant\tau^{-1} then the following properties concerning mcm_{\rm c} hold. All constants in the following estimates depend on τ\tau. Recall from (3.2) that λ=(α3)38(α1)>0\lambda_{-}=\frac{(\alpha-3)^{3}}{8(\alpha-1)}>0.

Eλ+E\geqslant\lambda_{+} and wλ+τ|w-\lambda_{+}|\geqslant\tau. We have

EλE\leqslant\lambda_{-} and wλτ|w-\lambda_{-}|\geqslant\tau. We have

wτ|w|\geqslant\tau, wλ+τ|w-\lambda_{+}|\geqslant\tau and λEλ+\lambda_{-}\leqslant E\leqslant\lambda_{+}. We have

Here Case 1 covers the regime Eλ+E\geqslant\lambda_{+} and ww is far away from λ+\lambda_{+}. Case 2 concerns the regime EλE\leqslant\lambda_{-} and ww is far away from λ\lambda_{-}. Case 3 is for ww near λ±\lambda_{\pm}. Finally Case 4 is for ww not covered by the first three cases.

The following lemma concerns the two cases covered in Lemmas 4.1 and 4.2, i.e., zz is either strictly inside or outside of the unit disk.

There exists τ0>0\tau_{0}>0 such that for any ττ0\tau\leqslant\tau_{0} if either the conditions z1τ|z|\leqslant 1-\tau and wτ1|w|\leqslant\tau^{-1} hold or the conditions z1+τ|z|\geqslant 1+\tau, wτ1|w|\leqslant\tau^{-1}, Reωλ/5\operatorname{Re}\omega\geqslant\lambda_{-}/5 hold, then we have the following three bounds concerning mcm_{\rm c} (all constants in the following estimates depend on τ\tau):

Proof of Theorem 2.2, local circular law in the bulk

Our main tool in this section will be Theorem 3.4, which critically uses the hypothesis z1τ||z|-1|\geqslant\tau: when zz is on the unit circle the self-consistent equation (which is a fixed point equation for the function g(m)=(1+wm(1+m)2)/(z21)g(m)=(1+wm(1+m)^{2})/(|z|^{2}-1) see (6.21) later in this paper) becomes unstable

We follow Girko’s idea [Gir1984] of Hermitization, which can be reformulated as the following identity (see e.g. [GuiKriZei2009]): for any smooth FF

We will use the notation z=z(ξ)=z0+Naξz=z(\xi)=z_{0}+N^{-a}\xi. Choosing F=fz0F=f_{z_{0}} defined in Theorem 2.2 and changing the variable to ξ\xi, we can rewrite the identity (5.1) as

Recall that λj(z)\lambda_{j}(z)’s are the ordered eigenvalues of YzYzY_{z}^{*}Y_{z}, and define γj(z)\gamma_{j}(z) as the classical location of λj(z)\lambda_{j}(z), i.e.

Thanks to Proposition 3.1, one can check that uniformly in z<1τ|z|<1-\tau, and also in the domain 1+τzτ11+\tau\leqslant|z|\leqslant\tau^{-1} (τ>0\tau>0), for any δ>0\delta>0 we have

It is known, by Lemma 4.4 of [Bai1997], that

Combining (5.4) and (5.5), we have proved (2.2) provided that we can prove (5.3). To prove (5.3), we need the following rigidity estimate which is a consequence of Theorem 3.4.

Suppose τz1τ1\tau\leqslant||z|-1|\leqslant\tau^{-1} for some τ>0\tau>0 independent of NN. Then for any ζ>0\zeta>0, there exists Cζ>0C_{\zeta}>0 such that the following event holds with ζ\zeta-high probability: for any φCζ<j<NφCζ\varphi^{C_{\zeta}}<j<N-\varphi^{C_{\zeta}} we have

First, with (3.5) and the definition (3.4), for any ζ\zeta there exists Cζ>0C_{\zeta}>0 such that

holds with with ζ\zeta-high probability. It also implies that for η=φCζN1mc1\eta=\varphi^{C_{\zeta}}N^{-1}|m_{\rm c}|^{-1},

and define ηj0\eta_{j}\geqslant 0 as the smallest positive solution of

To prove (5.6), we choose qq to be supported in [E1,E2][E_{1},E_{2}] such that q(x)=1q(x)=1 if x[E1+η1,E2η2]x\in[E_{1}+\eta_{1},E_{2}-\eta_{2}] and qC(ηi)1|q^{\prime}|\leqslant C(\eta_{i})^{-1}, qC(ηi)2|q^{\prime\prime}|\leqslant C(\eta_{i})^{-2} if xEiηi|x-E_{i}|\leqslant\eta_{i}. We now claim that

Combining (5.12) and (5.11), we have for any 1jN1\leqslant j\leqslant N,

which implies (5.6) with CζC_{\zeta} in (5.6) replaced by 2Cζ2C_{\zeta}.

It remains to prove (5.12). Since qq and χ\chi are real, with Δm=mmc\Delta m=m-m_{\rm c}

using (3.5) and that on the support of χ\chi^{\prime} is in 1η1/21\geqslant|\eta|\geqslant 1/2.

For the second term in the r.h.s. of (5.13), with qCηi2|q^{\prime\prime}|\leqslant C\eta_{i}^{-2}, (5.9) and (5.10), we obtain

We now integrate the third term in (5.13) by parts first in EE, then in η\eta (and use the Cauchy-Riemann equation EIm(Δm)=ηRe(Δm)\frac{\partial}{\partial E}\operatorname{Im}(\Delta m)=-\frac{\partial}{\partial\eta}\operatorname{Re}(\Delta m)) so that

We therefore can bound the third term in (5.13) with absolute value by

where the last term can be bounded as the first term in r.h.s. of (5.13). By using (5.9) we have

where we used ηiNC\eta_{i}\geqslant N^{-C}. Together with (5.14) and (5.15), we obtain (5.12) and complete the proof of (5.6).

Now we prove (5.7). Using (5.2) and Proposition 3.1, we have

Combining (5.18) with (5.6), we obtain (5.7).

For (5.8), the proof is similar to the above reasoning, but simpler: in this case γj1\gamma_{j}\sim 1 for jN/2j\leqslant N/2. For jN/2j\geqslant N/2, γj\gamma_{j} is bounded as (5.17), and one can check if 1+τzτ11+\tau\leqslant|z|\leqslant\tau^{-1}, Proposition 3.1, we have

We return to the proof of the local circular law, Theorem 2.2. We now only need to prove (5.3) from Lemma 5.1. From (5.7) and (5.8), we have

Notice that, for large enough CC, there is a constant c>0c>0 such that for any jj we have

with probability larger than 1exp(Nc)1-\exp({{-N^{c}}}) (for this elementary fact, one can for example see that the entries of XX are smaller that 11 with probability greater than 1ϑ1eNϑ1-\vartheta^{-1}e^{-N^{\vartheta}} by the subexponential decay assumption (2.1) and then use λj=TrYY\sum\lambda_{j}=\operatorname{Tr}Y^{*}Y), so together with the above bounds on logλj(z)logγj(z)\left|\log\lambda_{j}(z)-\log\gamma_{j}(z)\right| this proves that for any ζ>0\zeta>0, there exists Cζ>0C_{\zeta}>0 such that

with ζ\zeta-high probability. Furthermore, one can see that or estimates hold uniformly for zz’s in this region.

On the other hand, the following important Lemma 5.2 holds, concerning the smallest eigenvalue. It implies that

holds uniformly for zz in any fixed compact set. It is easy to check that for any δ>0\delta>0, for large enough NN,

Hence we can extend the summation in (5.19) to all j1j\geqslant 1, which gives (5.3) and completes the proof of Theorem 2.2.

Under the same assumptions of Theorem 2.2,

holds uniformly for zz in any fixed compact set.

where uk(z)u_{k}(z) is a unit vector independent of X(z)ekX(z)e_{k}. By conditioning on uk(z)u_{k}(z), the result of this lemma is straightforward since the matrix entries have a density. ∎

Weak local Green function estimate

In this section, we make a first step towards Theorem 3.4, with a weaker version of it, stated hereafter.

This theorem will be proved in the subsequent subsections.

By definition, m(,)=mm^{(\emptyset,\emptyset)}=m. Since the eigenvalues of YYY^{*}Y and YYYY^{*} are the same except the zero eigenvalue, it is easy to check that

For i,jki,j\neq k ( i=ji=j is allowed) we have

By the row-column reflection symmetry, we only need to prove those formulas involving GG. We first prove (6.4). In [ErdYauYin2010PTRF]-[ErdYauYin2010Adv], was proved a lemma concerning Green functions of matrices and their minors. This lemma is stated as Lemma B.2 in Appendix B. Let

We now prove (6.5). Recall the rank one perturbation formula

where v{\bf{v}} is a row vector and v{\bf{v}}^{*} is its Hermitian conjugate. Together with

By (6.3), (6.10) holds for mG(i,)m_{\mathcal{G}}^{(i,\emptyset)} as well. Similar arguments can be used to prove (6.6) for mG(i,j)m_{\mathcal{G}}^{(i,j)}, mG(i,j)m_{G}^{(i,j)} and the general cases. This completes the proof of Lemma 6.3. ∎

The next step is to derive equations between the matrix and its minors. The main results are stated as the following Lemma 6.5. We first need the following definition.

By the row-column reflection symmetry, we only need to prove the GG part of this lemma. Furthermore, for simplicity, we prove the case T=T=\emptyset, the general case can be proved in the same way.

We first prove (6.11). Let H=YYH=Y^{*}Y. Similarly to (6.7) and (6.8), we define G[i]G^{[i]} and H[i]H^{[i]}. Then using (B.2) and (6.9), we have

From the definition of HH, we have hik=yiykh_{ik}={\bf{y}}_{i}^{*}{\bf{y}}_{k}. Then

We now prove (6.12). As above, using now (B.3), we have

Then using (6.16) again, we obtain (6.12). ∎

2 The self-consistent equation and its stability.

We now derive the self-consistent equation for m(w)m(w) and its stability estimates. Following [ErdYauYin2010Adv], we introduce the following control parameter:

The quantity mc1Ψ|m_{\rm c}|^{-1}\Psi will be our controlling small parameter in this paper.

Suppose z1τ|z|\leqslant 1-\tau for some τ>0\tau>0. Then there exists a small constant α>0\alpha>0 independent of NN such that if the estimate

holds for some wC|w|\leqslant C on a set AA in the probability space of matrix elements for XX, then in the set AA we have with ζ\zeta-high probability

By (4.15), (4.16) and (6.20), for z1t|z|\leqslant 1-t the following inequalities hold on the set AA:

Furthermore, using (6.22), (4.15), (4.16), (6.20) and (3.1), we have in the set AA

Moreover, we have from (C.1) that with ζ\zeta-high probability in AA

where we have used (6.22) and mcw1/2|m_{c}|\sim|w|^{-1/2}. Together with (6.23), we thus have with ζ\zeta-high probability

Using this estimate, (6.6) and (6.29), we can estimate Zi:=Zi()\mathcal{Z}_{i}:=\mathcal{Z}_{i}^{(\emptyset)} by

We can now use (6.32), (6.29) and (6.6) to estimate the right hand side of (6.11) such that

where E1\mathcal{E}_{1} and Zi\mathcal{Z}_{i} are bounded in (6.30) and (6.32) and E2\mathcal{E}_{2} is bounded by

In the last inequality, we have used (6.24) to bound 1+mz2w(1+m)1+m-\frac{|z|^{2}}{w(1+m)} and (4.15) for mcm_{\rm c}.

Summing over the index ii in (6.34), we have

Together with the assumption (6.20) on Λ\Lambda and (4.15) on the order of mcm_{\rm c}, this proves (6.21). ∎

Comparing (6.38) with (6.34), we have proved (6.37).

We will omit AA in the following argument.

We define for any sequence AiA_{i} (1iN1\leqslant i\leqslant N) the quantity

In application, we often use A=ZA=Z or A=ZA=\mathcal{Z}. Define

The following lemma is our stability estimate for the equation D(m)=0\mathcal{D}(m)=0. Notice that it is a deterministic result. It assumes that D(m)|\mathcal{D}(m)| has a crude upper bound and then derives a more precise estimate on Λ=mmc\Lambda=|m-m_{c}|.

Suppose that, for a fixed EE with 0EC0\leqslant E\leqslant C for some constant CC independent of NN, (6.20) and the estimate

The three upper bounds (i.e., the first inequalities in (6.42)-(6.44)) can be summarized as

By definition of Υ\Upsilon (6.21), we have

By definition of Pw,zP_{w,z}, we can express AA and BB by

Case 1: In this case, we claim that the following estimates concerning AA and BB hold:

Since AA and BB are explicit functions of mcm_{\rm c}, equation (6.46) is just properties of the solution mcm_{\rm c} of the third order polynomial Pw,z(m)P_{w,z}(m). We now give a sketch of the proof. Consider first the case w1|w|\ll 1. Then (6.46) follows from (4.15), (4.16), (4.9) and the definitions of AA and BB.

We now assume that w1w\sim 1 . Clearly, BO(1)w1/2|B|\leqslant\operatorname{O}(1)\sim|w^{1/2}|, which gives (6.46) for BB. To prove AC/M|A|\geqslant C/M, by definition of mcm_{\rm c} (3.1), we have w=1mc+mcz2mc(1+mc)2w=\frac{-1-m_{\rm c}+m_{\rm c}|z|^{2}}{m_{\rm c}(1+m_{\rm c})^{2}}. Thus we can rewrite AA as

By (4.15) and (4.17) (where α=1+8z2\alpha=\sqrt{1+8|z|^{2}}), we obtain (6.46).

We now prove (6.42) by contradiction. If (6.42) is violated then with u=mmcu=m-m_{c} we have

where MM is a large constant in the last inequality. By (6.41) and (4.15), ΥCδ/w|\Upsilon|\leqslant C\delta/|w|. Thus we have

which is a contradiction provided that MM is large enough.

Case 2: ε2:=κ+η1/M2{\varepsilon}^{2}:=\kappa+\eta\leqslant 1/M^{2}. Note in this case w1w\sim 1. Then by (4.3) we have

where the last equation can be checked by direct computation and we used z2<1t<1|z|^{2}<1-t<1. There is a more intrinsic reason why the last equation for AA holds. Notice that λ+\lambda_{+} is a point that the polynomial Pw,z(m)w=λ+P_{w,z}(m)|_{w=\lambda_{+}} has a double root. Therefore, we have 0=Pw,z(mc(λ+,z))=A(λ+,z)0=P^{\prime}_{w,z}(m_{\rm c}(\lambda_{+},z))=A(\lambda_{+},z).

Notice that in the case κ+η\kappa+\eta is small enough, we can approximate A(w,z)A(w,z) by linearizing w.r.t. w=λ+w=\lambda_{+}. Thus by the defining equation Pw,z(mc(λ+,z))=A(λ+,z)P^{\prime}_{w,z}(m_{\rm c}(\lambda_{+},z))=A(\lambda_{+},z), we have

where we have used that Pw,z(mc(λ+,z))=B(λ+,z)1P_{w,z}^{\prime\prime}(m_{\rm c}(\lambda_{+},z))=B(\lambda_{+},z)\sim 1, Pw,zw(mc(λ+,z))1\frac{\partial P_{w,z}}{\partial w}(m_{\rm c}(\lambda_{+},z))\sim 1 and, by (4.3), that (mc(w,z)mc(λ+,z))κ+η(m_{\rm c}(w,z)-m_{\rm c}(\lambda_{+},z))\sim\sqrt{\kappa+\eta}. While we can also check the conclusion of (6.48) by direction computation, the current derivation provides a more intrinsic reason why it is correct.

Case 2a: Suppose (6.43) is violated. We first choose MM large enough so that mc(1+mc)M1/4|m_{\rm c}(1+m_{\rm c})|\leqslant M^{1/4} in this regime. Then by (6.47) and (6.48), with w1w\sim 1, we have

which is a contradiction provided that MM is large enough. Here we have used that, by the restriction of ε{\varepsilon} and δ\delta in (6.43) that εM3/4δ{\varepsilon}\geqslant M^{3/4}\sqrt{\delta}, MM is large enough constant and δ1\delta\ll 1.

Case 2b: Suppose (6.44) is violated. Similarly we have

which is a contradiction. Here we have used, by the restriction of ε{\varepsilon} and δ\delta in (6.44) and MM is large enough constant, that C2εC2M3/4δMδ/20C_{2}{\varepsilon}\leqslant C_{2}M^{3/4}\sqrt{\delta}\leqslant M\sqrt{\delta}/20. ∎

With a slighter strong condition on δ\delta and an initial estimate Λ1\Lambda\ll 1 when η1\eta\sim 1, the first inequalities in (6.42)-(6.44), i.e., (6.45), always hold. We state this as the following Corollary, which is a deterministic statement.

Suppose that the assumptions of Lemma 6.9 hold. If we have

Suppose that as η\eta decreases, we get to Case 2a. Notice that when we decrease η\eta, by the conditions on ε{\varepsilon} we will not go back to Case 1 from either Case 2a or Case 2b. For any ε1/M{\varepsilon}\leqslant 1/M with MM large, we have

Hence at the transition point from Case 1 to Case 2a, the inequality Λ(E+iη)Mδε\Lambda(E+i\eta)\leqslant\frac{M\delta}{{{\varepsilon}}} holds. Thus by continuity of Λ\Lambda, the bound Λ(E+iη)Mδε\Lambda(E+i\eta)\leqslant\frac{M\delta}{{{\varepsilon}}} in (6.44) holds until we leave Case 2a.

It is possible that we cross from Case 2a to Case 2b. At the transition point, we have δ=ε2M3/2\delta=\frac{{\varepsilon}^{2}}{M^{3/2}} and thus

for MM large. Hence the first inequality of Case 2b, i.e., ΛMδ\Lambda\leqslant M\sqrt{\delta} holds. By continuity, this bound continues to hold unless we leave Case 2b. Since δ\delta is decreasing in η\eta when ε{\varepsilon} is small, once we get to Case 2b, we will not go back to Case 2a (or Case 1 as explained before).

It is possible that the Case 2a is omitted and we get to Case 2b directly from Case 1. Notice that ε=1/M{\varepsilon}=1/M at such a transition point and we have w1|w|\sim 1. Furthermore, by (6.40), we get δ1/logN\delta\leqslant 1/\log N at the transition point. Putting these together, we have for MM large,

Hence the bound Λ(E+iη)Mδ\Lambda(E+i\eta)\leqslant M\sqrt{\delta} in (6.44) holds. ∎

3 The large η𝜂\eta case.

Our method to estimate the Green functions and the Stieltjes transform is to fix the energy EE and apply a continuity argument in η\eta by first showing that the crude bound in Lemma 6.9 holds for large η\eta. In order to start this scheme, we need to establish estimates on the Green functions when η=O(1)\eta=\operatorname{O}(1). This is the main focus of this subsection. We start with the following lemma which provide a crude bound on the Green functions.

For any wS(0)w\in{\rm S}(0) and η>c>0\eta>c>0 for fixed cc, we have the bound

for some C>0C>0. Notice that this bound is deterministic and is independent of the randomness.

The main result of this subsection is the following bound on Λ\Lambda.

For any ζ>0\zeta>0 and ε>0{\varepsilon}>0, we have

From (6.25)-(6.27), for η=O(1)\eta=\operatorname{O}(1) we have

From (6.49), we have Gij+Gijη1O(1)|G_{ij}|+|\mathcal{G}_{ij}|\leqslant\eta^{-1}\leqslant\operatorname{O}(1) and mG(i,i)O(1)|m_{G}^{(i,i)}|\leqslant\operatorname{O}(1). Hence the large deviation estimate (6.27) becomes, with ζ\zeta-high probability,

By an argument similar to the one used in (6.51), we can estimate Zi\mathcal{Z}_{i} by

for any ε>0{\varepsilon}>0 with ζ\zeta-high probability. This implies that, with ζ\zeta-high probability,

For any η\eta fixed, we claim that the following inequality between the real and imaginary parts of mm holds:

Assume that Immc(logN)1\operatorname{Im}m\leqslant c(\log N)^{-1}. From (6.53), we have mc(logN)1/2|m|\leqslant c(\log N)^{-1/2}. Together with Imw=η1\operatorname{Im}w=\eta\sim 1,

for some constant CC. This contradicts mc(logN)1/2|m|\leqslant c(\log N)^{-1/2} and we can thus assume that Immc(logN)1\operatorname{Im}m\geqslant c(\log N)^{-1} when η1\eta\sim 1 and w=O(1)w=\operatorname{O}(1). In this case, we also have

Then (6.52) implies for any ε>0{\varepsilon}>0 that with ζ\zeta-high probability

Summing up all ii, we have the following equation for mm with ζ\zeta-high probability:

We can rewrite this equation into the following form:

It can be checked (with computer calculation or rather complicated but elementary algebraic calculation) that for 0E5λ+0\leqslant E\leqslant 5\lambda_{+} and η=O(1)\eta=O(1), the third order polynomial Pw,z(m)P_{w,z}(m) has no double root and there is only one root with positive real part. We denote this root by m1m_{1} and the other two roots by m2m_{2} and m3m_{3}. For 0E5λ+0\leqslant E\leqslant 5\lambda_{+} and tηt1t\leqslant\eta\leqslant t^{-1} for any tt fixed, the three roots are separate by order one due to compactness. Since there is no double root, we have Pw,z(m1)c>0|P^{\prime}_{w,z}(m_{1})|\geqslant c>0 whenever 0E5λ+0\leqslant E\leqslant 5\lambda_{+} and tηt1t\leqslant\eta\leqslant t^{-1}. Thus the stability of (6.54) is trivial and we have proved that in this range of parameters

for any ε>0{\varepsilon}>0 with ζ\zeta-high probability. ∎

4 Proof of the weak local Green function estimates.

In this subsection, we finish the proof of Theorem 6.1. We fix an energy EE and we will decrease the imaginary part η\eta of w=E+iηw=E+i\eta. Recall all stability results are based on assumption (6.20), i.e., Λαmcαw1/2\Lambda\leqslant\alpha|m_{c}|\sim\alpha|w|^{-1/2} for some small constant α\alpha, which so far was established only for large η\eta in (6.50). We would like to know that this condition continue to hold for smaller η\eta. More precisely, suppose that (6.20) holds in a set AA for all w=E+ηiw=E+\eta i with η[η~,10]\eta\in[\widetilde{\eta},10] where η~\widetilde{\eta} satisfies

for all w=E+iηjw=E+i\eta_{j} for all 1jn1\leqslant j\leqslant n. Since Λ(E+iη)\Lambda(E+i\eta) is continuous in η\eta at a scale, say, N10N^{-10}, (6.56) holds for all η[η~,10]\eta\in[\widetilde{\eta},10] with ζ\zeta-high probability in AA. Hence for η~\widetilde{\eta} satisfying (6.55) the estimate (6.41) holds with

With this choice, we can check that the assumption on δ\delta, (6.40), holds as well. Furthermore δ\delta is decreasing in η\eta when ε=κ+η{\varepsilon}=\sqrt{\kappa+\eta} is small enough. By Corollary 6.10, (6.45) holds all η[η~,10]\eta\in[\widetilde{\eta},10].

For z<1t|z|<1-t for some t>0t>0, if κ1\kappa\ll 1 then w1|w|\sim 1 and (6.45) implies

If κc>0\kappa\geqslant c>0 for some c>0c>0 then

By definition of Ψ\Psi, (6.58) and mcw1/2m_{c}\sim|w^{-1/2}|, we have

Using the restriction on η\eta so that Nηw1/2φ5QζN\eta\geqslant|w|^{1/2}\varphi^{5Q_{\zeta}}, we have

With (6.57) and (6.59), we have thus proved that

To conclude Theorem 6.1, it remains to prove the estimate on the off-diagonal elements. Recall the identity (6.12) for GijG_{ij} and the equations (C.3) and (C.4). We can estimate the off-diagonal Green function by

Here we have used GiiGjj(i,)=O(w1)|G_{ii}G^{(i,\emptyset)}_{jj}|=O(|w|^{-1}), which follows from (6.36), Λmc\Lambda\ll m_{c} and mcw1/2|m_{c}|\sim|w^{-1/2}|

where we have used (C.4) and that, by definition, ImGii(ij,ij)=0=ImGjj(ij,ij)\operatorname{Im}G^{(ij,ij)}_{ii}=0=\operatorname{Im}G^{(ij,ij)}_{jj}. Therefore, we have with ζ\zeta-high probability,

where we also used Gii(ij,)Gjj(ij,i)Cmc2Cw1|\mathcal{G}_{ii}^{(ij,\emptyset)}\mathcal{G}^{(ij,i)}_{jj}|\leqslant C|m_{c}|^{2}\leqslant C|w|^{-1}. Together with (6.61) and (6.36), we have proved that with ζ\zeta-high probability

Proof of the strong local Green function estimates

Lemma 6.7 provides an error estimate to the self-consistent equation of mm linearly in Ψ\Psi. The following Lemma improves this estimate to quadratic in Ψ\Psi. This is the key improvement leading to a proof of the strong local Green function estimates, i.e., Theorem 3.4.

For any ζ>1\zeta>1, there exists Rζ>0R_{\zeta}>0 such that the following statement holds. Suppose for some deterministic number Λ~(w,z)\widetilde{\Lambda}(w,z) (which can depend on ζ\zeta) we have

Notice that the probability deteriorates in the exponent by a (logN)2(\log N)^{-2} factor.

We remark that, by Lemma 4.1, Immcmc\operatorname{Im}m_{\rm c}\ll|m_{\rm c}| when η+κ1\eta+\kappa\ll 1. Hence we have to track the dependence of Immc\operatorname{Im}m_{\rm c} carefully in the previous Lemma. This is one major difference between the weak and strong local Green function estimates. Similar phenomena occur for the Stieltjes transforms of the eigenvalue distributions of Wigner matrices. Lemma 7.1 will be proved later in this section; we now use it to prove Theorem 3.4. We first give a heuristic argument.

Suppose that we have the estimate (7.2) with Ψ~\widetilde{\Psi} replaced by Ψ\Psi. We assume Λ(Nη)1\Lambda\geqslant(N\eta)^{-1} for convenience so that Ψ2(Immc+Λ)/(Nη)\Psi^{2}\sim(\operatorname{Im}m_{\rm c}+\Lambda)/(N\eta) (If this assumption is violated then then (3.5) holds automatically and we have nothing to prove). Then we can apply Corollary 6.10 by choosing

which implies (6.45). Consider first the case κ+ηO(1)\kappa+\eta\sim\operatorname{O}(1). Using (6.45) with the choice of δ\delta in (7.3) and κ+η+δO(1)\kappa+\eta+\delta\geqslant\operatorname{O}(1), we have

When η\eta satisfies the condition (6.55), the coefficient of Λ\Lambda on the right side of the last equation is smaller than 1/21/2. Hence, using ImmcmcCw1/2\operatorname{Im}m_{\rm c}\leqslant|m_{\rm c}|\leqslant C|w|^{-1/2} (see Proposition 3.2), we have

We now consider the case κ+η1\kappa+\eta\ll 1 and thus wO(1)|w|\sim\operatorname{O}(1). From the first inequality of (6.45), we have

Also, in the regime κ+η1\kappa+\eta\ll 1, (4.7) asserts that

Using the choice of δ\delta in (7.3), we have

where we have used (7.4) to absorb the last term involving Λ\Lambda in the last inequality with a change of constant CC. This completes the heuristic proof of Theorem 3.4. We now give a formal proof of this theorem assuming Lemma 7.1.

We first prove (3.6) assuming (3.5). By (6.63) and the definition of Ψ\Psi, we have for iji\neq j,

where we have used (3.5) in the last step. This proves (3.6).

holds with the probability larger than 1exp(φζ+5(logN)2)1-\exp(-\varphi^{\zeta+5}(\log N)^{-2}). Notice that the application of Lemma 7.1 causes the probability in the exponent to deteriorate by a (logN)2(\log N)^{-2} factor.

Using (7.6), we can apply Corollary 6.10 with

holds with the probability larger than 1exp(φζ+5(logN)2)1-\exp(-\varphi^{\zeta+5}(\log N)^{-2}). We have thus proved (3.5) provided that κ+η(logN)1\kappa+\eta\geqslant(\log N)^{-1}.

holds with probability larger than 1exp(φζ+5(logN)2)1-\exp(-\varphi^{\zeta+5}(\log N)^{-2}). Here C1C_{1} depends only on C0C_{0}. From the definition of δ1\delta_{1} and Ψ1\Psi_{1}, we have

holds with the probability larger than 1exp(φζ+5(logN)2)1-\exp(-\varphi^{\zeta+5}(\log N)^{-2}) for some C3C_{3}. Notice that we have used Nηφ5RζN\eta\geqslant\varphi^{5R_{\zeta}} in the last step in (7.8).

holds with the probability larger than 1exp(φζ+5(logN)4)1-\exp(-\varphi^{\zeta+5}(\log N)^{-4}). Notice that the last constant C3C_{3} is the same as the one appears in (7.8) and it does not change in the iteration procedure. We now iterate this process KK times to have

holds with the probability larger than 1exp(φζ+5(logN)2K)1-\exp(-\varphi^{\zeta+5}(\log N)^{-2K}). We need KK so large that

On the other hand, we need KK small enough so that

We note that it also guarantees (7.1), since φζ+5p1p2pKφ\varphi^{\zeta+5}\geqslant p_{1}\geqslant p_{2}\geqslant\cdots\geqslant p_{K}\geqslant\varphi. We choose K=loglogN/log2K=\log\log N/\log 2 and we have thus proved that

with the probability larger than 1exp(φζ)1-\exp(-\varphi^{\zeta}) which implies (3.5) when κ+η(logN)1\kappa+\eta\leqslant(\log N)^{-1}. This completes the proof of Theorem 3.4. ∎

The first step in proving Lemma 7.1 is to derive a second order self-consistent equation which identifies the first order dependence of the correction in the self-consistent equation derived in Lemma 6.7. The second error terms will be bounded by Ψ2\Psi^{2}; the first order terms are of the forms of averages of Zi(i)Z^{(i)}_{i} and Zi\mathcal{Z}_{i}. In Lemma 7.3, the averages of Zi(i)Z^{(i)}_{i} and Zi\mathcal{Z}_{i} will be estimated by Ψ2\Psi^{2}. This improvement from the naive order Ψ\Psi to Ψ2\Psi^{2} is the key ingredient to obtain the strong local law. We remark that Immcmc\operatorname{Im}m_{\rm c}\ll|m_{\rm c}| when η+κ1\eta+\kappa\ll 1. Hence the dependence of Immc\operatorname{Im}m_{\rm c} verses mcm_{c} has to be tracked carefully. We now state the second order self-consistent equation: as the following lemma.

We first take the inverse of both sides of (6.33) and sum up ii to get, with ζ\zeta-high probability,

where we have used (6.30) and the bound (6.22). Recall the estimates of Zi\mathcal{Z}_{i} and Zi(i)Z^{(i)}_{i} by Ψ\Psi in (6.27) and (6.32). Hence we have

where b5Qζb\geqslant 5Q_{\zeta} and QζQ_{\zeta} is defined in Lemma C.1. We now perform the expansion Gii1=[(Giim)+m]1G_{ii}^{-1}=[(G_{ii}-m)+m]^{-1} to have

Using this approximation in (7.13), we have

The diagonal element GiiG_{ii} can be estimated by (7.14) so that

Notice that only the imaginary part of mcm_{\rm c} appears through Ψ\Psi instead of mcm_{\rm c} which can be much bigger near the spectral edge.

We now estimate the last term in (7.16). Notice that G(i,)\mathcal{G}^{(i,\emptyset)} is the Green function of the matrix A+AA^{+}A where A=(Y(i,))A=(Y^{(i,\emptyset)})^{*}. Then m(i,i)m^{(i,i)} is the Green function of A(i,),+A(i,)A^{(i,),+}A^{(i,)} where we have used A(i,)=Y(i,i)A^{(i,)}=Y^{(i,i)}. Thus we can apply (7.17) (which holds for matrices of the form A+AA^{+}A with A not necessarily a square matrix) to get

Inserting (7.18) and (7.19) into (7.15), we obtain

To conclude Lemma 7.2, we choose Cζ=2QζC_{\zeta}=2Q_{\zeta} and it remains to prove 1mc3Ψ2O(N1)|\frac{1}{m_{\rm c}^{3}}\Psi^{2}|\geqslant\operatorname{O}(N^{-1}). By definition of Ψ\Psi and the fact that mcw1/2|m_{\rm c}|\sim|w|^{-1/2} (4.15), this inequality follows from the following property of Immc\operatorname{Im}m_{c}:

This estimate on Immc\operatorname{Im}m_{c} is a direct consequence of (4.2), (4.7), (4.9) and (4.10). This completes the proof of Lemma 7.2 ( with CζC_{\zeta} increasing by 1).

We now estimate the averages [Z][\mathcal{Z}] and [Z][Z_{\ast}^{\ast}]. Our goal is to catch cancellation effects due to the average over the indices ii. This is the content of the next lemma, to be proved in next subsection. Clearly this lemma completes the proof of Lemma 7.1.

For any ζ>1\zeta>1, there exists Rζ>0R_{\zeta}>0 such that the following statement holds. Suppose for some deterministic number Λ~(w,z)\widetilde{\Lambda}(w,z) (which can depend on ζ\zeta) we have

where Ψ~\widetilde{\Psi} is defined in (7.2).

2 Strong bounds on [Z]delimited-[]𝑍[Z].

In this subsection, we prove Lemma 7.3. The main tool is the abstract cancellation Lemma D.1.

We first perform a cutoff for all random variables XijX_{ij} in XX so that XijN10|X_{ij}|\leqslant N^{10}. Due to the subexponential decay assumption, the probability of the complement of this event is eNce^{-N^{c}}, which is negligible.

Define PiP_{i} and Pi\mathcal{P}_{i} as the operator for the expectation value w.r.t. the ii-th row and ii-th column. Let

With this convention and Lemma 6.5, we can rewrite Zi\mathcal{Z}_{i} and Zi(i)Z_{i}^{(i)}, from Definition 6.4, as

Choosing Cζ=2Dζ+20ζC_{\zeta}=2D_{\zeta}+20\zeta, one can see that (7.21) follows from (7.20), (7.30) and the Markov inequality.

It remains to prove (7.28) and (7.29). We prove (7.28) first. For simplicity, we assume that A={1,,A}{A=\{1,\ldots,\lvert A\rvert\}}. Denote the first A|A| column of YzY_{z} by a{\bf a} so that a\bf a is a N×AN\times|A| matrix. Similarly, denote by BB the matrix obtained after removing the first KK-columns of YY. Then we have the identity

Recall the identity (6.16): for any matrix MM,

We will prove R1\|R\|\ll 1 with high probability. Using (3.1), Λmc\Lambda\ll m_{\rm c} (7.24) and (6.6), we have

With (7.31) and the definition of RR, we have wαGij=[(I+R)1]ij-w\alpha G_{ij}=[(I+R)^{-1}]_{ij} for i,jAi,j\in A. Therefore,

Then, together with (7.32), (7.24) and mcw1/2αm_{c}\sim|w^{-1/2}|\sim\alpha, we have thus proved that, in Ξ\Xi,

where we used Apφ2ζ|A|\leqslant p\leqslant\varphi^{2\zeta} and UAU_{A} is a linear combination of the following products of (Rj)ii(R^{j})_{ii}’s

Define ΩA\Omega_{A} as the probability space for the columns {yk:kA}\{{\bf{y}}_{k}:k\in A\} and ΩAc\Omega_{A^{c}} the one for the columns {yk:kAc}\{{\bf{y}}_{k}:k\in A^{c}\}. Then the full probability space Ω\Omega equals to Ω=ΩA×ΩAc\Omega=\Omega_{A}\times\Omega_{A^{c}}. Define πAc\pi_{A^{c}} to be the projection onto ΩAc\Omega_{A^{c}} and Ξ=(πAc1πAcΞ)\Xi^{*}=\left(\pi^{-1}_{A^{c}}\cdot\pi_{A^{c}}\cdot\Xi\right). Then 1(Ξ){\bf 1}(\Xi^{*}) is independent of {yk:kA}\{{\bf{y}}_{k}:k\in A\}. Hence we can extend (7.35) to

By (7.33), Zi,AO(w1/2(AφDζ+2ζw1/2Ψ~)A)|\mathcal{Z}_{i,A}|\leqslant\operatorname{O}(|w|^{-1/2}(|A|\varphi^{D_{\zeta}+2\zeta}|w|^{1/2}\widetilde{\Psi})^{{|A|}}) in Ξ\Xi. We now prove that

By (7.22), we have (wGii)1=O(NC)\left(wG_{ii}\right)^{-1}=\operatorname{O}(N^{C}). Notice that α\alpha is independent of {yk:kA}\{{\bf{y}}_{k}:k\in A\}. Since αw1/2\alpha\sim|w^{-1/2}| in Ξ\Xi, the same asymptotic holds in ΞΞ\Xi^{*}\setminus\Xi. By definitions of UAU_{A} (7.33) and RR, and the assumption Xij=O(NC)X_{ij}=O(N^{C}), we obtain (7.36) and this completes the proof of (7.28). Similarly, we can prove (7.29) and this completes the proof of Lemma 7.3.

In this appendix we are going to prove the lemma 4.1, 4.2 and 4.3. We can solve mcm_{\rm c} explicitly by the following formula.

By definition, mcm_{\rm c} is an analytic function, so we only need to prove (A.1). By definition, mcm_{\rm c} is one of the three solutions of (3.1), and needs to have positive imaginary part. Solving explicitly this degree three polynomial equation proves that there is just one such solution, with the limit A.1 close to the critical axis. ∎

With Lemma A.1 and (A.2), one can easily prove Proposition 3.1.

Moreover, recall that α=1+8z2\alpha=\sqrt{1+8|z|^{2}}, so (still in the first case)

We also have easily mc1|m_{\rm c}|\sim 1 easily from (A.3), we therefore obtained the l.h.s. of (4.2). Similarly, one can prove Immcη\operatorname{Im}m_{\rm c}\sim\eta thanks to

and complete the proof for the first case.

For the second case, it is easy to prove (4.3) when w=λ+w=\lambda_{+}, as we did from an explicit calculation. Then one obtains (4.3) by expanding mcm_{\rm c} around mc(λ+,z)m_{\rm c}(\lambda_{+},z), using (3.1). The estimate (4.7) directly follows from (4.3).

Similarly, for the third case, first mc=m_{\rm c}=\infty, i.e., mc1=0m_{\rm c}^{-1}=0 when w=0w=0, then one can easily obtain (4.8) in case 3 by solving (3.1) with expanding mc1m_{\rm c}^{-1} around (mc(0,z))1(m_{\rm c}(0,z))^{-1}. The estimate (4.9) directly follows from (4.8). The fourth case follows from

and the properties of ρ\rho stated in proposition 3.1. ∎

This is similar to the proof of Lemma 4.1. ∎

We are going to prove this lemma in the case z1τ|z|\leqslant 1-\tau, the other cases can be proved similarly. Note first that (4.15) is a consequence of all possible cases in Lemma 4.1.

We now prove (4.16) in the four different cases, which have been classified in Lemma 4.1. In the first case, if additionally η1\eta\sim 1, as 0>Re(mc)>1/20>\operatorname{Re}(m_{\rm c})>-1/2, the l.h.s. in (4.16) is bounded by O(1)\operatorname{O}(1), which implies (4.16). For the first case if η\eta is small enough, since Rew(1+mc)1|\operatorname{Re}w|\sim(1+m_{\rm c})\sim 1 and Im(mc)η|\operatorname{Im}(m_{\rm c})|\sim\eta, so

which gives (4.16) in the first case. In the same way we get (4.16) in the second case, where Immccη\operatorname{Im}m_{\rm c}\geqslant c\eta. For the third case, using (4.8), one can easily prove (4.16). Finally, the fourth case is simple since the l.h.s. in (4.16) is clearly O(1)\operatorname{O}(1).

We now prove (4.17). Using (4.9) and (4.10), (α=1+8z2\alpha=\sqrt{1+8|z|^{2}} is a real number) we have that, in the cases three and four,

Note mc(λ+)=2/(3+α)m_{\rm c}(\lambda_{+})=-2/(3+\alpha). For case one, with (A.4), it is easy to prove that either Immc1\operatorname{Im}m_{\rm c}\sim 1 or Remcmc(λ+)=Remc+2/(3+α)1\operatorname{Re}m_{\rm c}-m_{\rm c}(\lambda_{+})=\operatorname{Re}m_{\rm c}+2/(3+\alpha)\sim 1. It implies that mc23+α1\left|m_{\rm c}-\frac{-2}{3+\alpha}\right|\sim 1. This completes the proof. ∎

Appendix B Perturbation theorem

In this section, we introduce the theorem on the relations between the Green function GG of the matrix HH and the Green function of the minor of the matrix. This theorem was proved in [ErdYauYin2010PTRF]. We first introduce some notations (here we use [][] instead of ()() in [ErdYauYin2010PTRF], since upper index ()() has been used in the main part of the paper).

The following formulas were proved in Lemma 4.2 from [ErdYauYin2010PTRF].

Appendix C Large deviation estimates.

In order to obtain the self-consistent equations for the Green functions, we needed the following large deviation estimate.

We first recall the following large deviation estimates concerning independent random variables, which were proved in Appendix B of [ErdYauYin2010PTRF].

Let aia_{i} (1iN1\leqslant i\leqslant N) be independent complex random variables with mean zero, variance σ2\sigma^{2} and having a uniform subexponential decay

for any sufficiently large NN0N\geqslant N_{0}, where N0=N0(ϑ)N_{0}=N_{0}(\vartheta) depends on ϑ\vartheta.

We will only prove the assertion of this lemma concerning the Green function GG. Similar statement for G\mathcal{G} can be proved with the row-column symmetry. From now on, we will only prove all statements concerning GG if identical proofs are valid for G\mathcal{G} and we will not repeat this comment.

with ζ\zeta-high probability. Similarly, with (C.5), the second term on the right hand side is bounded by

The proofs for the other bounds follow from similar arguments. ∎

Appendix D Abstract decoupling lemma

We recall an abstract cancellation Lemma proved in [PilYin2011].

Let I\mathcal{I} be a finite set which may depend on NN and

Let S1,,SN{S}_{1},\dots,{S}_{N} be random variables which depend on the independent random variables {xα,αI}\{x_{\alpha},\alpha\in\mathcal{I}\}. In application, we often take I=1,N\mathcal{I}=\llbracket 1,N\rrbracket and Ii={i}\mathcal{I}_{i}=\{i\}.

Let pp be an even integer Suppose for some constants C0C_{0}, c0>0c_{0}>0 there is a set Ξ\Xi (the "good configurations") so that the following assumptions hold:

(Bound on QASiQ_{A}S_{i} in Ξ\Xi). There exist deterministic positive numbers X<1\mathcal{X}<1 and Y\mathcal{Y} such that for any set A1,NA\subset\llbracket 1,N\rrbracket with iAi\in A and Ap\lvert A\rvert\leqslant p, QASiQ_{A}S_{i} in Ξ\Xi can be written as the sum of two random variables

Then, under the assumptions (i) – (iii), we have

for some C>0C>0 and any sufficiently large NN.

Roughly speaking, this lemma increase the estimate of Zi{\bf Z}_{i} from X\mathcal{X} to X2\mathcal{X}^{2} after averaging over ii.

References