Fluctuations of the extreme eigenvalues of finite rank deformations of random matrices

Florent Benaych-Georges, Alice Guionnet, Mylène Maïda

Introduction

Most of the spectrum of a large matrix is not much altered if one adds a finite rank perturbation to the matrix, simply because of Weyl’s interlacement properties of the eigenvalues. But the extreme eigenvalues, depending on the strength of the perturbation, can either stick to the extreme eigenvalues of the non-perturbed matrix or deviate to some larger values. This phenomenon was made precise in , where a sharp phase transition, known as the BBP transition , was exhibited for finite rank perturbations of a complex Gaussian Wishart matrix. In this case, it was shown that if the strength of the perturbation is above a threshold, the largest eigenvalue of the perturbed matrix deviates away from the bulk and has then Gaussian fluctuations, otherwise it sticks to the bulk and fluctuates according to the Tracy-Widom law. The fluctuations of the extreme eigenvalues which deviate from the bulk were studied as well when the non-perturbed matrix is a Wishart (or Wigner) matrix with non-Gaussian entries; they were shown to be Gaussian if the perturbation is chosen randomly with i.i.d. entries in , or with completely delocalised eigenvectors , whereas in , a non-Gaussian behaviour was exhibited when the perturbation has localised eigenvectors. The influence of the localisation of the eigenvectors of the perturbation was studied more precisely in .

In this paper, we also focus on the behaviour of the extreme eigenvalues of a finite rank perturbation of a large matrix, this time in the framework where the large matrix is deterministic whereas the perturbation has delocalised random eigenvectors. We show that the eigenvalues which deviate away from the bulk have Gaussian fluctuations, whereas those which stick to the bulk are extremely close to the extreme eigenvalues of the non-perturbed matrix. In a one-dimensional perturbation situation, we can as well study the fluctuations of the next eigenvalues, for instance showing that if the first eigenvalue deviates from the bulk, the second eigenvalue will stick to the first eigenvalue of the non-perturbed matrix, whereas if the first eigenvalue sticks to the bulk, the second eigenvalue will be very close to the second eigenvalue of the non-perturbed matrix. Hence, for a one dimensional perturbation, the eigenvalues which stick to the bulk will fluctuate as the eigenvalues of the non-perturbed matrix. We can also extend these results beyond the case when the non-perturbed matrix is deterministic. In particular, if the non-perturbed matrix is a Wishart (or Wigner) matrix with rather general entries, or a matrix model, we can use the universality of the fluctuations of the extreme eigenvalues of these random matrices to show that the ppth extreme eigenvalue which sticks to the bulk fluctuates according to the ppth dimensional Tracy-Widom law. This proves the universality of the BBP transition at the fluctuation level, provided the perturbation is delocalised and random. The reader should notice however that we do not deal with the asymptotics of eigenvalues corresponding to critical deformations. This probably requires a case-by-case analysis and may depend on the model under consideration.

Let us now describe more precisely the models we will be dealing with. We consider a deterministic self-adjoint matrix XnX_{n} with eigenvalues λ1nλnn\lambda_{1}^{n}\leq\cdots\leq\lambda_{n}^{n} satisfying the following hypothesis.

The spectral measure μn:=n1l=1nδλln\mu_{n}:=n^{-1}\sum_{l=1}^{n}\delta_{\lambda_{l}^{n}} of XnX_{n} converges towards a deterministic probability measure μX\mu_{X} with compact support. Moreover, the smallest and largest eigenvalues of XnX_{n} converge respectively to aa and bb, the lower and upper bounds of the support of μX\mu_{X}.

We consider now a random vector vn=1n(x1,,xn)Tv^{n}=\frac{1}{\sqrt{n}}(x_{1},\ldots,x_{n})^{T} with (xi)1in(x_{i})_{1\leq i\leq n} i.i.d. real or complex random variables with law ν\nu. Then

Either the uinu_{i}^{n}’s (i=1,,ri=1,\ldots,r) are independent copies of vnv^{n}

Or (uin)1ir(u_{i}^{n})_{1\leq i\leq r} are obtained by the Gram-Schmidt orthonormalisation of rr independent copies of a vector vn.v^{n}.

We shall refer to the model (1) as the i.i.d. model and to the model (2) as the orthonormalised model.

Before giving a rough statement of our results, let us make a few remarks. We first recall that a probability measure ν\nu is said to satisfy a logarithmic Sobolev inequality with constant cc if, for any differentiable funtion ff in L2(ν),L^{2}(\nu),

Consequently, in the sequel, we shall restrict ourselves to the event when the model (2) is well defined without mentioning it explicitly.

In this work, we study the asymptotics of the eigenvalues of Xn~\widetilde{X_{n}} outside the spectrum of XnX_{n}.

It has already been observed in similar situations, see , that these eigenvalues converge to the boundary of the support of XnX_{n} if the θi\theta_{i}’s are small enough, whereas for sufficiently large values of the θi\theta_{i}’s, they stay away from the bulk of XnX_{n}. More precisely, if we let GμXG_{\mu_{X}} be the Cauchy-Stieltjes transform of μX\mu_{X}, defined, for z<az<a or z>b,z>b, by the formula

then the eigenvalues of Xn~\widetilde{X_{n}} outside the bulk converge to the solutions of GμX(z)=θi1G_{\mu_{X}}(z)=\theta_{i}^{-1} if they exist.

then we have the following theorem. Let r0{0,,r}r_{0}\in\{0,\ldots,r\} be such that

Assume that Hypothesis 1.1 and Assumption 1.2 are satisfied. For all i{1,,r0}i\in\{1,\ldots,r_{0}\}, we have

and for all i{r0+1,,r}i\in\{{r_{0}+1},\ldots,r\},

Moreover, for all i>r0i>r_{0} (resp. for all irr0i\geq r-r_{0}) independent of nn,

The uniform case was proved in [10, Theorem 2.1] and we will follow a similar strategy to prove Theorem 1.3 under our assumptions in Section 2.

The main object of this paper is to study the fluctuations of the extreme eigenvalues of Xn~.\widetilde{X_{n}}. Precise statements will be given in Theorems 3.2, 3.4, 4.3, 4.4 and 4.5. For any xx such that xax\leq a or xb,x\geq b, we denote by IxI_{x} the set of indices ii such that ρθi=x.\rho_{\theta_{i}}=x. The results roughly state as follows.

Let α1<<αq\alpha_{1}<\cdots<\alpha_{q} be the different values of the θi\theta_{i}’s such that ρθi{a,b}\rho_{\theta_{i}}\notin\{a,b\} and denote, for each jj, kj=Iραjk_{j}=|I_{\rho_{\alpha_{j}}}| and q0q_{0} the largest index so that αq0<0\alpha_{q_{0}}<0. Then, the law of the random vector

converges to the law of the eigenvalues of (cαjMj)1jq(c_{\alpha_{j}}M_{j})_{1\leq j\leq q} with the MjM_{j}’s being independent matrices following the law of a kj×kjk_{j}\times k_{j} matrix from the GUE or the GOE, depending whether ν\nu is supported on the complex plane or the real line. The constant cαjc_{\alpha_{j}} is explicitly defined in Equation (6).

If none of the θi\theta_{i}’s are critical (i.e. equal to θ\underline{\theta} or θ\overline{\theta}), with overwhelming probability, the extreme eigenvalues converging to aa or bb are at distance at most n1+ϵn^{-1+{\epsilon}} of the extreme eigenvalues of XnX_{n} for some ϵ>0{\epsilon}>0.

If r=1r=1 and θ1=θ>0\theta_{1}=\theta>0, we have the following more precise picture about the extreme eigenvalues:

If ρθ>b\rho_{\theta}>b, n(λ~nnρθ)\sqrt{n}(\widetilde{\lambda}^{n}_{n}-\rho_{\theta}) converges towards a Gaussian variable, whereas n1ϵ(λ~ninλni+1)n^{1-{\epsilon}}(\widetilde{\lambda}^{n}_{n-i}-\lambda_{n-i+1}) vanishes in probability as nn goes to infinity for any fixed i1i\geq 1 and some ϵ>0{\epsilon}>0.

If ρθ=b\rho_{\theta}=b and θθ,\theta\neq\overline{\theta}, n1ϵ(λ~ninλni)n^{1-{\epsilon}}(\widetilde{\lambda}^{n}_{n-i}-\lambda_{n-i}) vanishes in probability as nn goes to infinity for any fixed i1i\geq 1 and some ϵ>0{\epsilon}>0.

For any fixed j1j\geq 1, n1ϵ(λ~jnλj)n^{1-{\epsilon}}(\widetilde{\lambda}^{n}_{j}-\lambda_{j}) vanishes in probability as nn goes to infinity for some ϵ>0{\epsilon}>0.

These different behaviours are illustrated in Figure 1 below.

𝑋diag𝜃0…0\widetilde{X_{n}}=X+\operatorname{diag}(\theta,0,\ldots,0) (above the dotted line). In the left picture, θ=0.5<θ=1\theta=0.5<\overline{\theta}=1 and as predicted, λ~1b=2\widetilde{\lambda}_{1}\approx b=2, whereas in the right one, θ=1.5>θ\theta=1.5>\overline{\theta}, which indeed implies that λ~1ρθ=θ+1θ=2.17\widetilde{\lambda}_{1}\approx\rho_{\theta}=\theta+\frac{1}{\theta}=2.17 and λ~2b\widetilde{\lambda}_{2}\approx b. Moreover, in the left picture, we have, for all ii, λ~iλi\widetilde{\lambda}_{i}\approx\lambda_{i}, with some deviations In the same way, in the right picture, ii, λ~i+1λi\widetilde{\lambda}_{i+1}\approx\lambda_{i}, with some deviations At last, here, in the right picture, we have λ~12.167\widetilde{\lambda}_{1}\approx 2.167, which gives n(λ~1ρθ)cθ0.040\frac{\sqrt{n}(\widetilde{\lambda}_{1}-\rho_{\theta})}{c_{\theta}}\approx 0.040, reasonable value for a standard Gaussian variable. The first part of this theorem will be proved in Section 3, whereas Section 4 will be devoted to the study of the eigenvalues sticking to the bulk, i.e. to the proof of the second and third parts of the theorem. Moreover, our results can be easily generalised to non-deterministic self-adjoint matrices XnX_{n} that satisfy our hypotheses with probability tending to one. This will allow us to study in Section 5 the deformations of various classical models. This will include the study of the Gaussian fluctuations away from the bulk for rather general Wigner and Wishart matrices, hence providing a new proof of the first part of [18, Theorem 1.1] and of [5, Theorem 3.1] but also a new generalisation to non-white ensembles. The study of the eigenvalues that stick to the bulk requires a finer control on the eigenvalues of XnX_{n} in the vicinity of the edges of the bulk, which we prove for random matrices such as Wigner and Wishart matrices with entries having a sub-exponential tail. This result complements [18, Theorem 1.1], where the fluctuations of the largest eigenvalue of a non-Gaussian Wishart matrix perturbed by a delocalised but deterministic rank one perturbation was studied. One should remark that our result depends very little on the law ν\nu (only through its fourth moment in fact).

Our approach is based upon a determinant computation (see Lemma 6.1), which shows that the eigenvalues of Xn~\widetilde{X_{n}} we are interested in are the solutions of the equation

and hence it is clear that one should expect the eigenvalues of Xn~\widetilde{X_{n}} outside of the bulk to converge to the solutions of GμX(z)=θi1G_{\mu_{X}}(z)=\theta_{i}^{-1} if they exist. Studying the fluctuations of these eigenvalues amounts to analyse the behavior of the solutions of (3) around their limit. Such an approach was already developed in several papers (see e.g or ). However, to our knowledge, the model we consider, with a fixed deterministic matrix XnX_{n}, was not yet studied and the fluctuations of the eigenvalues which stick to the bulk of XnX_{n} was never achieved in such a generality.

For the sake of clarity, throughout the paper, we will call “hypothesis” any hypothesis we need to make on the deterministic part of the model XnX_{n} and “assumption” any hypothesis we need to make on the deformation Rn.R_{n}. Moreover, because of concentration considerations that are developed in the Appendix of the paper, the proofs will be quite similar in the i.i.d. and orthonormalised models. Therefore, we will detail each proof in the i.i.d. model, which is simpler and then check that the argument is the same in the orthonormalised model or detail the slight changes to make in the proofs.

\bullet p+p_{+} is the number of ii’s such that ρθi>b\rho_{\theta_{i}}>b, pp_{-} is the number of ii’s such that ρθi<a\rho_{\theta_{i}}<a and α1<<αq\alpha_{1}<\cdots<\alpha_{q} are the different values of the θi\theta_{i}’s such that ρθi{a,b}\rho_{\theta_{i}}\notin\{a,b\} (so that qp+p+q\leq p_{-}+p_{+}, with equality in the particular case where the θi\theta_{i}’s are pairwise distinct), \bullet γ1n,γp+p+n\gamma_{1}^{n},\ldots\ldots\gamma_{p_{-}+p_{+}}^{n} are the rescaled differences between the eigenvalues with limit out of [a,b][a,b] and their limits:

\bullet for any xx such that xax\leq a or xb,x\geq b, IxI_{x} is the set of indices ii such that ρθi=x\rho_{\theta_{i}}=x, \bullet for any j=1,,qj=1,\ldots,q, kjk_{j} is the number of indices ii such that θi=αj\theta_{i}=\alpha_{j}, i.e. kj=Iραjk_{j}=|I_{\rho_{\alpha_{j}}}|.

Almost sure convergence of the extreme eigenvalues

For the sake of completeness, in this section, we prove Theorem 1.3. In fact, we shall even prove the more general following result.

Assume that Hypothesis 1.1 and Assumption 1.2 are satisfied.

Let us fix, independently of nn, an integer i1i\geq 1 and VV, a neighborhood of ρθi\rho_{\theta_{i}} if ir0i\leq r_{0} and of aa if i>r0i>r_{0}. Then λ~inV\widetilde{\lambda}_{i}^{n}\in V with overwhelming probability.

The analogue result exists for largest eigenvalues: for any fixed integer i0i\geq 0 and VV, a neighborhood of ρθri\rho_{\theta_{r-i}} if i<rr0i<r-r_{0} and of bb if irr0i\geq r-r_{0}, λ~ninV\widetilde{\lambda}_{n-i}^{n}\in V with overwhelming probability.

By Lemma 6.1, the eigenvalues of Xn~\widetilde{X_{n}} which are not in the spectrum of XnX_{n} are the solutions of the equation

the functions Gs,tn()G_{s,t}^{n}(\cdot) being defined in (4). For zz out of the support of μX\mu_{X}, let us introduce the r×rr\times r matrix

The key point, to prove Theorem 2.1, is the following lemma. For A=[Ai,j]i,j=1rA=[A_{i,j}]_{i,j=1}^{r} and r×rr\times r matrix, we set A:=supi,jAi,j|A|_{\infty}:=\sup_{i,j}|A_{i,j}|.

Assume that Hypothesis 1.1 and Assumption 1.2 are satisfied. For any δ,ε>0,\delta,\varepsilon>0, with overwhelming probability,

In the case where the θi\theta_{i}’s are pairwise distinct, Theorem 2.1 follows directly from this lemma, because the zz’s such that det(M(z))=0\det(M(z))=0 are precisely the zz’s such that for some ii, GμX(z)=1θiG_{\mu_{X}}(z)=\frac{1}{\theta_{i}} and because close continuous functions on an interval have close zeros. The case where the θi\theta_{i}’s are not pairwise distinct can then be deduced by an approximation procedure similar to the one of Section 6.2.3 of .

Then since the support of μX\mu_{X} is contained in [a,b][a,b] and for nn large enough, the eigenvalues of XnX_{n} are all in [aδ/2,b+δ/2][a-\delta/2,b+\delta/2], it suffices to prove that with overwhelming probability,

Now, fix some zz such that zR|z|\leq R, d(z,[a,b])>δ,d(z,[a,b])>\delta, and nn large enough. By Proposition 6.2 with A=(zXn)1A=(z-X_{n})^{-1}, whose operator norm is bounded by 2δ1,2\delta^{-1}, we find that for any ϵ>0{\epsilon}>0, there exists c>0c>0 such that

It follows that there are c,η>0c,\eta>0 such that for all zz such that zR|z|\leq R, d(z,[a,b])>δ,d(z,[a,b])>\delta,

As a consequence, since the number of zz’s such that zR|z|\leq R and nznz have integer real and imaginary parts has order n2n^{2}, there is a constant CC such that

This concludes the proof for the i.i.d. model.

The orthonormalised model can be treated similarly, by writing Un=WnGnU_{n}=W^{n}G_{n} with nWn\sqrt{n}W^{n} a matrix converging almost surely to the identity by Proposition 6.3. \square

Fluctuations of the eigenvalues away from the bulk

Let p+p_{+} be the number of ii’s such that ρθi>b\rho_{\theta_{i}}>b and pp_{-} be the number of ii’s such that ρθi<a\rho_{\theta_{i}}<a. In this section, we study the fluctuations of the eigenvalues of Xn~\widetilde{X_{n}} with limit out of the bulk, that is (λ~1n,,λ~pn,λ~np++1n,,λ~nn)(\widetilde{\lambda}_{1}^{n},\ldots,\widetilde{\lambda}_{p_{-}}^{n},\widetilde{\lambda}_{n-p_{+}+1}^{n},\ldots,\widetilde{\lambda}_{n}^{n}). We shall assume throughout this section that the spectral measure of XnX_{n} converges to μX\mu_{X} faster than 1/n.1/\sqrt{n}. More precisely,

For all z{ρα1,,ραq}z\in\{\rho_{\alpha_{1}},\ldots,\rho_{\alpha_{q}}\}, n(Gμn(z)GμX(z))\sqrt{n}(G_{\mu_{n}}(z)-G_{\mu_{X}}(z)) converges to .

Our theorem deals with the limiting joint distribution of the variables γ1n,,γp+p+n\gamma^{n}_{1},\ldots,\gamma^{n}_{p_{-}+p_{+}}, the rescaled differences between the eigenvalues with limit out of [a,b][a,b] and their limits:

Let us recall that for k1k\geq 1, GOE(k)\operatorname{GOE}(k) (resp. GUE(k)\operatorname{GUE}(k)) is the distribution of a k×kk\times k symmetric (resp. Hermitian) random matrix [gi,j]i,j=1k[g_{i,j}]_{i,j=1}^{k} such that the random variables {12gi,i;1ik}{gi,j;1i<jk}\{\frac{1}{\sqrt{2}}g_{i,i}\,;\,1\leq i\leq k\}\cup\{g_{i,j}\,;\,1\leq i<j\leq k\} (resp. {gi,i;1ik}{2(gi,j);1i<jk}{2(gi,j);1i<jk}\{g_{i,i}\,;\,1\leq i\leq k\}\cup\{\sqrt{2}\Re(g_{i,j})\,;\,1\leq i<j\leq k\}\cup\{\sqrt{2}\Im(g_{i,j})\,;\,1\leq i<j\leq k\}) are independent standard Gaussian variables.

The limiting behaviour of the eigenvalues with limit outside the bulk will depend on the law ν\nu through the following quantity, called the fourth cumulant of ν\nu

Note that if ν\nu is Gaussian standard, then κ4(ν)=0\kappa_{4}(\nu)=0.

The definitions of the αj\alpha_{j}’s and of the kjk_{j}’s have been given in Theorem 1.4 and recalled in the Notations gathered at the end of the introduction above.

Suppose that Assumption 1.2 holds with κ4(ν)=0,\kappa_{4}(\nu)=0, as well as Hypotheses 1.1 and 3.1. Then the law of

converges to the law of (λi,j,1ikj)1jq,(\lambda_{i,j},1\leq i\leq k_{j})_{1\leq j\leq q}, with λi,j\lambda_{i,j} the iith largest eigenvalue of cαjMjc_{\alpha_{j}}M_{j} with (M1,,Mq)(M_{1},\ldots,M_{q}) being independent matrices, MjM_{j} following the GUE(kj)(k_{j}) (resp. GOE(kj)(k_{j})) distribution if ν\nu is supported on the complex plane (resp. the real line). The constant cαc_{\alpha} is given by

When κ4(ν)0,\kappa_{4}(\nu)\neq 0, we need a bit more than Hypothesis 3.1, namely

In the case when Assumption 1.2 holds with κ4(ν)0,\kappa_{4}(\nu)\neq 0, under Hypotheses 1.1, 3.1 and 3.3, Theorem 3.2 stays true, replacing the matrices cαjMjc_{\alpha_{j}}M_{j} by matrices cαjMj+Djc_{\alpha_{j}}M_{j}+D_{j} where the DjD_{j}’s are independent diagonal random matrices, independent of the MjM_{j}’s, and such that for all jj, the diagonal entries of DjD_{j} are independent centred real Gaussian variables, with variance l(ραj)κ4(ν)/GμX(ραj)-l({\rho_{\alpha_{j}}})\kappa_{4}(\nu)/G_{\mu_{X}}^{\prime}(\rho_{\alpha_{j}}).

2. Proof of Theorems 3.2 and 3.4

We prove hereafter Theorem 3.2 and we will indicate briefly at the end of this section the minor changes to make to get Theorem 3.4. The main ingredient will be a central limit theorem for quadratic forms, stated in Theorem 6.4 in the appendix.

We set ρni(x):=ραi+xn.\rho_{n}^{i}(x):=\rho_{\alpha_{i}}+\frac{x}{\sqrt{n}}.

The first step of the proof will be to get the asymptotic behavior of Mn(i,x).M^{n}(i,x).

with (ns,t)s,t=1,,r(n_{s,t})_{s,t=1,\ldots,r} a family of independent Gaussian variables with ns,sN(0,2)n_{s,s}\sim\mathcal{N}(0,2) and ns,tN(0,1)n_{s,t}\sim\mathcal{N}(0,1) when sts\neq t in the real case (resp. ns,sN(0,1)n_{s,s}\sim\mathcal{N}(0,1) and (ns,t),(ns,t)N(0,1/2)\Re(n_{s,t}),\Im(n_{s,t})\sim\mathcal{N}(0,1/2) and independent in the complex case).

From (5), we know that for sIραis\notin I_{\rho_{\alpha_{i}}},

Let sIραi.s\in I_{\rho_{\alpha_{i}}}. We write the decomposition

The asymptotics of the first term is given by Theorem 6.4 with a variance given by

As ραi\rho_{\alpha_{i}} is at distance of order one from the support of XnX_{n}, we can expand x/nx/\sqrt{n} in Ms,tn,2(i,x)M_{s,t}^{n,2}(i,x) to deduce that

Equations (8), (9), (10) and (11) prove the lemma (using the fact that the distribution of the Gaussian variables ns,sn_{s,s} and ns,tn_{s,t} are symmetric). ∎

The last point to check is a result of asymptotic independence, from which the independence of the matrices M1,,MqM_{1},\ldots,M_{q} will be inherited. In fact, the matrices (Mn(1,x1),,Mn(q,xq))(M^{n}(1,x_{1}),\ldots,M^{n}(q,x_{q})) won’t be asymptotically independent but their determinants will.

Then, as the set of indices Iρα1,,IραqI_{\rho_{\alpha_{1}}},\ldots,I_{\rho_{\alpha_{q}}} are disjoint, the submatrices involved in the main terms are independent in the i.i.d case and asymptotically independent in the orthonormalised case.

Let us now show (12). Firstly, note that by the convergence of Ms,tn(i,x)M^{n}_{s,t}(i,x) obtained in the proof of the Lemma 3.5, we have for all s,t{1,,r}s,t\in\{1,\ldots,r\} such that sts\neq t or sIραis\in I_{\rho_{\alpha_{i}}}, for all κ<1/2\kappa<1/2,

it suffices to prove that for any σSr\sigma\in S_{r} such that for some i0{1,,r}\Iραii_{0}\in\{1,\ldots,r\}\backslash I_{\rho_{\alpha_{i}}}, σ(i0)i0\sigma(i_{0})\neq i_{0},

It follows immediately from (13) since for any κ<1/2\kappa<1/2, in the above product, all the terms with index in IραiI_{\rho_{\alpha_{i}}} are of order at most nκn^{-\kappa}, giving a contribution nkiκn^{-k_{i}\kappa}, and i0i_{0} is not in IραiI_{\rho_{\alpha_{i}}} and satisfies σ(i0)i0\sigma(i_{0})\neq i_{0}, yielding another term of order at most nκn^{-\kappa}. Hence, the other terms being bounded because ρni(x)\rho_{n}^{i}(x) stays bounded away from [a,b][a,b], the above product is at most of order nκ(ki+1)n^{-\kappa(k_{i}+1)} and so taking κ(ki2(ki+1),12)\kappa\in(\frac{k_{i}}{2(k_{i}+1)},\frac{1}{2}) proves (14). ∎

we can deduce from the lemmata above the following

Under the hypothesis of Theorem 3.2, the random process

converges weakly, as nn goes to infinity to the random process

in the sense of finite dimensional marginals, with the constants cαic_{\alpha_{i}} and the joint distribution of (M1,,Mq)(M_{1},\ldots,M_{q}) as in the statement of Theorem 3.2.

From there, the proof of Theorem 3.2 is straightforward.

To prove Theorem 3.4, the only substantial change to make is in the definition (7), in the case when sIραi,s\in I_{\rho_{\alpha_{i}}}, we have to put

The convergence of [Mn(i,x)]s,t\left[M^{n}(i,x)\right]_{s,t} to [M(i,x)]s,t\left[\mathcal{M}(i,x)\right]_{s,t} is again obtained by applying Theorem 6.4.

The sticking eigenvalues

To study the fluctuations of the eigenvalues which stick to the bulk, we need a more precise information on the eigenvalues of XnX_{n} in the vicinity of their extremes. More explicitly, we shall need the following additional hypothesis, which depends on a positive integer pp and a real number α(0,1)\alpha\in(0,1). Note that this hypothesis has two versions: Hypothesis 4.1[p,α,a][p,\alpha,a] is adapted to the study of the smallest eigenvalues (it is the version detailed below) and Hypothesis 4.1[p,α,b][p,\alpha,b] is adapted to the study of the largest eigenvalues (this version is only outlined below).

[p,α,a][p,\alpha,a] There exists a sequence mnm_{n} of positive integers tending to infinity such that mn=O(nα)m_{n}=O(n^{\alpha}),

and there exist η2>0\eta_{2}>0 and η4>0\eta_{4}>0, so that for nn large enough

Hypothesis 4.1. [p,α,b][p,\alpha,b] is the same hypothesis where we replace λpnλin\lambda_{p}^{n}-\lambda_{i}^{n} by λnp+1nλni+1n\lambda_{n-p+1}^{n}-\lambda_{n-i+1}^{n}, and (15) becomes

For many matrix models, the behaviors of largest and smallest eigenvalues are similar, and Hypothesis 4.1 [p,α,a][p,\alpha,a] is satisfied if and only if Hypothesis 4.1 [p,α,b][p,\alpha,b] is satisfied. In such cases, we shall simply say that Hypothesis 4.1 [p,α][p,\alpha] is satisfied.

For rank one perturbations and in the i.i.d. model, we will only require the two first conditions (15) and (16) whereas for higher rank perturbations, we will need in addition (17) to control the off-diagonal terms of the determinant.

Moreover, we shall not study the critical case where for some ii, θi{θ,θ}\theta_{i}\in\{\underline{\theta},\overline{\theta}\}.

For all ii, θiθ\theta_{i}\neq\underline{\theta} and θiθ\theta_{i}\neq\overline{\theta}.

In fact, Assumption 4.2 can be weakened into: for all ii, θiθ\theta_{i}\neq\underline{\theta} (resp. θiθ\theta_{i}\neq\overline{\theta}) if we only study the smallest (resp. largest) eigenvalues.

The fact that the eigenvalues of the non-perturbed matrix are sufficiently spread at the edges to insure the above hypothesis allow the eigenvalues of the perturbed matrix to be very close to them, as stated in the following theorem.

Let Ia={i[1,r]:ρθi=a}=[p+1,r0]I_{a}=\{i\in[1,r]:\rho_{\theta_{i}}=a\}=[p_{-}+1,r_{0}] (resp. Ib={i[1,r]:ρθi=b}=[r0+1,rp+]I_{b}=\{i\in[1,r]:\rho_{\theta_{i}}=b\}=[r_{0}+1,r-p_{+}]) be the set of indices corresponding to the eigenvalues λ~in\widetilde{\lambda}_{i}^{n} (resp. λ~nr+in\widetilde{\lambda}_{n-r+i}^{n}) converging to the lower (resp. upper) bound of the support of μX\mu_{X}. Let us suppose Hypothesis 1.1, Hypothesis 4.1 [r,α,a][r,\alpha,a] (resp. Hypothesis 4.1 [r,α,b][r,\alpha,b]) and Assumptions 1.2 and 4.2 to hold. Then for any α>α,\alpha^{\prime}>\alpha, we have, for all iIai\in I_{a} (resp. iIbi\in I_{b}),

Moreover, in the case where the perturbation has rank one, we can locate exactly in the neighborhood of which eigenvalues of the non-perturbed matrix the eigenvalues of the perturbed matrix lie.

We state hereafter the result for the smallest eigenvalues, but of course a similar statement holds for the largest ones.

Let (λ~in)i1(\widetilde{\lambda}_{i}^{n})_{i\geq 1} be the eigenvalues of Xn+θu1u1X_{n}+\theta u_{1}u_{1}^{*}, with θ<0\theta<0. Then, under Assumption 1.2 and Hypothesis 1.1, if (15) and (16) in Hypothesis 4.1 [p,α,a][p,\alpha,a] hold for some α(0,1)\alpha\in(0,1) and a positive integer pp, then for any α>α\alpha^{\prime}>\alpha, we have

if θ<θ\theta<\underline{\theta}, λ~1n\widetilde{\lambda}_{1}^{n} converges to ρθ<a\rho_{\theta}<a whereas n1α(λ~i+1nλin)1ip1n^{1-\alpha^{\prime}}(\widetilde{\lambda}_{i+1}^{n}-\lambda_{i}^{n})_{1\leq i\leq p-1} vanishes in probability as nn goes to infinity,

if θ(θ,0)\theta\in(\underline{\theta},0), n1α(λ~inλin)1ipn^{1-\alpha^{\prime}}(\widetilde{\lambda}_{i}^{n}-\lambda_{i}^{n})_{1\leq i\leq p} vanishes in probability as nn goes to infinity,

if, instead of (15) and (16) in Hypothesis 4.1 [p,α,a][p,\alpha,a], one supposes (15) and (16) in Hypothesis 4.1 [p,α,b][p,\alpha,b] to hold, then n1α(λ~ninλnin)0i<pn^{1-\alpha^{\prime}}(\widetilde{\lambda}_{n-i}^{n}-\lambda_{n-i}^{n})_{0\leq i<p} vanishes in probability as nn goes to infinity.

Consider the i.i.d. model and let (λ~in)i1(\widetilde{\lambda}_{i}^{n})_{i\geq 1} be the eigenvalues of Xn+i=1rθiuiuiX_{n}+\sum_{i=1}^{r}\theta_{i}u_{i}u_{i}^{*}. Let pp_{-} (resp. p+p_{+}) be the number of indices ii so that ρθi<a\rho_{\theta_{i}}<a (resp. ρθi>b\rho_{\theta_{i}}>b). We assume that Assumptions 1.2 and 4.2, Hypothesis 1.1, and (15) and (16) in Hypotheses 4.1 [p,α,a][p,\alpha,a] and [q,α,b][q,\alpha,b] hold for some α(0,1)\alpha\in(0,1) and integers p,qp,q. Then, for all α>α\alpha^{\prime}>\alpha, for all fixed 1ip(p+r)1\leq i\leq p-(p_{-}+r) and 0j<p(p++r)0\leq j<p-(p_{+}+r),

both vanish in probability as nn goes to infinity.

Note that if p(p+r)0p-(p_{-}+r)\leq 0 (resp. if p(p++r)<0p-(p_{+}+r)<0), then the statement of the theorem is empty as far as ii’s (resp. jj’s) are concerned. The same convention is made throughout the proof.

2. Proofs

Let us first prove Theorem 4.3. Let us choose i0Iai_{0}\in I_{a} and study the behaviour of λ~i0n\widetilde{\lambda}_{i_{0}}^{n} (the case of the largest eigenvalues can be treated similarly). We assume throughout the section that Hypotheses 1.1, 4.1 [r,α,a][r,\alpha,a] and Assumptions 1.2 and 4.2 are satisfied. We also fix α>α.\alpha^{\prime}>\alpha.

We know, by Lemma 6.1, that the eigenvalues of Xn~\widetilde{X_{n}} which are not eigenvalues of XnX_{n} are the zz’s such that

Recall that by Weyl’s interlacing inequalities (see [1, Th. A.7])

Let ζ\zeta be a fixed constant such that max1ipρθi<ζ<a\max_{1\leq i\leq p_{-}}\rho_{\theta_{i}}<\zeta<a. By Theorem 2.1, we know that

With overwhelming probability, λ~i0n>ζ\widetilde{\lambda}^{n}_{i_{0}}>\zeta.

We want to show that (18) is not possible on

The following lemma deals with the asymptotic behaviour of the off-diagonal terms of the matrix Mn(z)M_{n}(z) of (19).

For sts\neq t and κ>0\kappa>0 small enough,

The following lemma deals with the asymptotic behaviour of the diagonal terms of the matrix Mn(z)M_{n}(z) of (19).

For all s=1,,rs=1,\ldots,r, for all δ>0\delta>0, any δ>0\delta>0,

Let us assume these lemmas proven for a while and complete the proof of Theorem 4.3. By these two lemmas, for zΩnz\in\Omega_{n}, we find by expanding the determinant that with overwhelming probability,

where the O(nκ)O(n^{-\kappa}) is uniform on zΩnz\in\Omega_{n}. Indeed, in the second term of the right hand side of

each diagonal term is bounded and each non diagonal term is O(nκ)O(n^{-\kappa}).

Since for all ii, θiθ\theta_{i}\neq\underline{\theta}, (22) and Lemma 4.8 allow to assert that with overwhelming probability, for all zΩnz\in\Omega_{n}, det(Mn(z))0\det(M_{n}(z))\neq 0. It completes the proof of the theorem. ∎

Proof of Lemma 4.7. Let us consider zΩnz\in\Omega_{n} (zz might depend on nn, but for notational brevity, we omit to denote it by znz_{n}). We treat simultaneously the orthonormalised model and the i.i.d. model (in the i.i.d. model, one just takes Wn=IW^{n}=I and replaces (Gn(Wn)T)s2\|(G^{n}(W^{n})^{T})_{s}\|_{2} by n\sqrt{n} in the proof below). Observe that if we write Xn=ODnOX_{n}=O^{*}D_{n}O with Dn=(λ1n,,λnn)D_{n}=(\lambda_{1}^{n},\ldots,\lambda_{n}^{n}) and OO a unitary or orthogonal matrix,

The first step is to show that for any ϵ>0{\epsilon}>0, with overwhelming probability,

Indeed, with OlO_{l} the llth row vector of OO and using the notations of Section 6.2,

But gOl,gsng\mapsto\langle O_{l},g^{n}_{s}\rangle is Lipschitz for the Euclidean norm with constant one. Hence, by concentration inequality due to the log-Sobolev hypothesis (see e.g. [1, section 4.4]), there exists c>0c>0 such that for all δ>0\delta>0,

From Proposition 6.3, we know that with overwhelming probability, (Gn(Wn)T)s2\|(G^{n}(W^{n})^{T})_{s}\|_{2} is bounded below by nnϵ\sqrt{n}n^{-{\epsilon}} and the entries of WnW^{n} are of order one. This gives therefore (23).

Note that as (Ousn)l,1lmn|(Ou_{s}^{n})_{l}|,1\leq l\leq m_{n}, are smaller than n12+ϵn^{-\frac{1}{2}+{\epsilon}^{\prime}} by (23), for any ϵ>0{\epsilon}^{\prime}>0, with overwhelming probability, we have, uniformly on zΩnz\in\Omega_{n},

We choose 0<ϵ(αα)/40<{\epsilon}^{\prime}\leq(\alpha^{\prime}-\alpha)/4 and now study Bn(z)B_{n}(z) which can be written

with PP the orthogonal projection onto the linear span of the eigenvectors of XnX_{n} corresponding to the eigenvalues λmn+1n,,λnn\lambda_{m_{n}+1}^{n},\ldots,\lambda_{n}^{n}. By the second point in Proposition 6.2, with zΩnz\in\Omega_{n}, for all sts\neq t,

Moreover, by Hypothesis 4.1, for nn large enough, for all zΩnz\in\Omega_{n},

We deduce that there is C,η>0C,\eta>0 such that for all zΩnz\in\Omega_{n},

A similar control is verified for s=ts=t since we have, by Proposition 6.2,

whereas Hypothesis 4.1 insures that the term 1nTr(P(zXn)1)\frac{1}{n}{\rm Tr}(P(z-X_{n})^{-1}) is bounded uniformly on Ωn\Omega_{n}. Thus, up to a change of the constants CC and η\eta, there is a constant MM such that for all zΩnz\in\Omega_{n},

Therefore, with Proposition 6.3 and developing the vectors usnu_{s}^{n}’s as the normalised column vectors of Gn(Wn)TG^{n}(W^{n})^{T}, we conclude that, up to a change of the constants CC and η\eta, for all zΩnz\in\Omega_{n},

Hence, we have proved that there exists κ>0,C\kappa>0,C and η>0\eta>0 so that for all zΩnz\in\Omega_{n},

We finally obtain this control uniformly on zΩnz\in\Omega_{n} by noticing that zGs,tn(z)z{\rightarrow}G^{n}_{s,t}(z) is Lipschitz on Ωn\Omega_{n}, with constant bounded by (minzλi)2n22α(\min|z-\lambda_{i}|)^{-2}\leq n^{2-2\alpha^{\prime}}. Thus, if we take a grid (zkn)0kcn2(z_{k}^{n})_{0\leq k\leq cn^{2}} of Ωn\Omega_{n} with mesh n2+2ακ\leq n^{-2+2\alpha^{\prime}-\kappa} (there are about n2n^{2} such zknz_{k}^{n}’s) we have

Since there are at most cn2cn^{2} such kk and n2n^{2} possible i,ji,j, we conclude that

Proof of Lemma 4.8. We shall use the decomposition

with PP as above the orthogonal projection onto the linear span of the eigenvectors of XnX_{n} corresponding to the eigenvalues λmn+1n,,λnn\lambda_{m_{n}+1}^{n},\ldots,\lambda_{n}^{n}, and then prove that for zΩnz\in\Omega_{n},

Let us now give a formal proof. Again, we first prove the estimate for a fixed zΩnz\in\Omega_{n}, the uniform estimate on zz being obtained by a grid argument as in the previous proof (a key point being that the constants CC and η\eta of the definition of overwhelming probability are independent of the choice of zΩnz\in\Omega_{n}).

First, observe that (15) implies that for any sequence εn\varepsilon_{n} tending to zero,

Indeed, for all ϵ>0{\epsilon}>0, for nn such that λnp\lambda_{n}^{p} and aεna-\varepsilon_{n} are both aϵ\geq a-{\epsilon}, we have, for all z[aεn,λpn]z\in[a-\varepsilon_{n},\lambda_{p}^{n}],

So let us consider zΩnz\in\Omega_{n} (zz might depend on nn, but for notational brevity, we omit to denote it by znz_{n}). By the inequality zλkn>n1+α|z-\lambda_{k}^{n}|>n^{-1+\alpha^{\prime}} for all 1kmn1\leq k\leq m_{n} and (27), we have

with, by (24), the off diagonal terms tvt\neq v of order nη2η4/8n^{-\eta_{2}\wedge\eta_{4}/8} with overwhelming probability, whereas the diagonal terms are close to 1nTr(P(zXn)1)\frac{1}{n}{\rm Tr}(P(z-X_{n})^{-1}) with overwhelming probability by (25). Hence, we deduce with Proposition 6.2 that for any δ>0\delta>0,

with overwhelming probability. Hence, by (28), for any δ>0\delta>0

with overwhelming probability. On the other hand

By Proposition 6.3, the denominator is of order nn with overwhelming probability, whereas by Proposition 6.2, the numerator is of order mn+nϵmnm_{n}+n^{\epsilon}\sqrt{m_{n}} (since Tr(1P)=mn{\rm Tr}(1-P)=m_{n}) with overwhelming probability. As WnW^{n} is bounded by Proposition 6.3 we conclude that

with overwhelming probability. Putting together Equations (29), (30) and (31), we have proved that for any zΩnz\in\Omega_{n}, any δ>0\delta>0,

with overwhelming probability, the constants CC and η\eta of the definition of overwhelming probability being independent of the choice of zΩnz\in\Omega_{n} We do not detail the grid argument used to get a control uniform on zz because this argument is similar to what we did in the proof of the previous lemma. ∎

Proof of Theorem 4.4. In the one dimensional case, the eigenvalues of X~n\widetilde{X}_{n} which do not belong to the spectrum of XnX_{n} are the zeroes of

with εn(g)=1\varepsilon_{n}(g)=1 or g22/n\|g\|_{2}^{2}/n according to the model we are considering. A straightforward study of the function fnf_{n} tells us that the eigenvalues of X~n\widetilde{X}_{n} are distinct from those of XnX_{n} as soon as XnX_{n} has no multiple eigenvalue and

has no null entry, which we can always assume up to modify XnX_{n} and gg so slightly that the fluctuations of the eigenvalues are not affected. We do not detail these arguments but the reader can refer to Lemmas 9.3, 9.4 and 11.2 of for a full proof in the finite rank case. Therefore, (32) characterises all the eigenvalues of X~n\widetilde{X}_{n}. Moreover, by Weyl’s interlacing properties, for θ<0\theta<0,

Theorems 2.1 and 4.3 thus already settle the study of λ~1n\widetilde{\lambda}_{1}^{n} which either goes to ρθ\rho_{\theta} or is at distance O(n1+α)O(n^{-1+\alpha^{\prime}}) of λ1n\lambda_{1}^{n} depending on the strength of θ\theta. We consider α>α\alpha^{\prime}>\alpha and i{2,,p}i\in\{2,\ldots,p\} and define

Note first that if Λn\Lambda_{n} is empty, then the eigenvalue of Xn~\widetilde{X_{n}} which lies between λi1n\lambda_{i-1}^{n} and λin\lambda_{i}^{n} is within n1+αn^{-1+\alpha^{\prime}} to both λi1n\lambda_{i-1}^{n} and λin\lambda_{i}^{n}, so we have nothing to prove. Now, we want to prove that fnf_{n} does not vanish on Λn\Lambda_{n} and that according to the sign of 1θ1θ\frac{1}{\theta}-\frac{1}{\underline{\theta}}, it vanishes on one side or the other of Λn\Lambda_{n} in ]λi1n,λin[]\lambda_{i-1}^{n},\lambda_{i}^{n}[. This will prove (i) and (ii) of the theorem. Part (iii) can be proved in the same way, proving that with overwhelming probability, fnf_{n} does not vanish in ]λni1n+n1+α,λninn1+α[\left]\lambda_{n-i-1}^{n}+{n^{-1+\alpha^{\prime}}},\lambda_{n-i}^{n}-{n^{-1+\alpha^{\prime}}}\right[.

The proof of this fact will follow the same lines as the proof of Lemma 4.8 and we recall that PP was defined above as the orthogonal projection onto the linear span of the eigenvectors of XnX_{n} corresponding to the eigenvalues λmn+1n,,λnn\lambda_{m_{n}+1}^{n},\ldots,\lambda_{n}^{n}. Then, exactly as for (30), we can show that for all δ>0\delta>0,

with overwhelming probability. Moreover, for any zΛnz\in\Lambda_{n}, for any j=1,,mnj=1,\ldots,m_{n}, we have

By Proposition 6.2, we deduce that for any ϵ>0{\epsilon}>0,

with overwhelming probability. We choose ϵ{\epsilon} in such a way that the latter right hand side goes to zero. Therefore, we know that uniformly on Λn\Lambda_{n},

with overwhelming probability. Since for all nn, fnf_{n} is decreasing, going to ++\infty (resp. -\infty) as zz goes to any λi1n\lambda_{i-1}^{n} on the right (resp. λin\lambda_{i}^{n} on the left), it follows that according to the sign of 1θ1θ\frac{1}{\underline{\theta}}-\frac{1}{\theta}, the zero of fnf_{n} in ]λi1n,λin[]\lambda_{i-1}^{n},\lambda_{i}^{n}[ is either in ]λi1n,λi1n+n1+α[]\lambda_{i-1}^{n},\lambda_{i-1}^{n}+{n^{-1+\alpha^{\prime}}}[ or in ]λinn1+α,λin[]\lambda_{i}^{n}-{n^{-1+\alpha^{\prime}}},\lambda_{i}^{n}[.∎

Let us also choose ζa<a\zeta_{a}<a and ζb>b\zeta_{b}>b such that

Application to classical models of matrices

Let (Xn)(X_{n}) be a sequence of random matrices independent of the uinu_{i}^{n}’s. Under Assumption 1.2,

If Hypothesis 1.1 holds in probability, Theorem 2.1 holds.

If κ4(ν)=0\kappa_{4}(\nu)=0 and Hypotheses 1.1 and 3.1 hold in probability, Theorem 3.2 holds. If κ4(ν)0\kappa_{4}(\nu)\neq 0 and Hypotheses 1.1 and 3.3 hold in probability, Theorem 3.4 holds.

Under Assumption 4.2, if Hypotheses 1.1 and 4.1 hold in probability, Theorem 4.3 holds “with probability converging to one” instead of “with overwhelming probability”; Theorems 4.4 and Corollary 4.5 hold.

The remaining of this section is devoted to showing that such results hold if XnX_{n}, independent of (uin)1ir(u_{i}^{n})_{1\leq i\leq r}, is a Wigner or a Wishart matrix or a random matrix which law has density proportional to eTrVe^{-\operatorname{Tr}V} for a certain potential VV. In each case, we have to check that the hypotheses hold in probability.

Let (xi,j)i,j1(x_{i,j})_{i,j\geq 1} be an infinite Hermitian random matrix which entries are independent up to the condition xj,i=xi,jx_{j,i}=\overline{x_{i,j}} such that the xi,ix_{i,i}’s are distributed according to μ2\mu_{2} and the xi,jx_{i,j}’s (iji\neq j) are distributed according to μ1\mu_{1}. We take Xn=1n[xi,j]i,j=1n,X_{n}=\frac{1}{\sqrt{n}}\left[{x_{i,j}}\right]_{i,j=1}^{n}, which is said to be a Wigner matrix. For certain results, we will also need an additional hypothesis, which we present here:

The probability measures μ1\mu_{1} and μ2\mu_{2} have a sub-exponential decay, that is there exists positive constants C,CC,C^{\prime} such that if XX is distributed according to μ1\mu_{1} or μ2,\mu_{2}, for all tCt\geq C^{\prime},

Moreover, μ1\mu_{1} and μ2\mu_{2} are symmetric.

The following Proposition generalises some results of which study the effect of a finite rank perturbation on a non-Gaussian Wigner matrix. In particular, it includes the study of the eigenvalues which stick to the bulk.

Let XnX_{n} be a Wigner matrix. Assume that Assumption 1.2 holds. The limits of the extreme eigenvalues of Xn~\widetilde{X_{n}} are given by Theorem 2.1 and the fluctuations of the ones which limits are out of [2σ,2σ][-2\sigma,2\sigma] are given by Theorem 3.2, where the parameters a,b,ρθ,cαa,b,\rho_{\theta},c_{\alpha} are given by the following formulas : b=a=2σb=-a=2\sigma,

Assume moreover that, for all i,i, θi∉{σ,σ}\theta_{i}\not\in\{-\sigma,\sigma\} and Hypothesis 5.2 holds. If the perturbation has rank one, we have the following precise description of the fluctuations of the sticking eigenvalues :

If θ>σ\theta>\sigma (resp. θ<σ\theta<-\sigma), for all p2p\geq 2, n2/3(λ~np+1n2σ)n^{2/3}(\widetilde{\lambda}_{n-p+1}^{n}-2\sigma) (resp. n2/3(λ~pn+2σ)n^{2/3}(\widetilde{\lambda}_{p}^{n}+2\sigma)) converges in law to the p1p-1th Tracy Widom law.

If 0θ<σ0\leq\theta<\sigma (resp. σ<θ0-\sigma<\theta\leq 0), for all p1,p\geq 1, n2/3(λ~np+1n2σ)n^{2/3}(\widetilde{\lambda}_{n-p+1}^{n}-2\sigma) (resp. n2/3(λ~pn+2σ)n^{2/3}(\widetilde{\lambda}_{p}^{n}+2\sigma)) converges in law to the ppth Tracy Widom law.

If the perturbation is rank more than one and Assumption 4.2 holds, the extreme eigenvalues of Xn~\widetilde{X_{n}} are at distance less than n1+ϵn^{-1+{\epsilon}} for any ϵ>0{\epsilon}>0 to the extreme eigenvalues of Xn,X_{n}, which have Tracy-Widom fluctuations. We can localize exactly near which eigenvalue of XnX_{n} they lie by using Theorem 4.5 in the i.i.d model.

According to Theorem 5.1, it suffices to verify that the hypotheses hold in probability for (Xn)n1(X_{n})_{n\geq 1}. We study separately the eigenvalues which stick to the bulk and those which deviate from the bulk.

If XnX_{n} is a Wigner matrix (that is, with our terminology, with entries having a finite fourth moment), the fact that XnX_{n} satisfies Hypothesis 1.1 in probability is a well known result (see for example [4, Th. 5.2]) for μX\mu_{X} the semicircle law with support [2σ,2σ][-2\sigma,2\sigma]. The formulas for ρθ\rho_{\theta} and cαc_{\alpha} can be checked with the well known formula [1, Sect. 2.4]:

Moreover, [5, Th. 1.1] shows that Tr(f(Xn))nf(x)dσ(x){\rm Tr}(f(X_{n}))-n\int f(x)d\sigma(x) converges in law to a Gaussian distribution for any function ff which is analytic in a neighborhood of [2σ,2σ][-2\sigma,2\sigma]. For any fixed z[2σ,2σ]z\notin[-2\sigma,2\sigma], applied for f(t)=1ztf(t)=\frac{1}{z-t}, we get that n(Gμn(z)GμX(z))n(G_{\mu_{n}}(z)-G_{\mu_{X}}(z)) converges in law to a Gaussian distribution, hence n(Gμn(z)GμX(z))\sqrt{n}(G_{\mu_{n}}(z)-G_{\mu_{X}}(z)) converges in probability to zero, so that Hypothesis 3.1 holds in probability.

We now assume moreover that the laws of the entries satisfy Hypothesis 5.2. In order to lighten the notation, we shall now suppose that σ=1\sigma=1. Let us first recall that by , the extreme eigenvalues of the non-perturbed matrix XnX_{n}, once re-centred and renormalised by n2/3n^{2/3}, converge to the Tracy-Widom law (which depends on whether the entries are complex or real). We need to verify that Hypothesis 4.1[p,α\alpha] for any finite pp and an α<1/3\alpha<1/3 is fulfilled in probability. By , the spacing between the two smallest eigenvalues of XnX_{n} is of order greater than nγn^{-\gamma} for γ>2/3\gamma>2/3 with probability going to one and therefore, by the inequality

it is sufficient to prove the first point of Hypothesis 4.1[p,α\alpha]. We shall prove it by replacing first the smallest eigenvalue by the edge 2-2 thanks to a lemma that Benjamin Schlein kindly communicated to us. We will then prove that the sum of the inverse of the distance of the eigenvalues to the edge indeed converges to the announced limit, thanks to both Soshnikov paper (for sub-Gaussian tails) or (for finite moments), and Tao and Vu article .

Suppose the entries of XnX_{n} have a uniform sub-exponential tail. Then for all δ>0\delta>0, for all integer number pp,

Now, for any K2>K1K_{2}>K_{1}, on the event {λpn+2<K1n2/3}\{|\lambda_{p}^{n}+2|<K_{1}n^{-2/3}\}, for any κ>0\kappa>0, we have

Let us fix κ(23,1)\kappa\in(\frac{2}{3},1). It follows that the first term of the r.h.s. of (39) can be estimated by

Let us now estimate the second term of the r.h.s. of (39). For any positive integer K3K_{3}, we have

From (38), (39), (41) and (42), we conclude that

for arbitrary 0<K1<K30<K_{1}<K_{3} and K31K_{3}\geq 1. Taking the limit nn\to\infty, the last two terms disappear, because by [42, Th. 1.16], the distribution of the smallest K3K_{3} eigenvalues lives on scales of order n2/3n5/6n^{-2/3}\gg n^{-5/6}. Therefore,

still for any 0<K1<K30<K_{1}<K_{3} and K31K_{3}\geq 1. Now, note that for K1K_{1} large enough, the first term can be made as small as we want. Then, keeping K1K_{1} fixed, K2K_{2} can be chosen in such a way to make the second term as small as we want too. At last, keeping K2K_{2} fixed, one can choose K3K_{3} large enough to make the third term as small as we want (as can be computed since the limit is given by the K3K_{3} correlation function of the Airy kernel). \square

To complete the proof of Hypothesis 4.1, we therefore need to show that

Assume that the entries of XnX_{n} satisfy Hypothesis 5.2. Then, for any δ>0\delta>0, any finite integer number pp,

Proof. Notice that by we know that the pp smallest eigenvalues of XnX_{n} converge in law towards the Tracy-Widom law, so that

Thus, for any finite pp, with large probability,

and therefore it is enough to prove the lemma for any particular pp. As in the previous proof, we choose pp large enough so that λpn2+n23\lambda_{p}^{n}\geq-2+n^{-\frac{2}{3}} with probability greater than 1δ(p)1-\delta(p) with δ(p)\delta(p) going to zero as pp goes to infinity. We shall prove that with high probability

This is enough to prove the statement as for any γ>0\gamma>0, 2+λ[nγ]n2+\lambda^{n}_{[n\gamma]} converges to δ(γ)>0\delta(\gamma)>0 so that μX([δ(γ),2])=1γ\mu_{X}([{\delta(\gamma)},2])=1-\gamma, see [43, Theorem 1.3],

which converges as γ\gamma goes to zero to (2+x)1dμX(x)=1\int(2+x)^{-1}d\mu_{X}(x)=1 (by e.g. (37)). To prove (43), we choose ρ(2/3,2/3)\rho\in(2/3,\sqrt{2/3}) and write, on the event λjn+2λpn+2n23nρ\lambda_{j}^{n}+2\geq\lambda_{p}^{n}+2\geq n^{-\frac{2}{3}}\geq n^{-\rho} for jpj\geq p,

For the first term, we use Sinai-Soshnikov bound, which under the weakest hypothesis are given in [39, Theorem 2.1]. It implies that with probability going to one with MM going to infinity, for sn=o(n2/3)s_{n}=o(n^{2/3}) going to infinity,

This implies, by Tchebychev’s inequality and taking sn=n+ρk+1s_{n}=n^{+\rho^{k+1}} that

which goes to zero as ρ>2/3\rho>2/3. For the second term BnB_{n}, note that by [42, Theorem 1.10], for any ϵ>0{\epsilon}>0 small enough,

which goes to zero as nn goes to infinity and then γ\gamma goes to zero. ∎

2. Coulomb Gases

We can also consider random matrices XnX_{n} which law is invariant under the action of the unitary or the orthogonal group and with eigenvalues with law given by

with a polynomial function VV of even degree and positive leading coefficient and β=1,2\beta=1,2 or 44. We assume moreover that VV is such that the limiting spectral measure μV\mu_{V} of (Xn)(X_{n}) is connected and compact and that its smallest and largest eigenvalues converge to the boundaries of the support. This set of hypotheses is often referred to as the “one-cut assumption”. It holds in particular if VV is strictly convex and this includes the classical Gaussian ensembles GOE and GUE (with V(x)=x2/4V(x)=x^{2}/4 and β=1,2\beta=1,2).

Under the above hypothesis on V,V, the extreme eigenvalues of XnX_{n} converge to the boundary of the support. The convergence of the extreme eigenvalues of Xn~\widetilde{X_{n}} is given by Theorem 2.1. These eigenvalues have Gaussian fluctuations as stated in Theorem 3.2 if they deviate away from the bulk. Suppose moreover that Assumption 4.2 holds. If the perturbation is of rank one and is strong enough so that the largest eigenvalues deviates from the bulk, for all k2,k\geq 2, the rescaled kkth largest eigenvalue n23(λ~nk+1nbV)n^{\frac{2}{3}}(\widetilde{\lambda}_{n-k+1}^{n}-b_{V}) converges weakly towards the k1k-1-th Tracy Widom law. If the perturbation is of rank one and is weak enough, for all k1,k\geq 1, the rescaled kkth largest eigenvalue n23(λ~nk+1nbV)n^{\frac{2}{3}}(\widetilde{\lambda}_{n-k+1}^{n}-b_{V}) converges weakly towards the kk-th Tracy Widom law. If the perturbation is of rank more than one, the extreme eigenvalues of Xn~\widetilde{X_{n}} sticking to the bulk are at distance less than n1+ϵn^{-1+{\epsilon}} for any ϵ>0{\epsilon}>0 from the eigenvalues of Xn.X_{n}. In the i.i.d model, Theorem 4.5 prescribes exactly in the neighborhood of which eigenvalues of XnX_{n} each of them lie.

Proof. As explained above, it suffices to verify that the hypotheses hold in probability for (Xn)n1(X_{n})_{n\geq 1}.

Note that the convergence of the spectral measure, of the edges and the fluctuations of the extreme eigenvalues were obtained in . The fact that n(Gμn(z)Gsc(z))\sqrt{n}(G_{\mu_{n}}(z)-G_{\operatorname{sc}}(z)) converges in probability to zero is a consequence of so that Hypothesis 3.1 holds.

We next check Hypothesis 4.1[p,α\alpha] for the matrix model Pn.P_{n}. We shall prove it for any α>1/3\alpha>1/3 and any integer pp. We first show that

Indeed, the joint distribution of (λ1n,,λnn)(\lambda_{1}^{n},\ldots,\lambda_{n}^{n}) is

with β=1,2\beta=1,2 or 44, ZnβZ_{n}^{\beta} is the normalising constant and Δn={λ1<<λn}\Delta_{n}=\{\lambda_{1}<\cdots<\lambda_{n}\}. Therefore,

by integration by parts. Equation (45) follows, since λpn\lambda^{n}_{p} converges almost surely to aVa_{V} (and concentration inequalities insures V(λpn)V^{\prime}(\lambda_{p}^{n}) is uniformly integrable). But, for any ϵ>0{\epsilon}>0,

with, by convergence of the spectral measure and of λpn\lambda^{n}_{p}, the right hand side converging to GμX(aVϵ)-G_{\mu_{X}}(-a_{V}-{\epsilon}) which converges as ϵ{\epsilon} decreases to zero to GμX(aV)=V(aV)-G_{\mu_{X}}(-a_{V})=-V^{\prime}(a_{V}). Hence, 1nip1λinλpn\frac{1}{n}\sum_{i\neq p}\frac{1}{\lambda^{n}_{i}-\lambda^{n}_{p}} is bounded below by V(aV)-V^{\prime}(a_{V}) with large probability for large nn, and converges in expectation to V(aV)-V^{\prime}(a_{V}), and therefore converges in probability to V(aV)-V^{\prime}(a_{V}).

Moreover, by (see in the Gaussian case), the joint law of

converges weakly towards a probability measure which is absolutely continuous with respect to Lebesgue measure. As a consequence, we also deduce from the first point that n1i<mn(λpnλin)1n^{-1}\sum_{i<m_{n}}(\lambda_{p}^{n}-\lambda_{i}^{n})^{-1} vanishes as nn goes to infinity in probability for mnn1/3m_{n}\ll n^{1/3} and therefore (45) proves the lacking point of Hypothesis 4.1.

so that by (45) and Markov’s inequality, Hypothesis 4.1 holds in probability for any η<1/3\eta<1/3, η4<1\eta_{4}<1 and α>1/3\alpha>1/3. \square

3. Wishart matrices

Let n,mn,m tend to infinity in such a way that n/mc(0,1)n/m\to c\in(0,1). The limits of the extreme eigenvalues of Xn~\widetilde{X_{n}} are given by Theorem 2.1 and the fluctuations of those which limits are out of [a,b][a,b] are given by Theorem 3.2, where the parameters a,b,ρθ,cαa,b,\rho_{\theta},c_{\alpha} are given by the following formulas: a=(1c)2a=(1-\sqrt{c})^{2}, b=(1+c)2b=(1+\sqrt{c})^{2}

Assume now that the law of the entries satisfy Hypothesis 5.2. If the perturbation has rank one, we have the following precise description of the fluctuations of the extreme eigenvalues of Xn~\widetilde{X_{n}} :

If θ>c+c\theta>c+\sqrt{c} (resp. θ<cc\theta<c-\sqrt{c}), for all p2p\geq 2, n2/3(λ~np+1n2σ)n^{2/3}(\widetilde{\lambda}_{n-p+1}^{n}-2\sigma) (resp. n2/3(λ~pn2σ)n^{2/3}(\widetilde{\lambda}_{p}^{n}-2\sigma)) converges in law to the p1p-1th Tracy Widom law.

If 0θ<c+c0\leq\theta<c+\sqrt{c} (resp. cc<θ0c-\sqrt{c}<\theta\leq 0), for all p1,p\geq 1, n2/3(λ~np+1n2σ)n^{2/3}(\widetilde{\lambda}_{n-p+1}^{n}-2\sigma) (resp. n2/3(λ~pn2σ)n^{2/3}(\widetilde{\lambda}_{p}^{n}-2\sigma)) converges in law to the ppth Tracy Widom law.

If the perturbation has rank more than one and for all i,i, θi{c+c,cc}\theta_{i}\notin\{c+\sqrt{c},c-\sqrt{c}\}, the extreme eigenvalues of Xn~\widetilde{X_{n}} are at distance less than n1+ϵn^{-1+{\epsilon}} for any ϵ>0{\epsilon}>0 to the extreme eigenvalues of Xn,X_{n}, which have Tracy-Widom fluctuations.

Before getting into the proof, let us make a remark. The Proposition above generalizes some results first appeared in . In these papers, the authors consider models with multiplicative perturbations (in the sense that the population covariance Σ\Sigma matrix is assumed to be a perturbation of the identity). Here, we consider additive perturbations but the two models are in fact similar, since a Wishart matrix can be written as a sum of rank one matrices i=1mσiYiYi,\sum_{i=1}^{m}\sigma_{i}Y_{i}Y_{i}^{*}, with σi\sigma_{i} the eigenvalues of Σ\Sigma and YiY_{i} nn-dimensional vectors with i.i.d. entries. So, adding our perturbation i=1rθiUiUi\sum_{i=1}^{r}\theta_{i}U_{i}U_{i}^{*} boils down to change mm into m+rm+r (the limit of m/nm/n is not changed) and to extend Σ\Sigma with some new eigenvalues θ1,,θr\theta_{1},\ldots,\theta_{r}.

Proof. Again, it suffices to verify that the hypotheses hold in probability for (Xn)n1(X_{n})_{n\geq 1}.

It is known, , that the spectral measure of XnX_{n} converges to the so-called Marčenko-Pastur distribution

where a=(1c)2a=(1-\sqrt{c})^{2} and b=(1+c)2b=(1+\sqrt{c})^{2}. It is known, [4, Th. 5.11], that the extreme eigenvalues converge to the bounds of this support. The formula

allows to compute ρθ\rho_{\theta} and cαc_{\alpha}. Moreover, by [3, Th. 1.1] or [4, Th. 9.10], we also know that a central limit theorem holds for the linear statistics of Wishart matrices, giving Hypothesis 3.1 as in the Wigner case.

For Hypothesis 4.1, the proof is similar to the Wigner case. The convergence to the Tracy-Widom law of the non-perturbed matrix is due to S. Péché (see and for the Gaussian case). The approximation of the eigenvalues by the quantiles of the limiting law can be found in [17, Theorem 9.1] whereas the absolute continuity property needed to prove Lemma 5.5 is derived in [17, Lemma 8.1]. This allows to prove Hypothesis 4.1 in this setting as in the Wigner case, we omit the details. \square

4. Non-white ensembles

In the case of non-white matrices, we can only study the fluctuations away from the bulk (since we do not have the appropriate information about the top eigenvalues to prove Hypothesis 4.1). We illustrate this generalisation in a few cases, but it is rather clear that Theorem 3.2 applies in a much wider generality.

4.2. Non-white Wigner matrices

There are less results in the literature about the central limit theorem for band matrices (with centring with respect to the limit) and the convergence of the spectrum. We therefore concentrate on a special case, namely a Hermitian matrix XnX_{n} with independent Gaussian centred entries so that E[Xij2]=n1σ(i/n,j/n)E[|X_{ij}|^{2}]=n^{-1}\sigma(i/n,j/n) with a stepwise constant function

which entails the convergence of the spectrum of XnX_{n} towards the support of the limiting measure [31, Proposition 11] with exponential speed by [31, Proof of Lemma 14]. Thus XnX_{n} satisfies Hypothesis 1.1. Hypothesis 3.1 can be checked by modifying slightly the proof of (46) which is based on an integration by parts to be able to take zz on the real line but away from the limiting support. Indeed, as in [23, Section 3.3], we can add a smooth cut-off function in the expectation which vanishes outside of the event AnA_{n} that XnX_{n} has all its eigenvalues within a small neighborhood of the limiting support. This additional cut-off will only give a small error in the integration by parts due to the previous point. Then, (46), but with an expectation restricted to this event, is proved exactly in the same way, except that z\Im z can be replaced by the distance of zz to the neighborhood of the limiting support where the eigenvalues of XnX_{n} lives. Finally, concentration inequalities, in the local version [22, Lemma 5.9 and Part II], insure that on AnA_{n},

is at most of order n1+ϵn^{-1+{\epsilon}} with overwhelming probability. This completes the proof of Hypothesis 3.1.

5. Some models for which our hypothesis are not satisfied

We gather hereafter a few remarks about some models for which the hypothesis we made on XnX_{n} are not satisfied. For sake of simplicity, we present hereafter only the case of i.i.d. perturbations (1).

We assume that XnX_{n} is diagonal with i.i.d. entries which law μ\mu is compactly supported. As in the core of the paper, we denote by a (resp. b) the left (resp. right) edge of the support of μ.\mu. We also denote by FμF_{\mu} its cumulative distribution function and assume that there is κ>0\kappa>0 such that for all c>0,c>0,

In this situation, it is easy to check that Hypothesis 1.1 holds in probability with μX=μ\mu_{X}=\mu. But Hypothesis 3.1 is not satisfied. Indeed, by classical CLT, we have, for ρα[a,b],\rho_{\alpha}\notin[a,b],

converges in law, as nn goes to infinity to a Gaussian variable WαW_{\alpha} with variance Gμ(ρα)Gμ(ρα)2.-G^{\prime}_{\mu}(\rho_{\alpha})-G_{\mu}(\rho_{\alpha})^{2}. Moreover,

Nevertheless, Theorem 3.2 holds for this model. Indeed, the whole proof of this theorem goes through in this context, except the proof of Lemma 3.5, where we have to make the following decomposition Ms,tn(i,x)=Ms,tn,1(i,x)+Ms,tn,2(i,x)+Ms,tn,3(i,x)M_{s,t}^{n}(i,x)=M_{s,t}^{n,1}(i,x)+M_{s,t}^{n,2}(i,x)+M_{s,t}^{n,3}(i,x) with the difference that this time Ms,tn,3M_{s,t}^{n,3} does not go to zero but converges towards WαiW_{\alpha_{i}}. Hence, the eigenvalues fluctuate according to the distribution of the eigenvalues of (cjMj+WαjIkj)1jq(c_{j}M_{j}+W_{\alpha_{j}}I_{k_{j}})_{1\leq j\leq q}, with cjc_{j} and MjM_{j} as in the statement of Theorem 3.2 and IkjI_{k_{j}} denotes the kj×kjk_{j}\times k_{j} identity matrix.

5.2. Coulomb gases with non-convex potentials

In , Pastur showed that for a Coulomb gas law (44) with a potential VV so that the equilibrium measure has a disconnected support, the central limit theorem does not hold in the sense that the variance may have different limits according to subsequences (see [35, (3.4)]. Moreover the asymptotics of n(Tr(Xn)μ(x))\sqrt{n}({\rm Tr}(X_{n})-\mu(x)) can be computed sometimes and do not lead to a Gaussian limit. We might expect then that also n(Gμn(x)Gμ(x))\sqrt{n}(G_{\mu_{n}}(x)-G_{\mu}(x)) converges to a non-Gaussian limit, which would then result with non-Gaussian fluctuations for the eigenvalues outside of the bulk.

Appendix

We here state formula (3), which can be deduced from the well known formula \det\left(\begin{array}[]{cc}A&B\cr C&D\cr\end{array}\right)=\det(D)\det(A-BD^{-1}C).

2. Concentration estimates

Under Assumption 1.2, there exists a constant c>0c>0 so that for any matrix A:=(ajk)1j,knA:=(a_{jk})_{1\leq j,k\leq n} with complex entries, for any δ>0\delta>0, for any g=(g1,,gn)Tg=(g_{1},\ldots,g_{n})^{T} with i.i.d. entries (gi)1in(g_{i})_{1\leq i\leq n} with law ν\nu,

On the other hand, the previous estimate shows that

As a consequence, we deduce the second point of the proposition. \square

Let Gn=[g1ngrn]{G}^{n}=\begin{bmatrix}g_{1}^{n}\cdots g_{r}^{n}\end{bmatrix} be an n×rn\times r matrix which columns g1n,,grng^{n}_{1},\ldots,g^{n}_{r}, are independent copies of an n×1n\times 1 matrix with i.i.d. entries with law ν\nu and define

and, for ji1j\leq i-1, if det[Vk,ln]k,l=1i10\det[V^{n}_{k,l}]_{k,l=1}^{i-1}\neq 0,

On det[Vk,ln]k,l=1i1=0\det[V^{n}_{k,l}]_{k,l=1}^{i-1}=0, we give to Wi,jnW_{i,j}^{n} an arbitrary value, say one. Putting Wiin=1W^{n}_{ii}=1 and Wijn=0W^{n}_{ij}=0 for ji+1j\geq i+1, it is a standard linear algebra exercise to check that the column vectors

For any γ>0\gamma>0, there exists finite positive constants c,Cc,C (depending on rr) so that for Zn=VnZ^{n}=V^{n} or WnW^{n},

Moreover, with v22=i=1nvi2\|v||_{2}^{2}=\sum_{i=1}^{n}|v_{i}|^{2}, for any γ(0,n(2rϵ)\gamma\in(0,\sqrt{n}(2^{-r}-\epsilon) for some ϵ>0,{\epsilon}>0,

Proof. We first consider the case Zn=VnZ^{n}=V^{n}. The maximum of Vijnδij|V_{ij}^{n}-\delta_{ij}| is controlled by the previous proposition with A=n1IA=n^{-1}I, and the result follows from TrAA=n1{\rm Tr}AA^{*}=n^{-1} and Tr((AA)2)=n3{\rm Tr}((AA^{*})^{2})=n^{-3}, and choosing δ=γ/2\delta=\gamma/\sqrt{2}, κ=n\kappa=\sqrt{n}. The result for WnW^{n} follows as on VnIγn121\|V^{n}-I\|_{\infty}\leq\gamma n^{-\frac{1}{2}}\leq 1

For the last point, we just notice that since 1nj=1rZi,jngjn22=(ZVZ)i,i\frac{1}{n}\|\sum_{j=1}^{r}Z_{i,j}^{n}g^{n}_{j}\|_{2}^{2}=(ZVZ^{*})_{i,i}, we have

for a finite constant C(r)C(r) which only depends on rr. Thus the result follows from the previous point. \square

3. Central Limit Theorem for quadratic forms

Let us fix r1r\geq 1 and let, for each nn, An(s,t)A^{n}(s,t) (1s,tr1\leq s,t\leq r) be a family of n×nn\times n real (resp. complex) matrices such that for all s,ts,t, An(t,s)=An(s,t)A^{n}(t,s)=A^{n}(s,t)^{*} and such that for all s,t=1,,rs,t=1,\ldots,r,

for some finite numbers σs,t,ωs\sigma_{s,t},\omega_{s} (in the case where κ4(ν)=0\kappa_{4}(\nu)=0, the part of the hypothesis related to ωs\omega_{s} can be removed). For each nn, let us define the r×rr\times r random matrix

Then the distribution of GnG_{n} converges weakly to the distribution of a real symmetric (resp. Hermitian) random matrix G=[gs,t]s,t=1rG=[g_{s,t}]_{s,t=1}^{r} such that the random variables

are independent and for all ss, gs,sN(0,2σs,s2+κ4(ν)ωs)g_{s,s}\sim\mathcal{N}(0,2\sigma_{s,s}^{2}+\kappa_{4}(\nu)\omega_{s}) (resp. gs,sN(0,σs,s2+κ4(ν)ωs)g_{s,s}\sim\mathcal{N}(0,\sigma_{s,s}^{2}+\kappa_{4}(\nu)\omega_{s})) and for all sts\neq t, gs,tN(0,σs,t2)g_{s,t}\sim\mathcal{N}(0,\sigma_{s,t}^{2}) (resp. (gs,t),(gs,t)N(0,σs,t2/2)\Re(g_{s,t}),\Im(g_{s,t})\sim\mathcal{N}(0,\sigma_{s,t}^{2}/2)).

then it follows directly from Theorem 6.4 and from a second moment computation that each finite dimensional marginal of the process

We have to prove that for any real symmetric (resp. Hermitian) matrix B:=[bs,t]s,t=1rB:=[b_{s,t}]_{s,t=1}^{r}, the distribution of Tr(BGn)\operatorname{Tr}(BG_{n}) converges weakly to the distribution of Tr(BG)\operatorname{Tr}(BG). Note that

where CnC^{n} is the rn×rnrn\times rn matrix and UnU_{n} is the rn×1rn\times 1 random vector defined by

In the real (resp. complex) case, let us now apply Theorem 7.1 of in the case K=1K=1. It follows that the distribution of

converges weakly to a centred real Gaussian law with variance

It completes the proof in the i.i.d. model.

\bullet In the orthonormalised model, we can write usn=1i=1sWsingi2j=1sWsjngju^{n}_{s}=\frac{1}{\|\sum_{i=1}^{s}W^{n}_{si}g_{i}\|_{2}}\sum_{j=1}^{s}W^{n}_{sj}g_{j}, where the matrix WnW^{n} is the one introduced in this section. It follows that, with

by orthonormalization of the usnu_{s}^{n}’s

But, by the previous result, if iji\neq j,

converges in distribution to a Gaussian law, whereas if i=ji=j,

where both terms converge to a Gaussian. Thus this term is also bounded as nn goes to infinity.

Hence, by Proposition 6.3, we may and shall replace WnW^{n} by the identity (since the error term would be of order at most n12+ϵn^{-\frac{1}{2}+{\epsilon}}), which yields

so that we are back to the previous setting with BB instead of AA. \square

Acknowledgments: We are very grateful to B. Schlein for communicating us Lemma 5.5. We also thank G. Ben Arous and J. Baik for fruitful discussions. We also thank the referee, who pointed some vagueness in the first version of the paper.

References