On Finite Rank Deformations of Wigner Matrices
Alessandro Pizzo, David Renfrew, Alexander Soshnikov
Introduction and Formulation of Main Results
Let be a random real symmetric (Hermitian) Wigner matrix with independent entries up from the diagonal. In the real symmetric case, we assume that the off-diagonal entries
are independent random variables such that
are independent random variables (that are also independent from the off-diagonal entries), such that
In a similar fashion, in the Hermitian case, we assume that the off-diagonal entries
are independent random variables such that
are independent centered random variables, independent from the off-diagonal entries, with uniformly bounded third moment of the absolute values.
For a real symmetric (Hermitian) matrix of order its empirical distribution of the eigenvalues is defined as where are the (ordered) eigenvalues of Wigner semicircle law (see e.g. , , ) states that almost surely the empirical distribution of a random real symmetric (Hermitian) Wigner matrix converges weakly to the nonrandom limiting distribution The limiting distribution is known as the semicircle distribution. It is absolutely continuous with respect to the Lebesgue measure and has the compact support The density of the Wigner semicircle distribution is given by
converges to almost surely; here and throughout the paper, we use the notation to denote the normalized trace.
The Stieltjes transform of the semicircle law is
In this paper, we study the fluctuations of the outliers in the spectrum of finite-dimensional deformations of Wigner matrices. Starting with the pioneering work by Füredi and Komlós , there have been several results on finite rank perturbations of matrices with i.i.d. entries, in particular , , , , , , , , , . We also note several papers on the eigenvalues of sample covariance matrices of spiked population models (, , , ).
This manuscript can be viewed as a companion paper to our recent works and on the non-Gaussian fluctuation of the matrix entries of regular functions of Wigner matrices. However, no knowledge of the machinery used in and is required, and the paper can be read independently from these papers.
Here is a random real symmetric (Hermitian) Wigner matrix as defined in (1.1-1.4) ((1.5-1.7)), and is a deterministic Hermitian matrix of fixed finite rank We assume that the eigenvalues of and their multiplicities are fixed. Let
be the eigenvalues of each with fixed multiplicity . Clearly, the eigenvalue has multiplicity and
The first theorem of this section, Theorem 1.1, concerns the convergence of the extreme eigenvalues of the deformed random matrix. Let us denote We shall use the shorthand notation for Theorem 1.1 was originally proved by Capitaine, Donati-Martin, and Feral in in the case when the common marginal distribution of the matrix entries is symmetric and satisfies a Poincaré inequality.
Let be a random real symmetric (Hermitian) Wigner matrix satisfying (1.1-1.4) (respectively (1.5-1.7)). Let be the number of ’s such that and be the number of ’s such that .
For all and ,
For all and ,
In other words, the first largest eigenvalues of converge to the next largest eigenvalues converge to the th bunch of the largest eigenvalues converge to the next largest eigenvalue converges to (since it corresponds to a nonnegative eigenvalue of which is not bigger than ), etc.
If random variables satisfy a Poincaré inequality (1.12) with constant uniformly bounded from zero, the convergence holds with probability one.
Note that the Poincaré inequality tensorizes and the probability measures satisfying the Poincaré inequality have subexponential tails (, ) . In particular, if the marginal distributions of the matrix entries of satisfy the Poincaré inequality with constant then the joint distribution of also satisfies the Poincaré inequality with the same constant By a standard scaling argument, we note that if the marginal distributions of the matrix entries of satisfy the Poincaré inequality with then the marginal distributions of the matrix entries of satisfy the Poincaré inequality with constant N\*\upsilon.\
Theorem 1.1 follow from Theorem 1.2 formulated below. Theorem 1.2 is concerned with the distribution of the outliers, i.e. the eigenvalues of corresponding to Namely, we are interested in the fluctuation of the outliers around Let us consider a fixed eigenvalue of such that In general, if one does not assume some additional information about the structure of the eigenvectors of corresponding to the sequence of random vectors
Let be a random real symmetric (Hermitian) Wigner matrix defined in (1.1-1.4) (respectively (1.5-1.7)). Let so the eigenvalue of satisfies . Then the sequence of random vectors
is bounded in probability. In addition, if the marginal distributions of the matrix entries of satisfy the Poincaré inequality (1.12) with constant uniformly bounded from zero, the following holds with probability
Theorem 1.2 clearly implies parts (a) and (d) of Theorem 1.1. To see that parts (b) and (c) of Theorem 1.1 also follow, we note that for any fixed positive integer the -th largest eigenvalue of converges in probability to This is a simple consequence of the convergence of the largest eigenvalue of to and the semicircle law. Then the interlacing property and Theorem 1.2 imply the desired result.
The bound (1.15) means that there exists a sufficiently large deterministic constant such that with probability
To study the fluctuations of the outliers in more detail, we consider two special cases following .
Case A (“The eigenvectors don’t spread out”)
Case B (“The eigenvectors are delocalized”)
The norm of every orthonormal eigenvector of corresponding to goes to zero as
The next theorem is a consequence of Proposition 1.1 below and Theorems 1.1 and 1.5 in . We use a standard notation in the real symmetric case and in the Hermitian case.
Let be a random real symmetric (Hermitian) Wigner matrix defined in (1.1-1.4) (respectively (1.5-1.7)) such that the off-diagonal entries are i.i.d. real (complex) random variables with probability distribution and the diagonal entries are i.i.d. random variables with probability distribution In Case A, the -dimensional vector
converges in distribution to the distribution of the ordered eigenvalues of the random matrix defined as
(i) is a Wigner random matrix of size with the same marginal distribution of the matrix entries as
(ii) is a real symmetric (Hermitian) Gaussian matrix of size independent of with centered independent entries in the Hermitian case) with the variance of the entries given by
(iii) is a such that the (-dimensional) columns of are written from the first coordinates of the orthonormal eigenvectors corresponding to
In , Theorem 1.3 was proved for symmetric marginal distribution satisfying the Poincaré inequality (1.12) under an additional technical assumption that k=o(\sqrt{N}),\ where is defined in the paragraph above (1.16).
Using Theorems 4.1 and 4.2 from , one can extend the results of Theorem 1.3 to the case when the entries of are not identically distributed provided the distribution of the entries does not depend on
Let be a random real symmetric (Hermitian) Wigner matrix defined in (1.1-1.4) (respectively (1.5-1.7)) such that the distribution of the entries does not depend on Let us assume that the limits
Then in case A, the results of Theorem 1.3 hold with in (1.18) replaced by
Let be a random real symmetric (Hermitian) Wigner matrix defined in (1.1-1.4) (respectively (1.5-1.7)) such that the off-diagonal entries are i.i.d. random variables with probability distribution and the diagonal entries are i.i.d. random variables with probability distribution In Case B, the -dimensional vector
converges in distribution to the distribution of the (ordered) eigenvalues of a GOE (GUE) matrix with the variance of the matrix entries given by provided
We recall that has been defined above as the minimal number of canonical basis vectors required to span the eigenvectors corresponding to the eigenvalues
Theorem 1.5 is an immediate extension of the result of Capitaine, Donati-Martin, and Féral from to our setting since their arguments apply essentially unchanged as soon as Theorem 1.1 is established.
It should be noted that Benaych-Georges, Guionnet, and Maida consider in perturbations of a random Wigner matrix by a finite rank random matrix with eigenvectors that are either independent copies of a random vector with i.i.d. centered components satisfying the log-Sobolev inequality or are obtained by Gram-Schmidt orthonormalization of such independent copies. The distribution of the outliers is given in Proposition 5.3. of . Let us denote the distribution of the first component of by If the fourth cumulant of vanishes, the limiting distribution of the outliers is similar to the result of Theorem 1.5, and given by the distribution of the ordered eigenvalues of a GOE (GUE) matrix. If the fourth cumulant does not vanish, one has to add a diagonal matrix with i.i.d. real Gaussian entries to a GOE (GUE) matrix.
One of the most important results of , concerns the distribution of the “sticking” eigenvalues (i.e. the eigenvalues that correspond to In Theorem 5.3 of , Benaych-Georges, Guionnet, and Maida prove that their limiting distribution is given by the Tracy-Widom law.
Let us briefly describe a key ingredient of the proofs of Theorems 1.2-1.4. We use the notation
Let us consider a fixed eigenvalue of such that and denote by the orthonormal eigenvectors of that correspond to the eigenvalue Denote by the matrix with the entries
where we recall that \rho_{j}=\theta_{j}+\frac{\sigma^{2}}{\theta_{j}}\. The following proposition plays an important part in our proofs.
Let be the ordered eigenvalues of the matrix Then
It should be mentioned that the key part of the proof of Proposition 1.1 is a lemma from which is stated as Lemma 4.2 in Section 4. Proposition 1.1 indicates that the question of the limiting distribution of the outliers of the spectrum of the deformed Wigner matrix can be reduced to the question about the limiting distribution of the entries of (1.25).
Without additional assumptions on and the sequence
does not necessarily converge in distribution. However, one can show that it is tight.
Let be a random real symmetric (Hermitian) Wigner matrix defined in (1.1-1.4) (respectively (1.5-1.7)). Then the following statements hold:
where depends on and
(iii) If the marginal distributions of the entries of satisfy the Poincaré inequality (1.12) with a uniform constant , and is a Lipschitz continuous function on that satisfies a subexponential growth condition
for some positive constants and then
where is defined in (1.32),
and is the constant in the Poincaré inequality (1.12).
We finish this section by formulating our last theorem, Theorem 1.7, which allows us to extend Theorem 1.3 (see Remark 5.1 in Section 5). Assume that that the off-diagonal entries are i.i.d. random variables with probability distribution and the diagonal entries are i.i.d. random variables with probability distribution
converges in distribution as Without loss of generality, we will consider the real symmetric case; the Hermitian case is essentially identical. Let be an arbitrary fixed positive integer. Denote by the upper-left corner of the matrix Theorem 1.1 in states that a matrix-valued random field
with values in the space of complex symmetric matrices, converges in finite-dimensional distributions to a random field
where is the upper-left corner submatrix of a Wigner matrix is the Stieltjes transform (1.9) of the Wigner semicircle law, and
is a Gaussian random field with the covariance matrix given by the formulas (1.18)-(1.23) in the real-symmetric case and (1.50)-(1.55) in the Hermitian case in . It is important to note that are independent random processes for different indices
Let us extend the definition of to that of an infinite-dimensional matrix \Upsilon(z)_{pq},\ 1\leq p,q<\infty,\ using the formulas (1.18)-(1.23) (respectively (1.50)-(1.55)) from . Thus, the r.h.s. in (1.43) defines now the upper-left corner of the infinite matrix Then Theorem 1.1 of implies that
Let be a random real symmetric (Hermitian) Wigner matrix defined in (1.1-1.4) (respectively (1.5-1.7)) such that that the off-diagonal entries are i.i.d. random variables with probability distribution and the diagonal entries are i.i.d. random variables with probability distribution
converges weakly to the joint distribution of
We would like to thank A. Guionnet for bringing our attention to the preprints and .
Mathematical Expectation and Variance of Resolvent Sesquilinear Form
This section is devoted to the proof of the main building block Theorem 1.6, namely Proposition 2.1.
When it does not lead to ambiguity we will use the shorthand notation, , for the -th entry of the resolvent matrix
In the case when and are standard basis vectors, the mathematical expectation and the variance of have been studied in . In particular, it has been shown there in Proposition 2.1 and (3.27) that
In , Erdös, Yau, and Yin studied generalized Wigner matrices (defined at the beginning of Section 2 of ), and obtained the following estimates provided the marginal distributions have subexponential tails
where are some constants, and is sufficiently large.
It follows from our proofs that the error term on the r.h.s. of (2.2) can be replaced by O\left(\frac{\min(\|u\|_{1},\|v\|_{1})}{|\operatorname{\mathfrak{Im}}z|^{7}\*N}\right),\ where
The rest of the section is devoted to the proof of Proposition 2.1.
Without loss of generality, we can restrict our attention to the real symmetric case. The proof in the Hermitian case is very similar. We start by proving (2.2). Using we write
where is defined in (2.1), and contains the third and the fourth cumulant terms corresponding to and in the decoupling formula (6.1) for and the error terms due to the truncation of the decoupling formula (6.1) for at and for at
where by we denote the third cumulant of We note that
uniformly in and To estimate the absolute value of the first term in (2.15), we first sum with respect to and then use the Cauchy-Schwarz inequality and (6.7) to obtain
To estimate the absolute value of the second term in (2.15), we write
Finally, we bound the last of the third cumulant terms in (2.15) as
Combining the bounds (2.16-2.18), we see that the contribution of the third cumulant terms to in (2.12-2.13) is bounded from above by The fourth cumulant terms give
To estimate the absolute value of the first term in (2.19), we note that
(6.7), and the fact that the fourth cumulants of are uniformly bounded in absolute value by some constant
To estimate the second term in (2.19), we write
The other two terms in (2.19) are estimated in a similar fashion. Each of them is Therefore, the fourth cumulant terms give the contribution to in (2.12-2.13).
Finally, we estimate the error terms due to the truncation of the decoupling formula at for and at for Here, we treat the error term due to the truncation of the decoupling formula at for The second error term can be treated in a similar way. To estimate the error term, we have to consider expressions of the following form
where a,b,c,d,e,f,p,q,s\in\{i,k\},\ the supremum in (2.22) is considered over the resolvents of rank two perturbations of with Estimating each entry of by taking into account that
and using the fact that the fifth cumulants of the off-diagonal entries of are uniformly bounded, we bound (2.22) from above by O\left(\frac{1}{N\*|\operatorname{\mathfrak{Im}}z|^{5}}\right).\
Combining the estimates of the third and the fourth cumulant terms and the truncation error term, we can rewrite the Master equation (2.12) as
where we recall that by we denote a polynomial of degree with positive coefficients that do not depend on
which is exactly the estimate (2.2) of Proposition 2.1.
To prove (2.3), we note that (2.25-2.26), (2.28) and (2.6) imply
Now, we turn our attention to the proof of (2.4). The key part of the proof is the following lemma.
where contains the third and the fourth cumulant terms corresponding to and in (6.1) for , and the error due to the truncation of the decoupling formula (6.1) at for and at for Clearly,
Using (2.42) and (2.47), one can write the last term in (2.39) as
The third cumulant terms in in (2.40) can be written as
We are going to estimate the terms (2.53-2.55) separately. We start with the last two. We claim that both (2.54) and (2.55) are . Indeed, consider first (2.54). It follows from (6.4-6.5), (2.42), and (2.47), that it is equal to
Combining (2.59) and (2.58), we estimate (2.57) as The other terms in (2.56) can be estimated in a similar way, which implies that (2.54) is .
Now, we turn our attention to (2.55). Using (2.43-2.45) and (2.48), one can rewrite (2.55) as
We estimate (2.60). The subsums (2.61-2.63) can be estimated in a similar way. The summation with respect to in (2.60) gives
Combining the last two bounds, we obtain that (2.60) is
Finally, let us estimate (2.53). It can be written as
The subsums (2.64) and (2.66) are bounded from above by The calculations are very similar to the ones used above and are left to the reader. The subsum (2.65) can be written as
It follows from the estimates in (2.17) that one has a deterministic upper bound
Combining the estimates (2.53-2.70), we obtain that the third cumulant term (2.52) contributing to in (2.38) can be written as
Somewhat long but straightforward calculations using (6.4-6.5) and (2.42-2.51) show that the fourth cumulant term in in (2.38) can be estimated from above by O\left(\frac{1}{|\operatorname{\mathfrak{Im}}z|^{5}\*N}\right).\ Since the calculations are very similar to those in (2.19- 2.21), we leave the details to the reader. In a similar fashion, the error terms in due to the truncation of the decoupling formula at for and at for are bounded from above by O\left(\frac{1}{|\operatorname{\mathfrak{Im}}z|^{6}\*N}\right).\ The considerations are similar to those given in the analysis of (2.22).
Combining (2.41), (2.50-2.51), (2.71-2.72), and the bounds on the fourth cumulant term and the error terms discussed in the above paragraph, one rewrites the Master equation (2.38-2.39) as
Subtracting the r.h.s. in (2.35) from the r.h.s. in (2.76), we obtain (2.32). Lemma 2.1 is proven. ∎
Now, we are ready to finish the proof of Proposition 2.1. To obtain the estimate (2.4) from (2.32), we use the same arguments as in Section 3 of and Section 2 of . We note (see e.g. (3.9) in ) that
where the constant is chosen sufficiently large so that the term on the r.h.s. of (2.77) is at most in absolute value. Multiplying both sides of (2.32) by and using (6.8), we obtain that
for It follows from (2.78) that
On the other hand, if then Since we have
for such that Combining (2.79) and (2.80), we obtain (2.4). This finishes the proof of Proposition 2.1. ∎
Proof of Theorem 1.6
Our exposition follows closely the ones in Section 3 of and Section 4 of . In order to extend the estimates of Proposition 2.1 to a more general class of test functions, we use the Helffer Sjöstrand functional calculus (see , ).
To prove (1.34), we let in (3.2) and assume that has compact support. It follows from (2.2) that
uniformly on and is a constant depending on We conclude that the second term on the r.h.s. of (3.8) can be estimated as follows
where and are the characteristic functions of the support of and of respectively, and is such that This proves (1.34).
where Taking into account (2.4), we get
Plugging (3.5) with in (3.15), we prove (1.33). Thus, we have proved the parts (i) and (ii) of Theorem 1.6.
Now, let us assume that the marginal distributions of the entries of satisfy the Poincaré inequality (1.12) with a uniform constant and prove the parts (iii)-(v), i.e. the estimates (1.37), (1.39), and (1.40). Since the proof of (1.37-1.40) is very similar to the proof of Proposition 3.3 in , we discuss here only the main ingredients.
where the Hilbert-Schmidt norm is defined as
In particular, if and are unit vectors, then
is a complex-valued Lipschitz continuous function on the space of real symmetric (Hermitian) matrices with the Lipschitz constant
The second observation is that joint distribution of the matrix entries
of satisfies the Poincaré inequality with the constant since the Poincaré inequality tensorizes (, ). Therefore, for any complex-valued Lipschitz continuous function of the matrix entries with the Lipschitz constant the distribution of has exponential tails (see e.g. Lemma 4.4.3 and Exercise 4.4.5 in ), i.e.
Applying (3.19) to the spectral norm of the matrix and using the universality results for the largest eigenvalues (see and references therein), we obtain
Outliers in the Spectrum of Finite Rank Perturbations of Wigner Matrices
This section is devoted to the proof of Theorem 1.2
is decreasing and Let us choose in such a way that
i.e. for all that correspond to the outliers (so ). Let
where is defined in (4.6).
where x_{i+1}-x_{i}=N^{-1/3},\ 0\leq i\leq l(N)-1,\ and Clearly, the number of elements in the sequence is We have
uniformly in and N\geq 1.\ Indeed,
Now, we are ready to start the proof of Theorem 1.2. Let us denote by the orthonormal eigenvectors of corresponding to the non-zero eigenvalues. We recall that we used the notation for the (fixed) eigenvalues of and denoted the (fixed) multiplicity of by . The zero eigenvalue has multiplicity Clearly, Let us denote by the diagonal matrix built from the non-zero eigenvalues of
Let us also denote by the matrix whose columns are given by the orthonormal eigenvectors of Clearly,
For any we define the matrix as follows. Let
The first step in the proof of Theorem 1.2 is the following lemma from .
Suppose that is not an eigenvalue of Then is an eigenvalue of with multiplicity if and only if is an eigenvalue of the matrix
For the convenience of the reader, we sketch the proof of Lemma 4.2 below.
Let Therefore is well defined, and
We obtain that for that if and only if
where one uses the identity \det(I-B\*C)=\det(I-C\*B).\ Rewriting
Proposition 1.1 plays an important role in the proof of Theorem 1.2. Before we prove Proposition 1.1, we need to introduce some notations and prove Lemma 4.3.
Consider a family of matrices defined in (4.23) for Fix an eigenvalue of such that and use the notation for the eigenvectors of that correspond to the eigenvalue \theta_{j}.\ Without loss of generality we can assume that We do it just to simplify notations. The case is identical. We recall that is defined in (1.25) as the submatrix of restricted to the rows and columns corresponding to The central role in the proof of Proposition 1.1 is played by the following lemma.
Let be as in (4.23), with defined in (4.22), and defined in (4.20). Let
be the ordered eigenvalues of Then, for sufficiently large constant
in probability, i.e. is bounded in probability,
We claim that (4.27) follows from Lemma 4.1. Indeed, (4.11) and (4.6) imply that
as Since we conclude that (4.29) implies (4.27).
in probability. Indeed, the entries of the matrix are bounded in probability since the expectation and variance of
almost surely. Thus, is also bounded in probability. Since the first eigenvalues of are equal to we obtain (4.28). Lemma 4.3 is proven. ∎
Now, we are ready to prove Proposition 1.1.
By Lemma 4.2, the outliers of are given by those values of such that
We recall that is a monotonically decreasing function on and
Since for (4.28) gives us that in probability, it follows from (4.31) and (4.27) that with probability going to there exist such that g_{\sigma}(x_{i})=z_{i}(x_{i}),\ 1\leq i\leq k_{1},\ and
in probability. Applying (4.27) one more time, we get that
in probability. By a standard perturbation theory argument (see e.g. section XII.1 in ), one proves that the first smallest eigenvalues of the matrix differ from the (increasingly ordered) eigenvalues of the matrix by at most O\left(\frac{1}{N}\right),\ in probability, where the matrix has been defined in (1.25). To see this, we use the following standard lemma from the perturbation theory
Let be an real symmetric (Hermitian) matrix that can be written in the block form as where is an matrix. Suppose that all eigenvalues of are smaller than all eigenvalues of and the gap between the spectra of and is at least In addition, suppose that the operator norm of the offdiagonal block is bounded from above by so that
Then there exists such that the first smallest eigenvalues of differ from the (increasingly ordered) eigenvalues of by at most
and Then it is easy to see that
is an approximate eigenvector of with the approximate eigenvalue such that
Since and we obtain that
The result of the lemma can be immediately extended by induction to the case of block matrices To apply it in our setting, we note that the matrix is the upper-left block of The other diagonal blocks of are given by defined in (1.25). Since the operator norms of the off-diagonal blocks of are the desired statement follows.
where are the eigenvalues of the matrix The result of Proposition 1.1 now follows from (4.36) and (4.33). ∎
Since the eigenvalues of the matrix are bounded in probability, the first part of Theorem 1.2, i.e. (1.14), follows from (1.26) in Proposition 1.1.
almost surely, where are sufficiently large, improving (4.11). Reasoning as before, (4.37) implies that
almost surely for sufficiently large constant Thus, we have
almost surely, which implies (1.15) since Theorem 1.2 is proven. ∎
Proof of Theorems 1.3, 1.4, and 1.7
In this section, we prove Theorems 1.3, 1.4, and 1.7. We start with Theorem 1.3.
Let be an eigenvalue of with the multiplicity Let us assume that Case A takes place. Thus, without loss of generality, we can assume that the eigenvectors of corresponding to the eigenvalue belong to where is a fixed positive integer. As always, we consider the real symmetric case. The treatment of the Hermitian case is very similar. Consider a matrix such that the (-dimensional) columns of are filled by the first coordinates of the orthonormal vectors of corresponding to the eigenvalue We recall that the remaining coordinates of these orthonormal vectors are zero. Let us denote by the upper-left submatrix of the resolvent matrix Finally, we define the random matrix-valued field
We recall that Theorem 1.1 in states that converges weakly in finite-dimensional distributions to a random field
Now, Theorem 1.3 follows from Proposition 1.1 in this paper, and Theorem 1.1 in , since
Theorem 1.3 is proven. The proof of Theorem 1.4 is very similar to the given proof of Theorem 1.3. One has to use Theorems 4.1 and 4.2 and Remark 4.1 in that generalize Theorems 1.1 and 1.5 in to the non-i.i.d. case, and replace in (5.4) with ∎
Now, we turn to the proof of Theorem 1.7.
converges weakly to the joint distribution of Choosing sufficiently large, we can make
arbitrary small uniformly in Indeed, the variance in (5.7) is bounded by O(\|u_{p}-u^{(n)}_{p}\|^{2}+\|u_{q}-u^{(n)}_{q}\|^{2})\ since the entries of are i.i.d random variables with bounded variance on the diagonal and i.i.d. random variables with bounded variance off the diagonal. In addition,
and we can use the bounds (1.33) and (1.34) in Theorem 1.6 rewritten as
are arbitrary small (uniformly in ) provided one chooses sufficiently large. This finishes the proof. ∎
Theorem 1.7 allows the following extension of Theorem 1.3:
where denotes the projection of onto the subspace spanned by the first standard basis vectors Let be the matrix whose columns are given by the vectors u^{(1)}_{N},\ldots,u^{(r)}_{N}.\ Also denote by the diagonal matrix
The result of Theorem 1.3 can be extended for such , with the matrix given by
Appendix
The appendix contains several basic formulas used throughout the paper.
and can be immediately verified by integration by parts.
Next, we write a basic resolvent identity. For any two Hermitian matrices and and non-real we have:
As a corollary of (6.3), one has the following formulas. If is a real symmetric matrix with resolvent then
In a similar way, if is a Hermitian matrix then
Finally, we will use the following properties of the resolvent:
where by we denote the spectrum of a real symmetric (Hermitian) matrix The bound (6.6) implies
Therefore, all entries of the resolvent matrix are bounded by . In a similar fashion, we have the following bound for the Stieltjes transform, , of any probability measure: