Logarithmic law of large random correlation matrices

Journal Article (2024)
Author(s)

N. Parolya (TU Delft - Statistics)

Johannes Heiny (Ruhr-Universität Bochum)

Dorota Kurowicka (TU Delft - Applied Probability)

Research Group
Statistics
Copyright
© 2024 N. Parolya, Johannes Heiny, D. Kurowicka
DOI related publication
https://doi.org/10.3150/23-BEJ1600
More Info
expand_more
Publication Year
2024
Language
English
Copyright
© 2024 N. Parolya, Johannes Heiny, D. Kurowicka
Research Group
Statistics
Issue number
1
Volume number
30
Pages (from-to)
346 - 370
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Consider a random vector y = Σ
1/2 x, where the p elements of the vector x are i.i.d. real-valued random variables with zero mean and finite fourth moment, and Σ
1/2 is a deterministic p × p matrix such that the eigenvalues of the population correlation matrix R of y are uniformly bounded away from zero and infinity. In this paper, we find that the log determinant of the sample correlation matrix R based on a sample of size n from the distribution of y satisfies a CLT (central limit theorem) for p/n → γ ∈ (0, 1] and p ≤ n. Explicit formulas for the asymptotic mean and variance are provided. In case the mean of y is unknown, we show that after re-centering by the empirical mean the obtained CLT holds with a shift in the asymptotic mean. This result is of independent interest in both large dimensional random matrix theory and high-dimensional statistical literature of large sample correlation matrices for non-normal data. Finally, the obtained findings are applied for testing of uncorrelatedness of p random variables. Surprisingly, in the null case R = I, the test statistic becomes distribution-free and the extensive simulations show that the obtained CLT also holds if the moments of order four do not exist at all, which conjectures a promising and robust test statistic for heavy-tailed high-dimensional data.

Files

Bej1600.pdf
(pdf | 0.765 Mb)
- Embargo expired in 05-08-2024
License info not available