Abstract
The Matrix-based Rényi's entropy enables us to directly measure information quantities from given data without the costly probability density estimation of underlying distributions, thus has been widely adopted in numerous statistical learning and inference tasks. However, exactly calculating this new information quantity requires access to the eigenspectrum of a semi-positive definite (SPD) matrix A which grows linearly with the number of samples n , resulting in a O(n3) time complexity that is prohibitive for large-scale applications. To address this issue, this paper takes advantage of stochastic trace approximations for matrix-based Rényi's entropy with arbitrary α \in R+ orders, lowering the complexity by converting the entropy approximation to a matrix-vector multiplication problem. Specifically, we develop random approximations for integer-order α cases and polynomial series approximations (Taylor and Chebyshev) for fractional α cases, leading to a O(n2sm) overall time complexity, where s, m \ll n denote the number of vector queries and the polynomial order respectively. We theoretically establish statistical guarantees for all approximation algorithms and give explicit order of s and m with respect to the approximation error \epsilon , showing optimal convergence rate for both parameters up to a logarithmic factor. Large-scale simulations and real-world applications validate the effectiveness of the developed approximations, demonstrating remarkable speedup with negligible loss in accuracy.
Original language | English |
---|---|
Pages (from-to) | 4218-4234 |
Number of pages | 17 |
Journal | IEEE Transactions on Information Theory |
Volume | 69 |
Issue number | 7 |
Early online date | 21 Mar 2023 |
DOIs | |
Publication status | Published - Jul 2023 |
Bibliographical note
Funding Information:This work was supported in part by the Key Research and Development Program of China under Grant 2021ZD0110700; in part the National Natural Science Foundation of China under Grant 62106191; in part by the Key Research and Development Program of Shaanxi Province under Grant 2021GXLH-Z-095; in part by the Innovative Research Group of the National Natural Science Foundation of China under Grant 61721002; in part by the Consulting Research Project of the Chinese Academy of Engineering, The Online and Offline Mixed Educational Service System for The Belt and Road Training in MOOC China; in part by the Project of China Knowledge Centre for Engineering Science and Technology; and in part by the Innovation Team from the Ministry of Education under Grant IRT_17R86.
Publisher Copyright:
© 1963-2012 IEEE.
Funding
This work was supported in part by the Key Research and Development Program of China under Grant 2021ZD0110700; in part the National Natural Science Foundation of China under Grant 62106191; in part by the Key Research and Development Program of Shaanxi Province under Grant 2021GXLH-Z-095; in part by the Innovative Research Group of the National Natural Science Foundation of China under Grant 61721002; in part by the Consulting Research Project of the Chinese Academy of Engineering, The Online and Offline Mixed Educational Service System for The Belt and Road Training in MOOC China; in part by the Project of China Knowledge Centre for Engineering Science and Technology; and in part by the Innovation Team from the Ministry of Education under Grant IRT_17R86.
Funders | Funder number |
---|---|
National Natural Science Foundation of China | 62106191 |
National Natural Science Foundation of China | |
Ministry of Education of the People's Republic of China | IRT_17R86 |
Ministry of Education of the People's Republic of China | |
National Key Research and Development Program of China | 2021ZD0110700 |
National Key Research and Development Program of China | |
China Knowledge Centre for Engineering Sciences and Technology | |
Shanxi Provincial Key Research and Development Project | 61721002, 2021GXLH-Z-095 |
Shanxi Provincial Key Research and Development Project | |
Chinese Academy of Engineering |
Keywords
- Matrix-based Rényis entropy
- mutual information
- polynomial approximation
- randomized numerical linear algebra
- trace estimation