Optimal Randomized Approximations for Matrix-Based Rényi's Entropy

Yuxin Dong, Tieliang Gong*, Shujian Yu, Chen Li

*Corresponding author for this work

Research output: Contribution to JournalArticleAcademicpeer-review

90 Downloads (Pure)

Abstract

The Matrix-based Rényi's entropy enables us to directly measure information quantities from given data without the costly probability density estimation of underlying distributions, thus has been widely adopted in numerous statistical learning and inference tasks. However, exactly calculating this new information quantity requires access to the eigenspectrum of a semi-positive definite (SPD) matrix A which grows linearly with the number of samples n , resulting in a O(n3) time complexity that is prohibitive for large-scale applications. To address this issue, this paper takes advantage of stochastic trace approximations for matrix-based Rényi's entropy with arbitrary α \in R+ orders, lowering the complexity by converting the entropy approximation to a matrix-vector multiplication problem. Specifically, we develop random approximations for integer-order α cases and polynomial series approximations (Taylor and Chebyshev) for fractional α cases, leading to a O(n2sm) overall time complexity, where s, m \ll n denote the number of vector queries and the polynomial order respectively. We theoretically establish statistical guarantees for all approximation algorithms and give explicit order of s and m with respect to the approximation error \epsilon , showing optimal convergence rate for both parameters up to a logarithmic factor. Large-scale simulations and real-world applications validate the effectiveness of the developed approximations, demonstrating remarkable speedup with negligible loss in accuracy.

Original languageEnglish
Pages (from-to)4218-4234
Number of pages17
JournalIEEE Transactions on Information Theory
Volume69
Issue number7
Early online date21 Mar 2023
DOIs
Publication statusPublished - Jul 2023

Bibliographical note

Funding Information:
This work was supported in part by the Key Research and Development Program of China under Grant 2021ZD0110700; in part the National Natural Science Foundation of China under Grant 62106191; in part by the Key Research and Development Program of Shaanxi Province under Grant 2021GXLH-Z-095; in part by the Innovative Research Group of the National Natural Science Foundation of China under Grant 61721002; in part by the Consulting Research Project of the Chinese Academy of Engineering, The Online and Offline Mixed Educational Service System for The Belt and Road Training in MOOC China; in part by the Project of China Knowledge Centre for Engineering Science and Technology; and in part by the Innovation Team from the Ministry of Education under Grant IRT_17R86.

Publisher Copyright:
© 1963-2012 IEEE.

Funding

This work was supported in part by the Key Research and Development Program of China under Grant 2021ZD0110700; in part the National Natural Science Foundation of China under Grant 62106191; in part by the Key Research and Development Program of Shaanxi Province under Grant 2021GXLH-Z-095; in part by the Innovative Research Group of the National Natural Science Foundation of China under Grant 61721002; in part by the Consulting Research Project of the Chinese Academy of Engineering, The Online and Offline Mixed Educational Service System for The Belt and Road Training in MOOC China; in part by the Project of China Knowledge Centre for Engineering Science and Technology; and in part by the Innovation Team from the Ministry of Education under Grant IRT_17R86.

FundersFunder number
National Natural Science Foundation of China62106191
National Natural Science Foundation of China
Ministry of Education of the People's Republic of ChinaIRT_17R86
Ministry of Education of the People's Republic of China
National Key Research and Development Program of China2021ZD0110700
National Key Research and Development Program of China
China Knowledge Centre for Engineering Sciences and Technology
Shanxi Provincial Key Research and Development Project61721002, 2021GXLH-Z-095
Shanxi Provincial Key Research and Development Project
Chinese Academy of Engineering

    Keywords

    • Matrix-based Rényis entropy
    • mutual information
    • polynomial approximation
    • randomized numerical linear algebra
    • trace estimation

    Fingerprint

    Dive into the research topics of 'Optimal Randomized Approximations for Matrix-Based Rényi's Entropy'. Together they form a unique fingerprint.

    Cite this