Coping with change: Learning invariant and minimum sufficient representations for fine-grained visual categorization

Shuo Ye, Shujian Yu*, Wenjin Hou, Yu Wang, Xinge You

*Corresponding author for this work

Research output: Contribution to JournalArticleAcademicpeer-review

24 Downloads (Pure)

Abstract

Fine-grained visual categorization (FGVC) is a challenging task due to similar visual appearances between various species. Previous studies always implicitly assume that the training and test data have the same underlying distributions, and that features extracted by modern backbone architectures remain discriminative and generalize well to unseen test data. However, we empirically justify that these conditions are not always true on benchmark datasets. To this end, we combine the merits of invariant risk minimization (IRM) and information bottleneck (IB) principle to learn invariant and minimum sufficient (IMS) representations for FGVC, such that the overall model can always discover the most succinct and consistent fine-grained features. We apply the matrix-based Rényi's α-order entropy to simplify and stabilize the training of IB; we also design a “soft” environment partition scheme to make IRM applicable to FGVC task. To the best of our knowledge, we are the first to address the problem of FGVC from a generalization perspective and develop a new information-theoretic solution accordingly. Extensive experiments demonstrate the consistent performance gain offered by our IMS. Code is available at: https://github.com/SYe-hub/IMS.

Original languageEnglish
Article number103837
Pages (from-to)1-11
Number of pages11
JournalComputer Vision and Image Understanding
Volume237
Early online date26 Sept 2023
DOIs
Publication statusPublished - Dec 2023

Bibliographical note

Funding Information:
This work was supported in part by the National Key R&D Program of China2022YFC3301000, in part by the Fundamental Research Funds for the Central Universities, HUST: 2023JYCXJJ031.

Funding Information:
This work was supported in part by the National Key R&D Program of China 2022YFC3301000 , in part by the Fundamental Research Funds for the Central Universities , HUST: 2023JYCXJJ031 .

Publisher Copyright:
© 2023 Elsevier Inc.

Funding

This work was supported in part by the National Key R&D Program of China2022YFC3301000, in part by the Fundamental Research Funds for the Central Universities, HUST: 2023JYCXJJ031. This work was supported in part by the National Key R&D Program of China 2022YFC3301000 , in part by the Fundamental Research Funds for the Central Universities , HUST: 2023JYCXJJ031 .

FundersFunder number
National Key R&D Program of China2022YFC3301000
Huazhong University of Science and Technology2023JYCXJJ031
Huazhong University of Science and Technology
National Key Research and Development Program of China2022YFC3301000
National Key Research and Development Program of China
Fundamental Research Funds for the Central Universities

    Keywords

    • Fine-grained visual categorization
    • Information bottleneck
    • Invariant risk minimization

    Fingerprint

    Dive into the research topics of 'Coping with change: Learning invariant and minimum sufficient representations for fine-grained visual categorization'. Together they form a unique fingerprint.

    Cite this