Open Long-Tailed Recognition in a Dynamic World

被引:10
作者
Liu, Ziwei [1 ]
Miao, Zhongqi [2 ]
Zhan, Xiaohang [3 ]
Wang, Jiayun [2 ]
Gong, Boqing [4 ]
Yu, Stella X. [2 ]
机构
[1] Nanyang Technol Univ, Singapore 639798, Singapore
[2] Univ Calif Berkeley, Int Comp Sci Inst, Berkeley, CA 94720 USA
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[4] Google Inc, Mountain View, CA 94043 USA
关键词
Tail; Visualization; Head; Training; Task analysis; Measurement; Magnetic heads; Long-tailed recognition; few-shot learning; active learning;
D O I
10.1109/TPAMI.2022.3200091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real world data often exhibits a long-tailed and open-ended (i.e., with unseen classes) distribution. A practical recognition system must balance between majority (head) and minority (tail) classes, generalize across the distribution, and acknowledge novelty upon the instances of unseen classes (open classes). We define Open Long-Tailed Recognition++ (OLTR++) as learning from such naturally distributed data and optimizing for the classification accuracy over a balanced test set which includes both known and open classes. OLTR++ handles imbalanced classification, few-shot learning, open-set recognition, and active learning in one integrated algorithm, whereas existing classification approaches often focus only on one or two aspects and deliver poorly over the entire spectrum. The key challenges are: 1) how to share visual knowledge between head and tail classes, 2) how to reduce confusion between tail and open classes, and 3) how to actively explore open classes with learned knowledge. Our algorithm, OLTR++, maps images to a feature space such that visual concepts can relate to each other through a memory association mechanism and a learned metric (dynamic meta-embedding) that both respects the closed world classification of seen classes and acknowledges the novelty of open classes. Additionally, we propose an active learning scheme based on visual memory, which learns to recognize open classes in a data-efficient manner for future expansions. On three large-scale open long-tailed datasets we curated from ImageNet (object-centric), Places (scene-centric), and MS1M (face-centric) data, as well as three standard benchmarks (CIFAR-10-LT, CIFAR-100-LT, and iNaturalist-18), our approach, as a unified framework, consistently demonstrates competitive performance. Notably, our approach also shows strong potential for the active exploration of open classes and the fairness analysis of minority groups.
引用
收藏
页码:1836 / 1851
页数:16
相关论文
共 50 条
  • [21] Margin-aware rectified augmentation for long-tailed recognition
    Xiang, Liuyu
    Han, Jungong
    Ding, Guiguang
    PATTERN RECOGNITION, 2023, 141
  • [22] SWRM: Similarity Window Reweighting and Margin for Long-Tailed Recognition
    Chen, Qiong
    Huang, Tianlin
    Liu, Qingfa
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (06)
  • [23] Long-tailed image recognition through balancing discriminant quality
    Yan-Xue Wu
    Fan Min
    Ben-Wen Zhang
    Xian-Jie Wang
    Artificial Intelligence Review, 2023, 56 : 833 - 856
  • [24] Long-tailed image recognition through balancing discriminant quality
    Wu, Yan-Xue
    Min, Fan
    Zhang, Ben-Wen
    Wang, Xian-Jie
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 1) : 833 - 856
  • [25] Long-Tailed Recognition Based on Self-attention Mechanism
    Feng, Zekai
    Jia, Hong
    Li, Mengke
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024, 2024, 14876 : 380 - 391
  • [26] Constructing Balance from Imbalance for Long-Tailed Image Recognition
    Xu, Yue
    Li, Yong-Lu
    Li, Jiefeng
    Lu, Cewu
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 38 - 56
  • [27] Open-set long-tailed recognition via orthogonal prototype learning and false rejection correction
    Deng, Binquan
    Kamel, Aouaidjia
    Zhang, Chongsheng
    NEURAL NETWORKS, 2025, 181
  • [28] Dynamic Adaptive Federated Learning on Local Long-Tailed Data
    Pu, Juncheng
    Fu, Xiaodong
    Dong, Hai
    Zhang, Pengcheng
    Liu, Li
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (06) : 3485 - 3498
  • [29] Open-Set Long-Tailed Radio Frequency Fingerprint Identification
    He, Yixin
    Ma, Ying
    Qian, Ruiqi
    Zhao, Yanqing
    Ding, Haichuan
    An, Jianping
    2024 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA, ICCC, 2024,
  • [30] Rebalanced supervised contrastive learning with prototypes for long-tailed visual recognition
    Chang, Xuhui
    Zhai, Junhai
    Qiu, Shaoxin
    Sun, Zhengrong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 252