Open Long-Tailed Recognition in a Dynamic World

被引：10

作者：

Liu, Ziwei ^{[1
]}

Miao, Zhongqi ^{[2
]}

Zhan, Xiaohang ^{[3
]}

Wang, Jiayun ^{[2
]}

Gong, Boqing ^{[4
]}

Yu, Stella X. ^{[2
]}

机构：

[1] Nanyang Technol Univ, Singapore 639798, Singapore

[2] Univ Calif Berkeley, Int Comp Sci Inst, Berkeley, CA 94720 USA

[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[4] Google Inc, Mountain View, CA 94043 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2024年 / 46卷 / 03期

关键词：

Tail; Visualization; Head; Training; Task analysis; Measurement; Magnetic heads; Long-tailed recognition; few-shot learning; active learning;

D O I：

10.1109/TPAMI.2022.3200091

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Real world data often exhibits a long-tailed and open-ended (i.e., with unseen classes) distribution. A practical recognition system must balance between majority (head) and minority (tail) classes, generalize across the distribution, and acknowledge novelty upon the instances of unseen classes (open classes). We define Open Long-Tailed Recognition++ (OLTR++) as learning from such naturally distributed data and optimizing for the classification accuracy over a balanced test set which includes both known and open classes. OLTR++ handles imbalanced classification, few-shot learning, open-set recognition, and active learning in one integrated algorithm, whereas existing classification approaches often focus only on one or two aspects and deliver poorly over the entire spectrum. The key challenges are: 1) how to share visual knowledge between head and tail classes, 2) how to reduce confusion between tail and open classes, and 3) how to actively explore open classes with learned knowledge. Our algorithm, OLTR++, maps images to a feature space such that visual concepts can relate to each other through a memory association mechanism and a learned metric (dynamic meta-embedding) that both respects the closed world classification of seen classes and acknowledges the novelty of open classes. Additionally, we propose an active learning scheme based on visual memory, which learns to recognize open classes in a data-efficient manner for future expansions. On three large-scale open long-tailed datasets we curated from ImageNet (object-centric), Places (scene-centric), and MS1M (face-centric) data, as well as three standard benchmarks (CIFAR-10-LT, CIFAR-100-LT, and iNaturalist-18), our approach, as a unified framework, consistently demonstrates competitive performance. Notably, our approach also shows strong potential for the active exploration of open classes and the fairness analysis of minority groups.

引用

页码：1836 / 1851

页数：16

共 50 条

[1] ResLT: Residual Learning for Long-Tailed Recognition
Cui, Jiequan
Liu, Shu
Tian, Zhuotao
Zhong, Zhisheng
Jia, Jiaya
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3695 - 3706
[2] Attentive Feature Augmentation for Long-Tailed Visual Recognition
Wang, Weiqiu
Zhao, Zhicheng
Wang, Pingyu
Su, Fei
Meng, Hongying
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 5803 - 5816
[3] Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation
Pan, Haolin
Guo, Yong
Yu, Mianjie
Chen, Jian
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4215 - 4230
[4] Towards Effective Collaborative Learning in Long-Tailed Recognition
Xu, Zhengzhuo
Chai, Zenghao
Xu, Chengyin
Yuan, Chun
Yang, Haiqin
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3754 - 3764
[5] Normalizing Batch Normalization for Long-Tailed Recognition
Bao, Yuxiang
Kang, Guoliang
Yang, Linlin
Duan, Xiaoyue
Zhao, Bo
Zhang, Baochang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 209 - 220
[6] A Comprehensive Framework for Long-Tailed Learning via Pretraining and Normalization
Kang, Nan
Chang, Hong
Ma, Bingpeng
Shan, Shiguang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3437 - 3449
[7] The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition
Tan, Jingru
Li, Bo
Lu, Xin
Yao, Yongqiang
Yu, Fengwei
He, Tong
Ouyang, Wanli
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13876 - 13892
[8] Dynamic prior probability network for long-tailed visual recognition
Zhou, Xuesong
Sun, Jiaqi
Zhai, Junhai
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 268
[9] Multimodal Framework for Long-Tailed Recognition
Chen, Jian
Zhao, Jianyin
Gu, Jiaojiao
Qin, Yufeng
Ji, Hong
APPLIED SCIENCES-BASEL, 2024, 14 (22):
[10] Open world long-tailed data classification through active distribution optimization
Wang, Min
Zhou, Lei
Li, Qian
Zhang, An-an
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213

← 1 2 3 4 5 →