Deep Long-Tailed Learning: A Survey

被引:269
作者
Zhang, Yifan [1 ]
Kang, Bingyi [2 ]
Hooi, Bryan [1 ]
Yan, Shuicheng [3 ]
Feng, Jiashi [2 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 119077, Singapore
[2] ByteDance AI Lab, Shanghai 201103, Peoples R China
[3] SEA AI Lab, Singapore 658065, Singapore
关键词
Deep learning; imbalanced learning; long-tailed learning; DOMAIN ADAPTATION; NEURAL-NETWORKS;
D O I
10.1109/TPAMI.2023.3268118
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep long-tailed learning, one of the most challenging problems in visual recognition, aims to train well-performing deep models from a large number of images that follow a long-tailed class distribution. In the last decade, deep learning has emerged as a powerful recognition model for learning high-quality image representations and has led to remarkable breakthroughs in generic visual recognition. However, long-tailed class imbalance, a common problem in practical visual recognition tasks, often limits the practicality of deep network based recognition models in real-world applications, since they can be easily biased towards dominant classes and perform poorly on tail classes. To address this problem, a large number of studies have been conducted in recent years, making promising progress in the field of deep long-tailed learning. Considering the rapid evolution of this field, this article aims to provide a comprehensive survey on recent advances in deep long-tailed learning. To be specific, we group existing deep long-tailed learning studies into three main categories (i.e., class re-balancing, information augmentation and module improvement), and review these methods following this taxonomy in detail. Afterward, we empirically analyze several state-of-the-art methods by evaluating to what extent they address the issue of class imbalance via a newly proposed evaluation metric, i.e., relative accuracy. We conclude the survey by highlighting important applications of deep long-tailed learning and identifying several promising directions for future research.
引用
收藏
页码:10795 / 10816
页数:22
相关论文
共 173 条
[1]  
[Anonymous], Technical report
[2]  
[Anonymous], 2000, P AAAI 2000 WORKSH I
[3]  
[Anonymous], 2005, Semi-supervised learning literature survey
[4]   Deep Learning for AI [J].
Bengio, Yoshua ;
Lecun, Yann ;
Hinton, Geoffrey .
COMMUNICATIONS OF THE ACM, 2021, 64 (07) :58-65
[5]   ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot [J].
Cai, Jiarui ;
Wang, Yizhou ;
Hwang, Jenq-Neng .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :112-121
[6]   Domain Balancing: Face Recognition on Long-Tailed Domains [J].
Cao, Dong ;
Zhu, Xiangyu ;
Huang, Xingyu ;
Guo, Jianzhu ;
Lei, Zhen .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5670-5678
[7]  
Cao K., 2021, PROC INT C LEARN REP
[8]  
Cao KD, 2019, ADV NEUR IN, V32
[9]   Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts [J].
Changpinyo, Soravit ;
Sharma, Piyush ;
Ding, Nan ;
Soricut, Radu .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :3557-3567
[10]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)