Normalizing Batch Normalization for Long-Tailed Recognition

被引:0
|
作者
Bao, Yuxiang [1 ]
Kang, Guoliang [1 ]
Yang, Linlin [2 ]
Duan, Xiaoyue [1 ]
Zhao, Bo [3 ]
Zhang, Baochang [4 ,5 ,6 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing 100024, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Artificial Intelligence, Shanghai 200240, Peoples R China
[4] Beihang Univ, Sch Artificial Intelligence, Beijing 100191, Peoples R China
[5] Beihang Univ, Hangzhou Res Inst, Hangzhou 310051, Peoples R China
[6] Nanchang Inst Technol, Nanchang 330044, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Heavily-tailed distribution; Vectors; Batch normalization; Training; Standards; Accuracy; Visualization; Image recognition; Benchmark testing; Gaussian distribution; Long-tailed recognition; batch normalization; deep learning;
D O I
10.1109/TIP.2024.3518099
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real-world scenarios, the number of training samples across classes usually subjects to a long-tailed distribution. The conventionally trained network may achieve unexpected inferior performance on the rare class compared to the frequent class. Most previous works attempt to rectify the network bias from the data-level or from the classifier-level. Differently, in this paper, we identify that the bias towards the frequent class may be encoded into features, i.e., the rare-specific features which play a key role in discriminating the rare class are much weaker than the frequent-specific features. Based on such an observation, we introduce a simple yet effective approach, normalizing the parameters of Batch Normalization (BN) layer to explicitly rectify the feature bias. To achieve this end, we represent the Weight/Bias parameters of a BN layer as a vector, normalize it into a unit one and multiply the unit vector by a scalar learnable parameter. Through decoupling the direction and magnitude of parameters in BN layer to learn, the Weight/Bias exhibits a more balanced distribution and thus the strength of features becomes more even. Extensive experiments on various long-tailed recognition benchmarks (i.e., CIFAR-10/100-LT, ImageNet-LT and iNaturalist 2018) show that our method outperforms previous state-of-the-arts remarkably.
引用
收藏
页码:209 / 220
页数:12
相关论文
共 50 条
  • [1] Compound Batch Normalization for Long-tailed Image Classification
    Cheng, Lechao
    Fang, Chaowei
    Zhang, Dingwen
    Li, Guanbin
    Huang, Gang
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1925 - 1934
  • [2] Mutual Learning for Long-Tailed Recognition
    Park, Changhwa
    Yim, Junho
    Jun, Eunji
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2674 - 2683
  • [3] A Survey on Long-Tailed Visual Recognition
    Yang, Lu
    Jiang, He
    Song, Qing
    Guo, Jun
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (07) : 1837 - 1872
  • [4] Multimodal Framework for Long-Tailed Recognition
    Chen, Jian
    Zhao, Jianyin
    Gu, Jiaojiao
    Qin, Yufeng
    Ji, Hong
    APPLIED SCIENCES-BASEL, 2024, 14 (22):
  • [5] Improving Calibration for Long-Tailed Recognition
    Zhong, Zhisheng
    Cui, Jiequan
    Liu, Shu
    Jia, Jiaya
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16484 - 16493
  • [6] A Survey on Long-Tailed Visual Recognition
    Lu Yang
    He Jiang
    Qing Song
    Jun Guo
    International Journal of Computer Vision, 2022, 130 : 1837 - 1872
  • [7] Long-Tailed Recognition via Weight Balancing
    Alshammari, Shaden
    Wang, Yu-Xiong
    Ramanan, Deva
    Kong, Shu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6887 - 6897
  • [8] ResLT: Residual Learning for Long-Tailed Recognition
    Cui, Jiequan
    Liu, Shu
    Tian, Zhuotao
    Zhong, Zhisheng
    Jia, Jiaya
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3695 - 3706
  • [9] Equalization Loss for Long-Tailed Object Recognition
    Tan, Jingru
    Wang, Changbao
    Li, Buyu
    Li, Quanquan
    Ouyang, Wanli
    Yin, Changqing
    Yan, Junjie
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 11659 - 11668
  • [10] Learning Prototype Classifiers for Long-Tailed Recognition
    Sharma, Saurabh
    Xian, Yongqin
    Yu, Ning
    Singh, Ambuj
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1360 - 1368