Equalization Loss v2: A New Gradient Balance Approach for Long-tailed Object Detection

被引：130

作者：

Tan, Jingru ^{[1
]}

Lu, Xin ^{[2
]}

Zhang, Gang ^{[3
]}

Yin, Changqing ^{[1
]}

Li, Quanquan ^{[2
]}

机构：

[1] Tongji Univ, Shanghai, Peoples R China

[2] SenseTime Res, Hong Kong, Peoples R China

[3] Tsinghua Univ, Beijing, Peoples R China

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

D O I：

10.1109/CVPR46437.2021.00173

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently proposed decoupled training methods emerge as a dominant paradigm for long-tailed object detection. But they require an extra fine-tuning stage, and the disjointed optimization of representation and classifier might lead to suboptimal results. However, end-to-end training methods, like equalization loss (EQL), still perform worse than decoupled training methods. In this paper, we reveal the main issue in long-tailed object detection is the unbalanced gradients between positives and negatives, and find that EQL does not solve it well. To address the problem of Unbalanced gradients, we introduce a new version of equalization loss, called equalization loss v2 (EQL v2), a novel gradient guided reweighing mechanism that rebalances the training process for each category independently and equally. Extensive experiments are performed on the challenging LVIS benchmark. EQI, v2 outperforms origin EQI, by about 4 points overall AP with 14 similar to 18 points improvements on the rare categories. More importantly, it also surpasses decoupled training methods. Without further tuning for the Open Images dataset, EQL v2 improves EQL by 7.3 points AP showing strong generalization ability. Codes have been released at https://github.com/tztztztztz/eq1v2

引用

页码：1685 / 1694

页数：10

共 46 条

[1]

[Anonymous], 2017, ADV NEURAL INFORM PR

[2]

Cao KD, 2019, ADV NEUR IN, V32

[3] SMOTE: Synthetic minority over-sampling technique [J].

Chawla, Nitesh V. ;

Bowyer, Kevin W. ;

Hall, Lawrence O. ;

Kegelmeyer, W. Philip .

2002, American Association for Artificial Intelligence (16)

[4]

Chen K., 2019, arXiv:1906.07155

[5] Hybrid Task Cascade for Instance Segmentation [J].

Chen, Kai ;

Pang, Jiangmiao ;

Wang, Jiaqi ;

Xiong, Yu ;

Li, Xiaoxiao ;

Sun, Shuyang ;

Feng, Wansen ;

Liu, Ziwei ;

Shi, Jianping ;

Ouyang, Wanli ;

Loy, Chen Change ;

Lin, Dahua .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4969-4978

[6]

Chu Peng, 2020, ARXIV200803673

[7] Class-Balanced Loss Based on Effective Number of Samples [J].

Cui, Yin ;

Jia, Menglin ;

Lin, Tsung-Yi ;

Song, Yang ;

Belongie, Serge .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9260-9269

[8] Quantitative Identification Method of Injection Production Dominant Channel in a Marine Sandstone Reservoir [J].

Ding, Shuaiwei ;

Xi, Yi ;

Jiang, Hanqiao .

PROCEEDINGS OF THE INTERNATIONAL FIELD EXPLORATION AND DEVELOPMENT CONFERENCE 2017, 2019, :704-727

[9] CenterNet: Keypoint Triplets for Object Detection [J].

Duan, Kaiwen ;

Bai, Song ;

Xie, Lingxi ;

Qi, Honggang ;

Huang, Qingming ;

Tian, Qi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577

[10]

Duerig T, 2018, ARXIV181100982

← 1 2 3 4 5 →