Deep active learning for object detection

被引：39

作者：

Li, Ying ^{[1
]}

Fan, Binbin ^{[1
]}

Zhang, Weiping ^{[2
]}

Ding, Weiping ^{[3
]}

Yin, Jianwei ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310000, Peoples R China

[2] Zhejiang Univ, Binhai Ind Technol Res Inst, Tianjin 300450, Peoples R China

[3] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China

来源：

INFORMATION SCIENCES | 2021年 / 579卷

基金：

中国国家自然科学基金;

关键词：

Active learning; Loss; Gradient; Object detection;

D O I：

10.1016/j.ins.2021.08.019

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Active learning (AL) for object detection (OD) aims to reduce labeling costs by selecting the most valuable samples that enhance the detection network from the unlabeled pool. Due to the complexity of OD compared with image classification, more consideration should be given when designing the selection strategies. Previous works have studied aggregating information of multiple outputs (especially the location information) and aggregating information of batch boxes, all of which indicate improved performances. However, the evaluation index-mean average precision (mAP) has not been considered seriously, although improving it is the goal of AL. Moreover, the background class is far more than other classes (15:1 or more) in each batch of samples, leading to a class imbalance problem. Therefore, AL strategies for OD, which take mAP and class imbalance in batch into consideration, may perform better. In this paper, WBetGS is proposed, which not only considers aggregating information of multiple outputs and batch boxes but also aims to mAP improvement and to address the class imbalance in batch. A weighted algorithm is introduced to promote the mAP more effectively. Besides, WBetGS eliminates the impact of class imbalance between background and object categories by extracting class-balanced information. Moreover, a diversity and uncertainty based sampling algorithm is introduced for batch mode active learning in object detection. The experimental results demonstrate that our method performs better than basic methods, saving up 100% of the labeling efforts while reaching the same performance in an actual industrial application. (c) 2021 Elsevier Inc. All rights reserved.

引用

页码：418 / 433

页数：16

共 31 条

[1] Active Learning for Deep Detection Neural Networks [J].

Aghdam, Hamed H. ;

Gonzalez-Garcia, Abel ;

van de Weijer, Joost ;

Lopez, Antonio M. .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3671-3679

[2] A data aggregation based approach to exploit dynamic spatio-temporal correlations for citywide crowd flows prediction in fog computing [J].

Ali, Ahmad ;

Zhu, Yanmin ;

Zakarya, Muhammad .

MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (20) :31401-31433

[3]

[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4

[4] The power of ensembles for active learning in image classification [J].

Beluch, William H. ;

Genewein, Tim ;

Nuernberger, Andreas ;

Koehler, Jan M. .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9368-9377

[5]

Brust Clemens-Alexander, 2018, ARXIV180909875

[6] Ranked batch-mode active learning [J].

Cardoso, Thiago N. C. ;

Silva, Rodrigo M. ;

Canuto, Sergio ;

Moro, Mirella M. ;

Goncalves, Marcos A. .

INFORMATION SCIENCES, 2017, 379 :313-337

[7]

Chen K., 2019, arXiv:1906.07155

[8]

Desai S. V., 2019, 30 BRIT MACH VIS C 2

[9] Towards Fine-grained Sampling for Active Learning in Object Detection [J].

Desai, Sai Vikas ;

Balasubramanian, Vineeth N. .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :4010-4014

[10]

Haussmann E, 2020, IEEE INT VEH SYM, P1430, DOI 10.1109/IV47402.2020.9304793

← 1 2 3 4 →