Faster ILOD: Incremental learning for object detectors based on faster RCNN

被引：68

作者：

Peng, Can ^{[1
]}

Zhao, Kun ^{[1
]}

Lovell, Brian C. ^{[1
]}

机构：

[1] Univ Queensland, Sch ITEE, Brisbane, Qld, Australia

来源：

PATTERN RECOGNITION LETTERS | 2020年 / 140卷

基金：

澳大利亚研究理事会;

关键词：

Deep learning; Object detection; Incremental learning;

D O I：

10.1016/j.patrec.2020.09.030

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The human vision and perception system is inherently incremental where new knowledge is continually learned over time whilst existing knowledge is retained. On the other hand, deep learning networks are ill-equipped for incremental learning. When a well-trained network is adapted to new categories, its performance on the old categories will dramatically degrade. To address this problem, incremental learning methods have been explored which preserve the old knowledge of deep learning models. However, the state-of-the-art incremental object detector employs an external fixed region proposal method that increases overall computation time and reduces accuracy comparing to Region Proposal Network (RPN) based object detectors such as Faster RCNN. The purpose of this paper is to design an efficient end-to-end incremental object detector using knowledge distillation. We first evaluate and analyze the performance of the RPN-based detector with classic distillation on incremental detection tasks. Then, we introduce multi-network adaptive distillation that properly retains knowledge from the old categories when finetuning the model for new task. Experiments on the benchmark datasets, PASCAL VOC and COCO, demonstrate that the proposed incremental detector based on Faster RCNN is more accurate as well as being 13 times faster than the baseline detector. (C) 2020 Elsevier B.V. All rights reserved.

引用

页码：109 / 115

页数：7

共 24 条

[1]

[Anonymous], 2010, INT J COMPUT VISION, DOI DOI 10.1007/s11263-009-0275-4

[2] Multiscale Combinatorial Grouping [J].

Arbelaez, Pablo ;

Pont-Tuset, Jordi ;

Barron, Jonathan T. ;

Marques, Ferran ;

Malik, Jitendra .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :328-335

[3] End-to-End Incremental Learning [J].

Castro, Francisco M. ;

Marin-Jimenez, Manuel J. ;

Guil, Nicolas ;

Schmid, Cordelia ;

Alahari, Karteek .

COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 :241-257

[4]

Chen GB, 2017, ADV NEUR IN, V30

[5]

Chen L., 2019, 2019 INT JOINT C NEU, P1, DOI DOI 10.1109/IJCNN.2019.8851980

[6] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[7]

Goodfellow I. J., 2014, P INT C LEARN REPR

[8] AN END-TO-END ARCHITECTURE FOR CLASS-INCREMENTAL OBJECT DETECTION WITH KNOWLEDGE DISTILLATION [J].

Hao, Yu ;

Fu, Yanwei ;

Jiang, Yu-Gang ;

Tian, Qi .

2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, :1-6

[9] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[10] A Comprehensive Overhaul of Feature Distillation [J].

Heo, Byeongho ;

Kim, Jeesoo ;

Yun, Sangdoo ;

Park, Hyojin ;

Kwak, Nojun ;

Choi, Jin Young .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1921-1930

← 1 2 3 →