Attentional and adversarial feature mimic for efficient object detection

被引：6

作者：

Wang, Hongxing ^{[1
]}

Chen, Yuquan ^{[1
]}

Wu, Mei ^{[1
]}

Zhang, Xin ^{[1
]}

Huang, Zheng ^{[1
]}

Mao, Weiping ^{[2
]}

机构：

[1] Jiangsu Frontier Elect Power Technol Co Ltd, Nanjing 210036, Peoples R China

[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China

来源：

VISUAL COMPUTER | 2023年 / 39卷 / 02期

关键词：

Object detection; Knowledge distillation; Attention network; Feature mimic;

D O I：

10.1007/s00371-021-02363-4

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, we focus on learning efficient object detectors by knowledge (or network) distillation. More specifically, we mimic features from deeper and larger teacher networks to help train better efficient student networks. Unlike the previous method that mimics features through minimizing an L2 loss between feature generated by teacher and student networks, we propose an attentional and adversarial feature mimic (AAFM) method which consists of an attentional feature mimic module and an adversarial feature mimic module, where the former module uses an attentional L2 loss which learns to pay attention to important object-related regions for feature mimic, and the latter module uses an adversarial loss which makes features generated by teacher and student networks have similar distributions. We apply our AAFM method in the two-stage Faster R-CNN detector. Experiments on the PASCAL VOC 2007 and COCO datasets show that our method consistently improves the performance of detectors without feature mimic or with other feature mimic methods. In particular, our method obtains 72.1% mAP on the PASCAL VOC 2007 dataset using the ResNet-18-based detector.

引用

页码：639 / 650

页数：12

共 67 条

[1]

[Anonymous], 2017, ARXIV170201478

[2]

[Anonymous], 2015, P 3 INT C LEARNING

[3]

[Anonymous], 2014, P BRIT MACH VIS C 20

[4]

[Anonymous], 2015, CVPR

[5]

Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473

[6]

Carion Nicolas., 2020, EUR C COMPUT VIS

[7]

Chen GB, 2017, ADV NEUR IN, V30

[8] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[9]

Chorowski J, 2015, ADV NEUR IN, V28

[10]

Dai X., ARXIV PREPRINT ARXIV

← 1 2 3 4 5 6 7 →