M2-Net: A Multi-scale Multi-level Feature Enhanced Network for Object Detection in Optical Remote Sensing Images

被引：4

作者：

Ye, Xinhai ^{[1
]}

Xiong, Fengchao ^{[1
]}

Lu, Jianfeng ^{[1
]}

Zhao, Haifeng ^{[2
]}

Zhou, Jun ^{[3
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China

[2] Jinling Inst Technol, Sch Software Engn, Nanjing, Peoples R China

[3] Griffith Univ, Sch Informat & Commun, Nathan, Qld, Australia

来源：

2020 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA) | 2020年

基金：

中国国家自然科学基金;

关键词：

Convolutional neural network (CNN); object detection; feature fusion; remote sensing image; multi-scale analysis;

D O I：

10.1109/DICTA51227.2020.9363420

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object detection in remote sensing images is a challenging task due to diversified orientation, complex background, dense distribution and scale variation of objects. In this paper, we tackle this problem by proposing a novel multi-scale multi-level feature enhanced network (M-2-Net) that integrates a Feature Map Enhancement (FME) module and a Feature Fusion Block (FFB) into Rotational RetinaNet. The FME module aims to enhance the weak features by factorizing the convolutional operation into two similar branches instead of one single branch, which helps to broaden receptive field with less parameters. This module is embedded into different layers in the backbone network to capture multi-scale semantics and location information for detection. The FFB module is used to shorten the information propagation path between low-level high-resolution features in shallow layers and high-level semantic features in deep layers, facilitating more effective feature fusion and object detection especially those with small sizes. Experimental results on three benchmark datasets show that our method not only outperforms many one-stage detectors but also achieves competitive accuracy with lower time cost than two-stage detectors.

引用

页数：8

共 40 条

[1]

Anguelov D, 2015, ABS14094842 CORR

[2] Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery [J].

Azimi, Seyed Majid ;

Vig, Eleonora ;

Bahmanyar, Reza ;

Koerner, Marco ;

Reinartz, Peter .

COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 :150-165

[3] Multi-Scale Spatial and Channel-wise Attention for Improving Object Detection in Remote Sensing Imagery [J].

Chen, Jie ;

Wan, Li ;

Zhu, Jingru ;

Xu, Gang ;

Deng, Min .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (04) :681-685

[4] Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection [J].

Cheng, Gong ;

Han, Junwei ;

Zhou, Peicheng ;

Xu, Dong .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) :265-278

[5] Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].

Cheng, Gong ;

Zhou, Peicheng ;

Han, Junwei .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415

[6] A survey on object detection in optical remote sensing images [J].

Cheng, Gong ;

Han, Junwei .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 117 :11-28

[7]

Dai JF, 2016, ADV NEUR IN, V29

[8] Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].

Ding, Jian ;

Xue, Nan ;

Long, Yang ;

Xia, Gui-Song ;

Lu, Qikai .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853

[9] A light and faster regional convolutional neural network for object detection in optical remote sensing images [J].

Ding, Peng ;

Zhang, Ye ;

Deng, Wei-Jian ;

Jia, Ping ;

Kuijper, Arjan .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 141 :208-218

[10] ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks [J].

Ding, Xiaohan ;

Guo, Yuchen ;

Ding, Guiguang ;

Han, Jungong .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1911-1920

← 1 2 3 4 →