Learning Semantic Keypoints for Object Detection in Aerial Images

被引：0

作者：

Kim, Minsu ^{[1
]}

Joung, Sunghun ^{[2
]}

Song, Taeyong ^{[1
]}

Kim, Hanjae ^{[1
]}

Sohn, Kwanghoon ^{[1
,3
]}

机构：

[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul 03722, South Korea

[2] Hyundai Motor Co, Seoul 06182, South Korea

[3] Korea Inst Sci & Technol, Artificial Intelligence & Robot Inst, Seoul 23792, South Korea

来源：

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS | 2023年 / 20卷

基金：

新加坡国家研究基金会;

关键词：

Semantics; Object detection; Feature extraction; Location awareness; Heating systems; Head; Image color analysis; Convolutional neural networks (CNNs); equivariant representation; oriented object detection; remote sensing;

D O I：

10.1109/LGRS.2022.3226201

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Object detection in aerial images has achieved remarkable progress with the advent of deep convolutional neural networks (CNNs). It is, however, still a challenging task since the objects in aerial images are arbitrarily oriented and often densely packed. In this letter, we propose a novel method for oriented object detection in aerial images that represents objects as rotation equivariant semantic keypoints. Unlike conventional methods that represent object rotation according to angles from each axis in the Cartesian coordinate system, we represent object using a canonical orientation to ensure rotation equivariance. We accomplish this by representing an object as semantic keypoints, where each keypoint of the object consistently corresponds to the semantic part, regardless of rotation variation. To this end, we define the "head" point of the object as the canonical orientation and the remaining bounding box vectors as semantic keypoints in clockwise order. To discriminate visual attributes between different categories, we further use category-specific semantic keypoints, so that object classification and localization can be jointly solved in a cooperative manner. Our experiments demonstrate the effectiveness of rotation equivariant semantic keypoints on oriented object detection.

引用

页数：5

共 36 条

[1] Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery [J].

Azimi, Seyed Majid ;

Vig, Eleonora ;

Bahmanyar, Reza ;

Koerner, Marco ;

Reinartz, Peter .

COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 :150-165

[2] Anchor-Free Oriented Proposal Generator for Object Detection [J].

Cheng, Gong ;

Wang, Jiabao ;

Li, Ke ;

Xie, Xingxing ;

Lang, Chunbo ;

Yao, Yanqing ;

Han, Junwei .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[3] Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].

Cheng, Gong ;

Zhou, Peicheng ;

Han, Junwei .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415

[4]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[5] Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].

Ding, Jian ;

Xue, Nan ;

Long, Yang ;

Xia, Gui-Song ;

Lu, Qikai .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853

[6] CenterNet: Keypoint Triplets for Object Detection [J].

Duan, Kaiwen ;

Bai, Song ;

Xie, Lingxi ;

Qi, Honggang ;

Huang, Qingming ;

Tian, Qi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577

[7] The Pascal Visual Object Classes (VOC) Challenge [J].

Everingham, Mark ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338

[8] Beyond Bounding-Box: Convex-hull Feature Adaptation for Oriented and Densely Packed Object Detection [J].

Guo, Zonghao ;

Liu, Chang ;

Zhang, Xiaosong ;

Jiao, Jianbin ;

Ji, Xiangyang ;

Ye, Qixiang .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8788-8797

[9] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[10]

Kingma DP, 2014, ADV NEUR IN, V27

← 1 2 3 4 →