SOLARNet: A single stage regression based framework for efficient and robust object recognition in aerial images

被引:2
作者
Saini, Nandini [1 ]
Chattopadhyay, Chiranjoy [2 ]
Das, Debasis [1 ]
机构
[1] Indian Inst Technol, Comp Sci & Engn, Jodhpur 342037, India
[2] FLAME Univ, Sch Comp & Data Sci, Pune 412115, India
关键词
Adversarial attacks; Deep learning; Aerial image; Object detection; DOTA; DIOR; REMOTE-SENSING IMAGES;
D O I
10.1016/j.patrec.2023.05.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object recognition and localization play a crucial role in aerial images and their applications. The aerial images are challenging due to the large aspect ratio, arbitrary orientation, variation in scales, and non-uniform and cluttered object distribution. To address these challenges, we propose an efficient and robust model called the Simultaneous Object Localization and Recognition Network (SOLARNet), which is a fu-sion network that integrates two different sub-networks: PixelAttentionDetector (PD) and RotationDetec-tor (RD). The PD considers features from different scales and cluttered objects, while RD handles rotation invariance, giving horizontal and oriented object detection results. The state-of-the-art model fails to im-prove accuracy when images are adversarially attacked. SOLARNet is not only efficient in terms of accu-racy but also robust concerning to Fast Gradient Sign Method (FGSM), Basic Iterative Method (BIM), Jaco-bian Based Saliency Map (JSM) adversarial attacks, which is the crucial factor for any mission-critical sys-tem. We have performed experiments and achieved the accuracy on the publicly available DOTA dataset (75.00% mAP, 66.40% mAP) and DIOR dataset (88.60% mAP, 81.50% mAP) for horizontal and oriented ob-ject recognition tasks respectively while having high inference speed. Qualitative and quantitative results reported in this paper substantiate the superiority of SOLARNet over other state-of-the-art methodologies. & COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:37 / 43
页数:7
相关论文
共 44 条
[1]   Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey [J].
Akhtar, Naveed ;
Mian, Ajmal .
IEEE ACCESS, 2018, 6 :14410-14430
[2]   Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery [J].
Azimi, Seyed Majid ;
Vig, Eleonora ;
Bahmanyar, Reza ;
Koerner, Marco ;
Reinartz, Peter .
COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 :150-165
[3]   Traffic congestion-aware graph-based vehicle rerouting framework from aerial imagery [J].
Bayraktar, Ertugrul ;
Korkmaz, Burla Nur ;
Erarslan, Aras Umut ;
Celebi, Numan .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
[4]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
[5]   Weakly supervised vehicle detection in satellite images via multi-instance discriminative learning [J].
Cao, Liujuan ;
Luo, Feng ;
Chen, Li ;
Sheng, Yihan ;
Wang, Haibin ;
Wang, Cheng ;
Ji, Rongrong .
PATTERN RECOGNITION, 2017, 64 :417-424
[6]  
Carlini N, 2019, Arxiv, DOI arXiv:1902.06705
[7]  
Carlini N, 2017, PROCEEDINGS OF THE 10TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, AISEC 2017, P3, DOI 10.1145/3128572.3140444
[8]  
Chen L, 2020, Arxiv, DOI arXiv:1910.13222
[9]   Anchor-Free Oriented Proposal Generator for Object Detection [J].
Cheng, Gong ;
Wang, Jiabao ;
Li, Ke ;
Xie, Xingxing ;
Lang, Chunbo ;
Yao, Yanqing ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[10]   Dual-Aligned Oriented Detector [J].
Cheng, Gong ;
Yao, Yanqing ;
Li, Shengyang ;
Li, Ke ;
Xie, Xingxing ;
Wang, Jiabao ;
Yao, Xiwen ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60