Tiny Object Detection via Regional Cross Self-Attention Network

被引:7
作者
Cheng, Keyang [1 ]
Cui, Honggang [1 ]
Ghafoor, Humaira Abdul [1 ]
Wan, Hao [1 ]
Mao, Qirong [1 ]
Zhan, Yongzhao [1 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Commun Engn, Zhenjiang 212013, Peoples R China
基金
中国国家自然科学基金;
关键词
Detectors; Object detection; Encoding; Feature extraction; Transformers; Image coding; Generators; Tiny object detection; context aggregation; vision transformer; self-attention; position coding; feature fusion;
D O I
10.1109/TCSVT.2022.3232688
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As vision sensor technology continues to evolve, the requirements for detecting targets of interest in the images captured by the sensors are increasing. Considering fast detection and high accuracy, the industry favors geometric key point-based solutions. However, there are a large number of small and fuzzy objects in the real world. Geometric key point detectors do not effectively utilize the contextual features of the region of interest, leading to excessive false positive and false negative results. In this work, a simple, effective, and interpretable tiny object detection method called Regional Cross Self-Attention Object Detection Network (RCSANet) is proposed. It adopts Region Proposal Networks and transformers to capture regional background relations and uses regional background relations to generate key point sequences. The regional cross self-attention mechanism is introduced to curtail computation redundancy and minimize the interference of redundant information to the target region. Additionally, a position coding called dynamic implicit position coding is proposed to cooperate with regional cross self-attentiveness. Dynamic implicit location coding can encode arbitrarily long input sequences. The computational cost of RCSANet is significantly lower than that of state-of-the-art object detection solutions. Moreover, RCSANet improves the performance on the four benchmark datasets, of MSCOCO, Tinyperson, DOTA, and AI-TOD, by about 3.0%AP.
引用
收藏
页码:8984 / 8996
页数:13
相关论文
共 50 条
[21]   An Improved Siamese Tracking Network Based On Self-Attention And Cross-Attention [J].
Lai Yijun ;
Song Jianmei ;
She Haoping .
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, :466-470
[22]   Multi-Head Mixed Self-Attention Mechanism for Object Detection [J].
Su, Qinghua ;
Mu, Jianhong ;
Liang, Wenhui ;
Wang, Xiyu ;
Li, Juntao .
LASER & OPTOELECTRONICS PROGRESS, 2025, 62 (06)
[23]   Long-Tailed Visual Recognition via Improved Cross-Window Self-Attention and TrivialAugment [J].
Song, Ying ;
Li, Mengxing ;
Wang, Bo .
IEEE ACCESS, 2023, 11 :49601-49610
[24]   Dynamic Network Embedding in Hyperbolic Space via Self-attention [J].
Duan, Dingyang ;
Zha, Daren ;
Yang, Xiao ;
Mu, Nan ;
Shen, Jiahui .
WEB ENGINEERING (ICWE 2022), 2022, 13362 :189-203
[25]   DCENet: a tiny object detection network for aerial images based on deformable cross-attention and enhanced classifier [J].
Chen, Shuai ;
Wen, Mi ;
Tian, Yingjie ;
Xue, Yunsheng ;
Wang, Hongwei .
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
[26]   Cross-Layer Attention Network for Small Object Detection in Remote Sensing Imagery [J].
Li, Yangyang ;
Huang, Qin ;
Pei, Xuan ;
Chen, Yanqiao ;
Jiao, Licheng ;
Shang, Ronghua .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 :2148-2161
[27]   The Heterogeneous Network Community Detection Model Based on Self-Attention [J].
Zhou, Gaofeng ;
Wang, Rui-Feng .
SYMMETRY-BASEL, 2025, 17 (03)
[28]   A Dual-Branch Self-attention Method for Mobile Malware Detection via Network Traffic [J].
Ge, Ruihai ;
Zhang, Yongzheng ;
Li, Shuhao ;
Zhou, GuoQiao .
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[29]   YOLOv3 object detection method by introducing Gaussian mask self-attention module [J].
Kong Ya-jie ;
Zhang Ye .
CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2022, 37 (04) :539-548
[30]   Dual-Mode Serial Night Road Object Detection Model Based on Depthwise Separable and Self-Attention Mechanism [J].
Yang, Qin ;
Ma, Yahong ;
Li, Linsen ;
Zhao, Zeyu .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 :1-9