Tiny Object Detection via Regional Cross Self-Attention Network

被引:3
|
作者
Cheng, Keyang [1 ]
Cui, Honggang [1 ]
Ghafoor, Humaira Abdul [1 ]
Wan, Hao [1 ]
Mao, Qirong [1 ]
Zhan, Yongzhao [1 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Commun Engn, Zhenjiang 212013, Peoples R China
基金
中国国家自然科学基金;
关键词
Detectors; Object detection; Encoding; Feature extraction; Transformers; Image coding; Generators; Tiny object detection; context aggregation; vision transformer; self-attention; position coding; feature fusion;
D O I
10.1109/TCSVT.2022.3232688
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As vision sensor technology continues to evolve, the requirements for detecting targets of interest in the images captured by the sensors are increasing. Considering fast detection and high accuracy, the industry favors geometric key point-based solutions. However, there are a large number of small and fuzzy objects in the real world. Geometric key point detectors do not effectively utilize the contextual features of the region of interest, leading to excessive false positive and false negative results. In this work, a simple, effective, and interpretable tiny object detection method called Regional Cross Self-Attention Object Detection Network (RCSANet) is proposed. It adopts Region Proposal Networks and transformers to capture regional background relations and uses regional background relations to generate key point sequences. The regional cross self-attention mechanism is introduced to curtail computation redundancy and minimize the interference of redundant information to the target region. Additionally, a position coding called dynamic implicit position coding is proposed to cooperate with regional cross self-attentiveness. Dynamic implicit location coding can encode arbitrarily long input sequences. The computational cost of RCSANet is significantly lower than that of state-of-the-art object detection solutions. Moreover, RCSANet improves the performance on the four benchmark datasets, of MSCOCO, Tinyperson, DOTA, and AI-TOD, by about 3.0%AP.
引用
收藏
页码:8984 / 8996
页数:13
相关论文
共 50 条
  • [1] SpotNet: Self-Attention Multi-Task Network for Object Detection
    Perreault, Hughes
    Bilodeau, Guillaume-Alexandre
    Saunier, Nicolas
    Heritier, Maguelonne
    2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 230 - 237
  • [2] Improving Object Detection Quality by Incorporating Global Contexts via Self-Attention
    Lee, Donghyeon
    Kim, Joonyoung
    Jung, Kyomin
    ELECTRONICS, 2021, 10 (01) : 1 - 15
  • [3] Regional Prediction-Aware Network With Cross-Scale Self-Attention for Ship Detection in SAR Images
    Zhang, Lili
    Liu, Yuxuan
    Huang, Yufeng
    Qu, Lele
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [4] Rethinking Self-Attention for Multispectral Object Detection
    Hu, Sijie
    Bonardi, Fabien
    Bouchafa, Samia
    Prendinger, Helmut
    Sidibe, Desire
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16300 - 16311
  • [5] Save the Tiny, Save the All: Hierarchical Activation Network for Tiny Object Detection
    Guo, Guangqian
    Chen, Pengfei
    Yu, Xuehui
    Han, Zhenjun
    Ye, Qixiang
    Gao, Shan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 221 - 234
  • [6] CAG-FPN: CHANNEL SELF-ATTENTION GUIDED FEATURE PYRAMID NETWORK FOR OBJECT DETECTION
    Chang, Jie
    Dai, Huhe
    Zheng, Yuan
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 9616 - 9620
  • [7] Enhancing small object detection in point clouds with self-attention voting network
    Zhu, Minghao
    Wang, Gaihua
    Li, Mingjie
    Long, Qian
    Zhou, Zhengshu
    OPTICAL ENGINEERING, 2024, 63 (04)
  • [8] Sampling Equivariant Self-Attention Networks for Object Detection in Aerial Images
    Yang, Guo-Ye
    Li, Xiang-Li
    Xiao, Zi-Kai
    Mu, Tai-Jiang
    Martin, Ralph R.
    Hu, Shi-Min
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 6413 - 6425
  • [9] Object Detection Algorithm Based on Context Information and Self-Attention Mechanism
    Liang, Hong
    Zhou, Hui
    Zhang, Qian
    Wu, Ting
    SYMMETRY-BASEL, 2022, 14 (05):
  • [10] Joint self-attention and branch sampling for object detection on drone imagery
    Zhang Y.
    Wu C.
    Liu Y.
    Zhang T.
    Zheng Y.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (18): : 2723 - 2735