Enhanced object recognition from remote sensing images based on hybrid convolution and transformer structure

被引:0
|
作者
Nguyen, Hoanh [1 ]
Ngo, Thanh Quyen [1 ]
Uyen, Hoang Thi Tu [1 ]
Duong, Mien Ka [1 ]
机构
[1] Ind Univ Ho Chi Minh City, Fac Elect Engn Technol, Ho Chi Minh City 700000, Vietnam
关键词
Object detection; Remote sensing images; Depthwise separable convolution; Attention mechanisms; NETWORK;
D O I
10.1007/s12145-025-01751-x
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Object recognition in remote sensing images presents unique challenges due to the diverse scales, shapes, and distributions of objects, particularly small and complex ones. Existing frameworks, such as RT-DETR, struggle to accurately detect small objects because of their limited ability to extract fine-grained details and integrate multi-scale information. To overcome these challenges, we propose an enhanced object recognition model based on a hybrid convolution and transformer structure. This model improves two critical components of the original RT-DETR by introducing the Multi-Scale Adaptive Attention Module (MSAAM) and the Hybrid Feature Fusion Module (HFFM), specifically designed to enhance feature extraction and integration. The MSAAM strengthens the ResNet backbone by adaptively combining local and global information, ensuring the effective extraction of fine-grained details while emphasizing features critical for small object detection. The HFFM, integrated into the final stages of the neck, employs a dual-branch design to balance fine-grained local detail extraction and large-scale contextual understanding. By employing group convolution, depthwise separable convolution, and attention mechanisms, the HFFM mitigates the loss of fine details caused by downsampling while leveraging the expanded receptive field for broader context understanding. Experimental results demonstrate that the proposed model achieves superior object recognition performance, particularly for small objects, making it well-suited for remote sensing applications.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Object Recognition in Remote Sensing Images Using Combined Deep Features
    Jiang, Bitao
    Li, Xiaobin
    Yin, Lu
    Yue, Wenzhen
    Wang, Shengjin
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 606 - 610
  • [22] OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images
    Zhao, Jiaqi
    Ding, Zeyu
    Zhou, Yong
    Zhu, Hancheng
    Du, Wen-Liang
    Yao, Rui
    El Saddik, Abdulmotaleb
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [23] Hybrid Transformer and Convolution for Image Compressed Sensing
    Nan, Ruili
    Sun, Guiling
    Zheng, Bowen
    Zhang, Pengchen
    ELECTRONICS, 2024, 13 (17)
  • [24] Aircraft Recognition from Remote Sensing Images Based on Machine Vision
    Chen, Lu
    Zhou, Liming
    Liu, Jinming
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2020, 16 (04): : 795 - 808
  • [25] Remote Sensing Object Recognition Based on Transfer Learning
    Dan, Zhiping
    Sang, Nong
    Chen, Yanfei
    Chen, Xi
    2013 10TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2013, : 930 - 934
  • [26] Object and topology extraction from remote sensing images
    Maire, C
    Datcu, M
    2005 International Conference on Image Processing (ICIP), Vols 1-5, 2005, : 1949 - 1952
  • [27] Cross Teaching-Enhanced Multispectral Remote Sensing Object Detection With Transformer
    Zhu, Jiahe
    Zhang, Huan
    Li, Simin
    Wang, Shengjin
    Ma, Hongbing
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 2401 - 2413
  • [28] QETR: A Query-Enhanced Transformer for Remote Sensing Image Object Detection
    Ma, Xinyu
    Lv, Pengyuan
    Zhong, Yanfei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [29] Improved Deformable Convolution Method for Aircraft Object Detection in Flight Based on Feature Separation in Remote Sensing Images
    Yu, Lijian
    Zhi, Xiyang
    Hu, Jianming
    Zhang, Shuqing
    Niu, Ruize
    Zhang, Wei
    Jiang, Shikai
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 8313 - 8323
  • [30] Shallow multiplexing and multiscale dilation convolution combined attention based oriented object detection in remote sensing images
    Wang, Jiangtao
    Shi, Jiawei
    DIGITAL SIGNAL PROCESSING, 2025, 156