Unified diffusion-based object detection in multi-modal and low-light remote sensing images

被引:0
|
作者
Sun, Xu [1 ]
Yu, Yinhui [1 ]
Cheng, Qing [1 ]
机构
[1] Jilin Univ, Sch Commun Engn, Changchun, Peoples R China
基金
中国国家自然科学基金;
关键词
computer vision; convolutional neural nets; image processing;
D O I
10.1049/ell2.70093
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Remote sensing object detection remains a challenge under complex conditions such as low light, adverse weather, modality attacks or losses. Previous approaches typically alleviate this problem by enhancing visible images or leveraging multi-modal fusion technologies. In view of this, the authors propose a unified framework based on YOLO-World that combines the advantages of both schemes, achieving more adaptable and robust remote sensing object detection in complex real-world scenarios. This framework introduces a unified modality modelling strategy, allowing the model to learn abundant object features from multiple remote sensing datasets. Additionally, a U-fusion neck based on the diffusion method is designed to effectively remove modality-specific noise and generate missing complementary features. Extensive experiments were conducted on four remote sensing image datasets: Multimodal VEDAI, DroneVehicle, unimodal VisDrone and UAVDT. This approach achieves average precision scores of 50.5%$\%$, 55.3%$\%$, 25.1%$\%$, and 20.7%$\%$, which outperforms advanced multimodal remote sensing object detection methods and low-light image enhancement techniques.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Multi-Modal Prototypes for Few-Shot Object Detection in Remote Sensing Images
    Liu, Yanxing
    Pan, Zongxu
    Yang, Jianwei
    Zhou, Peiling
    Zhang, Bingchen
    REMOTE SENSING, 2024, 16 (24)
  • [2] LOW-LIGHT PEDESTRIAN DETECTION FROM RGB IMAGES USING MULTI-MODAL KNOWLEDGE DISTILLATION
    Kruthiventi, Srinivas S. S.
    Sahay, Pratyush
    Biswal, Rajesh
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 4207 - 4211
  • [3] Classification of multi-modal remote sensing images based on knowledge graph
    Fang, Jianyong
    Yan, Xuefeng
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (15) : 4815 - 4835
  • [4] Object detection in multi-modal images using genetic programming
    Bhanu, B
    Lin, YQ
    APPLIED SOFT COMPUTING, 2004, 4 (02) : 175 - 201
  • [5] RGB-INFRARED MULTI-MODAL REMOTE SENSING OBJECT DETECTION USING CNN AND TRANSFORMER BASED FEATURE FUSION
    Tian, Tao
    Cai, Jiang
    Xu, Yang
    Wu, Zebin
    Wei, Zhihui
    Chanussot, Jocelyn
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5728 - 5731
  • [6] Ship detection in haze and low-light remote sensing images via colour balance and DCNN
    Song, Runyu
    Li, Tieshan
    Li, Taoying
    APPLIED OCEAN RESEARCH, 2023, 139
  • [7] Deep-Learning for Change Detection Using Multi-Modal Fusion of Remote Sensing Images: A Review
    Saidi, Souad
    Idbraim, Soufiane
    Karmoude, Younes
    Masse, Antoine
    Arbelo, Manuel
    REMOTE SENSING, 2024, 16 (20)
  • [8] Vehicle detection method based on remote sensing image fusion of superpixel and multi-modal sensing network
    Lian Y.
    Li G.
    Shen S.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (06): : 905 - 919
  • [9] Diffusion model for multi-scale ship object detection and recognition in remote sensing images
    Chen, Lei
    Wang, Bin
    Liu, Ying
    Zhao, Shuang
    Guan, Qinghe
    Li, Guandian
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [10] RSCNN: A CNN-Based Method to Enhance Low-Light Remote-Sensing Images
    Hu, Linshu
    Qin, Mengjiao
    Zhang, Feng
    Du, Zhenhong
    Liu, Renyi
    REMOTE SENSING, 2021, 13 (01) : 1 - 13