Multi-modal object detection using unsupervised transfer learning and adaptation techniques

被引:1
|
作者
Abbott, Rachael [1 ]
Robertson, Neil [1 ]
del Rincon, Jesus Martinez [1 ]
Connor, Barry [2 ]
机构
[1] Queens Univ Belfast, Belfast, Antrim, North Ireland
[2] Thales UK, London, England
来源
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS | 2019年 / 11169卷
关键词
Object detection; Transfer learning; Modality adaption; Thermal imagery; Multi-modal detection;
D O I
10.1117/12.2532794
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks achieve state-of-the-art performance on object detection tasks with RGB data. However, there are many advantages of detection using multi-modal imagery for defence and security operations. For example, the IR modality offers persistent surveillance and is essential in poor lighting conditions and 24hr operation. It is, therefore, crucial to create an object detection system which can use IR imagery. Collecting and labelling large volumes of thermal imagery is incredibly expensive and time-consuming. Consequently, we propose to mobilise labelled RGB data to achieve detection in the IR modality. In this paper, we present a method for multi-modal object detection using unsupervised transfer learning and adaptation techniques. We train faster RCNN on RGB imagery and test with a thermal imager. The images contain object classes; people and land vehicles and represent real-life scenes which include clutter and occlusions. We improve the baseline F1-score by up to 20% through training with an additional loss function, which reduces the difference between RGB and IR feature maps. This work shows that unsupervised modality adaptation is possible, and we have the opportunity to maximise the use of labelled RGB imagery for detection in multiple modalities. The novelty of this work includes; the use of the IR imagery, modality adaption from RGB to IR for object detection and the ability to use real-life imagery in uncontrolled environments. The practical impact of this work to the defence and security community is an increase in performance and the saving of time and money in data collection and annotation.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Unsupervised Multi-modal Learning
    Iqbal, Mohammed Shameer
    ADVANCES IN ARTIFICIAL INTELLIGENCE (AI 2015), 2015, 9091 : 343 - 346
  • [2] Multi-Modal Object Tracking and Image Fusion With Unsupervised Deep Learning
    LaHaye, Nicholas
    Ott, Jordan
    Garay, Michael J.
    El-Askary, Hesham Mohamed
    Linstead, Erik
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (08) : 3056 - 3066
  • [3] Multi-modal anchor adaptation learning for multi-modal summarization
    Chen, Zhongfeng
    Lu, Zhenyu
    Rong, Huan
    Zhao, Chuanjun
    Xu, Fan
    NEUROCOMPUTING, 2024, 570
  • [4] Multi-modal cyber security based object detection by classification using deep learning and background suppression techniques
    Srinivas, Kalyanapu
    Singh, Laxman
    Chavva, Subba Reddy
    Dappuri, Bhasker
    Chandrasekaran, Saravanan
    Qamar, Shamimul
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103
  • [5] Imagery in multi-modal object learning
    Jüttner, M
    Rentschler, I
    BEHAVIORAL AND BRAIN SCIENCES, 2002, 25 (02) : 197 - +
  • [6] Small Object Detection Technology Using Multi-Modal Data Based on Deep Learning
    Park, Chi-Won
    Seo, Yuri
    Sun, Teh-Jen
    Lee, Ga-Won
    Huh, Eui-Nam
    2023 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN, 2023, : 420 - 422
  • [7] Object detection in multi-modal images using genetic programming
    Bhanu, B
    Lin, YQ
    APPLIED SOFT COMPUTING, 2004, 4 (02) : 175 - 201
  • [8] Human head detection using multi-modal object features
    Luo, Y
    Murphey, YL
    Khairallah, F
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2134 - 2139
  • [9] Unsupervised scene detection and commentator building using multi-modal chains
    Gert-Jan Poulisse
    Yorgos Patsis
    Marie-Francine Moens
    Multimedia Tools and Applications, 2014, 70 : 159 - 175
  • [10] Unsupervised Change Detection in Multi-Modal SAR Images using CycleGAN
    Bergamasco, Luca
    Bovolo, Francesca
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXVIII, 2022, 12267