Multi-modal object detection using unsupervised transfer learning and adaptation techniques

被引:1
作者
Abbott, Rachael [1 ]
Robertson, Neil [1 ]
del Rincon, Jesus Martinez [1 ]
Connor, Barry [2 ]
机构
[1] Queens Univ Belfast, Belfast, Antrim, North Ireland
[2] Thales UK, London, England
来源
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS | 2019年 / 11169卷
关键词
Object detection; Transfer learning; Modality adaption; Thermal imagery; Multi-modal detection;
D O I
10.1117/12.2532794
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks achieve state-of-the-art performance on object detection tasks with RGB data. However, there are many advantages of detection using multi-modal imagery for defence and security operations. For example, the IR modality offers persistent surveillance and is essential in poor lighting conditions and 24hr operation. It is, therefore, crucial to create an object detection system which can use IR imagery. Collecting and labelling large volumes of thermal imagery is incredibly expensive and time-consuming. Consequently, we propose to mobilise labelled RGB data to achieve detection in the IR modality. In this paper, we present a method for multi-modal object detection using unsupervised transfer learning and adaptation techniques. We train faster RCNN on RGB imagery and test with a thermal imager. The images contain object classes; people and land vehicles and represent real-life scenes which include clutter and occlusions. We improve the baseline F1-score by up to 20% through training with an additional loss function, which reduces the difference between RGB and IR feature maps. This work shows that unsupervised modality adaptation is possible, and we have the opportunity to maximise the use of labelled RGB imagery for detection in multiple modalities. The novelty of this work includes; the use of the IR imagery, modality adaption from RGB to IR for object detection and the ability to use real-life imagery in uncontrolled environments. The practical impact of this work to the defence and security community is an increase in performance and the saving of time and money in data collection and annotation.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Industrial object detection with multi-modal SSD: closing the gap between synthetic and real images
    Cohen, Julia
    Crispim-Junior, Carlos
    Chiappa, Jean-Marc
    Rodet, Laure Tougne
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 12111 - 12138
  • [42] Multi-Modal System for Walking Safety for the Visually Impaired: Multi-Object Detection and Natural Language Generation
    Lee, Jekyung
    Cha, Kyung-Ae
    Lee, Miran
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [43] Improving Transfer Learning in Unsupervised Language Adaptation
    Rocha, Gil
    Cardoso, Henrique Lopes
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 588 - 599
  • [44] Deep Multi-Modal Transfer Learning for Augmented Patient Acuity Assessment in the Intelligent ICU
    Shickel, Benjamin
    Davoudi, Anis
    Ozrazgat-Baslanti, Tezcan
    Ruppert, Matthew
    Bihorac, Azra
    Rashidi, Parisa
    [J]. FRONTIERS IN DIGITAL HEALTH, 2021, 3
  • [45] A multi camera unsupervised domain adaptation pipeline for object detection in cultural sites through adversarial learning and self-training
    Pasqualino, Giovanni
    Furnari, Antonino
    Farinella, Giovanni Maria
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 222
  • [46] Research on ship safety risk early warning model integrating transfer learning and multi-modal learning
    Wu, Zhizheng
    Wang, Shengzheng
    Xu, He
    Shi, Faqin
    Li, Qian
    Li, Leyao
    Qian, Feng
    [J]. APPLIED OCEAN RESEARCH, 2024, 150
  • [47] Re-transfer learning and multi-modal learning assisted early diagnosis of Alzheimer's disease
    Fang, Meie
    Jin, Zhuxin
    Qin, Feiwei
    Peng, Yong
    Jiang, Chao
    Pan, Zhigeng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (20) : 29159 - 29175
  • [48] Re-transfer learning and multi-modal learning assisted early diagnosis of Alzheimer’s disease
    Meie Fang
    Zhuxin Jin
    Feiwei Qin
    Yong Peng
    Chao Jiang
    Zhigeng Pan
    [J]. Multimedia Tools and Applications, 2022, 81 : 29159 - 29175
  • [49] Incremental learning based multi-domain adaptation for object detection
    Wei, Xing
    Liu, Shaofan
    Xiang, Yaoci
    Duan, Zhangling
    Zhao, Chong
    Lu, Yang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 210 (210)
  • [50] Transfer Learning Method for Object Detection Model Using Genetic Algorithm
    Ito, Ryuji
    Nobuhara, Hajime
    Kato, Shigeru
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2022, 26 (05) : 776 - 783