Multi-modal object detection using unsupervised transfer learning and adaptation techniques

被引：1

作者：

Abbott, Rachael ^{[1
]}

Robertson, Neil ^{[1
]}

del Rincon, Jesus Martinez ^{[1
]}

Connor, Barry ^{[2
]}

机构：

[1] Queens Univ Belfast, Belfast, Antrim, North Ireland

[2] Thales UK, London, England

来源：

ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS | 2019年 / 11169卷

关键词：

Object detection; Transfer learning; Modality adaption; Thermal imagery; Multi-modal detection;

D O I：

10.1117/12.2532794

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks achieve state-of-the-art performance on object detection tasks with RGB data. However, there are many advantages of detection using multi-modal imagery for defence and security operations. For example, the IR modality offers persistent surveillance and is essential in poor lighting conditions and 24hr operation. It is, therefore, crucial to create an object detection system which can use IR imagery. Collecting and labelling large volumes of thermal imagery is incredibly expensive and time-consuming. Consequently, we propose to mobilise labelled RGB data to achieve detection in the IR modality. In this paper, we present a method for multi-modal object detection using unsupervised transfer learning and adaptation techniques. We train faster RCNN on RGB imagery and test with a thermal imager. The images contain object classes; people and land vehicles and represent real-life scenes which include clutter and occlusions. We improve the baseline F1-score by up to 20% through training with an additional loss function, which reduces the difference between RGB and IR feature maps. This work shows that unsupervised modality adaptation is possible, and we have the opportunity to maximise the use of labelled RGB imagery for detection in multiple modalities. The novelty of this work includes; the use of the IR imagery, modality adaption from RGB to IR for object detection and the ability to use real-life imagery in uncontrolled environments. The practical impact of this work to the defence and security community is an increase in performance and the saving of time and money in data collection and annotation.

引用

页数：10

共 50 条

[21] Multi-Modal Dataset Generation using Domain Randomization for Object Detection
Marez, Diego
Nans, Lena
Borden, Samuel
GEOSPATIAL INFORMATICS XI, 2021, 11733
[22] Multi-modal deep feature learning for RGB-D object detection
Xu, Xiangyang
Li, Yuncheng
Wu, Gangshan
Luo, Jiebo
PATTERN RECOGNITION, 2017, 72 : 300 - 313
[23] Deep learning based object detection from multi-modal sensors: an overview
Ye Liu
Shiyang Meng
Hongzhang Wang
Jun Liu
Multimedia Tools and Applications, 2024, 83 : 19841 - 19870
[24] UNSUPERVISED BUILDING CHANGE DETECTION IN MULTI-MODAL SAR IMAGES USING CYCLEGAN
Bergamasco, Luca
Bovolo, Francesca
IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 483 - 486
[25] Deep learning based object detection from multi-modal sensors: an overview
Liu, Ye
Meng, Shiyang
Wang, Hongzhang
Liu, Jun
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19841 - 19870
[26] Unsupervised Multi-modal Style Transfer for Cardiac MR Segmentation
Chen, Chen
Ouyang, Cheng
Tarroni, Giacomo
Schlemper, Jo
Qiu, Huaqi
Bai, Wenjia
Rueckert, Daniel
STATISTICAL ATLASES AND COMPUTATIONAL MODELS OF THE HEART: MULTI-SEQUENCE CMR SEGMENTATION, CRT-EPIGGY AND LV FULL QUANTIFICATION CHALLENGES, 2020, 12009 : 209 - 219
[27] Multi-modal Hate Speech Detection using Machine Learning
Boishakhi, Fariha Tahosin
Shill, Ponkoj Chandra
Alam, Md Golam Rabiul
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4496 - 4499
[28] Deep Multi-modal Object Detection for Autonomous Driving
Ennajar, Amal
Khouja, Nadia
Boutteau, Remi
Tlili, Fethi
2021 18TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2021, : 7 - 11
[29] Fast unsupervised multi-modal hashing based on piecewise learning
Li, Yinan
Long, Jun
Tu, Zerong
Yang, Zhan
KNOWLEDGE-BASED SYSTEMS, 2024, 299
[30] Multi-modal object detection via transformer network
Liu, Wenbing
Wang, Haibo
Gao, Quanxue
Zhu, Zhaorui
IET IMAGE PROCESSING, 2023, 17 (12) : 3541 - 3550

← 1 2 3 4 5 →