BiCLR: Radar–Camera-Based Cross-Modal Bi-Contrastive Learning for Human Motion Recognition

被引:1
作者
Chen, Yuh-Shyan [1 ]
Cheng, Kuang-Hung [1 ]
机构
[1] Natl Taipei Univ, Dept Comp Sci & Informat Engn, Taipei 23741, Taiwan
关键词
Radar; Cameras; Task analysis; Radar imaging; Transformers; Human activity recognition; Sensors; Camera; cross-modal; contrastive learning; human motion recognition; radar;
D O I
10.1109/JSEN.2023.3344789
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Radar-based human motion recognition is gaining attention due to its inherent resistance to lighting conditions, especially in healthcare and safety applications concerning personal privacy. In this article, we propose a novel cross-modal bi-contrastive learning model named "BiCLR," which utilizes a Transformer-based network for temporal modeling to conduct instance discrimination in both single- and cross-modal modalities in a self-supervised learning manner. To enhance data density, a new radar data format, the "radar combination map (RCM)," is presented to seamlessly integrate range-Doppler map (RDM), range-azimuth map (RAM), and range-elevation map (REM) into a single map. The objective of this article is to address the inherent sparsity of radar data through cross-modality and newly introduced RCM, offering a transferable framework for various kinds of downstream tasks, advancing understanding through radar-based recognition. After a comprehensive evaluation, the pretrained encoder demonstrates effectiveness in a new human motion recognition task using only radar data, despite being trained on a significantly smaller dataset. The experimental results clearly demonstrate BiCLR's capability to utilize cross-modal and contrastive learning methods, as well as the improved performance in downstream tasks.
引用
收藏
页码:4102 / 4119
页数:18
相关论文
共 63 条
[1]   Activity Classification Based on Feature Fusion of FMCW Radar Human Motion Micro-Doppler Signatures [J].
Abdu, Fahad Jibrin ;
Zhang, Yixiong ;
Deng, Zhenmiao .
IEEE SENSORS JOURNAL, 2022, 22 (09) :8648-8662
[2]  
Al Hadhrami E, 2018, 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD), P148, DOI 10.1109/ICAIBD.2018.8396184
[3]   Multi-Modal Cross Learning for an FMCW Radar Assisted by Thermal and RGB Cameras to Monitor Gestures and Cooking Processes [J].
Altmann, Marco ;
Ott, Peter ;
Stache, Nicolaj C. ;
Waldschmidt, Christian .
IEEE ACCESS, 2021, 9 :22295-22303
[4]   ViViT: A Video Vision Transformer [J].
Arnab, Anurag ;
Dehghani, Mostafa ;
Heigold, Georg ;
Sun, Chen ;
Lucic, Mario ;
Schmid, Cordelia .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :6816-6826
[5]   A Joint Global-Local Network for Human Pose Estimation With Millimeter Wave Radar [J].
Cao, Zhongping ;
Ding, Wen ;
Chen, Rihui ;
Zhang, Jianxiong ;
Guo, Xuemei ;
Wang, Guoli .
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (01) :434-446
[6]   HIGH-RESOLUTION FREQUENCY-WAVENUMBER SPECTRUM ANALYSIS [J].
CAPON, J .
PROCEEDINGS OF THE IEEE, 1969, 57 (08) :1408-&
[7]   DIAT-μRadHAR (Micro-Doppler Signature Dataset) & μRadNet (A Lightweight DCNN)-For Human Suspicious Activity Recognition [J].
Chakraborty, Mainak ;
Kumawat, Harish C. ;
Dhavale, Sunita Vikrant ;
Raj, A. Arockia Bazil .
IEEE SENSORS JOURNAL, 2022, 22 (07) :6851-6858
[8]  
Chen HY, 2022, 2022 IEEE MTT-S INTERNATIONAL MICROWAVE BIOMEDICAL CONFERENCE (IMBIOC), P245, DOI [10.1109/IMBioC52515.2022.9790101, 10.1109/IMBIOC52515.2022.9790101]
[9]   A Three-Stage Low-Complexity Human Fall Detection Method Using IR-UWB Radar [J].
Chen, Mengxia ;
Yang, Zhaocheng ;
Lai, Jialei ;
Chu, Ping ;
Lin, Jinghong .
IEEE SENSORS JOURNAL, 2022, 22 (15) :15154-15168
[10]   A HAND GESTURE RECOGNITION METHOD FOR MMWAVE RADAR BASED ON ANGLE-RANGE JOINT TEMPORAL FEATURE [J].
Chen, Qin ;
Li, Yiwei ;
Cui, Zongyong ;
Cao, Zongjie .
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, :2650-2653