Vision Transformers for Human Activity Recognition Using WiFi Channel State Information

被引:25
作者
Luo, Fei [1 ]
Khan, Salabat [2 ]
Jiang, Bin [3 ]
Wu, Kaishun [4 ]
机构
[1] Great Bay Univ, Sch Comp & Informat Technol, Dongguan 523000, Peoples R China
[2] Qilu Inst Technol, Sch Comp & Informat Engn, Jinan 250202, Shandong, Peoples R China
[3] Univ Petr, Coll Oceanog & Space Informat, Qingdao 266580, Peoples R China
[4] Hong Kong Univ Sci & Technol, Informat Hub, Guangzhou 510000, Peoples R China
关键词
Wireless fidelity; Human activity recognition; Transformers; Sensors; Feature extraction; Deep learning; Computer architecture; Human activity recognition (HAR); vision transformer (ViT); WiFi channel state information (CSI); WiFi sensing;
D O I
10.1109/JIOT.2024.3375337
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wireless sensing and communication evolved separately in the past. However, integrated sensing and communication (ISAC) unlocks a new era of mobile network capabilities, with WiFi emerging as a prime candidate. By leveraging existing WiFi infrastructure and frequencies, ISAC enables powerful services like accurate localization and human activity recognition (HAR). WiFi-based HAR is a prime example powered by the magic of ISAC. WiFi channel state information (CSI) is susceptible to human movement disturbances; the alterations in CSI mirror the dynamic attributes of human activities. Given the intricate relationship between human activities and CSI, numerous deep learning models have been introduced to enhance HAR accuracy. Recently, transformer-based models have achieved excellent performance in various tasks, including speech recognition, natural language processing, and image classification. This has spurred research into incorporating transformer-based models into WiFi sensing applications. However, their application in WiFi-based HAR remains nascent. Vision transformer (ViT) is well-suited for analyzing WiFi CSI signals in the form of spectra, such as the Doppler frequency spectrum frequently utilized in related studies, owing to its data structure mimicking that of images. In this study, we explored five widely used ViT architectures (vanilla ViT, SimpleViT, DeepViT, SwinTransformer, and CaiT) for WiFi CSI-based HAR using two publicly available data sets, UT-HAR and NTU-Fi HAR. Our work aims to assess and compare the performance of diverse ViT architectures for WiFi CSI-based HAR and provide guidelines for WiFi-based HAR modeling and ViT selection, considering accuracy, model size, and computational efficiency.
引用
收藏
页码:28111 / 28122
页数:12
相关论文
共 72 条
[1]   H2HI-Net: A Dual-Branch Network for Recognizing Human-to-Human Interactions From Channel-State Information [J].
Abdel-Basset, Mohamed ;
Hawash, Hossam ;
Moustafa, Nour ;
Mohammad, Nazeeruddin .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (12) :10010-10021
[2]  
Abdelnasser Heba., 2015, Proceedings of the 16th ACM International Symposium on Mobile Ad Hoc Networking and Computing, P277
[3]   Device free human gesture recognition using Wi-Fi CSI: A survey [J].
Ahmed, Hasmath Farhana Thariq ;
Ahmad, Hafisoh ;
Aravind, C., V .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 87
[4]   Keystroke Recognition Using WiFi Signals [J].
Ali, Kamran ;
Liu, Alex X. ;
Wang, Wei ;
Shahzad, Muhammad .
MOBICOM '15: PROCEEDINGS OF THE 21ST ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, 2015, :90-102
[5]   Human activity recognition based on wrist PPG via the ensemble method [J].
Almanifi, Omair Rashed Abdulwareth ;
Khairuddin, Ismail Mohd ;
Razman, Mohd Azraai Mohd ;
Musa, Rabiu Muazu ;
Majeed, Anwar P. P. Abdul .
ICT EXPRESS, 2022, 8 (04) :513-517
[6]  
Beyer Lucas., 2022, arXiv, DOI 10.48550/arXiv.2205.01580
[7]  
Butyrskaya D., 2023, Human movement recognition using deep learning on visualized CSI Wi-Fi data
[8]   Exposing the CSI: A Systematic Investigation of CSI-based Wi-Fi Sensing Capabilities and Limitations [J].
Cominelli, Marco ;
Gringoli, Francesco ;
Restuccia, Francesco .
2023 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS, PERCOM, 2023, :81-90
[9]  
Cubuk ED, 2019, Arxiv, DOI arXiv:1909.13719
[10]   WiFi CSI-Based Human Activity Recognition Using Deep Recurrent Neural Network [J].
Ding, Jianyang ;
Wang, Yong .
IEEE ACCESS, 2019, 7 :174257-174269