CSITime: Privacy-preserving human activity recognition using WiFi channel state information

被引:52
作者
Yadav, Santosh Kumar [1 ,2 ,3 ]
Sai, Siva [4 ]
Gundewar, Akshay [4 ]
Rathore, Heena [5 ]
Tiwari, Kamlesh [4 ]
Pandey, Hari Mohan [6 ]
Mathur, Mohit [3 ]
机构
[1] Acad Sci & Innovat Res AcSIR, Ghaziabad 201002, UP, India
[2] CSIR Cent Elect Engn Res Inst CEERI, Cyber Phys Syst, Pilani 333031, Rajasthan, India
[3] DeepBlink LLC, 30 N Gould St Ste R, Sheridan, WY 82801 USA
[4] Birla Inst Technol & Sci Pilani, Dept CSIS, Pilani Campus, Pilani 333031, Rajasthan, India
[5] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX USA
[6] Edge Hill Univ, Dept Comp Sci, Ormskirk, Lancs, England
关键词
Human activity recognition; WiFi channel state information; Time series classification; Data augmentation;
D O I
10.1016/j.neunet.2021.11.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human activity recognition (HAR) is an important task in many applications such as smart homes, sports analysis, healthcare services, etc. Popular modalities for human activity recognition involving computer vision and inertial sensors are in the literature for solving HAR, however, they face serious limitations with respect to different illumination, background, clutter, obtrusiveness, and other factors. In recent years, WiFi channel state information (CSI) based activity recognition is gaining momentum due to its many advantages including easy deployability, and cost-effectiveness. This work proposes CSITime, a modified InceptionTime network architecture, a generic architecture for CSI-based human activity recognition. We perceive CSI activity recognition as a multi-variate time series problem. The methodology of CSITime is threefold. First, we pre-process CSI signals followed by data augmentation using two label-mixing strategies - mixup and cutmix to enhance the neural network's learning. Second, in the basic block of CSITime, features from multiple convolutional kernels are concatenated and passed through a self-attention layer followed by a fully connected layer with Mish activation. CSITime network consists of six such blocks followed by a global average pooling layer and a final fully connected layer for the final classification. Third, in the training of the neural network, instead of adopting general training procedures such as early stopping, we use one-cycle policy and cosine annealing to monitor the learning rate. The proposed model has been tested on publicly available benchmark datasets, i.e., ARIL, StanWiFi, and SignFi datasets. The proposed CSITime has achieved accuracy of 98.20%, 98%, and 95.42% on ARIL, StanWiFi, and SignFi datasets, respectively, for WiFi-based activity recognition. This is an improvement on state-of-the-art accuracies by 3.3%, 0.67%, and 0.82% on ARIL, StanWiFi, and SignFi datasets, respectively. In lab-5 users' scenario of the SignFi dataset, which has the training and testing data from different distributions, our model achieved accuracy was 2.17% higher than state-of-the-art, which shows the comparative robustness of our model. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页码:11 / 21
页数:11
相关论文
共 55 条
[51]   CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features [J].
Yun, Sangdoo ;
Han, Dongyoon ;
Oh, Seong Joon ;
Chun, Sanghyuk ;
Choe, Junsuk ;
Yoo, Youngjoon .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6022-6031
[52]  
Zhang H., 2018, ARXIV171009412
[53]  
Zhang M., 2019, INT C LEARN REPR
[54]   DRCDN: learning deep residual convolutional dehazing networks [J].
Zhang, Shengdong ;
He, Fazhi .
VISUAL COMPUTER, 2020, 36 (09) :1797-1808
[55]   WiFi and Vision Multimodal Learning for Accurate and Robust Device -Free Human Activity Recognition [J].
Zou, Han ;
Yang, Jianfei ;
Das, Hari Prasanna ;
Liu, Huihan ;
Zhou, Yuxun ;
Spanos, Costas J. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :426-433