RepMobile: A MobileNet-Like Network With Structural Reparameterization for Sensor-Based Human Activity Recognition

被引:1
作者
Yu, Jianglai [1 ]
Zhang, Lei [1 ]
Cheng, Dongzhou [1 ]
Bu, Can [1 ]
Wu, Hao [2 ]
Song, Aiguo [3 ]
机构
[1] Nanjing Normal Univ, Sch Elect & Automat Engn, Nanjing 210023, Peoples R China
[2] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650500, Peoples R China
[3] Southeast Univ, Sch Instrument Sci & Engn, Nanjing 210096, Peoples R China
关键词
Computer architecture; Sensors; Accuracy; Training; Human activity recognition; Low latency communication; Feature extraction; Deep learning; human activity recognition; mobile devices; sensor; structural reparameterization; CONVOLUTIONAL NEURAL-NETWORK; ACCELEROMETER DATA;
D O I
10.1109/JSEN.2024.3412736
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
How to design and deploy efficient neural network backbones always plays a crucial role in sensor-based human activity recognition (HAR) on mobile devices, which have targeted at reducing floating-point operations (FLOPs) and parameter count, hence offering a faster activity inference. However, since the most commonly used efficiency indicators such as FLOPs and parameter count are indirect metrics for activity inference, they could not directly translate into practical inference latency in real-world. This article is the first attempt to mitigate such discrepancy between theoretic computational efficiency and practical inference latency. Our main goal is to identify and analyze key performance bottlenecks including activations and branching, which might potentially incur higher inference latency for on-device activity inference due to the degree of parallelism and memory access (MAC) cost. To this end, we provide an in-depth analysis about how the indirect metrics and different structural design choices correlate with practical inference latency, which has been rarely explored in ubiquitous HAR scenario. In particular, based on the above analyses, motivated by an idea of structural reparameterization, we present an efficient architecture called RepMobile built on MobileNet by decoupling training-time and inference-time architectures, where a linearly overparameterized model is used to improve accuracy at training time, and a reparameterization is performed for faster activity inference. We validate the effectiveness and efficiency of our proposed method on several public HAR benchmarks, including PAMAP2, UNIMIB-SHAR, WISDM, and USC-HAD, as well as a resource-restrained mobile device, indicating a better tradeoff between accuracy and on-device latency.
引用
收藏
页码:24224 / 24237
页数:14
相关论文
共 56 条
[1]  
Agarap A. F., 2018, arXiv, DOI DOI 10.48550/ARXIV.1803.08375
[2]  
[Anonymous], 2008, P 25 INT C MACH LEAR
[3]  
[Anonymous], 2011, SIGKDD Explor. Newslett., DOI 10.1145/1964897.1964918
[4]   Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks [J].
Chen, Jierun ;
Kao, Shiu-Hong ;
He, Hao ;
Zhuo, Weipeng ;
Wen, Song ;
Lee, Chul-Ho ;
Chan, S. -H. Gary .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :12021-12031
[5]  
Chen KX, 2019, Arxiv, DOI arXiv:1905.08948
[6]   Deep Learning for Sensor-based Human Activity Recognition: Overview, Challenges, and Opportunities [J].
Chen, Kaixuan ;
Zhang, Dalin ;
Yao, Lina ;
Guo, Bin ;
Yu, Zhiwen ;
Liu, Yunhao .
ACM COMPUTING SURVEYS, 2021, 54 (04)
[7]   Learning hierarchical time series data augmentation invariances via contrastive supervision for human activity recognition [J].
Cheng, Dongzhou ;
Zhang, Lei ;
Bu, Can ;
Wu, Hao ;
Song, Aiguo .
KNOWLEDGE-BASED SYSTEMS, 2023, 276
[8]   Applying singular value decomposition on accelerometer data for 1D convolutional neural network based fall detection [J].
Cho, H. ;
Yoon, S. M. .
ELECTRONICS LETTERS, 2019, 55 (06) :320-+
[9]   Sensor-based and vision-based human activity recognition: A comprehensive survey [J].
Dang, L. Minh ;
Min, Kyungbok ;
Wang, Hanxiang ;
Piran, Md. Jalil ;
Lee, Cheol Hee ;
Moon, Hyeonjoon .
PATTERN RECOGNITION, 2020, 108
[10]   Diverse Branch Block: Building a Convolution as an Inception-like Unit [J].
Ding, Xiaohan ;
Zhang, Xiangyu ;
Han, Jungong ;
Ding, Guiguang .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :10881-10890