MultiSense: Cross-labelling and Learning Human Activities Using Multimodal Sensing Data

被引：1

作者：

Zhang, Lan ^{[1
,2
]}

Zheng, Daren ^{[3
]}

Yuan, Mu ^{[3
]}

Han, Feng ^{[3
]}

Wu, Zhengtao ^{[3
]}

Liu, Mengjing ^{[3
]}

Li, Xiang-Yang ^{[3
]}

机构：

[1] Univ Sci & Technol China, Hefei 230026, Anhui, Peoples R China

[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei 230026, Anhui, Peoples R China

[3] Univ Sci & Technol China, Hefei 230026, Anhui, Peoples R China

来源：

ACM TRANSACTIONS ON SENSOR NETWORKS | 2023年 / 19卷 / 03期

基金：

国家重点研发计划;

关键词：

Multimodel sensing data; cross-labelling; cross-learning;

D O I：

10.1145/3578267

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To tap into the gold mine of data generated by Internet of Things (IoT) devices with unprecedented volume and value, there is an urgent need to efficiently and accurately label raw sensor data. To this end, we explore and leverage the hidden connections among the multimodal data collected by various sensing devices and propose to let different modal data complement and learn from each other. But it is challenging to align and fuse multimodal data without knowing their perception (and thus the correct labels). In this work, we propose MultiSense, a paradigm for automatically mining potential perception, cross-labelling each modal data, and then updating the learning models for recognizing human activity to achieve higher accuracy or even recognize new activities. We design innovative solutions for segmenting, aligning, and fusing multimodal data from different sensors, as well as model updating mechanism. We implement our framework and conduct comprehensive evaluations on a rich set of data. Our results demonstrate that MultiSense significantly improves the data usability and the power of the learning models. With nine diverse activities performed by users, our framework automatically labels multimodal sensing data generated by five different sensing mechanisms (video, smart watch, smartphone, audio, and wireless-channel) with an average accuracy 98.5%. Furthermore, it enables models of some modalities to learn unknown activities from other modalities and greatly improves the activity recognition ability.

引用

页数：26

共 47 条

[31] Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study [J].

Triguero, Isaac ;

Garcia, Salvador ;

Herrera, Francisco .

KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 42 (02) :245-284

[32] Long-Term Temporal Convolutions for Action Recognition [J].

Varol, Gul ;

Laptev, Ivan ;

Schmid, Cordelia .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (06) :1510-1517

[33] Particle Swarm Optimisation for Evolving Deep Neural Networks for Image Classification by Evolving and Stacking Transferable Blocks [J].

Wang, Bin ;

Xue, Bing ;

Zhang, Mengjie .

2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,

[34] SGSF: A Small Groups Based Serial Fusion Method [J].

Wang, Nian ;

Zhang, Zhe ;

Li, Tingting ;

Xiao, Jing ;

Cui, Li .

IPSN '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2019, :97-108

[35] Convolutional Pose Machines [J].

Wei, Shih-En ;

Ramakrishna, Varun ;

Kanade, Takeo ;

Sheikh, Yaser .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4724-4732

[36]

Wilk Stefan, 2016, P 24 ACM INT C MULTI, P626

[37]

Wu JJ, 2015, PROC CVPR IEEE, P3460, DOI 10.1109/CVPR.2015.7298968

[38] Enabling Edge Devices that Learn from Each Other: Cross Modal Training for Activity Recognition [J].

Xing, Tianwei ;

Sandha, Sandeep Singh ;

Balaji, Bharathan ;

Chakraborty, Supriyo ;

Srivastava, Mani .

EDGESYS'18: PROCEEDINGS OF THE FIRST ACM INTERNATIONAL WORKSHOP ON EDGE SYSTEMS, ANALYTICS AND NETWORKING, 2018, :37-42

[39] Learning Multimodal Attention LSTM Networks for Video Captioning [J].

Xu, Jun ;

Yao, Ting ;

Zhang, Yongdong ;

Mei, Tao .

PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, :537-545

[40] DeepFusion: A Deep Learning Framework for the Fusion of Heterogeneous Sensory Data [J].

Xue, Hongfei ;

Jiang, Wenjun ;

Miao, Chenglin ;

Yuan, Ye ;

Ma, Fenglong ;

Ma, Xin ;

Wang, Yijiang ;

Yao, Shuochao ;

Xu, Wenyao ;

Zhang, Aidong ;

Su, Lu .

PROCEEDINGS OF THE 2019 THE TWENTIETH ACM INTERNATIONAL SYMPOSIUM ON MOBILE AD HOC NETWORKING AND COMPUTING (MOBIHOC '19), 2019, :151-160

← 1 2 3 4 5 →