Domain generalization with semi-supervised learning for people-centric activity recognition

被引：4

作者：

Liu, Jing ^{[1
,2
]}

Zhu, Wei ^{[1
]}

Li, Di ^{[2
]}

Hu, Xing ^{[3
]}

Song, Liang ^{[1
,2
]}

机构：

[1] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China

[2] Shanghai East Bund Res Inst Networking Syst AI, Shanghai 202162, Peoples R China

[3] Univ Shanghai Sci & Technol, Sch Optoelect Informat & Comp Engn, Shanghai 200093, Peoples R China

来源：

SCIENCE CHINA-INFORMATION SCIENCES | 2025年 / 68卷 / 01期

关键词：

activity recognition; deep learning; domain generalization; semi-supervised learning; adversarial training; TIME-SERIES; ADAPTATION;

D O I：

10.1007/s11432-022-3860-y

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

People-centric activity recognition is one of the most critical technologies in a wide range of real-world applications, including intelligent transportation systems, healthcare services, and brain-computer interfaces. Large-scale data collection and annotation make the application of machine learning algorithms prohibitively expensive when adapting to new tasks. One way of circumventing this limitation is to train the model in a semi-supervised learning manner that utilizes a percentage of unlabeled data to reduce the labeling burden in prediction tasks. Despite their appeal, these models often assume that labeled and unlabeled data come from similar distributions, which leads to the domain shift problem caused by the presence of distribution gaps. To address these limitations, we propose herein a novel method for people-centric activity recognition, called domain generalization with semi-supervised learning (DGSSL), that effectively enhances the representation learning and domain alignment capabilities of a model. We first design a new autoregressive discriminator for adversarial training between unlabeled and labeled source domains, extracting domain-specific features to reduce the distribution gaps. Second, we introduce two reconstruction tasks to capture the task-specific features to avoid losing information related to representation learning while maintaining task-specific consistency. Finally, benefiting from the collaborative optimization of these two tasks, the model can accurately predict both the domain and category labels of the source domains for the classification task. We conduct extensive experiments on three real-world sensing datasets. The experimental results show that DGSSL surpasses the three state-of-the-art methods with better performance and generalization.

引用

页数：18

共 66 条

[1] DynaLAP: Human Activity Recognition in Fixed Protocols via Semi-Supervised Variational Recurrent Neural Networks With Dynamic Priors
An, Sungtae
Gazi, Asim H.
Inan, Omer T.
[J]. IEEE SENSORS JOURNAL, 2022, 22 (18) : 17963 - 17976
[2] AdaptNet: Human Activity Recognition via Bilateral Domain Adaptation Using Semi-Supervised Deep Translation Networks
An, Sungtae
Medda, Alessio
Sawka, Michael N.
Hutto, Clayton J.
Millard-Stafford, Mindy L.
Appling, Scott
Richardson, Kristine L. S.
Inan, Omer T.
[J]. IEEE SENSORS JOURNAL, 2021, 21 (18) : 20398 - 20411
[3] Anguita D., 2012, Proceedings, P216
[4] Bousmalis K, 2016, ADV NEUR IN, V29
[5] Carpineti C, 2018, INT CONF PERVAS COMP, DOI 10.1109/PERCOMW.2018.8480119
[6] Chan A., 2020, PMLR, P1392
[7] Chen C, 2020, AAAI CONF ARTIF INTE, V34, P3422
[8] Chen DD, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2014
[9] Chen ZK, 2022, AAAI CONF ARTIF INTE, P6342
[10] A language-independent neural network for event detection
Feng, Xiaocheng
Qin, Bing
Liu, Ting
[J]. SCIENCE CHINA-INFORMATION SCIENCES, 2018, 61 (09)

← 1 2 3 4 5 6 7 →