Domain generalization with semi-supervised learning for people-centric activity recognition

被引:4
作者
Liu, Jing [1 ,2 ]
Zhu, Wei [1 ]
Li, Di [2 ]
Hu, Xing [3 ]
Song, Liang [1 ,2 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
[2] Shanghai East Bund Res Inst Networking Syst AI, Shanghai 202162, Peoples R China
[3] Univ Shanghai Sci & Technol, Sch Optoelect Informat & Comp Engn, Shanghai 200093, Peoples R China
关键词
activity recognition; deep learning; domain generalization; semi-supervised learning; adversarial training; TIME-SERIES; ADAPTATION;
D O I
10.1007/s11432-022-3860-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
People-centric activity recognition is one of the most critical technologies in a wide range of real-world applications, including intelligent transportation systems, healthcare services, and brain-computer interfaces. Large-scale data collection and annotation make the application of machine learning algorithms prohibitively expensive when adapting to new tasks. One way of circumventing this limitation is to train the model in a semi-supervised learning manner that utilizes a percentage of unlabeled data to reduce the labeling burden in prediction tasks. Despite their appeal, these models often assume that labeled and unlabeled data come from similar distributions, which leads to the domain shift problem caused by the presence of distribution gaps. To address these limitations, we propose herein a novel method for people-centric activity recognition, called domain generalization with semi-supervised learning (DGSSL), that effectively enhances the representation learning and domain alignment capabilities of a model. We first design a new autoregressive discriminator for adversarial training between unlabeled and labeled source domains, extracting domain-specific features to reduce the distribution gaps. Second, we introduce two reconstruction tasks to capture the task-specific features to avoid losing information related to representation learning while maintaining task-specific consistency. Finally, benefiting from the collaborative optimization of these two tasks, the model can accurately predict both the domain and category labels of the source domains for the classification task. We conduct extensive experiments on three real-world sensing datasets. The experimental results show that DGSSL surpasses the three state-of-the-art methods with better performance and generalization.
引用
收藏
页数:18
相关论文
共 66 条
  • [1] DynaLAP: Human Activity Recognition in Fixed Protocols via Semi-Supervised Variational Recurrent Neural Networks With Dynamic Priors
    An, Sungtae
    Gazi, Asim H.
    Inan, Omer T.
    [J]. IEEE SENSORS JOURNAL, 2022, 22 (18) : 17963 - 17976
  • [2] AdaptNet: Human Activity Recognition via Bilateral Domain Adaptation Using Semi-Supervised Deep Translation Networks
    An, Sungtae
    Medda, Alessio
    Sawka, Michael N.
    Hutto, Clayton J.
    Millard-Stafford, Mindy L.
    Appling, Scott
    Richardson, Kristine L. S.
    Inan, Omer T.
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (18) : 20398 - 20411
  • [3] Anguita D., 2012, Proceedings, P216
  • [4] Bousmalis K, 2016, ADV NEUR IN, V29
  • [5] Carpineti C, 2018, INT CONF PERVAS COMP, DOI 10.1109/PERCOMW.2018.8480119
  • [6] Chan A., 2020, PMLR, P1392
  • [7] Chen C, 2020, AAAI CONF ARTIF INTE, V34, P3422
  • [8] Chen DD, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2014
  • [9] Chen ZK, 2022, AAAI CONF ARTIF INTE, P6342
  • [10] A language-independent neural network for event detection
    Feng, Xiaocheng
    Qin, Bing
    Liu, Ting
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2018, 61 (09)