SiSe: Simultaneous and Sequential Transformers for multi-label activity recognition

被引:0
|
作者
Chen, Zhao-Min [1 ]
Jin, Xin [2 ]
Chan, Sixian [3 ]
机构
[1] Wenzhou Univ, Key Lab Intelligent Informat Safety & Emergency Zh, Wenzhou 325035, Peoples R China
[2] Samsung Elect China R&D Ctr, Samsung Elect, Nanjing 210012, Peoples R China
[3] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label; Activity recognition; Sequential transformer; Hierarchical structure;
D O I
10.1016/j.patcog.2024.110844
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label activity recognition is extremely challenging, where multiple activities may appear simultaneously or sequentially in a video. While previous works have realized the temporal co-occurrence of activities, the sequential order of activities have been largely overlooked. However, we argue that the sequential order of activities should also be preserved in correlation modeling, because shuffling the order might not form a semantically meaningful video. In this work, we present plug-and-play Simultaneous and Sequential Transformer (SiSe) modules for multi-label activity recognition. Upon frame features of all time steps, SiSe enhances spatiotemporal feature learning for multi-label activity recognition, by capturing the simultaneous and sequential activity correlations. Specifically, we employ a Simultaneous Transformer module to connect multiple activities that probably appear at each frame, and a hierarchical Sequential Transformer module to efficiently capture the sequential activity correlations in an order-preserved manner. Despite the straightforward and class- agnostic design of SiSe, it can outperform state-of-the-art approaches on three multi-label activity recognition benchmarks. In particular, we verify the significance of preserving the sequential order of activities with our Sequential Transformer in correlation modeling. We also conduct ablation studies and visual analysis for better understanding of our SiSe.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Multi-Label Retinal Disease Classification Using Transformers
    Rodriguez, Manuel Alejandro
    AlMarzouqi, Hasan
    Liatsis, Panos
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (06) : 2739 - 2750
  • [2] MLMO-HSM: Multi-label Multi-output Hybrid Sequential Model for multi-resident smart home activity recognition
    Ramanujam E.
    Perumal T.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (03) : 2313 - 2325
  • [3] Transformers for Multi-label Classification of Medical Text: An Empirical Comparison
    Yogarajan, Vithya
    Montiel, Jacob
    Smith, Tony
    Pfahringer, Bernhard
    ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2021), 2021, : 114 - 123
  • [4] Modeling Label Correlations with Latent Context for Multi-label Recognition
    Chen, Zhaomin
    Cui, Quan
    Deng, Ruoxi
    Hu, Jie
    Zhang, Guodao
    COMPUTER VISION - ECCV 2024, PT XXXIII, 2025, 15091 : 218 - 234
  • [5] Multi-label classification based ensemble learning for human activity recognition in smart home
    Jethanandani, Manan
    Sharma, Abhishek
    Perumal, Thinagaran
    Chang, Jieh-Ren
    INTERNET OF THINGS, 2020, 12
  • [6] Appliance Recognition with Combined Single- and Multi-label Approaches
    Manca, Marco Manolo
    Faustine, Anthony
    Pereira, Lucas
    PROCEEDINGS OF THE 2022 THE 9TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2022, 2022, : 388 - 392
  • [7] Radar emitter multi-label recognition based on residual network
    Hong-hai, Yu
    Xiao-peng, Yan
    Shao-kun, Liu
    Ping, Li
    Xin-hong, Hao
    DEFENCE TECHNOLOGY, 2022, 18 (03) : 410 - 417
  • [8] Multi-Label Relevant Vector Machine based Simultaneous Fault Diagnosis
    Song Chao
    Xie Lei
    Zeng Jiusun
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 4792 - 4796
  • [9] Disorder recognition in clinical texts using multi-label structured SVM
    Wutao Lin
    Donghong Ji
    Yanan Lu
    BMC Bioinformatics, 18
  • [10] Improving Multi-Label Facial Expression Recognition With Consistent and Distinct Attentions
    Jiang, Jing
    Deng, Weihong
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (03) : 1279 - 1288