SiSe: Simultaneous and Sequential Transformers for multi-label activity recognition

被引:0
|
作者
Chen, Zhao-Min [1 ]
Jin, Xin [2 ]
Chan, Sixian [3 ]
机构
[1] Wenzhou Univ, Key Lab Intelligent Informat Safety & Emergency Zh, Wenzhou 325035, Peoples R China
[2] Samsung Elect China R&D Ctr, Samsung Elect, Nanjing 210012, Peoples R China
[3] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label; Activity recognition; Sequential transformer; Hierarchical structure;
D O I
10.1016/j.patcog.2024.110844
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label activity recognition is extremely challenging, where multiple activities may appear simultaneously or sequentially in a video. While previous works have realized the temporal co-occurrence of activities, the sequential order of activities have been largely overlooked. However, we argue that the sequential order of activities should also be preserved in correlation modeling, because shuffling the order might not form a semantically meaningful video. In this work, we present plug-and-play Simultaneous and Sequential Transformer (SiSe) modules for multi-label activity recognition. Upon frame features of all time steps, SiSe enhances spatiotemporal feature learning for multi-label activity recognition, by capturing the simultaneous and sequential activity correlations. Specifically, we employ a Simultaneous Transformer module to connect multiple activities that probably appear at each frame, and a hierarchical Sequential Transformer module to efficiently capture the sequential activity correlations in an order-preserved manner. Despite the straightforward and class- agnostic design of SiSe, it can outperform state-of-the-art approaches on three multi-label activity recognition benchmarks. In particular, we verify the significance of preserving the sequential order of activities with our Sequential Transformer in correlation modeling. We also conduct ablation studies and visual analysis for better understanding of our SiSe.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] MULTI-LABEL TEXT CLASSIFICATION WITH A ROBUST LABEL DEPENDENT REPRESENTATION
    Alfaro, Rodrigo
    Allende, Hector
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 211 - 214
  • [32] Exploiting Label Dependency and Feature Similarity for Multi-Label Classification
    Nedungadi, Prema
    Haripriya, H.
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 2196 - 2200
  • [33] Semi-Supervised Multi-Label Learning from Crowds via Deep Sequential Generative Model
    Shi, Wanli
    Sheng, Victor S.
    Li, Xiang
    Gu, Bin
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1141 - 1149
  • [34] Multi-label body constitution recognition via HWmixer-MLP for facial and tongue images
    Zhang, Mengjian
    Wen, Guihua
    Yang, Pei
    Wang, Changjun
    Chen, Chuyun
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 269
  • [35] Real-Time Multi-Label Upper Gastrointestinal Anatomy Recognition from Gastroscope Videos
    Yu, Tao
    Hu, Huiyi
    Zhang, Xinsen
    Lei, Honglin
    Liu, Jiquan
    Hu, Weiling
    Duan, Huilong
    Si, Jianmin
    APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [36] Dynamic Classifier Chains for Multi-label Learning
    Trajdos, Pawel
    Kurzynski, Marek
    PATTERN RECOGNITION, DAGM GCPR 2019, 2019, 11824 : 567 - 580
  • [37] Feature Selection for Multi-label Classification Problems
    Doquire, Gauthier
    Verleysen, Michel
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2011, PT I, 2011, 6691 : 9 - 16
  • [38] Set Labelling using Multi-label Classification
    Sanjaya, Ngurah Agus E. R.
    Read, Jesse
    Abdessalem, Talel
    Bressan, Stephane
    IIWAS2018: THE 20TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2014, : 216 - 220
  • [39] Multi-label large margin hierarchical perceptron
    Woolam, Clay
    Khan, Latifur
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2008, 1 (01) : 5 - 22
  • [40] Multi-Label Requirements Classification with Large Taxonomies
    Abdeen, Waleed
    Unterkalmsteiner, Michael
    Wnuk, Krzysztof
    Chirtoglou, Alexandros
    Schimanski, Christoph
    Goli, Heja
    32ND IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE, RE 2024, 2024, : 264 - 274