Diffusion-Based Unsupervised Pre-training for Automated Recognition of Vitality Forms

被引:1
作者
Canovi, Noemi [1 ]
Montagna, Federico [1 ]
Niewiadomski, Radoslaw [2 ]
Sciutti, Alessandra [3 ]
Di Cesare, Giuseppe [3 ,4 ]
Beyan, Cigdem [5 ]
机构
[1] Univ Trento, Dep Informat Engn & Comp Sci, Trento, Italy
[2] Univ Genoa, Dept Informat Bioengn Robot & Syst Engn, Genoa, Italy
[3] Ist Italiano Tecnol, CONTACT Unit, Genoa, Italy
[4] Univ Parma, Dept Med & Surg, Parma, Italy
[5] Univ Verona, Dept Comp Sci, Verona, Italy
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED VISUAL INTERFACES, AVI 2024 | 2024年
基金
欧洲研究理事会;
关键词
Vitality forms; nonverbal communication; unsupervised pre-training; diffusion models; autoencoders; gestures; actions; trajectory; EXPRESSION;
D O I
10.1145/3656650.3656689
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Social communication involves interpreting nonverbal behaviors, detecting and anticipating others' actions and intentions. Actions convey not only the goal and motor intention but also the form, i.e., variations in action execution. These variations, termed vitality forms, communicate attitudes during interactions, such as being gentle, calm, vigorous, and rude. Automatic vitality form recognition may have several applications in social robotics, social skills training, and therapy, yet it remains a rarely studied topic. This paper introduces an unsupervised pre-training approach that utilizes 2D-body key point trajectories as input and employs diffusion models to derive more effective features for representing these trajectories. The features learned from the diffusion model's encoder are utilized to train a multilayer perceptron for vitality form recognition. Experimental analysis showcases the superior performance of the proposed method not only across various videos but also for action classes not encountered during training.
引用
收藏
页数:9
相关论文
共 51 条
  • [1] Bodily Behaviors in Social Interaction: Novel Annotations and State-of-the-Art Evaluation
    Balazia, Michal
    Mueller, Philipp
    Tanczos, Akos Levente
    von Liechtenstein, August
    Bremond, Francois
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [2] Expression of emotion in the kinematics of locomotion
    Barliya, Avi
    Omlor, Lars
    Giese, Martin A.
    Berthoz, Alain
    Flash, Tamar
    [J]. EXPERIMENTAL BRAIN RESEARCH, 2013, 225 (02) : 159 - 176
  • [3] Modeling Multiple Temporal Scales of Full-Body Movements for Emotion Classification
    Beyan, Cigdem
    Karumuri, Sukumar
    Volpe, Gualtiero
    Camurri, Antonio
    Niewiadomski, Radoslaw
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1070 - 1081
  • [4] Moving as a Leader: Detecting Emergent Leadership in Small Groups using Body Pose
    Beyan, Cigdem
    Katsageorgiou, Vasiliki-Maria
    Murino, Vittorio
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1425 - 1433
  • [5] Beyan Cigdem, 2023, Comput. Surveys, V56, P1
  • [6] OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields
    Cao, Zhe
    Hidalgo, Gines
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 172 - 186
  • [7] Virtual agent multimodal mimicry of humans
    Caridakis, George
    Raouzaiou, Amaryllis
    Bevacqua, Elisabetta
    Mancini, Maurizio
    Karpouzis, Kostas
    Malatesta, Lori
    Pelachaud, Catherine
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2007, 41 (3-4) : 367 - 388
  • [8] Chen SF, 2023, Arxiv, DOI [arXiv:2211.09788, DOI 10.48550/ARXIV.2211.09788]
  • [9] Unleashing the Transferability Power of Unsupervised Pre-Training for Emotion Recognition in Masked and Unmasked Facial Images
    D'Inca, Moreno
    Beyan, Cigdem
    Niewiadomski, Radoslaw
    Barattin, Simone
    Sebe, Nicu
    [J]. IEEE ACCESS, 2023, 11 : 90876 - 90890
  • [10] Perceived gesture dynamics in nonverbal expression of emotion
    Dael, Nele
    Goudbeek, Martijn
    Scherer, K. R.
    [J]. PERCEPTION, 2013, 42 (06) : 642 - 657