Pose Uncertainty Aware Movement Synchrony Estimation via Spatial-Temporal Graph Transformer

被引:6
作者
Li, Jicheng [1 ]
Bhat, Anjana [1 ]
Barmaki, Roghayeh [1 ]
机构
[1] Univ Delaware, Newark, DE 19716 USA
来源
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022 | 2022年
关键词
deep learning; movement synchrony estimation; contrastive learning; transformer networks; knowledge distillation; autism spectrum disorder; NEURAL-NETWORKS; DATASETS;
D O I
10.1145/3536221.3556627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The concept of movement synchrony is derived from the scientifc study of interacting dyads in the autism feld. Automated movement synchrony estimation has been achieved by utilizing deep learning models applied to other tasks, such as human activity recognition. To better adapt to the movement synchrony estimation task, we proposed a skeleton-based uncertainty-aware graph transformer incorporating joint confdence scores. We uniquely designed a joint position embedding shared between the same joints of interacting individuals and introduced a temporal similarity matrix in temporal attention computation considering the periodic intrinsic of body movements. To further improve the performance, we constructed a dataset for movement synchrony estimation using Human3.6M and pretrained our model on it via contrastive learning. We further applied knowledge distillation to alleviate information loss introduced by pose detector failure in a privacy-preserving way. Our method achieved an overall accuracy of 88.98% on PT13, a dataset collected from autism therapy interventions, and surpassed its counterpart approaches by a good margin. This work also has implications for synchronous movement activity recognition in group settings, with broad applications in education and sports.
引用
收藏
页码:73 / 82
页数:10
相关论文
共 66 条
  • [1] Movement Synchrony and Facial Synchrony as Diagnostic Features of Depression A Pilot Study
    Altmann, Uwe
    Bruemmel, Maria
    Meier, Julija
    Strauss, Bernhard
    [J]. JOURNAL OF NERVOUS AND MENTAL DISEASE, 2021, 209 (02) : 128 - 136
  • [2] Bai RW, 2022, Arxiv, DOI arXiv:2109.02860
  • [3] Berndt D.J., 1994, P KDD WORKSH SEATTL, V10, P359, DOI DOI 10.5555/3000850.3000887
  • [4] Bucilua C., 2006, 12 INT C KNOWL DISC, P535, DOI DOI 10.1145/1150402.1150464
  • [5] Calabro G., 2021, Progresses in Artificial Intelligence and Neural Systems, P543
  • [6] OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields
    Cao, Zhe
    Hidalgo, Gines
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 172 - 186
  • [7] D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation
    Chang, Chien-Yi
    Huang, De-An
    Sui, Yanan
    Li Fei-Fei
    Niebles, Juan Carlos
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3541 - 3550
  • [8] Chen T, 2020, PR MACH LEARN RES, V119
  • [9] Cascaded Pyramid Network for Multi-Person Pose Estimation
    Chen, Yilun
    Wang, Zhicheng
    Peng, Yuxiang
    Zhang, Zhiqiang
    Yu, Gang
    Sun, Jian
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
  • [10] Unsupervised Synchrony Discovery in Human Interaction
    Chu, Wen-Sheng
    Zeng, Jiabei
    De la Torre, Fernando
    Cohn, Jeffrey F.
    Messinger, Daniel S.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3146 - 3154