Pose Uncertainty Aware Movement Synchrony Estimation via Spatial-Temporal Graph Transformer

被引：6

作者：

Li, Jicheng ^{[1
]}

Bhat, Anjana ^{[1
]}

Barmaki, Roghayeh ^{[1
]}

机构：

[1] Univ Delaware, Newark, DE 19716 USA

来源：

PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022 | 2022年

关键词：

deep learning; movement synchrony estimation; contrastive learning; transformer networks; knowledge distillation; autism spectrum disorder; NEURAL-NETWORKS; DATASETS;

D O I：

10.1145/3536221.3556627

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The concept of movement synchrony is derived from the scientifc study of interacting dyads in the autism feld. Automated movement synchrony estimation has been achieved by utilizing deep learning models applied to other tasks, such as human activity recognition. To better adapt to the movement synchrony estimation task, we proposed a skeleton-based uncertainty-aware graph transformer incorporating joint confdence scores. We uniquely designed a joint position embedding shared between the same joints of interacting individuals and introduced a temporal similarity matrix in temporal attention computation considering the periodic intrinsic of body movements. To further improve the performance, we constructed a dataset for movement synchrony estimation using Human3.6M and pretrained our model on it via contrastive learning. We further applied knowledge distillation to alleviate information loss introduced by pose detector failure in a privacy-preserving way. Our method achieved an overall accuracy of 88.98% on PT13, a dataset collected from autism therapy interventions, and surpassed its counterpart approaches by a good margin. This work also has implications for synchronous movement activity recognition in group settings, with broad applications in education and sports.

引用

页码：73 / 82

页数：10

共 66 条

[1] Movement Synchrony and Facial Synchrony as Diagnostic Features of Depression A Pilot Study
Altmann, Uwe
Bruemmel, Maria
Meier, Julija
Strauss, Bernhard
[J]. JOURNAL OF NERVOUS AND MENTAL DISEASE, 2021, 209 (02) : 128 - 136
[2] Bai RW, 2022, Arxiv, DOI arXiv:2109.02860
[3] Berndt D.J., 1994, P KDD WORKSH SEATTL, V10, P359, DOI DOI 10.5555/3000850.3000887
[4] Bucilua C., 2006, 12 INT C KNOWL DISC, P535, DOI DOI 10.1145/1150402.1150464
[5] Calabro G., 2021, Progresses in Artificial Intelligence and Neural Systems, P543
[6] OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields
Cao, Zhe
Hidalgo, Gines
Simon, Tomas
Wei, Shih-En
Sheikh, Yaser
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 172 - 186
[7] D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation
Chang, Chien-Yi
Huang, De-An
Sui, Yanan
Li Fei-Fei
Niebles, Juan Carlos
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3541 - 3550
[8] Chen T, 2020, PR MACH LEARN RES, V119
[9] Cascaded Pyramid Network for Multi-Person Pose Estimation
Chen, Yilun
Wang, Zhicheng
Peng, Yuxiang
Zhang, Zhiqiang
Yu, Gang
Sun, Jian
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
[10] Unsupervised Synchrony Discovery in Human Interaction
Chu, Wen-Sheng
Zeng, Jiabei
De la Torre, Fernando
Cohn, Jeffrey F.
Messinger, Daniel S.
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3146 - 3154

← 1 2 3 4 5 6 7 →