Gesture and Action Discovery for Evaluating Virtual Environments with Semi-Supervised Segmentation of Telemetry Records

被引:14
作者
Batch, Andrea [1 ]
Lee, Kyungjun [2 ]
Maddali, Hanuma Teja [2 ]
Elmqvist, Niklas [1 ]
机构
[1] Univ Maryland, Coll Informat Studies, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY (AIVR) | 2018年
关键词
D O I
10.1109/AIVR.2018.00009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel pipeline for semisupervised behavioral coding of videos of users testing a device or interface, with an eye toward human-computer interaction evaluation for virtual reality. Our system applies existing statistical techniques for time-series classification, including e-divisive change point detection and "Symbolic Aggregate approXimation" (SAX) with agglomerative hierarchical clustering, to 3D pose telemetry data. These techniques create classes of short segments of single-person video data-short actions of potential interest called "micro-gestures." A long short-term memory (LSTM) layer then learns these micro-gestures from pose features generated purely from video via a pre-trained OpenPose convolutional neural network (CNN) to predict their occurrence in unlabeled test videos. We present and discuss the results from testing our system on the single user pose videos of the CMU Panoptic Dataset.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 32 条
  • [1] [Anonymous], 1997, Neural Computation
  • [2] [Anonymous], 1983, The Psychology of Human-Computer Interaction
  • [3] [Anonymous], 2006, CHI 06 EXTENDED ABST, DOI [DOI 10.1145/1125451.1125580, 10.1145/1125451.1125580]
  • [4] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
    Cao, Zhe
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
  • [5] D. Team, 2014, DAT VID COD TOOL
  • [6] Girdhar R., 2017, ABS171209184 CORR
  • [7] Guinness Darren., 2015, Proceedings of the 3rd ACM Symposium on Spatial User Interaction (SUI'15), P34, DOI [DOI 10.1145/2788940.2788948, 10.1145/2788940, DOI 10.1145/2788940]
  • [8] Creating Summaries from User Videos
    Gygli, Michael
    Grabner, Helmut
    Riemenschneider, Hayko
    Van Gool, Luc
    [J]. COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 : 505 - 520
  • [9] Hailpern J., 2008, P WORK C ADV VIS INT, P317, DOI [10.1145/1385569.1385622, DOI 10.1145/1385569.1385622]
  • [10] Holle H., 2013, Understanding body movements: a guide to empirical research on nonverbal behavior: with an introduction to the NEUROGES coding system, P261, DOI 10.3726/978-3-653-04208-5