Gestures over video streams to support remote collaboration on physical tasks

被引:181
作者
Fussell, SR [1 ]
Setlock, LD [1 ]
Yang, J [1 ]
Ou, JZ [1 ]
Mauer, E [1 ]
Kramer, ADI [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
来源
HUMAN-COMPUTER INTERACTION | 2004年 / 19卷 / 03期
基金
美国国家科学基金会;
关键词
D O I
10.1207/s15327051hci1903_3
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This article considers tools to support remote gesture in video systems being used to complete collaborative physical tasks-tasks in which two or more individuals work together manipulating three-dimensional objects in the real world. We first discuss the process of conversational grounding during collaborative physical tasks, particularly the role of two types of gestures in the grounding process: pointing gestures, which are used to refer to task objects and locations, and representational gestures, which are used to represent the form of task objects and the nature of actions to be used with those objects. We then consider ways in which both pointing and representational gestures can be instantiated in systems for remote collaboration on physical tasks. We present the results of two studies that use a "surrogate" approach to remote gesture, in which images are intended to express the meaning of gestures through visible embodiments, rather than direct views of the hands. In Study 1, we compare performance with a cursor-based pointing device that allows remote partners to point to objects in a video feed of the work area to performance side-by-side or with the video system alone. In Study 2, we compare performance with two variations of a pen-based drawing tool that allows for both pointing and representational gestures to performance with video alone. The results suggest that simple surrogate gesture tools can be used to convey gestures from remote sites, but that the tools need to be able to convey representational as well as pointing gestures to be effective. The results further suggest that an automatic erasure function, in which drawings disappear a few seconds after they were created, is more beneficial for collaboration than tools requiring manual erasure. We conclude with a discussion of the theoretical and practical implications of the results, as well as several areas for future research.
引用
收藏
页码:273 / 309
页数:37
相关论文
共 52 条
  • [1] Effects of visibility between speaker and listener on gesture production: Some gestures are meant to be seen
    Alibali, MW
    Heath, DC
    Myers, HJ
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2001, 44 (02) : 169 - 188
  • [2] [Anonymous], PROC ACM CONF ON COM
  • [3] Bauer M., 1999, Digest of Papers. Third International Symposium on Wearable Computers, P151, DOI 10.1109/ISWC.1999.806696
  • [4] BEKKER MM, 1995, P S DES INT SYST DIS, P157
  • [5] BLY SA, 1990, SIGOIS BUL, V11, P184, DOI 10.1145/91478.91514
  • [6] BOLT RA, 1980, P 7 ANN C COMP GRAPH, P262, DOI [10.1145/965105.807503, DOI 10.1145/965105.807503]
  • [7] BRINCK T, 1992, P ACM C COMP SUPP CO, P171
  • [8] Cassell J., 1998, Computer Vision for Human-Machine Interaction, P191, DOI DOI 10.1017/CBO9780511569937.013
  • [9] Clark H., USING LANGUAGE
  • [10] CLARK HH, 1991, PERSPECTIVES ON SOCIALLY SHARED COGNITION, P127, DOI 10.1037/10096-006