A database-based framework for gesture recognition

被引:0
作者
Vassilis Athitsos
Haijing Wang
Alexandra Stefan
机构
[1] University of Texas at Arlington,Computer Science and Engineering Department
来源
Personal and Ubiquitous Computing | 2010年 / 14卷
关键词
Gesture recognition; Hand pose estimation; Embeddings; American Sign Language; Indexing methods; Image and video databases;
D O I
暂无
中图分类号
学科分类号
摘要
Gestures are an important modality for human–machine communication. Computer vision modules performing gesture recognition can be important components of intelligent homes, assistive environments, and human–computer interfaces. A key problem in recognizing gestures is that the appearance of a gesture can vary widely depending on variables such as the person performing the gesture, or the position and orientation of the camera. This paper presents a database-based approach for addressing this problem. The large variability in appearance among different examples of the same gesture is addressed by creating large gesture databases, that store enough exemplars from each gesture to capture the variability within that gesture. This database-based approach is applied to two gesture recognition problems: handshape categorization and motion-based recognition of American Sign Language signs. A key aspect of our approach is the use of database indexing methods, in order to address the challenge of searching large databases without violating the time constraints of an online interactive system, where system response times of over a few seconds are oftentimes considered unacceptable. Our experiments demonstrate the benefits of the proposed database-based framework, and the feasibility of integrating large gesture databases into online interacting systems.
引用
收藏
页码:511 / 526
页数:15
相关论文
共 48 条
  • [1] Athitsos V(2008)Boostmap: an embedding method for efficient nearest neighbor retrieval IEEE Trans Pattern Anal Mach Intell 30 89-104
  • [2] Alon J(2002)Shape matching and object recognition using shape contexts IEEE Trans Pattern Anal Mach Intell 24 509-522
  • [3] Sclaroff S(2001)Searching in high-dimensional spaces: index structures for improving the performance of multimedia databases ACM Comput Surv 33 322-373
  • [4] Kollios G(1985)On Lipschitz embeddings of finite metric spaces in Hilbert space Isr J Math 52 46-52
  • [5] Belongie S(1986)A computational approach to edge detection IEEE Trans Pattern Anal Mach Intell 8 679-698
  • [6] Malik J(2000)Appearance-based hand sign recognition from intensity image sequences Comput Vis Image Underst 78 157-176
  • [7] Puzicha J(1996)Task-specific gesture analysis in real-time using interpolated views IEEE Trans Pattern Anal Mach Intell 18 1236-1242
  • [8] Böhm C(1968)The condensed nearest neighbor rule IEEE Trans Inf Theory 14 515-516
  • [9] Berchtold S(2003)Index-driven similarity search in metric spaces ACM Trans Database Syst 28 517-580
  • [10] Keim DA(2003)Properties of embedding methods for similarity searching in metric spaces IEEE Trans Pattern Anal Mach Intell 25 530-549