LEARNING THE BASIC UNITS IN AMERICAN SIGN LANGUAGE USING DISCRIMINATIVE SEGMENTAL FEATURE SELECTION

被引:10
作者
Yin, Pei [1 ]
Starner, Thad [1 ]
Hamilton, Harley [1 ]
Essa, Irfan [1 ]
Rehg, James M. [1 ]
机构
[1] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30332 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年
关键词
Machine Learning; American Sign Language; Feature Selection;
D O I
10.1109/ICASSP.2009.4960694
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The natural language for most deaf signers in the United States is American Sign Language (ASL). ASL has internal structure like spoken languages, and ASL linguists have introduced several phonemic models. The study of ASL phonemes is not only interesting to linguists, but also useful for scalability in recognition by machines. Since machine perception is different than human perception, this paper learns the basic units for ASL directly from data. Comparing with previous studies, our approach computes a set of data-driven Units (fenemes) discriminatively from the results of segmental feature selection. The learning iterates the following two steps: first apply discriminative feature selection segmentally to the signs, and then tie the most similar temporal segments to re-train. Intuitively, the sign parts indistinguishable to machines are merged to form basic units, which we call ASL fenemes. Experiments on publicly available ASL recognition data show that the extracted data-driven fenemes are meaningful, and recognition using those fenemes achieves improved accuracy at reduced model complexity.
引用
收藏
页码:4757 / 4760
页数:4
相关论文
共 24 条
  • [1] ANDERSEN O, 1994, P 1994 IEEE INT C AC, P121
  • [2] [Anonymous], 1997, Statistical methods for speech recognition
  • [3] [Anonymous], 1989, Sign Language Studies
  • [4] BAUER B, 2002, LNCS LNAI, V2298, P64
  • [5] CHEN Y, 2003, IEEE INT WORKSH AMFG
  • [6] Dietterich T.G, 2002, Structural, Syntactic, and Statistical Pattern Recognition, V2396, P15, DOI 10.1007/3-540-70659-32
  • [7] A decision-theoretic generalization of on-line learning and an application to boosting
    Freund, Y
    Schapire, RE
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) : 119 - 139
  • [8] Hermansky H, 2000, INT CONF ACOUST SPEE, P1635, DOI 10.1109/ICASSP.2000.862024
  • [9] HOLT GT, 2006, SIGN LANGUAGE STUDIE, V6, P416
  • [10] THE SEGMENTAL K-MEANS ALGORITHM FOR ESTIMATING PARAMETERS OF HIDDEN MARKOV-MODELS
    JUANG, BH
    RABINER, LR
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (09): : 1639 - 1641