Comparison of Image Transform-Based Features for Visual Speech Recognition in Clean and Corrupted Videos

被引:0
|
作者
Rowan Seymour
Darryl Stewart
Ji Ming
机构
[1] Queen's University of Belfast,School of Electronics, Electrical Engineering and Computer Science
来源
EURASIP Journal on Image and Video Processing | / 2008卷
关键词
Image Processing; Pattern Recognition; Computer Vision; Feature Type; Head Movement;
D O I
暂无
中图分类号
学科分类号
摘要
We present results of a study into the performance of a variety of different image transform-based feature types for speaker-independent visual speech recognition of isolated digits. This includes the first reported use of features extracted using a discrete curvelet transform. The study will show a comparison of some methods for selecting features of each feature type and show the relative benefits of both static and dynamic visual features. The performance of the features will be tested on both clean video data and also video data corrupted in a variety of ways to assess each feature type's robustness to potential real-world conditions. One of the test conditions involves a novel form of video corruption we call jitter which simulates camera and/or head movement during recording.
引用
收藏
相关论文
共 50 条
  • [1] Comparison of Image Transform-Based Features for Visual Speech Recognition in Clean and Corrupted Videos
    Seymour, Rowan
    Stewart, Darryl
    Ming, Ji
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
  • [2] APPEARANCE FEATURE EXTRACTION VERSUS IMAGE TRANSFORM-BASED APPROACH FOR VISUAL SPEECH RECOGNITION
    Sagheer, Alaa
    Tsuruta, Naoyuki
    Taniguchi, Rin-Ichiro
    Maeda, Sakashi
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2006, 6 (01) : 101 - 122
  • [3] Visual speech recognition using wavelet transform and moment based features
    Yau, Wai C.
    Kumar, Dinesh K.
    Arjunan, Sridhar P.
    Kumar, Sanjay
    ICINCO 2006: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS: ROBOTICS AND AUTOMATION, 2006, : 340 - 345
  • [4] Networks of Transform-Based Evolvable Features for Object Recognition
    Kowaliw, Taras
    Banzhaf, Wolfgang
    Doursat, Rene
    GECCO'13: PROCEEDINGS OF THE 2013 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2013, : 1077 - 1084
  • [5] Comparison of Facial Emotion Recognition Based on Image Visual Features and EEG Features
    Long, Yanfang
    Wanzeng, Kong B.
    Ling, Wenfen
    Yang, Can
    Zhu, Jieyong
    COGNITIVE SYSTEMS AND SIGNAL PROCESSING, PT II, 2019, 1006 : 162 - 172
  • [6] Study on Transform-Based Image Sharpening
    Liu, Ying
    Toh, Yong Ho
    Ng, Tek Ming
    Liew, Beng Keat
    COMPUTER AND INFORMATION SCIENCE 2009, 2009, 208 : 139 - 148
  • [7] A comparison of image processing techniques for visual speech recognition applications
    Gray, MS
    Sejnowski, TJ
    Movellan, JR
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 939 - 945
  • [8] Dynamic visual features based on discriminative speech class projection for visual speech recognition
    Lei, X
    Cai, XL
    Fu, ZH
    Zhao, RC
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 687 - 690
  • [9] Scale-transform based features for application in speech recognition
    Umesh, S
    Cohen, L
    Nelson, D
    WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING VII, 1999, 3813 : 727 - 731
  • [10] Complex Wavelet Transform-Based Face Recognition
    Eleyan, Alaa
    Ozkaramanli, Huseyin
    Demirel, Hasan
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2008, 2008 (1)