Comparison of Image Transform-Based Features for Visual Speech Recognition in Clean and Corrupted Videos

被引：0

作者：

Rowan Seymour

Darryl Stewart

Ji Ming

机构：

[1] Queen's University of Belfast,School of Electronics, Electrical Engineering and Computer Science

来源：

EURASIP Journal on Image and Video Processing | / 2008卷

关键词：

Image Processing; Pattern Recognition; Computer Vision; Feature Type; Head Movement;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We present results of a study into the performance of a variety of different image transform-based feature types for speaker-independent visual speech recognition of isolated digits. This includes the first reported use of features extracted using a discrete curvelet transform. The study will show a comparison of some methods for selecting features of each feature type and show the relative benefits of both static and dynamic visual features. The performance of the features will be tested on both clean video data and also video data corrupted in a variety of ways to assess each feature type's robustness to potential real-world conditions. One of the test conditions involves a novel form of video corruption we call jitter which simulates camera and/or head movement during recording.

引用

共 50 条

[1] Comparison of Image Transform-Based Features for Visual Speech Recognition in Clean and Corrupted Videos
Seymour, Rowan
Stewart, Darryl
Ming, Ji
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
[2] APPEARANCE FEATURE EXTRACTION VERSUS IMAGE TRANSFORM-BASED APPROACH FOR VISUAL SPEECH RECOGNITION
Sagheer, Alaa
Tsuruta, Naoyuki
Taniguchi, Rin-Ichiro
Maeda, Sakashi
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2006, 6 (01) : 101 - 122
[3] Visual speech recognition using wavelet transform and moment based features
Yau, Wai C.
Kumar, Dinesh K.
Arjunan, Sridhar P.
Kumar, Sanjay
ICINCO 2006: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS: ROBOTICS AND AUTOMATION, 2006, : 340 - 345
[4] Networks of Transform-Based Evolvable Features for Object Recognition
Kowaliw, Taras
Banzhaf, Wolfgang
Doursat, Rene
GECCO'13: PROCEEDINGS OF THE 2013 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2013, : 1077 - 1084
[5] Comparison of Facial Emotion Recognition Based on Image Visual Features and EEG Features
Long, Yanfang
Wanzeng, Kong B.
Ling, Wenfen
Yang, Can
Zhu, Jieyong
COGNITIVE SYSTEMS AND SIGNAL PROCESSING, PT II, 2019, 1006 : 162 - 172
[6] Study on Transform-Based Image Sharpening
Liu, Ying
Toh, Yong Ho
Ng, Tek Ming
Liew, Beng Keat
COMPUTER AND INFORMATION SCIENCE 2009, 2009, 208 : 139 - 148
[7] A comparison of image processing techniques for visual speech recognition applications
Gray, MS
Sejnowski, TJ
Movellan, JR
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 939 - 945
[8] Dynamic visual features based on discriminative speech class projection for visual speech recognition
Lei, X
Cai, XL
Fu, ZH
Zhao, RC
PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 687 - 690
[9] Scale-transform based features for application in speech recognition
Umesh, S
Cohen, L
Nelson, D
WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING VII, 1999, 3813 : 727 - 731
[10] Complex Wavelet Transform-Based Face Recognition
Eleyan, Alaa
Ozkaramanli, Huseyin
Demirel, Hasan
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2008, 2008 (1)

← 1 2 3 4 5 →