Subspace-based speaker-independent vowel recognition

被引：0

作者：

Muralishankar, R ^{[1
]}

O'Shaughnessy, D ^{[1
]}

机构：

[1] Univ Quebec, INRS EMT, Quebec City, PQ, Canada

来源：

2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a subspace-based approach for speaker-independent vowel recognition. Five vowels (/aa/,/eh/,/iy/,/ow/ and /uw/) from the TIMIT database were considered for the task. The subspaces representing two different vowel classes may have a large common subspace due to speaker variability, noise and coarticulation. We use common principal component (CPC) [1] and its extension i.e., partial-Common principal component (pCPC) to obtain a specific subspace for each vowel which is insensitive to variations. We perform CPC analysis on the covariance matrices of the vowels. pCPC gives q eigenvectors which are common to all vowels and (p - q) vowel specific eigenvectors. For each value of q, vowel specific subspaces are obtained. An input vector from an unknown vowel is classified based on the maximum length of its projection on the specific subspaces. We have choosen 18-dimensional Mel-Frequency Cepstral coefficients as a feature in our recognition task. The specific subspace is treated as a transformation matrix which enhances the vowel-specific information in the feature vector and, inturn. increases signal-to-noise ratio. Recognition experiments were performed on vowels extracted from a multiple speaker set taken from different dialect regions in the TIMIT database. Results for each vowel-specific subspace are presented for different values of q ranging from 1 to 5. The results are encouraging in the context of a speaker-independent framework. Visual Analysis of the vowel basis spectra provides useful and interesting information by highlighting the importance of different frequency regions.

引用

页码：549 / 552

页数：4

共 50 条

[1] SPEAKER-INDEPENDENT VOWEL RECOGNITION IN PERSIAN SPEECH
Nazari, Mohammad
Sayadiyan, Abolghasem
Valiollahzadeh, Seyyed Majid
2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 672 - 676
[2] Speaker-Independent Time Domain Vowel Recognition.
Zuehlwetter, Juergen
Elektrotechnik und Maschinenbau, 1979, 96 (01): : 13 - 15
[3] Template theory and experiment of speaker-independent vowel recognition
Zhang, Hong
Li, Zhi
Huang, Taiyi
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 1999, 12 (04): : 431 - 436
[4] Speaker-Independent Malay Vowel Recognition of Children using Neural Networks
Ting, H. N.
Lam, Y. M.
WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 4: IMAGE PROCESSING, BIOSIGNAL PROCESSING, MODELLING AND SIMULATION, BIOMECHANICS, 2010, 25 : 288 - 291
[5] FLEXIBLE VOWEL RECOGNITION BY THE GENERATION OF DYNAMIC COHERENCE IN OSCILLATOR NEURAL NETWORKS - SPEAKER-INDEPENDENT VOWEL RECOGNITION
LIU, F
YAMAGUCHI, Y
SHIMIZU, H
BIOLOGICAL CYBERNETICS, 1994, 71 (02) : 105 - 114
[6] Speaker-independent Malay vowel recognition of children using multi-layer perceptron
Ting, HN
Yunus, D
TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : A68 - A71
[7] Uighur speaker-independent speech recognition based on CDCPM
Wang, K.L.
2001, Science Press (38):
[8] Speaker-independent recognition of Chinese tones
GUAN Cuntai and CHEN Yongbin(Dep. of Radio Eng.
Chinese Journal of Acoustics, 1993, (02) : 142 - 148
[9] SPEAKER-INDEPENDENT DIGIT RECOGNITION SYSTEM
SAMBUR, MR
RABINER, LR
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 56 : S26 - S26
[10] DYNAMIC SPEAKER ADAPTATION IN SPEAKER-INDEPENDENT WORD RECOGNITION
HEWETT, AJ
HOLMES, G
YOUNG, SJ
PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 275 - 282

← 1 2 3 4 5 →