Subspace-based speaker-independent vowel recognition

被引:0
|
作者
Muralishankar, R [1 ]
O'Shaughnessy, D [1 ]
机构
[1] Univ Quebec, INRS EMT, Quebec City, PQ, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a subspace-based approach for speaker-independent vowel recognition. Five vowels (/aa/,/eh/,/iy/,/ow/ and /uw/) from the TIMIT database were considered for the task. The subspaces representing two different vowel classes may have a large common subspace due to speaker variability, noise and coarticulation. We use common principal component (CPC) [1] and its extension i.e., partial-Common principal component (pCPC) to obtain a specific subspace for each vowel which is insensitive to variations. We perform CPC analysis on the covariance matrices of the vowels. pCPC gives q eigenvectors which are common to all vowels and (p - q) vowel specific eigenvectors. For each value of q, vowel specific subspaces are obtained. An input vector from an unknown vowel is classified based on the maximum length of its projection on the specific subspaces. We have choosen 18-dimensional Mel-Frequency Cepstral coefficients as a feature in our recognition task. The specific subspace is treated as a transformation matrix which enhances the vowel-specific information in the feature vector and, inturn. increases signal-to-noise ratio. Recognition experiments were performed on vowels extracted from a multiple speaker set taken from different dialect regions in the TIMIT database. Results for each vowel-specific subspace are presented for different values of q ranging from 1 to 5. The results are encouraging in the context of a speaker-independent framework. Visual Analysis of the vowel basis spectra provides useful and interesting information by highlighting the importance of different frequency regions.
引用
收藏
页码:549 / 552
页数:4
相关论文
共 50 条
  • [1] SPEAKER-INDEPENDENT VOWEL RECOGNITION IN PERSIAN SPEECH
    Nazari, Mohammad
    Sayadiyan, Abolghasem
    Valiollahzadeh, Seyyed Majid
    2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 672 - 676
  • [2] Speaker-Independent Time Domain Vowel Recognition.
    Zuehlwetter, Juergen
    Elektrotechnik und Maschinenbau, 1979, 96 (01): : 13 - 15
  • [3] Template theory and experiment of speaker-independent vowel recognition
    Zhang, Hong
    Li, Zhi
    Huang, Taiyi
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 1999, 12 (04): : 431 - 436
  • [4] Speaker-Independent Malay Vowel Recognition of Children using Neural Networks
    Ting, H. N.
    Lam, Y. M.
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 4: IMAGE PROCESSING, BIOSIGNAL PROCESSING, MODELLING AND SIMULATION, BIOMECHANICS, 2010, 25 : 288 - 291
  • [5] FLEXIBLE VOWEL RECOGNITION BY THE GENERATION OF DYNAMIC COHERENCE IN OSCILLATOR NEURAL NETWORKS - SPEAKER-INDEPENDENT VOWEL RECOGNITION
    LIU, F
    YAMAGUCHI, Y
    SHIMIZU, H
    BIOLOGICAL CYBERNETICS, 1994, 71 (02) : 105 - 114
  • [6] Speaker-independent Malay vowel recognition of children using multi-layer perceptron
    Ting, HN
    Yunus, D
    TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : A68 - A71
  • [7] Uighur speaker-independent speech recognition based on CDCPM
    Wang, K.L.
    2001, Science Press (38):
  • [8] Speaker-independent recognition of Chinese tones
    GUAN Cuntai and CHEN Yongbin(Dep. of Radio Eng.
    Chinese Journal of Acoustics, 1993, (02) : 142 - 148
  • [9] SPEAKER-INDEPENDENT DIGIT RECOGNITION SYSTEM
    SAMBUR, MR
    RABINER, LR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 56 : S26 - S26
  • [10] DYNAMIC SPEAKER ADAPTATION IN SPEAKER-INDEPENDENT WORD RECOGNITION
    HEWETT, AJ
    HOLMES, G
    YOUNG, SJ
    PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 275 - 282