Throat Microphone Speech Recognition using MFCC

被引:0
作者
Vijayan, Amritha [1 ]
Mathai, Bipil Mary [1 ]
Valsalan, Karthik [1 ]
Johnson, Riyanka Raji [1 ]
Mathew, Lani Rachel [1 ]
Gopakumar, K. [2 ]
机构
[1] Mar Baselios Coll Engn & Technol, Dept Elect & Commun Engn, Trivandrum, Kerala, India
[2] TKM Coll Engn & Technol, Dept Elect & Commun Engn, Kollam, Kerala, India
来源
2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT) | 2017年
关键词
Throat Microphone; MFCC; vocal fold vibrations; Minimum Mean Square Analysis;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Throat Microphone (TM) is a non-acoustic device, relying on the vibrations of vocal folds rather than the audible sound produced. Correctly capturing vocal fold vibrations is difficult due to poor signal representation capabilities. The system recognizes the TM vibrations and produces the corresponding speech sound. This is done by extracting features from the spectrum of the TM vibrations and comparing the obtained features with the values stored in a database. The extracted features include characteristic features of the speech waveform called Mel-Frequency Cepstral Coefficients (MFCC). The selection of the closest speech signal is chosen by the minimum mean square error estimation method, where the signal in the database whose corresponding MFCC values show the least difference from the input speech MFCCs is selected. This system has the potential of having applications for giving voice to those with defective speech and in military communications.
引用
收藏
页码:392 / 395
页数:4
相关论文
共 8 条
  • [1] Boukamcha Hamdi, SPEAKER RECOGNITION
  • [2] Brady K., 2004, P INT C AC SPEECH SI, V1
  • [3] Dash Kshamamayee, SPEAKER IDENTIFICATI
  • [4] Accurate hidden Markov models for non-audible murmur (NAM) recognition based on iterative supervised adaptation
    Heracleous, P
    Nakajima, Y
    Lee, A
    Saruwatari, H
    Shikano, K
    [J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 73 - 76
  • [5] Marx M. Arun, MES J TECHNOLOGY MAN
  • [6] Murty K. Sri Rama, 2008 ISCA
  • [7] Passy Victor, 1993, PASSY MUIR TRACHEOST
  • [8] Tura M. A. Tugtekin, 2016, IEEE ACM T AUDIO SPE, V24