Improving Recognition of Syallabic Units of Hindi Languagae Using Combined Features of Throat Microphone and Normal Microphone Speech

被引：0

作者：

Radha, N. ^{[1
]}

Shahina, A. ^{[1
]}

Vinoth, G. ^{[2
]}

Khan, A. Nayeemulla ^{[3
]}

机构：

[1] SSNCE, Dept IT, Madras, Tamil Nadu, India

[2] WIPRO Technol, Madras, Tamil Nadu, India

[3] VIT, Sch Comp Sci & Engn, Madras, Tamil Nadu, India

来源：

2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT) | 2014年

关键词：

Automatic speech recognition; normal microphone; throat microphone; hidden Markov model;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The performance of Automatic Speech recognition system (ASR) built using close talk microphones degrades in noisy environments. ASR built using Throat Microphone (TM) speech shows relatively better performance under such adverse situations. However, some of the sounds are not well captured in TM. In this work we explore the combined use of Normal Microphone (NM) and TM features to improve the recognition rate of ASR. In the proposed work, the combined Mel-Frequency Cepstral Coefficients (MFCC) derived from the two signals are used to built an ASR in the HMM framework to recognize the 145 syllabic units of Indian language Hindi. The performance of this combined ASR system shows a significant improvement in performance when compared with individual ASR systems built using NM and TM features, respectively.

引用

页码：1343 / 1348

页数：6

共 6 条

[1]

Dupont S., 2004, P ROB WORKSH ITRW RO

[2]

Gangashetty SV, 2001, IEEE IJCNN, P1542, DOI 10.1109/IJCNN.2001.939594

[3] Combining standard and throat microphones for robust speech recognition [J].

Graciarena, M ;

Franco, H ;

Sonmez, K ;

Bratt, H .

IEEE SIGNAL PROCESSING LETTERS, 2003, 10 (03) :72-74

[4] Language identification in noisy environments using throat microphone signals [J].

Shahina, A ;

Yegnanarayana, B .

2005 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, PROCEEDINGS, 2005, :400-403

[5]

Shahina A., 2008, P ICON 2008

[6]

Zhang ZY, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS, P781

← 1 →