Development And Suitability Of Indian Languages Speech Database For Building Watson Based ASR System

被引:0
作者
Pandey, Dipti [1 ]
Mondal, Tapabrata [2 ]
Agrawal, S. S. [1 ]
Bangalore, Srinivas [3 ]
机构
[1] KIIT Coll Engn, Gurgaon, India
[2] Jadavpur Univ, Kolkata 700032, W Bengal, India
[3] AT&T Lab, Florham Pk, NJ USA
来源
2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE) | 2013年
关键词
Speech Recognition; Speech databases; Indian Languages;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we discuss our efforts in the development of Indian spoken languages corpora for building large vocabulary speech recognition systems using WATSON Toolkit. The current paper demonstrates that these corpora can be reduced to a varied degree for various phonemes by comparing the similarity among phonemes of different languages. We also discuss the design and methodology of collection of speech databases and the challenges we have faced during database creation. The experiments have been conducted on commonly known Indian languages, by training the ASR system with WATSON toolkit and evaluation by Sclite. The results for these experiments show that different Indian languages have a great similarity among their phoneme structures and phoneme sequences and we have exploited these features to create speech recognition system. Also, we have developed an algorithm to bootstrapping the phonemes of one particular language into another by mapping the phonemes of different languages. The performance of Hindi and Bangla ASR systems using these databases has been compared.
引用
收藏
页数:6
相关论文
共 7 条
[1]  
Agrawal S S, 2004, PROCEEDINGS OF ICSLT
[2]  
Ahuja R., 1992, PROCEEDINGS OF THE W, P3
[3]  
Chourasia, 2005, PROC O COCOSDA 2005, P132
[4]  
Chourasia V., 2007, J ACOUST SOC INDIA, P41
[5]  
Kumar K., 2011, INTERNATIONAL JOURNA, V2
[6]  
Samudravijaya K, 2000, PROC INT CONF ON SPO
[7]  
Sinha Shweta, OCOCOSDA 2011