TELUGU ANKELU: A Telugu Spoken Digits Corpora for Mobile Speech Recognition

被引:0
作者
Bhagath, Parabattina [1 ]
Pullagura, Meghana
Das, Pradip K.
Yandra, Vikram Kumar
Thetla, Santhi Sri
机构
[1] Jawaharlal Nehru Technol Univ Kakinada, Lakireddy Bali Reddy Coll Engn Mylavaram, Dept Comp Sci & Engn, Kakinada, Andhra Pradesh, India
来源
2022 12TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS (ICPRS) | 2022年
关键词
Telugu spoken digits; Speech Processing; Mobile Speech Recognition; Speech Corpus; Low-resource languages;
D O I
10.1109/ICPRS54038.2022.9854065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Speech Recognition (ASR) deals with recognizing spoken words or sentences through an automated program or an algorithm. ASR has multi-faceted applications in various domains such as Mobile Speech Recognition, IoT, Human-Machine interaction, etc. Researchers have been working on different problems in these domains for more than 5 decades. Mobile speech recognition is a pre-dominant area that supports a wide variety of applications useful for physically challenged, senior citizens and novice mobile users. Though the domain has significant practicality, it faces challenges due to the lack of data in the targeted language. Though it is not major for well-developed languages like English, it is very profound in low-resourced and under-resourced languages. India has 1,391.99 million inhabitants speaking a wide variety of languages with Hindi being the most popular language. There are many native languages spoken in each region of the country. But, research that contributes towards the development of ASR is questionable due to many reasons, and the unavailability of data is one of those. Hence, an open-source speech digit data set has been prepared and will be available through this paper for the Telugu language which is the sixth largely spoken language in the country.
引用
收藏
页数:6
相关论文
共 20 条
[1]  
[Anonymous], 2011, CENSUS INDIA
[2]  
Ardila R, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P4218
[3]  
Bhagath P., 2021, 2021 IEEE 28 INT C E, P1
[4]  
Das B., 2011, 2011 Oriental COCOSDA 2011 - International Conference on Speech Database and Assessments, P51, DOI 10.1109/ICSDA.2011.6085979
[5]  
Gaikwad S., 2013, 2013 INT C ORIENTAL, P1
[6]  
Leonard R.G., 1984, P IEEE ICASSP, P328
[7]  
Madhavaraj A., 2017, 2017 14 IEEE INDIA C, P1
[8]  
Manjunath KE, 2018, INTERSPEECH, P1016
[9]  
Manjunath K. E., 2022, DEV ANAL MULTILINGUA, P27
[10]  
Myakala PR, 2016, PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), P864, DOI 10.1109/TENCON.2016.7848128