The Development of Isolated Words Corpus of Pashto for the Automatic Speech Recognition Research

被引:0
|
作者
Ahmed, Irfan [1 ]
Ahmad, Nasir [2 ]
Ali, Hazrat [1 ]
Ahmad, Gulzar [1 ]
机构
[1] Univ Engn & Technol, Dept Elect Engn, Peshawar, Pakistan
[2] Univ Engn & Technol, Dept Comp Syst Engn, Peshawar, Pakistan
来源
2012 INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE (ICRAI) | 2012年
关键词
Automatic Speech Recognition; Pashto Speech Corpus; Human Computer Interaction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The availability of standard speech database is of paramount importance in the automatic speech recognition (ASR) research in the context of providing a baseline for comparing the performance of automatic speech recognition approaches. This paper presents the development of a Medium-Vocabulary Speech Corpus for Pashto language. The vocabulary encompasses 161 isolated words of Pashto language, consisting of most frequently used words of Pashto language, names of the days of the week and digits from 0 to 25. The words were uttered by 30 speakers of different ages and genders, including both native and non-native speakers of Pashto language. Recording of the corpus was performed in a noise free office environment. The Corpus developed is then used for the development of an automatic speech recognition system for Pashto language.
引用
收藏
页码:139 / 143
页数:5
相关论文
共 50 条
  • [11] Automatic Recognition of Target Words in Infant-Directed Speech
    van der Klis, Anika
    Adriaans, Frans
    Han, Mengru
    Kager, Rene
    COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION), 2020, : 522 - 522
  • [12] ALGERIAN ARABIC SPEECH DATABASE (ALGASD): CORPUS DESIGN AND AUTOMATIC SPEECH RECOGNITION APPLICATION
    Droua-Hamdani, Ghania
    Selouani, Sid Ahmed
    Boudraa, Malika
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2010, 35 (2C): : 157 - 166
  • [13] Automatic Speech Recognition of Vietnamese for a New Large-Scale Corpus
    Tran, Linh Thi Thuc
    Kim, Han-Gyu
    La, Hoang Minh
    Pham, Su Van
    ELECTRONICS, 2024, 13 (05)
  • [14] Using adaptive filter to increase automatic speech recognition rate in a digit corpus
    Oropeza Rodriguez, Jose Luis
    Suarez Guerra, Sergio
    Sanchez Fernandez, Luis Pastor
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2007, 4756 : 78 - 87
  • [15] Detection of Input-Difficult Words by Automatic Speech Recognition for PC Captioning
    Takeuchi, Yoshinori
    Kojima, Daiki
    Sano, Shoya
    Kanamori, Shinji
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PT I, 2018, 10896 : 195 - 202
  • [16] ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact Centers
    Ha, Jung-Woo
    Nam, Kihyun
    Kang, Jingu
    Lee, Sang-Woo
    Yang, Sohee
    Jung, Hyunhoon
    Kim, Hyeji
    Kim, Eunmi
    Kim, Soojin
    Kim, Hyun Ah
    Doh, Kyoungtae
    Lee, Chan Kyu
    Sung, Nako
    Kim, Sunghun
    INTERSPEECH 2020, 2020, : 409 - 413
  • [17] Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition
    Masumura, Ryo
    Asami, Taichi
    Oba, Takanobu
    Sakauchi, Sumitaka
    Ito, Akinori
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (12) : 2557 - 2567
  • [18] Development of speechreading supplements based on automatic speech recognition
    Duchnowski, P
    Lum, DS
    Krause, JC
    Sexton, MG
    Bratakos, MS
    Braida, LD
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2000, 47 (04) : 487 - 496
  • [19] gram Approximation of Latent Words Language Models for Domain Robust Automatic Speech Recognition
    Masumura, Ryo
    Asami, Taichi
    Oba, Takanobu
    Masataki, Hirokazu
    Sakauchi, Sumitaka
    Takahashi, Satoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2462 - 2470
  • [20] Domain Adaptation Based on Mixture of Latent Words Language Models for Automatic Speech Recognition
    Masumura, Ryo
    Asami, Taichi
    Oba, Takanobu
    Masataki, Hirokazu
    Sakauchi, Sumitaka
    Ito, Akinori
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (06): : 1581 - 1590