(Speech recognition based on Spanish accent acoustic model)

被引:1
|
作者
Plaza, Johanna [1 ]
Sanchez-Zhunio, Cristina [1 ]
Acosta-Uriguen, Maria-Ines [1 ]
Orellana, Marcos [1 ]
Cedillo, Priscila [1 ,2 ]
Zambrano-Martinez, Jorge Luis [1 ]
机构
[1] Univ Azuay, Cuenca, Ecuador
[2] Univ Cuenca, Cuenca, Ecuador
来源
ENFOQUE UTE | 2022年 / 13卷 / 03期
关键词
Automatic Speech Recognition; Language Model; CMUSphinx;
D O I
10.29019/enfoqueute.839
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The objective of the article was to generate an Automatic Speech Recognition (ASR) model based on the translation from human voice to text, being considered as one of the branches of artificial intelligence. Voice analysis allows identifying information about the acoustics, phonetics, syntax, semantics of words, among other elements where ambiguity in terms, pronunciation errors, similar syntax but different semantics can be identified, which represent characteristics of the language. The model focused on the acoustic analysis of words proposing the generation of a methodology for acoustic recognition from speech transcripts from audios containing human voice and the error rate per word was considered to identify the accuracy of the model. The audios were taken from the Integrated Security Service ECU 911 that represent emergency calls registered by the entity. The model was trained with the CMUSphinx tool for the Spanish language without internet connection. The results showed that the word error rate varies in relation to the number of audios; that is, the greater the number of audios, the smaller number of erroneous words and the greater the accuracy of the model. The investigation concluded by emphasizing the duration of each audio as a variable that affects the accuracy of the model.
引用
收藏
页码:45 / 57
页数:13
相关论文
共 50 条
  • [1] A Multi-Accent Acoustic Model using Mixture of Experts for Speech Recognition
    Jain, Abhinav
    Singh, Vishwanath P.
    Rath, Shakti P.
    INTERSPEECH 2019, 2019, : 779 - 783
  • [2] ACOUSTIC MODEL SELECTION USING LIMITED DATA FOR ACCENT ROBUST SPEECH RECOGNITION
    Najafian, Maryam
    Safavi, Saeid
    Hanani, Abualsoud
    Russell, Martin
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1786 - 1790
  • [3] Acoustic Modeling in Mandarin Speech Recognition of Minority Accent in Yunnan
    Wu Peishan
    Yang Jian
    PROCEEDINGS OF THE 27TH CHINESE CONTROL CONFERENCE, VOL 4, 2008, : 526 - 530
  • [4] Multidialectal Spanish acoustic modeling for speech recognition
    Caballero, Monica
    Moreno, Asuncion
    Nogueiras, Albino
    SPEECH COMMUNICATION, 2009, 51 (03) : 217 - 229
  • [5] Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition
    Shao, Qijie
    Yan, Jinghao
    Kang, Jian
    Guo, Pengcheng
    Shi, Xian
    Hu, Pengfei
    Xie, Lei
    INTERSPEECH 2022, 2022, : 3719 - 3723
  • [6] Speech Emotion Recognition Based on Acoustic Segment Model
    Zheng, Siyuan
    Du, Jun
    Zhou, Hengshun
    Bai, Xue
    Lee, Chin-Hui
    Li, Shipeng
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [8] Accent classification for speech recognition
    Faria, A
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2005, 3869 : 285 - 293
  • [9] An Acoustic Model For English Speech Recognition Based On Deep Learning
    Ling, Zhang
    2019 11TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2019), 2019, : 610 - 614
  • [10] Speech Recognition Based on Concatenated Acoustic Feature and LightGBM Model
    Yu, Jiali
    Qu, Yuanyuan
    Zhang, Zhongkai
    Lu, Qidong
    Qin, Zhiliang
    Liu, Xiaowei
    TWELFTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2021, 11719