Empirical study of neural network language models for Arabic speech recognition

被引:28
|
作者
Emami, Ahmad [1 ]
Mangu, Lidia [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
language modeling; speech recognition; neural networks;
D O I
10.1109/ASRU.2007.4430100
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we investigate the use of neural network language models for Arabic speech recognition. By using a distributed representation of words, the neural network model allows for more robust generalization and is better able to fight the data sparseness problem. We investigate different configurations of the neural probabilistic model, experimenting with such parameters as N-gram order, output vocabulary, normalization method, and model size and parameters. Experiments were carried out on Arabic broadcast news and broadcast conversations data and the optimized neural network language models showed significant improvements over the baseline N-gram. model.
引用
收藏
页码:147 / 152
页数:6
相关论文
共 50 条
  • [1] Arabic Sign Language Recognition and Generating Arabic Speech Using Convolutional Neural Network
    Kamruzzaman, M. M.
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [2] Comparison of Various Neural Network Language Models in Speech Recognition
    Zuo, Lingyun
    Liu, Jian
    Wan, Xin
    2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2016, : 894 - 898
  • [3] A study of neural network Russian language models for automatic continuous speech recognition systems
    Kipyatkova, I. S.
    Karpov, A. A.
    AUTOMATION AND REMOTE CONTROL, 2017, 78 (05) : 858 - 867
  • [4] A study of neural network Russian language models for automatic continuous speech recognition systems
    I. S. Kipyatkova
    A. A. Karpov
    Automation and Remote Control, 2017, 78 : 858 - 867
  • [5] Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition
    Chen, X.
    Ragni, A.
    Liu, X.
    Gales, M. J. F.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 269 - 273
  • [6] Structured Output Layer Neural Network Language Models for Speech Recognition
    Le, Hai-Son
    Oparin, Ilya
    Allauzen, Alexandre
    Gauvain, Jean-Luc
    Yvon, Francois
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (01): : 195 - 204
  • [7] BIDIRECTIONAL RECURRENT NEURAL NETWORK LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION
    Arisoy, Ebru
    Sethy, Abhinav
    Ramabhadran, Bhuvana
    Chen, Stanley
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5421 - 5425
  • [8] Deep Convolutional Neural Network for Arabic Speech Recognition
    Amari, Rafik
    Noubigh, Zouhaira
    Zrigui, Salah
    Berchech, Dhaou
    Nicolas, Henri
    Zrigui, Mounir
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022, 2022, 13501 : 120 - 134
  • [9] Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition
    Masumura, Ryo
    Asami, Taichi
    Oba, Takanobu
    Sakauchi, Sumitaka
    Ito, Akinori
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (12) : 2557 - 2567
  • [10] Exploiting Future Word Contexts in Neural Network Language Models for Speech Recognition
    Chen, Xie
    Liu, Xunying
    Wang, Yu
    Ragni, Anton
    Wong, Jeremy H. M.
    Gales, Mark J. F.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (09) : 1444 - 1454