Evaluation of Small-Scale Deep Learning Architectures in Thai Speech Recognition

被引:0
作者
Kaewprateep, Jirayu [1 ]
Prom-on, Santitham [1 ]
机构
[1] King Mongkuts Univ Technol, Dept Comp Engn, Thonburi, Thailand
来源
2018 1ST INTERNATIONAL ECTI NORTHERN SECTION CONFERENCE ON ELECTRICAL, ELECTRONICS, COMPUTER AND TELECOMMUNICATIONS ENGINEERING (ECTI-NCON | 2018年
关键词
Thai speech recognition; deep learning; convolutional neural network; long short term memory network;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a performance evaluation study for small-scale deep learning neural network for Thai speech recognition task. Convolutional neural network and long short term memory networks were built with a relatively small size dataset and small constructs. The aim of this study is to determine which method would be suitable for a small-scale deep learning study. Relatively small speech corpus was used to build deep-learning neural networks with two different architectures, including convolutional neural network (CNN) model and long short term memory (LSTM) model. Models were evaluated using cross validation technique and compare to one another. The result shows that CNN outperformed LSTM for a small-scale deep learning. This suggests that with the limited dataset and small-scale architecture CNN is a more suitable choice in the speech recognition study.
引用
收藏
页码:60 / 64
页数:5
相关论文
共 50 条
  • [21] Arabic Speech Recognition with Deep Learning: A Review
    Algihab, Wajdan
    Alawwad, Noura
    Aldawish, Anfal
    AlHumoud, Sarah
    SOCIAL COMPUTING AND SOCIAL MEDIA: DESIGN, HUMAN BEHAVIOR AND ANALYTICS, SCSM 2019, PT I, 2019, 11578 : 15 - 31
  • [22] Speech Emotion Recognition Using Deep Learning
    Alagusundari, N.
    Anuradha, R.
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 313 - 325
  • [23] Speech Emotion Recognition Using Deep Learning
    Ahmed, Waqar
    Riaz, Sana
    Iftikhar, Khunsa
    Konur, Savas
    ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
  • [24] Deep learning architecture for direct probability density prediction of small-scale solar generation
    Afrasiabi, Mousa
    Mohammadi, Mohammad
    Rastegar, Mohammad
    Afrasiabi, Shahabodin
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2020, 14 (11) : 2017 - 2025
  • [25] An Experimental Analysis of Deep Learning Architectures for Supervised Speech Enhancement
    Nossier, Soha A.
    Wall, Julie
    Moniri, Mansour
    Glackin, Cornelius
    Cannings, Nigel
    ELECTRONICS, 2021, 10 (01) : 1 - 32
  • [26] Dementia Detection from Speech Using Machine Learning and Deep Learning Architectures
    Kumar, M. Rupesh
    Vekkot, Susmitha
    Lalitha, S.
    Gupta, Deepa
    Govindraj, Varasiddhi Jayasuryaa
    Shaukat, Kamran
    Alotaibi, Yousef Ajami
    Zakariah, Mohammed
    SENSORS, 2022, 22 (23)
  • [27] Deep Learning in Acoustic Modeling for Automatic Speech Recognition and Understanding - An Overview -
    Gavat, Inge
    Militaru, Diana
    2015 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2015,
  • [28] Emotion recognition of audio/speech data using deep learning approaches
    Gupta, Vedika
    Juyal, Stuti
    Singh, Gurvinder Pal
    Killa, Chirag
    Gupta, Nishant
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2020, 41 (06) : 1309 - 1317
  • [29] Deep Learning-Based Approach for Arabic Visual Speech Recognition
    Alsulami, Nadia H.
    Jamal, Amani T.
    Elrefaei, Lamiaa A.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 85 - 108
  • [30] Uncertainty-optimized deep learning model for small-scale person re-identification
    Cairong ZHAO
    Kang CHEN
    Di ZANG
    Zhaoxiang ZHANG
    Wangmeng ZUO
    Duoqian MIAO
    Science China(Information Sciences), 2019, 62 (12) : 20 - 32