Evaluation of Small-Scale Deep Learning Architectures in Thai Speech Recognition

被引:0
作者
Kaewprateep, Jirayu [1 ]
Prom-on, Santitham [1 ]
机构
[1] King Mongkuts Univ Technol, Dept Comp Engn, Thonburi, Thailand
来源
2018 1ST INTERNATIONAL ECTI NORTHERN SECTION CONFERENCE ON ELECTRICAL, ELECTRONICS, COMPUTER AND TELECOMMUNICATIONS ENGINEERING (ECTI-NCON | 2018年
关键词
Thai speech recognition; deep learning; convolutional neural network; long short term memory network;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a performance evaluation study for small-scale deep learning neural network for Thai speech recognition task. Convolutional neural network and long short term memory networks were built with a relatively small size dataset and small constructs. The aim of this study is to determine which method would be suitable for a small-scale deep learning study. Relatively small speech corpus was used to build deep-learning neural networks with two different architectures, including convolutional neural network (CNN) model and long short term memory (LSTM) model. Models were evaluated using cross validation technique and compare to one another. The result shows that CNN outperformed LSTM for a small-scale deep learning. This suggests that with the limited dataset and small-scale architecture CNN is a more suitable choice in the speech recognition study.
引用
收藏
页码:60 / 64
页数:5
相关论文
共 50 条
  • [31] Uncertainty-optimized deep learning model for small-scale person re-identification
    Zhao, Cairong
    Chen, Kang
    Zang, Di
    Zhang, Zhaoxiang
    Zuo, Wangmeng
    Mia, Duoqian
    SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (12)
  • [32] Advancements in Deep Learning Architectures for Image Recognition and Semantic Segmentation
    Nimma, Divya
    Uddagiri, Arjun
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 1172 - 1185
  • [33] Evaluation of deep learning model for human activity recognition
    Owais Bhat
    Dawood A Khan
    Evolving Systems, 2022, 13 : 159 - 168
  • [34] Uncertainty-optimized deep learning model for small-scale person re-identification
    Cairong Zhao
    Kang Chen
    Di Zang
    Zhaoxiang Zhang
    Wangmeng Zuo
    Duoqian Miao
    Science China Information Sciences, 2019, 62
  • [35] Small-Scale Foreign Object Debris Detection Using Deep Learning and Dual Light Modes
    Mo, Yiming
    Wang, Lei
    Hong, Wenqing
    Chu, Congzhen
    Li, Peigen
    Xia, Haiting
    APPLIED SCIENCES-BASEL, 2024, 14 (05):
  • [36] Evaluation of deep learning model for human activity recognition
    Bhat, Owais
    Khan, Dawood A.
    EVOLVING SYSTEMS, 2022, 13 (01) : 159 - 168
  • [37] DISTRIBUTED DEEP LEARNING STRATEGIES FOR AUTOMATIC SPEECH RECOGNITION
    Zhang, Wei
    Cui, Xiaodong
    Finkler, Ulrich
    Kingsbury, Brian
    Saon, George
    Kung, David
    Picheny, Michael
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5706 - 5710
  • [38] Kannada Continuous Speech Recognition Using Deep Learning
    Paul, Shubhojeet
    Bhattacharjee, Vandana
    Saha, Sujan Kumar
    ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT IV, 2024, 2093 : 258 - 269
  • [39] Deep Learning Techniques for Speech Emotion Recognition : A Review
    Pandey, Sandeep Kumar
    Shekhawat, H. S.
    Prasanna, S. R. M.
    2019 29TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2019, : 197 - 202
  • [40] Small sample face recognition based on ensemble deep learning
    Feng, Yuping
    Pang, Tengfei
    Li, Mengqi
    Guan, Yuyu
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 4402 - 4406