Evaluation of Small-Scale Deep Learning Architectures in Thai Speech Recognition

被引:0
作者
Kaewprateep, Jirayu [1 ]
Prom-on, Santitham [1 ]
机构
[1] King Mongkuts Univ Technol, Dept Comp Engn, Thonburi, Thailand
来源
2018 1ST INTERNATIONAL ECTI NORTHERN SECTION CONFERENCE ON ELECTRICAL, ELECTRONICS, COMPUTER AND TELECOMMUNICATIONS ENGINEERING (ECTI-NCON | 2018年
关键词
Thai speech recognition; deep learning; convolutional neural network; long short term memory network;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a performance evaluation study for small-scale deep learning neural network for Thai speech recognition task. Convolutional neural network and long short term memory networks were built with a relatively small size dataset and small constructs. The aim of this study is to determine which method would be suitable for a small-scale deep learning study. Relatively small speech corpus was used to build deep-learning neural networks with two different architectures, including convolutional neural network (CNN) model and long short term memory (LSTM) model. Models were evaluated using cross validation technique and compare to one another. The result shows that CNN outperformed LSTM for a small-scale deep learning. This suggests that with the limited dataset and small-scale architecture CNN is a more suitable choice in the speech recognition study.
引用
收藏
页码:60 / 64
页数:5
相关论文
共 50 条
  • [41] Deep Learning of Speech Features for Improved Phonetic Recognition
    Lee, Jaehyung
    Lee, Soo-Young
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1256 - 1259
  • [42] An Applied Holistic Landmark with Deep Learning for Thai Sign Language Recognition
    Chaikaew, Anusorn
    2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022), 2022, : 1046 - 1049
  • [43] Automatic Speech Recognition Method Based on Deep Learning Approaches for Uzbek Language
    Mukhamadiyev, Abdinabi
    Khujayarov, Ilyos
    Djuraev, Oybek
    Cho, Jinsoo
    SENSORS, 2022, 22 (10)
  • [44] Small-Scale Pedestrian Detection Based on Deep Neural Network
    Han, Bing
    Wang, Yunhao
    Yang, Zheng
    Gao, Xinbo
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (07) : 3046 - 3055
  • [45] A Deep Learning Speech Enhancement Architecture Optimised for Speech Recognition and Hearing Aids
    Nossier, Soha A.
    Wall, Julie
    Moniri, Mansour
    Glackin, Cornelius
    Cannings, Nigel
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 553 - 558
  • [46] Evaluation of Transfer Learning based Deep Learning architectures for Waste Classification
    Sukhendra, Singh
    Jyoti, Gautam
    SurSingh, Rawat
    Vimal, Gupta
    Gynendra, Kumar
    Pratap, Verma Lal
    2021 4TH INTERNATIONAL SYMPOSIUM ON ADVANCED ELECTRICAL AND COMMUNICATION TECHNOLOGIES (ISAECT), 2021,
  • [47] Exploring Advanced Deep Learning Architectures for Older Adults Activity Recognition
    Zafar, Raja Omman
    Latif, Insha
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PT II, ICCHP 2024, 2024, 14751 : 320 - 327
  • [48] FUSION OF DEEP LEARNING ARCHITECTURES FOR ENHANCED TARGET RECOGNITION ON SAR IMAGES
    Cheikh, K.
    Aitahcene, R.
    Toumi, A.
    Hammoudi, Z.
    JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2023, 9 (04): : 347 - 359
  • [49] Comparison Study of Traffic Signs Recognition Using Deep Learning Architectures
    Alawaji, Khaldaa
    Hedjar, Ramdane
    2022 13TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2022, : 442 - 447
  • [50] Scalogram based performance comparison of deep learning architectures for dysarthric speech detection
    Shabber, Shaik Mulla
    Sumesh, E. P.
    Ramachandran, Vidhya Lavanya
    ARTIFICIAL INTELLIGENCE REVIEW, 2025, 58 (05)