Thai phoneme segmentation using discrete wavelet transform

被引:0
作者
Thipakorn, Bundit [1 ,2 ]
Kaewkamnerdpong, Boonserm [1 ]
机构
[1] Department of Computer Engineering, King Mongkut's Univ. T. Thonburi, Tung-Kru, Bangkok
[2] Department of Computer Engineering, King Mongkut's Univ. T. Thonburi, Tung-Kru, Bangkok, 10140
来源
| 2003年 / Taylor and Francis Inc.卷 / 05期
关键词
Discrete wavelet transform; Speech signal processing; Speech-to-text; Thai phoneme segmentation; Time-frequency analysis; Tonal language;
D O I
10.1080/10255810390243467
中图分类号
学科分类号
摘要
Currently, the core of Thai speech recognition algorithms focuses on word recognition. However, such algorithms are not appropriate to construct the Speech-to-Text system since the ultimate goal in Speech-to-Text system is to recognize continuous speech states from any speaker of a given language. The meaning in each given language and its sound can be determined by phonemes which are slightly different for each language. The variability in each speaker's voice, for instance, accents, gender and speech style, and the tonal language such as Thai language can create rather different speech signals for the same word. Thus, phoneme recognition is more difficult to perform. Since segmentation takes place prior to recognition in such systems, an incorrect segmentation invariably leads to incorrect recognition results. We proposed a method for phoneme segmentation that based on Discrete Wavelet Transform (DWT). To verify our method, we performed experiments on eleven speakers: five males and six females. Each speaker pronounced one hundred and thirty Thai words. Then, we evaluated the performance of our method by synthesizing the new words from the obtained phonemes. The speech synthesis of the new words was then observed by humans to compare with the natural-sounding speech. The results indicated 95% accuracy.
引用
收藏
页码:389 / 399
页数:10
相关论文
共 50 条
  • [41] Facial expression recognition using weber discrete wavelet transform
    Nazir, Muhammad
    Jan, Zahoor
    Sajjad, Muhammad
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 33 (01) : 479 - 489
  • [42] Image Denoising Using Discrete Wavelet Transform and Edge Information
    Kimlyk, Maxim
    Umnyashkin, Sergei
    [J]. PROCEEDINGS OF THE 2018 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (EICONRUS), 2018, : 1823 - 1825
  • [43] Adaptive colour constancy algorithm using discrete wavelet transform
    Celik, Turgay
    Tjahjadi, Tardi
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2012, 116 (04) : 561 - 571
  • [44] Impedance cardiography signal denoising using discrete wavelet transform
    Chabchoub, Souhir
    Mansouri, Sofienne
    Ben Salah, Ridha
    [J]. AUSTRALASIAN PHYSICAL & ENGINEERING SCIENCES IN MEDICINE, 2016, 39 (03) : 655 - 663
  • [45] Fast License Plate Localization Using Discrete Wavelet Transform
    Wang, Yuh-Rau
    Lin, Wei-Hung
    Horng, Shi-Jinn
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PROCEEDINGS, 2009, 5574 : 408 - +
  • [46] Detection of Transmission Line Faults Using Discrete Wavelet Transform
    Devi, Suman
    Swarnkar, Nagendra K.
    Ola, Sheesh Ram
    Mahela, Om Prakash
    [J]. 2016 CONFERENCE ON ADVANCES IN SIGNAL PROCESSING (CASP), 2016, : 133 - 138
  • [47] On the using of discrete wavelet transform for physical layer key generation
    Zhan, Furui
    Yao, Nianmin
    [J]. AD HOC NETWORKS, 2017, 64 : 22 - 31
  • [48] High Impedance Fault Detection Using Discrete Wavelet Transform
    Torres G., Vicente
    Ruiz P, Hector F.
    [J]. 2011 IEEE ELECTRONICS, ROBOTICS AND AUTOMOTIVE MECHANICS CONFERENCE (CERMA 2011), 2011, : 325 - 329
  • [49] Glaucoma Classification Using Brownian Motion and Discrete Wavelet Transform
    Yun, Wong Li
    Mookiah, Muthu Rama Krishnan
    Koh, Joel E. W.
    [J]. JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2014, 4 (04) : 621 - 627
  • [50] Glaucoma Detection Using Image Channels and Discrete Wavelet Transform
    Kirar, Bhupendra Singh
    Agrawal, Dheeraj Kumar
    Kirar, Seema
    [J]. IETE JOURNAL OF RESEARCH, 2022, 68 (06) : 4421 - 4428