Automatic segmentation of Hindi speech into syllable-like units

被引:0
|
作者
Kumari R. [1 ,2 ]
Dev A. [2 ]
Kumar A. [2 ,3 ]
机构
[1] Department of ECE, Maharaja Surajmal Institute of Technology GGSIPU, New Delhi
[2] Indira Gandhi Delhi Technical University for Women, New Delhi
[3] Department of ECE, Indira Gandhi Delhi Technical University for Women, New Delhi
关键词
Convex hull; Database; Short term energy; Speech segmentation; Syllable;
D O I
10.14569/IJACSA.2020.0110553
中图分类号
学科分类号
摘要
To develop the high-quality Text-to-Speech (TTS) system, appropriate segmentation of continuous speech into the syllabic units placed an important role. The research work has been implemented for automatic syllable based speech segmentation technique for continuous speech for the Hindi language. The experiments were conducted by using the energy convex hull approach for clean, continuous speech for Hindi. In this method, the Savitzky-Golay filter was applied on the short term energy (STE) signal to increase the signal to noise ratio (SNR), followed by applying the median filter to preserve the boundaries, hence smoothing the energy curve. Also, the Hamming sliding-window was applied twice on speech signal to get the more accurate depth of convex hull valleys. Further, the algorithm was tested on 50 unique utterances chosen from the travel domain. The accuracy of the proposed algorithm has been calculated and obtains that 76.07% syllables have time-error less than 30 ms with manual segmentation reference. The performance of the proposed algorithm is also analyzed and gives better-segmented accuracy as compared to the existing group delay segmentation technique for fricatives or nasal sounds. The syllable base segmented database is suitable for the speech technology system for Hindi in the travel domain. © 2020 Science and Information Organization.
引用
收藏
页码:400 / 406
页数:6
相关论文
共 50 条
  • [41] Automatic segmentation of English words using phonotactic and syllable information
    Ng, Raymond W. M.
    Hirose, Keikichi
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II, 2012, : 27 - 30
  • [42] Automatic Syllabification and Syllable Timing of Automatically Recognized Speech - for Czech
    Bohac, Marek
    Mateju, Lukas
    Rott, Michal
    Safarik, Radek
    TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 540 - 547
  • [43] ''Blind'' speech segmentation: Automatic segmentation of speech without linguistic knowledge
    Sharma, M
    Mammone, R
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1237 - 1240
  • [44] Automatic Assessment of Articulation Errors in Hindi Speech at Phone Level
    Bhat, Chitralekha
    Vachhani, Bhavik
    Kopparapu, Sunil
    TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
  • [45] Segmentation of speech signal by sense units
    Izvestiya Vysshikh Uchebnykh Zavedenij. Radioelektronika, 1995, 38 (08): : 76 - 79
  • [46] Segmentation of speech signals by sense units
    Gorbachevskii, SK
    IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOELEKTRONIKA, 1995, 38 (7-8): : A76 - A79
  • [47] Optimizing Integrated Features for Hindi Automatic Speech Recognition System
    Dua, Mohit
    Aggarwal, Rajesh Kumar
    Biswas, Mantosh
    JOURNAL OF INTELLIGENT SYSTEMS, 2020, 29 (01) : 959 - 976
  • [48] Tibetan Word Segmentation as Sub-syllable Tagging with Syllable's Part-of-Speech Property
    Liu, Huidan
    Long, Congjun
    Nuo, Minghua
    Wu, Jian
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 189 - 201
  • [49] A multimodal Lombard speech recognition system for the confusable Hindi syllabic units
    Maheswari, S. Uma
    Radha, N.
    Shahina, A.
    Prabha, P.
    Sri, B. T. Preethi
    Khan, A. Nayeemulla
    MATERIALS TODAY-PROCEEDINGS, 2022, 62 : 5034 - 5041
  • [50] Syllable segmentation of Thai human speech using stationary wavelet transform
    Jitsup, Jakkapan
    Sritheeravirojana, U-Thai
    Udomhunsakul, Soinkait
    2007 ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS, 2007, : 29 - +