Automatic segmentation of Hindi speech into syllable-like units

被引:0
|
作者
Kumari R. [1 ,2 ]
Dev A. [2 ]
Kumar A. [2 ,3 ]
机构
[1] Department of ECE, Maharaja Surajmal Institute of Technology GGSIPU, New Delhi
[2] Indira Gandhi Delhi Technical University for Women, New Delhi
[3] Department of ECE, Indira Gandhi Delhi Technical University for Women, New Delhi
关键词
Convex hull; Database; Short term energy; Speech segmentation; Syllable;
D O I
10.14569/IJACSA.2020.0110553
中图分类号
学科分类号
摘要
To develop the high-quality Text-to-Speech (TTS) system, appropriate segmentation of continuous speech into the syllabic units placed an important role. The research work has been implemented for automatic syllable based speech segmentation technique for continuous speech for the Hindi language. The experiments were conducted by using the energy convex hull approach for clean, continuous speech for Hindi. In this method, the Savitzky-Golay filter was applied on the short term energy (STE) signal to increase the signal to noise ratio (SNR), followed by applying the median filter to preserve the boundaries, hence smoothing the energy curve. Also, the Hamming sliding-window was applied twice on speech signal to get the more accurate depth of convex hull valleys. Further, the algorithm was tested on 50 unique utterances chosen from the travel domain. The accuracy of the proposed algorithm has been calculated and obtains that 76.07% syllables have time-error less than 30 ms with manual segmentation reference. The performance of the proposed algorithm is also analyzed and gives better-segmented accuracy as compared to the existing group delay segmentation technique for fricatives or nasal sounds. The syllable base segmented database is suitable for the speech technology system for Hindi in the travel domain. © 2020 Science and Information Organization.
引用
收藏
页码:400 / 406
页数:6
相关论文
共 50 条
  • [1] Automatic Segmentation of Hindi Speech into Syllable-Like Units
    Kumari, Ruchika
    Dev, Amita
    Kumar, Ashwani
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 400 - 406
  • [2] SEMI-AUTOMATIC SYLLABLE-LIKE SEGMENTATION FOR HINDI
    Balyan, Archana
    Agrawal, S. S.
    Dev, Amita
    Kumari, Ruchika
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [3] Automatic Segmentation of Chinese Mandarin Speech into Syllable-like
    Li, Jian
    Shen, Furao
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 57 - 60
  • [4] Unsupervised word discovery from speech using automatic segmentation into syllable-like units
    Rasanen, Okko
    Doyle, Gabriel
    Frank, Michael C.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3204 - 3208
  • [5] Pre-linguistic segmentation of speech into syllable-like units
    Rasanen, Okko
    Doyle, Gabriel
    Frank, Michael C.
    COGNITION, 2018, 171 : 130 - 150
  • [6] Automatic transcription of continuous speech into syllable-like units for Indian languages
    Sarada, G. Lakshmi
    Lakshmi, A.
    Murthy, Hema A.
    Nagarajan, T.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2009, 34 (02): : 221 - 233
  • [7] Speech recognition using syllable-like units
    Hu, ZH
    Schalkwyk, J
    Barnard, E
    Cole, R
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1117 - 1120
  • [8] Subband-based group delay segmentation of spontaneous speech Into syllable-like units
    Nagarajan, T. (raju@lantana.iitm.ernet.in), 1600, Hindawi Publishing Corporation (2004):
  • [9] Subband-based group delay segmentation of spontaneous speech into syllable-like units
    Nagarajan, T
    Murthy, HA
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (17) : 2614 - 2625
  • [10] Subband-Based Group Delay Segmentation of Spontaneous Speech into Syllable-Like Units
    T. Nagarajan
    H.A. Murthy
    EURASIP Journal on Advances in Signal Processing, 2004