Estimation of glottal closure instants by considering speech signal as a spectrum

被引:5
|
作者
Sripriya, N. [1 ]
Nagarajan, T. [1 ]
机构
[1] SSN Coll Engn, Madras, Tamil Nadu, India
关键词
D O I
10.1049/el.2014.4444
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Close to glottal closure instants (GCIs), the speech signal is expected to change its amplitude rapidly and, at GCIs, it is expected to have strong negative peaks. A novel algorithm that exploits these two properties for the estimation of GCIs is presented. Here, a symmetrised speech segment is assumed to be a Fourier transform (FT) of an even function. In such a case, at the locations of the GCIs, the strong negative peaks in the symmetrised speech segment correspond to zeros that lie considerably outside the unit circle in the z-plane. The group delay spectrum of the time-domain signal derived by taking inverse FT of this assumed FT is expected to take a value close to -2 pi at the angular locations of these zeros. Mapping frequency scale to time scale, the frequency bins for which group delay reaches -2 pi correspond to the locations of GCIs. Theoretical justification for the proposed approach is also presented by defining a novel function called the conditional group delay function. Systematic evaluation is carried out on the CMU Arctic database and the performance of the proposed technique is better than that of the algorithms namely DYPSA, ZFF, YAGA and is close to that of SEDREAMS.
引用
收藏
页码:649 / 651
页数:2
相关论文
共 50 条
  • [31] Detection of Glottal Closure Instants from Speech Signals: A Convolutional Neural Network Based Method
    Yang, Shuai
    Wu, Zhiyong
    Shen, Binbin
    Meng, Helen
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 317 - 321
  • [32] DETECTION OF THE GLOTTAL CLOSURE BY JUMPS IN THE STATISTICAL PROPERTIES OF THE SPEECH SIGNAL
    MOULINES, E
    DIFRANCESCO, R
    SPEECH COMMUNICATION, 1990, 9 (5-6) : 401 - 418
  • [33] Detection of the Glottal Closure Instants Using Empirical Mode Decomposition
    Rajib Sharma
    S. R. M. Prasanna
    Hugo Leonardo Rufiner
    Gastón Schlotthauer
    Circuits, Systems, and Signal Processing, 2018, 37 : 3412 - 3440
  • [34] Detection of instants of glottal closure using characteristics of excitation source
    Guruprasad, S.
    Yegnanarayana, B.
    Murty, K. Sri Rama
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2572 - +
  • [35] Detection of Glottal Closure Instants Based on the Microcanonical Multiscale Formalism
    Khanagha, Vahid
    Daoudi, Khalid
    Yahia, Hussein M.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 1941 - 1950
  • [36] Detection of the Glottal Closure Instants Using Empirical Mode Decomposition
    Sharma, Rajib
    Prasanna, S. R. M.
    Leonardo Rufiner, Hugo
    Schlotthauer, Gaston
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (08) : 3412 - 3440
  • [37] A Frobenius Norm Approach to Glottal Closure Detection from the Speech Signal
    Ma, Changxue
    Kamp, Yves
    Willems, Lei F.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02): : 258 - 265
  • [38] Significance of Glottal Closure Instants Detection Algorithms in Vocal Emotion Conversion
    Vekkot, Susmitha
    Tripathi, Shikha
    SOFT COMPUTING APPLICATIONS, SOFA 2016, VOL 1, 2018, 633 : 462 - 473
  • [39] Determination of glottal closure instants from clean and telephone quality speech signals using single frequency filtering
    Kadiri, Sudarsana Reddy
    Yegnanarayana, B.
    COMPUTER SPEECH AND LANGUAGE, 2020, 64
  • [40] Duration modification using glottal closure instants and vowel onset points
    Rao, K. Sreenivasa
    Yegnanarayana, B.
    SPEECH COMMUNICATION, 2009, 51 (12) : 1263 - 1269