Estimation of glottal closure instants by considering speech signal as a spectrum

被引:5
|
作者
Sripriya, N. [1 ]
Nagarajan, T. [1 ]
机构
[1] SSN Coll Engn, Madras, Tamil Nadu, India
关键词
D O I
10.1049/el.2014.4444
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Close to glottal closure instants (GCIs), the speech signal is expected to change its amplitude rapidly and, at GCIs, it is expected to have strong negative peaks. A novel algorithm that exploits these two properties for the estimation of GCIs is presented. Here, a symmetrised speech segment is assumed to be a Fourier transform (FT) of an even function. In such a case, at the locations of the GCIs, the strong negative peaks in the symmetrised speech segment correspond to zeros that lie considerably outside the unit circle in the z-plane. The group delay spectrum of the time-domain signal derived by taking inverse FT of this assumed FT is expected to take a value close to -2 pi at the angular locations of these zeros. Mapping frequency scale to time scale, the frequency bins for which group delay reaches -2 pi correspond to the locations of GCIs. Theoretical justification for the proposed approach is also presented by defining a novel function called the conditional group delay function. Systematic evaluation is carried out on the CMU Arctic database and the performance of the proposed technique is better than that of the algorithms namely DYPSA, ZFF, YAGA and is close to that of SEDREAMS.
引用
收藏
页码:649 / 651
页数:2
相关论文
共 50 条
  • [21] Classification-Based Detection of Glottal Closure Instants from Speech Signals
    Matousek, Jindrich
    Tihelka, Daniel
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3053 - 3057
  • [22] Glottal instants extraction from speech signal using Deep Feature Loss
    Shetty, Supritha M.
    Durgesht, Suraj
    Deepak, K. T.
    2022 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, SPCOM, 2022,
  • [23] GLOTTAL INSTANTS EXTRACTION FROM SPEECH SIGNAL USING GENERATIVE ADVERSARIAL NETWORK
    Deepak, K. T.
    Kulkarni, Pavitra
    Mudenagudi, U.
    Prasanna, S. R. M.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5946 - 5950
  • [24] Detection of Glottal Closure Instants from Raw Speech using Convolutional Neural Networks
    Goyal, Mohit
    Srivastava, Varun
    Prathosh, A. P.
    INTERSPEECH 2019, 2019, : 1591 - 1595
  • [25] Determination of glottal closure instants by harmonic superposition
    Hu, HT
    Hsu, ST
    Yu, C
    SIGNAL PROCESSING, 2003, 83 (09) : 1985 - 1995
  • [26] Foreground Speech Segmentation and Enhancement Using Glottal Closure Instants and Mel Cepstral Coefficients
    Deepak, K. T.
    Prasanna, S. R. Mahadeva
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (07) : 1204 - 1218
  • [27] Detection of Glottal Closure Instants in Degraded Speech using Single Frequency Filtering Analysis
    Aneeja, G.
    Kadiri, Sudarsana Reddy
    Yegnanarayana, B.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2300 - 2304
  • [28] Estimation of Glottal Closing and Opening Instants in Voiced Speech Using the YAGA Algorithm
    Thomas, Mark R. P.
    Gudnason, Jon
    Naylor, Patrick A.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (01): : 82 - 91
  • [29] Significance of Radius in the Phase-Difference-Based Approach to the Estimation of Glottal Closure Instants
    Rachel, G. Anushiya
    Vijayalakshmi, P.
    Nagarajan, T.
    2018 2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, AND SIGNAL PROCESSING (ICCCSP): SPECIAL FOCUS ON TECHNOLOGY AND INNOVATION FOR SMART ENVIRONMENT, 2018, : 194 - +
  • [30] Exploring Bessel Features for Detection of Glottal Closure Instants
    Prakash, Chetana
    Dhananjaya, N.
    Gangashetty, Suryakanth V.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1996 - +