Quasi-closed phase forward-backward linear prediction analysis of speech for accurate formant detection and estimation

被引:10
|
作者
Gowda, Dhananjaya [1 ,2 ]
Airaksinen, Manu [1 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Otakaari 5, FI-00076 Espoo, Finland
[2] Samsung Elect, DMC R&D Ctr, Seoul, South Korea
来源
基金
芬兰科学院;
关键词
SELECTION;
D O I
10.1121/1.5001512
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, a quasi-closed phase (QCP) analysis of speech signals for accurate glottal inverse filtering was proposed. However, the QCP analysis which belongs to the family of temporally weighted linear prediction (WLP) methods uses the conventional forward type of sample prediction. This may not be the best choice especially in computing WLP models with a hard-limiting weighting function. A sample selective minimization of the prediction error in WLP reduces the effective number of samples available within a given window frame. To counter this problem, a modified quasi-closed phase forward-backward (QCP-FB) analysis is proposed, wherein each sample is predicted based on its past as well as future samples thereby utilizing the available number of samples more effectively. Formant detection and estimation experiments on synthetic vowels generated using a physical modeling approach as well as natural speech utterances show that the proposed QCP-FB method yields statistically significant improvements over the conventional linear prediction and QCP methods. (C) 2017 Acoustical Society of America.
引用
收藏
页码:1542 / 1553
页数:12
相关论文
共 14 条
  • [1] Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks
    Gowda, Dhananjaya N.
    Bollepalli, Bajibabu
    Kadiri, Sudarsana Reddy
    Alku, Paavo
    IEEE ACCESS, 2021, 9 : 151631 - 151640
  • [2] Time-varying quasi-closed-phase weighted linear prediction analysis of speech for accurate formant detection and tracking
    Gowda, Dhananjaya
    Alku, Paavo
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1760 - 1764
  • [3] QUASI CLOSED PHASE ANALYSIS OF SPEECH SIGNALS USING TIME VARYING WEIGHTED LINEAR PREDICTION FOR ACCURATE FORMANT TRACKING
    Gowda, Dhananjaya
    Airaksinen, Manu
    Alku, Paavo
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4980 - 4984
  • [4] Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals
    Gowda, Dhananjaya
    Kadiri, Sudarsana Reddy
    Story, Brad
    Alku, Paavo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1901 - 1914
  • [5] Time-varying quasi-closed-phase analysis for accurate formant tracking in speech signals
    Gowda, Dhananjaya
    Kadiri, Sudarsana Reddy
    Story, Brad
    Alku, Paavo
    arXiv, 2023,
  • [6] Subspace based analysis of the modified forward-backward linear prediction method
    Reddy, V.U.
    Ajayakumari, K.
    IETE Journal of Research, 1988, 34 (05) : 408 - 415
  • [7] Effect of subarray size on direction estimation of coherent cyclostationary signals based on forward-backward linear prediction
    Xin, J
    Sano, A
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2002, E85A (08): : 1807 - 1821
  • [8] Packet Loss Concealment Estimating Residual Errors of Forward-Backward Linear Prediction for Bone-Conducted Speech
    Ohidujjaman
    Yasui, Nozomiko
    Sugiura, Yosuke
    Shimamura, Tetsuya
    Makinae, Hisanori
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 1263 - 1268
  • [9] Parameter Estimation in Spectral Resolution Enhancement Based on Forward-Backward Linear Prediction Total Least Square Method
    Qin, Yusheng
    Han, Xin
    Li, Xiangxian
    Tong, Jingjing
    Gao, Minguang
    APPLIED SPECTROSCOPY, 2023, 77 (09) : 1025 - 1032
  • [10] Quasi Closed Phase Glottal Inverse Filtering Analysis With Weighted Linear Prediction
    Airaksinen, Manu
    Raitio, Tuomo
    Story, Brad
    Alku, Paavo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (03) : 596 - 607