Automatic glottal inverse filtering with non-negative matrix factorization

被引:0
作者
Airaksinen, Manu [1 ]
Juvela, Lauri [1 ]
Backstrom, Tom [2 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Espoo, Finland
[2] Friedrich Alexander Univ, Int Audio Labs Erlangen, Erlangen, Germany
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
基金
芬兰科学院;
关键词
speech analysis; glottal inverse filtering; non-negative matrix factorization; FEATURE-EXTRACTION; LINEAR PREDICTION;
D O I
10.21437/Interspeech.2016-338
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study presents an automatic glottal inverse filtering (GIF) technique based on separating the effect of the glottal main excitation from the impulse response of the vocal tract. The proposed method is based on a non-negative matrix factorization (NMF) based decomposition of an ultra short-term spectrogram of the analyzed signal. Unlike other state-of-theart GIF techniques, the proposed method does not require estimation of glottal closure instants. The proposed method was objectively evaluated with two test sets of continuous synthetic speech created with a glottal vocoding analysis/synthesis procedure. When compared to a set of reference GIF methods, the proposed NMF technique shows improved estimation accuracy especially for male voices.
引用
收藏
页码:1039 / 1043
页数:5
相关论文
共 33 条
  • [31] Smaragdis P., 2004, INT S ICA BSS
  • [32] Monaural sound source separation by nonnegative matrix factorization with tempora continuity and sparseness criteria
    Virtanen, Tuomas
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 1066 - 1074
  • [33] LEAST-SQUARES GLOTTAL INVERSE FILTERING FROM THE ACOUSTIC SPEECH WAVEFORM
    WONG, DY
    MARKEL, JD
    GRAY, AH
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (04): : 350 - 355