Normalized amplitude quotient for parametrization of the glottal flow

被引:232
作者
Alku, P
Bäckström, T
Vilkman, E
机构
[1] Helsinki Univ Technol, Lab Acoust & Audio Signal Proc, FIN-02015 Helsinki, Finland
[2] Univ Oulu, Dept Otolaryngol & Phoniatr, Oulu, Finland
[3] Univ Helsinki, Cent Hosp, FIN-00029 Hus, Finland
关键词
D O I
10.1121/1.1490365
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Normalized amplitude quotient (NAQ) is presented as a method to parametrize the glottal closing phase using two amplitude-domain measurements from waveforms estimated by inverse filtering. In this technique, the ratio between the amplitude of the ac flow and the negative peak amplitude of the flow derivative is first computed using,the concept of equivalent rectangular pulse, a hypothetical signal located at the instant of the main excitation of the vocal tract. This ratio is then normalized with respect to the length of the fundamental period. Comparison between NAQ and its counterpart among the conventional time-domain parameters, the closing quotient, shows that the proposed parameter is more robust against distortion such as measurement noise that make the extraction of conventional time-based parameters of the glottal flow problematic. Experiments with breathy, normal, and pressed vowels indicate that NAQ is also able to separate the type of phonation effectively. (C) 2002 Acoustical Society of America.
引用
收藏
页码:701 / 710
页数:10
相关论文
共 28 条
[1]   Parabolic spectral parameter - A new method for quantification of the glottal flow [J].
Alku, P ;
Strik, H ;
Vilkman, E .
SPEECH COMMUNICATION, 1997, 22 (01) :67-79
[2]   A comparison of glottal voice source quantification parameters in breathy, normal and pressed phonation of female and male speakers [J].
Alku, P ;
Vilkman, E .
FOLIA PHONIATRICA ET LOGOPAEDICA, 1996, 48 (05) :240-254
[3]   Amplitude domain quotient for characterization of the glottal volume velocity waveform estimated by inverse filtering [J].
Alku, P ;
Vilkman, E .
SPEECH COMMUNICATION, 1996, 18 (02) :131-138
[4]  
ALKU P, 1994, P INT C SPOK LANG PR, P1619
[5]  
[Anonymous], 1985, STL QPSR, DOI DOI 10.1016/0167-6393(89)90001-0
[6]  
[Anonymous], P IEEE INT C AC SPEE
[7]  
CARLSON B, 1986, COMMUNICATION SYSTEM, P177
[8]   VOCAL QUALITY FACTORS - ANALYSIS, SYNTHESIS, AND PERCEPTION [J].
CHILDERS, DG ;
LEE, CK .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 90 (05) :2394-2410
[9]   GLOTTAL AIR-FLOW AND ELECTROGLOTTOGRAPHIC MEASURES OF VOCAL FUNCTION AT MULTIPLE INTENSITIES [J].
DROMEY, C ;
STATHOPOULOS, ET ;
SAPIENZA, CM .
JOURNAL OF VOICE, 1992, 6 (01) :44-54
[10]   DISCRETE ALL-POLE MODELING [J].
ELJAROUDI, A ;
MAKHOUL, J .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (02) :411-423