Frequency and voice: Perspectives in the time domain

被引:40
作者
Roark, Rick M.
机构
[1] New York Med Coll, Dept Otolaryngol, Valhalla, NY 10595 USA
[2] New York Eye & Ear Infirm, New York, NY 10003 USA
关键词
frequency; analytic signal; empirical mode decomposition; instantaneous frequency; signal modeling;
D O I
10.1016/j.jvoice.2005.12.009
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Frequency variation is one of the most primitive features of voice production, endowing language and communication with richness and efficiency and enhancing enjoyment of the voice arts. In the first of two tutorial articles, the subject of frequency is examined formally, beginning in the time domain. A companion article explores the topic of frequency and voice from the frequency domain perspective. Frequency is a well-defined quantity of the sinusoidal function and of periodic functions of time. However, voice is inherently nonstationary, even over short time segments, to degrees that range from minor (stable vowels of a healthy voice) to major (singing voice and voiced consonants). For signals that are not periodic, the notion of frequency is ambiguous and often altogether unclear, which has led to a multitude of frequency-measurement techniques and discrepancy of measures. This article identifies the source of these discrepancies for a variety of time-domain techniques that are examined in the absence of noise. In the time domain, the subject of frequency is inherently coupled to the topic of signal modeling, which is explored in some detail. Sinusoidal models having time-varying phase are examined with the objective of achieving a frequency description of voice that is both continuous and instantaneous. The analytic signal method of mathematical physics is discussed and applied to the technology of empirical mode decomposition to demonstrate that the frequencies of voice may be comprehensively examined from the time domain point of view.
引用
收藏
页码:325 / 354
页数:30
相关论文
共 45 条
[1]  
[Anonymous], 2000, American Heritage dictionary of the English language, V4th
[2]  
[Anonymous], P IEEE, DOI DOI 10.1109/5.135378
[3]  
[Anonymous], 1983, PITCH DETERMINATION, DOI DOI 10.1007/978-3-642-81926-1
[4]  
Antosik P, 1973, THEORY DISTRIBUTIONS
[5]  
Baken RJ, 2000, CLIN MEASUREMENT SPE
[6]   Comparison of voice analysis systems for perturbation measurement [J].
Bielamowicz, S ;
Kreiman, J ;
Gerratt, BR ;
Dauer, MS ;
Berke, GS .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1996, 39 (01) :126-134
[7]   ESTIMATING AND INTERPRETING THE INSTANTANEOUS FREQUENCY OF A SIGNAL .1. FUNDAMENTALS [J].
BOASHASH, B .
PROCEEDINGS OF THE IEEE, 1992, 80 (04) :520-538
[8]  
BRIEDLANDER B, 1995, IEEE T SIGNAL PROCES, V43, P917
[9]  
COHEN L, 1991, SIGNAL P, V19, P301
[10]  
Cohen L., 1995, TIME FREQUENCY ANAL