Frequency and voice: Perspectives in the time domain

被引:40
作者
Roark, Rick M.
机构
[1] New York Med Coll, Dept Otolaryngol, Valhalla, NY 10595 USA
[2] New York Eye & Ear Infirm, New York, NY 10003 USA
关键词
frequency; analytic signal; empirical mode decomposition; instantaneous frequency; signal modeling;
D O I
10.1016/j.jvoice.2005.12.009
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Frequency variation is one of the most primitive features of voice production, endowing language and communication with richness and efficiency and enhancing enjoyment of the voice arts. In the first of two tutorial articles, the subject of frequency is examined formally, beginning in the time domain. A companion article explores the topic of frequency and voice from the frequency domain perspective. Frequency is a well-defined quantity of the sinusoidal function and of periodic functions of time. However, voice is inherently nonstationary, even over short time segments, to degrees that range from minor (stable vowels of a healthy voice) to major (singing voice and voiced consonants). For signals that are not periodic, the notion of frequency is ambiguous and often altogether unclear, which has led to a multitude of frequency-measurement techniques and discrepancy of measures. This article identifies the source of these discrepancies for a variety of time-domain techniques that are examined in the absence of noise. In the time domain, the subject of frequency is inherently coupled to the topic of signal modeling, which is explored in some detail. Sinusoidal models having time-varying phase are examined with the objective of achieving a frequency description of voice that is both continuous and instantaneous. The analytic signal method of mathematical physics is discussed and applied to the technology of empirical mode decomposition to demonstrate that the frequencies of voice may be comprehensively examined from the time domain point of view.
引用
收藏
页码:325 / 354
页数:30
相关论文
共 45 条
[31]   ENERGY SEPARATION IN SIGNAL MODULATIONS WITH APPLICATION TO SPEECH ANALYSIS [J].
MARAGOS, P ;
KAISER, JF ;
QUATIERI, TF .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (10) :3024-3051
[32]  
Mathews J., 1970, Mathematical methods of physics
[33]  
OONINCX PJ, 2004, ECUA 2004 P 7 EUR C
[34]  
Oppenheim A. V., 1989, DISCRETE TIME SIGNAL
[35]  
PICINBONO B, 1983, ANN TELECOMMUN, V38, P179
[36]  
Rabiner L.R., 2010, Digital Processing of Speech Signals
[37]   B-spline design of maximally flat and prolate spheroidal-type FIR filters [J].
Roark, RM ;
Escabí, MA .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1999, 47 (03) :701-716
[38]   DIGITAL SIGNAL PROCESSING APPROACH TO INTERPOLATION [J].
SCHAFER, RW ;
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1973, 61 (06) :692-702
[39]  
Stevens KN., 2000, ACOUSTIC PHONETICS
[40]  
Teukolsky SA, 1992, NUMERICAL RECIPES C, VSecond