Frequency warping and the Mel scale

被引:25
作者
Umesh, S [1 ]
Cohen, L
Nelson, D
机构
[1] Indian Inst Technol, Dept Elect Engn, Kanpur 208016, Uttar Pradesh, India
[2] CUNY Hunter Coll, Dept Phys, New York, NY 10021 USA
[3] US Dept Def, Ft George G Meade, MD 20755 USA
关键词
frequency-warping; Mel scale; nonuniform scaling; speaker-normalization;
D O I
10.1109/97.995829
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present experimental results that show that the scale-factor relating the formant frequencies of different speakers increases with decreasing values of formant frequency. Based on these results, we experimentally obtain a frequency warping function aimed at separating speaker dependencies from the inherent characterization of the sound. We find that the frequency warping function is similar to the Mel scale, and we believe that this is the first time that a Mel-like scale has been obtained using only speech. Our results and methods may therefore explain, from a speech point of view, the Mel scale, which was obtained historically from hearing based experiments.
引用
收藏
页码:104 / 107
页数:4
相关论文
共 7 条
  • [1] [Anonymous], AMERICAN J PSYCHOL
  • [2] EIDE E, P IEEE ICASSP 96 ATL, P346
  • [3] FANT G, 1975, STL QPSR, V2, P1
  • [4] A frequency warping approach to speaker normalization
    Lee, L
    Rose, R
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (01): : 49 - 60
  • [5] Scale transform in speech analysis
    Umesh, S
    Cohen, L
    Marinovic, N
    Nelson, DJ
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (01): : 40 - 45
  • [6] Umesh S, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P414, DOI 10.1109/ICSLP.1996.607142
  • [7] UMESH S, 1998, P INT SOC OPT ENG SA, P414