Refining algorithmic estimation of relative fundamental frequency: Accounting for sample characteristics and fundamental frequency estimation method

被引:14
|
作者
Vojtech, Jennifer M. [1 ,2 ]
Segina, Roxanne K. [2 ]
Buckley, Daniel P. [2 ,3 ]
Kolin, Katharine R. [2 ]
Tardif, Monique C. [2 ]
Noordzij, J. Pieter [3 ]
Stepp, Cara E. [1 ,2 ,4 ]
机构
[1] Boston Univ, Dept Biomed Engn, 44 Cummington Mall, Boston, MA 02215 USA
[2] Boston Univ, Dept Speech Language & Hearing Sci, 635 Commonwealth Ave, Boston, MA 02215 USA
[3] Boston Univ, Sch Med, Dept Otolaryngol Head & Neck Surg, 72 East Concord St, Boston, MA 02118 USA
[4] Boston Univ, Sch Med, Dept Otolaryngol Head & Neck Surg, Boston, MA 02215 USA
来源
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
AUDITORY-PERCEPTUAL EVALUATION; VOICING OFFSET; VOCAL EFFORT; SPASMODIC DYSPHONIA; PITCH STRENGTH; ONSET; SPEECH; MUSCLE; HYPERFUNCTION; INDIVIDUALS;
D O I
10.1121/1.5131025
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Relative fundamental frequency (RFF) is a promising acoustic measure for evaluating voice disorders. Yet, the accuracy of the current RFF algorithm varies across a broad range of vocal signals. The authors investigated how fundamental frequency (f(o)) estimation and sample characteristics impact the relationship between manual and semi-automated RFF estimates. Acoustic recordings were collected from 227 individuals with and 256 individuals without voice disorders. Common f(o) estimation techniques were compared to the autocorrelation method currently implemented in the RFF algorithm. Pitch strength-based categories were constructed using a training set (1158 samples), and algorithm thresholds were tuned to each category. RFF was then computed on an independent test set (291 samples) using category-specific thresholds and compared against manual RFF via mean bias error (MBE) and root-mean-square error (RMSE). Auditory-SWIPE' for f(o) estimation led to the greatest correspondence with manual RFF and was implemented in concert with category-specific thresholds. Refining f(o) estimation and accounting for sample characteristics led to increased correspondence with manual RFF [MBE - 0.01 semitones (ST), RMSE - 0.28 ST] compared to the unmodified algorithm (MBE - 0.90 ST, RMSE - 0.34 ST), reducing the MBE and RMSE of semi-automated RFF estimates by 88.4% and 17.3%, respectively. (C) 2019 Acoustical Society of America.
引用
收藏
页码:3184 / 3202
页数:19
相关论文
共 50 条
  • [21] FAST AND STATISTICALLY EFFICIENT FUNDAMENTAL FREQUENCY ESTIMATION
    Nielsen, Jesper Kjaer
    Jensen, Tobias Lindstrom
    Jensen, Jesper Rindom
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 86 - 90
  • [22] Validation of an Algorithm for Semi-automated Estimation of Voice Relative Fundamental Frequency
    Lien, Yu-An S.
    Murray, Elizabeth S. Heller
    Calabrese, Carolyn R.
    Michener, Carolyn M.
    Van Stan, Jarrad H.
    Mehta, Daryush D.
    Hillman, Robert E.
    Noordzij, J. Pieter
    Stepp, Cara E.
    ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 2017, 126 (10): : 712 - 716
  • [23] SCIPI, a method of precise fundamental frequency estimation from DFT spectrum
    Michna, Viktor
    Cernohorsky, Jindrich
    2011 INTERNATIONAL CONFERENCE ON APPLIED ELECTRONICS (AE), 2011,
  • [24] A fundamental frequency estimation method for noisy speech based on periodicity and harmonicity
    Ishimoto, Y
    Unoki, M
    Akagi, M
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4019 - 4019
  • [25] Analysis and Improvements of the Cepstrum Method for Fundamental Frequency Estimation in Music Signals
    Gauer, Johannes
    Kleingarn, Diana
    Martin, Rainer
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 371 - 375
  • [26] Robust Estimation of Fundamental Frequency using Single Frequency Filtering Approach
    Pannala, Vishala
    Aneeja, G.
    Kadiri, Sudarsana Reddy
    Yegnanarayana, B.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2155 - 2159
  • [27] Spectral refinement and its application to fundamental frequency estimation
    Krini, Mohamed
    Schmidt, Gerhard
    2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 181 - 184
  • [28] Robust subspace-based fundamental frequency estimation
    Christensen, Mads G.
    Vera-Candeas, Pedro
    Somasundaram, Samuel D.
    Jakobsson, Andreas
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 101 - +
  • [29] Estimation of fundamental frequency of speech using microphone array
    Tanigawa, Shinichi
    Kikuchi, Takafumi
    Yamaoka, Tateo
    Hamada, Nozomu
    Conference Record of the Asilomar Conference on Signals, Systems and Computers, 1999, 2 : 1115 - 1119
  • [30] Multiple fundamental frequency estimation of polyphonic music signals
    Yeh, C
    Röbel, A
    Rodet, X
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 225 - 228