Automated Relative Fundamental Frequency Algorithms for Use With Neck-Surface Accelerometer Signals

被引:5
作者
Groll, Matti D. [1 ,2 ]
Vojtech, Jennifer M. [1 ,2 ]
Hablani, Surbhi [2 ]
Mehta, Daryush D. [3 ,4 ,5 ,6 ,7 ]
Buckley, Daniel P. [2 ,8 ]
Noordzij, J. Pieter [8 ]
Stepp, Cara E. [1 ,2 ,8 ]
机构
[1] Boston Univ, Dept Biomed Engn, 677 Beacon St, Boston, MA 02215 USA
[2] Boston Univ, Dept Speech Language & Hearing Sci, Boston, MA 02215 USA
[3] Massachusetts Gen Hosp, Ctr Laryneal Surg & Voice Rehabil, Boston, MA 02114 USA
[4] Massachusetts Gen Hosp, MGH Inst Hlth Profess, Boston, MA 02114 USA
[5] Harvard Med Sch, Dept Surg, Boston, MA 02144 USA
[6] MGH Inst Hlth Profess, Program Rehabil Sci, Boston, MA 02129 USA
[7] Harvard Med Sch, Div Med Sci, Speech & Hearing Biosci & Technol Program, Boston, MA 02144 USA
[8] Boston Univ, Sch Med, Dept Otolaryngol Head & Neck Surg, Boston, MA 02118 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Relative fundamental frequency; Accelerometer; Vocal hyperfunction; VOICING OFFSET; VOCAL EFFORT; SPEECH; ONSET; AERODYNAMICS; INDIVIDUALS; PERCEPTION; MICROPHONE; VIBRATION;
D O I
10.1016/j.jvoice.2020.06.001
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Objective. Relative fundamental frequency (RFF) has been suggested as a potential acoustic measure of vocal effort. However, current clinical standards for RFF measures require time-consuming manual markings. Previous semi-automated algorithms have been developed to calculate RFF from microphone signals. The current study aimed to develop fully automated algorithms to calculate RFF from neck-surface accelerometer signals for ecological momentary assessment and ambulatory monitoring of voice. Methods. Training a set of 2646 /vowel-fricative-vowel/ utterances from 317 unique speakers, with and without voice disorders, was used to develop automated algorithms to calculate RFF values from neck-surface accelerometer signals. The algorithms first rejected utterances with poor vowel-to-noise ratios, then identified fricative locations, then used signal features to determine voicing boundary cycles, and finally calculated corresponding RFF values. These automated RFF values were compared to the clinical gold-standard of manual RFF calculated from simultaneously collected microphone signals in a novel test set of 639 utterances from 77 unique speakers. Results. Automated accelerometer-based RFF values resulted in an average mean bias error (MBE) across all cycles of 0.027 ST, with an MBE of 0.152 ST and -0.252 ST in the offset and onset cycles closest to the fricative, respectively. Conclusion. All MBE values were smaller than the expected changes in RFF values following successful voice therapy, suggesting that the current algorithms could be used for ecological momentary assessment and ambulatory monitoring via neck-surface accelerometer signals.
引用
收藏
页码:156 / 169
页数:14
相关论文
共 45 条
  • [1] Azarov E, 2016, INT CONF ACOUST SPEE, P4970, DOI 10.1109/ICASSP.2016.7472623
  • [2] Baken R.J., 1987, J VOICE, V1, P68, DOI [10.1016/s0892-1997(87)80027-9 ala, DOI 10.1016/S0892-1997(87)80027-9ALA]
  • [3] NOISE-LEVELS IN AN URBAN HOSPITAL AND WORKERS SUBJECTIVE RESPONSES
    BAYO, MV
    GARCIA, AM
    GARCIA, A
    [J]. ARCHIVES OF ENVIRONMENTAL HEALTH, 1995, 50 (03): : 247 - 251
  • [4] Boersma P., 2021, Praat: Doing phonetics by computer
  • [5] Camacho A., 2012, 2012 11 INT C INF SC
  • [6] Camacho A., 2007, SWIPE: A sawtooth waveform inspired pitch estimator for speech and music
  • [7] A sawtooth waveform inspired pitch estimator for speech and music
    Camacho, Arturo
    Harris, John G.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (03) : 1638 - 1652
  • [8] Development and testing of a portable vocal accumulator
    Cheyne, HA
    Hanson, HM
    Genereux, RP
    Stevens, KN
    Hillman, RE
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2003, 46 (06): : 1457 - 1467
  • [9] Ambulatory assessment of phonotraumatic vocal hyperfunction using glottal airflow measures estimated from neck-surface acceleration
    Cortes, Juan P.
    Espinoza, Victor M.
    Ghassemi, Marzyeh
    Mehta, Daryush D.
    Van Stan, Jarrad H.
    Hillman, Robert E.
    Guttag, John, V
    Zanartu, Matias
    [J]. PLOS ONE, 2018, 13 (12):
  • [10] Acoustic Correlate of Vocal Effort in Spasmodic Dysphonia
    Eadie, Tanya L.
    Stepp, Cara E.
    [J]. ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 2013, 122 (03) : 169 - 176