No Musician Advantage in the Perception of Degraded-Fundamental Frequency Speech in Noisy Environments

被引:0
|
作者
Hsieh, I-Hui [1 ,2 ]
Guo, Yu-Jyun [1 ]
机构
[1] Natl Cent Univ, Inst Cognit Neurosci, Taoyuan City, Taiwan
[2] Natl Cent Univ, Cognit Intelligence & Precis Healthcare Ctr, Taoyuan City, Taiwan
来源
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH | 2023年 / 66卷 / 08期
关键词
CUES; ABILITY; INTELLIGIBILITY; SEGMENTATION; MANDARIN; CONTOURS; CONTEXT;
D O I
10.1044/2023_JSLHR-22-00662
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Purpose: Pitch variations of the fundamental frequency (fo) contour contribute to speech perception in noisy environments, but whether musicians confer an advantage in speech in noise (SIN) with altered fo information remains unclear. This study investigated the effects of different levels of degraded fo contour (i.e., conveying lexical tone or intonation information) on musician advantage in speech-in-noise perception.Method: A cohort of native Mandarin Chinese speakers, comprising 30 trained musicians and 30 nonmusicians, were tested on the intelligibility of Mandarin Chinese sentences with natural, flattened-tone, flattened-intonation, and flattened-all fo contours embedded in background noise masked under three signal-to-noise ratios (0, -5, and -9 dB). Pitch difference thresholds and innate musical skills associated with speech-in-noise benefits were also assessed.Results: Speech intelligibility score improved with increasing signal-to-noise level for both musicians and nonmusicians. However, no musician advantage was observed for identifying any type of flattened-fo contour SIN. Musicians exhibited smaller fo pitch discrimination limens than nonmusicians, which correlated with benefits for perceiving speech with intact tone-level fo information. Regardless of musician status, performance on the pitch and accent musical skill subtests correlated with speech intelligibility score. Conclusions: Collectively, these results provide no evidence for a musician advantage for perceiving speech with distorted fo information in noisy environments. Results further show that perceptual musical skills on pitch and accent processing may benefit the perception of SIN, independent of formal musical training. Our findings suggest that the potential application of music training in speech perception in noisy backgrounds is not contingent on the ability to process fo pitch contours, at least for Mandarin Chinese speakers.Supplemental Material: https://doi.org/10.23641/asha.23706354
引用
收藏
页码:2643 / 2655
页数:13
相关论文
共 44 条
  • [21] Extraction of Fundamental Frequency From Degraded Speech Using Temporal Envelopes at High SNR Frequencies
    Aneeja, G.
    Yegnanarayana, B.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 829 - 838
  • [22] Robust Recognition of English Speech in Noisy Environments Using Frequency Warped Signal Processing
    Navneet Upadhyay
    Hamurabi Gamboa Rosales
    National Academy Science Letters, 2018, 41 : 15 - 22
  • [23] Robust Recognition of English Speech in Noisy Environments Using Frequency Warped Signal Processing
    Upadhyay, Navneet
    Gamboa Rosales, Hamurabi
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2018, 41 (01): : 15 - 22
  • [24] Fundamental frequency is critical to speech perception in noise in combined acoustic and electric hearing
    Carroll, Jeff
    Tiaden, Stephanie
    Zeng, Fan-Gang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (04): : 2054 - 2062
  • [25] Effects of differences in fundamental frequency on across-formant grouping in speech perception
    Summers, Robert J.
    Bailey, Peter J.
    Roberts, Brian
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (06): : 3667 - 3677
  • [26] DISCRIMINATION OF FUNDAMENTAL FREQUENCY CONTOURS IN SYNTHETIC SPEECH - IMPLICATIONS FOR MODELS OF PITCH PERCEPTION
    KLATT, DH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 8 - 16
  • [27] Fundamental frequency estimation for noisy speech using entropy-weighted periodic and harmonic features
    Ishimoto, Y
    Ishizuka, K
    Aikawa, K
    Akagi, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (01): : 205 - 214
  • [28] Deep Learning-Based Coding Strategy for Improved Cochlear Implant Speech Perception in Noisy Environments
    Essaid, Billel
    Kheddar, Hamza
    Batel, Noureddine
    Chowdhury, Muhammad E. H.
    IEEE ACCESS, 2025, 13 : 35707 - 35732
  • [29] Combining multi-band and frequency-filtering techniques for speech recognition in noisy environments
    Jancovic, P
    Ming, J
    Hanna, P
    Stewart, D
    Smith, J
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 265 - 270
  • [30] PERCEPTION OF SPEECH PATTERN CONTRASTS FROM AUDITORY PRESENTATION OF VOICE FUNDAMENTAL-FREQUENCY
    BOOTHROYD, A
    EAR AND HEARING, 1988, 9 (06): : 313 - 321