No Musician Advantage in the Perception of Degraded-Fundamental Frequency Speech in Noisy Environments

被引：0

作者：

Hsieh, I-Hui ^{[1
,2
]}

Guo, Yu-Jyun ^{[1
]}

机构：

[1] Natl Cent Univ, Inst Cognit Neurosci, Taoyuan City, Taiwan

[2] Natl Cent Univ, Cognit Intelligence & Precis Healthcare Ctr, Taoyuan City, Taiwan

来源：

JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH | 2023年 / 66卷 / 08期

关键词：

CUES; ABILITY; INTELLIGIBILITY; SEGMENTATION; MANDARIN; CONTOURS; CONTEXT;

D O I：

10.1044/2023_JSLHR-22-00662

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

Purpose: Pitch variations of the fundamental frequency (fo) contour contribute to speech perception in noisy environments, but whether musicians confer an advantage in speech in noise (SIN) with altered fo information remains unclear. This study investigated the effects of different levels of degraded fo contour (i.e., conveying lexical tone or intonation information) on musician advantage in speech-in-noise perception.Method: A cohort of native Mandarin Chinese speakers, comprising 30 trained musicians and 30 nonmusicians, were tested on the intelligibility of Mandarin Chinese sentences with natural, flattened-tone, flattened-intonation, and flattened-all fo contours embedded in background noise masked under three signal-to-noise ratios (0, -5, and -9 dB). Pitch difference thresholds and innate musical skills associated with speech-in-noise benefits were also assessed.Results: Speech intelligibility score improved with increasing signal-to-noise level for both musicians and nonmusicians. However, no musician advantage was observed for identifying any type of flattened-fo contour SIN. Musicians exhibited smaller fo pitch discrimination limens than nonmusicians, which correlated with benefits for perceiving speech with intact tone-level fo information. Regardless of musician status, performance on the pitch and accent musical skill subtests correlated with speech intelligibility score. Conclusions: Collectively, these results provide no evidence for a musician advantage for perceiving speech with distorted fo information in noisy environments. Results further show that perceptual musical skills on pitch and accent processing may benefit the perception of SIN, independent of formal musical training. Our findings suggest that the potential application of music training in speech perception in noisy backgrounds is not contingent on the ability to process fo pitch contours, at least for Mandarin Chinese speakers.Supplemental Material: https://doi.org/10.23641/asha.23706354

引用

页码：2643 / 2655

页数：13

共 44 条

[21] Extraction of Fundamental Frequency From Degraded Speech Using Temporal Envelopes at High SNR Frequencies
Aneeja, G.
Yegnanarayana, B.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 829 - 838
[22] Robust Recognition of English Speech in Noisy Environments Using Frequency Warped Signal Processing
Navneet Upadhyay
Hamurabi Gamboa Rosales
National Academy Science Letters, 2018, 41 : 15 - 22
[23] Robust Recognition of English Speech in Noisy Environments Using Frequency Warped Signal Processing
Upadhyay, Navneet
Gamboa Rosales, Hamurabi
NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2018, 41 (01): : 15 - 22
[24] Fundamental frequency is critical to speech perception in noise in combined acoustic and electric hearing
Carroll, Jeff
Tiaden, Stephanie
Zeng, Fan-Gang
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (04): : 2054 - 2062
[25] Effects of differences in fundamental frequency on across-formant grouping in speech perception
Summers, Robert J.
Bailey, Peter J.
Roberts, Brian
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (06): : 3667 - 3677
[26] DISCRIMINATION OF FUNDAMENTAL FREQUENCY CONTOURS IN SYNTHETIC SPEECH - IMPLICATIONS FOR MODELS OF PITCH PERCEPTION
KLATT, DH
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 8 - 16
[27] Fundamental frequency estimation for noisy speech using entropy-weighted periodic and harmonic features
Ishimoto, Y
Ishizuka, K
Aikawa, K
Akagi, M
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (01): : 205 - 214
[28] Deep Learning-Based Coding Strategy for Improved Cochlear Implant Speech Perception in Noisy Environments
Essaid, Billel
Kheddar, Hamza
Batel, Noureddine
Chowdhury, Muhammad E. H.
IEEE ACCESS, 2025, 13 : 35707 - 35732
[29] Combining multi-band and frequency-filtering techniques for speech recognition in noisy environments
Jancovic, P
Ming, J
Hanna, P
Stewart, D
Smith, J
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 265 - 270
[30] PERCEPTION OF SPEECH PATTERN CONTRASTS FROM AUDITORY PRESENTATION OF VOICE FUNDAMENTAL-FREQUENCY
BOOTHROYD, A
EAR AND HEARING, 1988, 9 (06): : 313 - 321

← 1 2 3 4 5 →