The 1st Clarity Prediction Challenge: A machine learning challenge for hearing aid intelligibility prediction

被引:9
作者
Barker, Jon [1 ]
Akeroyd, Michael A. [2 ]
Cox, Trevor J. [3 ]
Culling, John F. [4 ]
Firth, Jennifer [2 ]
Graetzer, Simone [3 ]
Griffiths, Holly [2 ]
Harris, Lara [3 ]
Viveros-Munoz, Rhoddy [4 ]
Naylor, Graham [2 ]
Podwinska, Zuzanna [3 ]
Porter, Eszter [2 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield, S Yorkshire, England
[2] Univ Nottingham, Sch Med, Nottingham, England
[3] Univ Salford, Acoust Res Ctr, Salford, Lancs, England
[4] Cardiff Univ, Sch Psychol, Cardiff, Wales
来源
INTERSPEECH 2022 | 2022年
基金
英国工程与自然科学研究理事会;
关键词
speech-in-noise; speech intelligibility; hearing aid; hearing loss; machine learning; SPEECH-INTELLIGIBILITY;
D O I
10.21437/Interspeech.2022-10821
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper reports on the design and outcomes of the 1st Clarity Prediction Challenge (CPC1) for predicting the intelligibility of hearing aid processed signals heard by individuals with a hearing impairment. The challenge was designed to promote the development of new intelligibility measures suitable for use in developing hearing aid algorithms. Participants were supplied with listening test data compromising 7233 responses from 27 individuals. Data was split between training and test sets in a manner that fostered a machine learning approach and allowed both closed-set (known listeners) and open-set (unseen listener/unseen system) evaluation. The paper provides a description of the challenge design including the datasets, the hearing aid algorithms applied, the listeners and the perceptual tests. The challenge attracted submissions from 15 systems. The results are reviewed and the paper summarises, compares and contrasts approaches.
引用
收藏
页码:3508 / 3512
页数:5
相关论文
共 31 条
[11]   Dataset of British English speech recordings for psychoacoustics and speech processing research: The clarity speech corpus [J].
Graetzer, Simone ;
Akeroyd, Michael A. ;
Barker, Jon ;
Cox, Trevor J. ;
Culling, John F. ;
Naylor, Graham ;
Porter, Eszter ;
Viveros-Munoz, Rhoddy .
DATA IN BRIEF, 2022, 41
[12]   Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage [J].
Hauth, Christopher F. ;
Berning, Simon C. ;
Kollmeier, Birger ;
Brand, Thomas .
TRENDS IN HEARING, 2020, 24
[13]  
Huckvale M., 2022, P INTERSPEECH 2022
[14]  
Kamo N., 2022, P 2 CLAR WORKSH MACH
[15]  
Karbasi M., 2020, ARXIV201008574
[16]   The Hearing-Aid Speech Perception Index (HASPI) [J].
Kates, James M. ;
Arehart, Kathryn H. .
SPEECH COMMUNICATION, 2014, 65 :75-93
[17]  
Martinez A. M. C., 2021, COMPUTER SPEECH LANG
[18]  
Mawalim C. O., 2022, P 2 CLAR WORKSH MACH
[19]  
McKinney A. F., 2022, SIGNAL PROCESSING LE
[20]  
McKinney A. F., 2022, P 2 CLAR WORKSH MACH