Evaluating automatic speech recognition systems as quantitative models of cross-lingual phonetic category perception

被引:5
作者
Schatz, Thomas [1 ,2 ]
Bach, Francis [3 ]
Dupoux, Emmanuel [4 ]
机构
[1] Univ Maryland, Dept Linguist, College Pk, MD 20742 USA
[2] Univ Maryland, UMIACS, College Pk, MD 20742 USA
[3] PSL Res Univ, CNRS, Ecole Normale Super, Dept Informat ENS,SIERRA Project Team,INRIA, 45 Rue Ulm, F-75005 Paris, France
[4] PSL Res Univ, CNRS, Ecole Normale Super, Dept Etud Cognit ENS,EHESS,LSCP, 29 Rue Ulm, F-75005 Paris, France
基金
美国国家科学基金会; 欧洲研究理事会;
关键词
JAPANESE;
D O I
10.1121/1.5037615
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Theories of cross-linguistic phonetic category perception posit that listeners perceive foreign sounds by mapping them onto their native phonetic categories, but, until now, no way to effectively implement this mapping has been proposed. In this paper, Automatic Speech Recognition systems trained on continuous speech corpora are used to provide a fully specified mapping between foreign sounds and native categories. The authors show how the machine ABX evaluation method can be used to compare predictions from the resulting quantitative models with empirically attested effects in human cross-linguistic phonetic category perception. (C) 2018 Acoustical Society of America
引用
收藏
页码:EL372 / EL378
页数:7
相关论文
共 26 条
[1]  
[Anonymous], P NEW SOUNDS
[2]  
[Anonymous], P OF ASRU
[3]  
[Anonymous], 1995, SPEECH PERCEPTION LI
[4]  
[Anonymous], 1995, Speech perception and linguistic experience
[5]  
[Anonymous], P WORKSH AUT SPEECH
[6]  
[Anonymous], 2015, P INTERSPEECH
[7]  
[Anonymous], 2002, P INTERSPEECH
[8]  
[Anonymous], 1982, Visual perception
[9]  
[Anonymous], P INTERSPEECH
[10]  
Barlow h., 1961, SENS COMMUN, V13, P217