Bridging the gap between speech technology and natural language processing: an evaluation toolbox for term discovery systems

被引:0
作者
Ludusan, Bogdan [1 ]
Versteegh, Maarten [1 ]
Jansen, Aren [2 ]
Gravier, Guillaume [3 ]
Cao, Xuan-Nga [1 ]
Johnson, Mark [4 ]
Dupoux, Emmanuel [1 ]
机构
[1] CNRS, EHESS, ENS, LSCP, Paris, France
[2] Johns Hopkins Univ, CLSP, Baltimore, MD USA
[3] CNRS, IRISA, Rennes, France
[4] Macquarie Univ, Sydney, NSW 2109, Australia
来源
LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2014年
关键词
evaluation; spoken term discovery; word segmentation;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The unsupervised discovery of linguistic terms from either continuous phoneme transcriptions or from raw speech has seen an increasing interest in the past years both from a theoretical and a practical standpoint. Yet, there exists no common accepted evaluation method for the systems performing term discovery. Here, we propose such an evaluation toolbox, drawing ideas from both speech technology and natural language processing. We first transform the speech-based output into a symbolic representation and compute five types of evaluation metrics on this representation: the quality of acoustic matching, the quality of the clusters found, and the quality of the alignment with real words (type, token, and boundary scores). We tested our approach on two term discovery systems taking speech as input, and one using symbolic input. The latter was run using both the gold transcription and a transcription obtained from an automatic speech recognizer, in order to simulate the case when only imperfect symbolic information is available. The results obtained are analysed through the use of the proposed evaluation metrics and the implications of these metrics are discussed.
引用
收藏
页码:560 / 567
页数:8
相关论文
共 27 条
[1]   A comparison of extrinsic clustering evaluation metrics based on formal constraints [J].
Amigo, Enrique ;
Gonzalo, Julio ;
Artiles, Javier ;
Verdejo, Felisa .
INFORMATION RETRIEVAL, 2009, 12 (04) :461-486
[2]  
[Anonymous], COMMUNICATION
[3]   Distributional regularity and phonotactic constraints are useful for segmentation [J].
Brent, MR ;
Cartwright, TA .
COGNITION, 1996, 61 (1-2) :93-125
[4]   An efficient, probabilistically sound algorithm for segmentation and word discovery [J].
Brent, MR .
MACHINE LEARNING, 1999, 34 (1-3) :71-105
[5]  
Catanese L, 2013, INTERSPEECH, P2674
[6]  
Daland R., 2013, P 51 ANN M ASS COMP, V2, P873
[7]   Learning Diphone-Based Segmentation [J].
Daland, Robert ;
Pierrehumbert, Janet B. .
COGNITIVE SCIENCE, 2011, 35 (01) :119-155
[8]  
Dredze M., 2010, Proc. Conference on Empirical Methods in Natural Language Processing, P460
[9]  
Elsner M., 2012, P 50 ANN M ASS COMP, P184
[10]  
Flamary R., 2011, 2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI), P133, DOI 10.1109/CBMI.2011.5972534