I-vectors and ILP clustering adapted to cross-show speaker diarization

被引:0
作者
Dupuy, Gregor [1 ]
Rouvier, Mickael [1 ]
Meignier, Sylvain [1 ]
Esteve, Yannick [1 ]
机构
[1] LUNAM Univ, LIUM, Le Mans, France
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
speaker diarization; cross-show diarization; i-vectors; up clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose to study speaker diarization from a collection of audio documents. The goal is to detect speakers appearing in several shows. In our approach, each show of the collection is processed separately before being processed collectively, to group speakers involved in several shows. Two clustering methods are studied for the overall processing of the collection: one uses the NCLR metric and the other is inspired by techniques based on i-vectors, mainly used in the speaker verification field. Both methods were evaluated on the whole training corpus of ESTER 2. The method based on the use of i-vectors achieves error rates similar to those obtained by the NCLR method, however, the computation time is on average 8.66 times faster. Therefore, this method is suitable for processing large volumes of data.
引用
收藏
页码:2171 / 2174
页数:4
相关论文
共 9 条
[1]  
Bousquet P.-M., 2011, P INT FLOR IT
[2]  
Chaubard L., 2009, P INT BRIGHT UK
[3]   Front-End Factor Analysis for Speaker Verification [J].
Dehak, Najim ;
Kenny, Patrick J. ;
Dehak, Reda ;
Dumouchel, Pierre ;
Ouellet, Pierre .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04) :788-798
[4]  
Le V. B., 2007, P INT ANTW BELG
[5]  
Meignier S., 2009, CMU SPUD WORKSH DALL
[6]  
Rouvier M., 2012, OD WORKSH SING
[7]  
Shum S., 2011, P INT FLOR IT
[8]  
Tran V.-A., 2011, P INT FLOR IT
[9]  
Yang Q., 2011, P INT FLOR IT