S4D: Speaker Diarization Toolkit in Python']Python

被引:9
作者
Broux, Pierre-Alexandre [1 ,2 ]
Desnous, Florent [2 ]
Larcher, Anthony [2 ]
Petitrenaud, Simon [2 ]
Carrive, Jean [1 ]
Meignier, Sylvain [2 ]
机构
[1] French Natl Audiovisual Inst INA, Paris, France
[2] Le Mans Univ LIUM, Comp Sci Lab, EA 4023, Le Mans, France
来源
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年
关键词
SIDEKIT; diarization; toolkit; !text type='Python']Python[!/text; open source; tutorials;
D O I
10.21437/Interspeech.2018-1232
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present S4D, a new opensource Python toolkit dedicated to speaker diarization. S4D provides various state-of-the-art components and the possibility to easily develop endto end diarization prototype systems. S4D offers a large panel of clustering, segmentation, scoring and visualization algorithms. S4D has been thought to be easily understood, installed, modified and used in order to allow fast transfers of diarization technologies to industry and facilitate development of new approaches. Examples, benchmarks on standard tasks and tutorials are provided in this paper. S4D is an extension of the open source toolkit for speaker recognition: SIDEKIT.
引用
收藏
页码:1368 / 1372
页数:5
相关论文
共 33 条
[1]   Speaker Diarization: A Review of Recent Research [J].
Anguera Miro, Xavier ;
Bozonnet, Simon ;
Evans, Nicholas ;
Fredouille, Corinne ;
Friedland, Gerald ;
Vinyals, Oriol .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02) :356-370
[2]  
[Anonymous], INTERSPEECH
[3]  
[Anonymous], INT CONF ACOUST SPEE
[4]  
[Anonymous], 2003, RICH TRANSCRIPTION F
[5]  
[Anonymous], INTERSPEECH
[6]  
[Anonymous], INT CONF ACOUST SPEE
[7]  
[Anonymous], INT CONF ACOUST SPEE
[8]  
[Anonymous], SPEECH COMMUNICATION
[9]  
[Anonymous], AC SPEECH SIGN PROC
[10]  
[Anonymous], 2009, 10 ANN C INT SPEECH