Detection of overlapping speech in meetings using Support Vector Machines and Support Vector Regression

被引:8
作者
Yamamoto, Kiyoshi [1 ]
Asano, Futoshi
Yamada, Takeshi
Kitawaki, Nobuhiko
机构
[1] Univ Tsukuba, Grad Sch Syst & Informat Engn, Tsukuba, Ibaraki 3058573, Japan
[2] AIST, Informat Technol Res Inst, Media Interact Grp, Tsukuba, Ibaraki 3058568, Japan
关键词
Support Vector Machines; Support Vector Regression; detection of overlapping speech;
D O I
10.1093/ietfec/e89-a.8.2158
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a method of detecting overlapping speech segments in meetings is proposed. It is known that the eigenvalue distribution of the spatial correlation matrix calculated from a multiple microphone input reflects information on the number and relative power of sound sources. However, in a reverberant sound field, the feature of the number of sources in the eigenvalue distribution is degraded by the room reverberation. In the Support Vector Machines approach, the eigenvalue distribution is classified into two classes (overlapping speech segments and single speech segments). In the Support Vector Regression approach, the relative power of sound sources is estimated by using the eigenvalue distribution, and overlapping speech segments are detected based on the estimated relative power. The salient feature of this approach is that the sensitivity of detecting overlapping speech segments can be controlled simply by changing the threshold value of the relative power. The proposed method was evaluated using recorded data of an actual meeting.
引用
收藏
页码:2158 / 2165
页数:8
相关论文
共 6 条
[1]   PROPERTIES OF EIGENVECTORS OF PERSYMMETRIC MATRICES WITH APPLICATIONS TO COMMUNICATION-THEORY [J].
CANTONI, A ;
BUTLER, P .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1976, 24 (08) :804-809
[2]  
Christianini N., 2000, INTRO SUPPORT VECTOR, DOI DOI 10.1017/CBO9780511801389
[3]  
Rijsbergen V., 1979, INFORM RETRIEVAL, VSecond Edi
[4]   ESPRIT - ESTIMATION OF SIGNAL PARAMETERS VIA ROTATIONAL INVARIANCE TECHNIQUES [J].
ROY, R ;
KAILATH, T .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (07) :984-995
[5]  
Scholkopf B., 1998, Advances in Kernel Methods-Support Vector Learning
[6]   DETECTION OF SIGNALS BY INFORMATION THEORETIC CRITERIA [J].
WAX, M ;
KAILATH, T .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :387-392