Bimodal speaker identification using dynamic Bayesian network

被引：0

作者：

Li, DD ^{[1
]}

Sang, LF ^{[1
]}

Yang, YC ^{[1
]}

Wu, ZH ^{[1
]}

机构：

[1] Zhejiang Univ, Dept Comp Sci, Hangzhou, Peoples R China

来源：

ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS | 2004年 / 3338卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The authentication of a person requires a consistently high recognition accuracy which is difficult to attain using a single recognition modality. This paper assesses the fusion of voiceprint and face feature for bimodal speaker identification using Dynamic Bayesian Network (DBN). Our contribution is to propose a general feature-level fusion framework in bimodal speaker identification. Within the framework, the voice and face feature are combined into a single DBN to obtain better performance than any single system alone. The tests were conducted on a multi-modal database of 54 users who provided voiceprint and face data of different speech type and content We compare our approach with mono-modal system and other classic decision-level methods and show that feature-level fusion using dynamic Bayesian network improved performance by about 4-5%, much better than the others.

引用

页码：577 / 585

页数：9

共 50 条

[21] A real time speaker identification using artificial neural network
Hossain, Md. Murad
Ahmed, Boshir
Asrafi, Mahrnuda
PROCEEDINGS OF 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2007), 2007, : 325 - 329
[22] Speaker Identification System Using Wavelet Transform and Neural Network
Daqrouq, K.
Abu Hilal, T.
Sherif, M.
El-Hajar, S.
Al-Qawasmi, A.
2009 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTATIONAL TOOLS FOR ENGINEERING APPLICATIONS, 2009, : 560 - +
[23] Speaker Identification Using Robust Speech Detection and Neural Network
Ouzounov, Atanas
CYBERNETICS AND INFORMATION TECHNOLOGIES, 2007, 7 (03) : 48 - 54
[24] Multistep Speaker Identification Using Gibbs-Distribution-Based Extended Bayesian Inference for Rejecting Unregistered Speaker
Mizobe, Yuta
Kurogi, Shuichi
Tsukazaki, Tomohiro
Nishida, Takeshi
NEURAL INFORMATION PROCESSING, ICONIP 2012, PT V, 2012, 7667 : 247 - 255
[25] Towards experimental design using a Bayesian framework for parameter identification in dynamic intracellular network models
Kramer, Andrei
Radde, Nicole
ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 1639 - 1647
[26] Bayesian networks in multimodal speech recognition and speaker identification
Nefian, AV
Liang, LH
CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 2004 - 2008
[27] Hierarchical speaker identification using speaker clustering
Sun, B
Liu, WJ
Zhong, QH
2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 299 - 304
[28] Time series prediction using dynamic Bayesian network
Xiao, Qinkun
Chu Chaoqin
Li, Zhao
OPTIK, 2017, 135 : 98 - 103
[29] Recognizing Hand Gestures using Dynamic Bayesian Network
Suk, Heung-Il
Sin, Bong-Kee
Lee, Seong-Whan
2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 390 - +
[30] Recognizing interaction activities using dynamic Bayesian network
Du, Youtian
Chen, Feng
Xu, Wenli
Li, Yongbin
18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 618 - +

← 1 2 3 4 5 →