Bimodal speaker identification using dynamic Bayesian network

被引:0
|
作者
Li, DD [1 ]
Sang, LF [1 ]
Yang, YC [1 ]
Wu, ZH [1 ]
机构
[1] Zhejiang Univ, Dept Comp Sci, Hangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The authentication of a person requires a consistently high recognition accuracy which is difficult to attain using a single recognition modality. This paper assesses the fusion of voiceprint and face feature for bimodal speaker identification using Dynamic Bayesian Network (DBN). Our contribution is to propose a general feature-level fusion framework in bimodal speaker identification. Within the framework, the voice and face feature are combined into a single DBN to obtain better performance than any single system alone. The tests were conducted on a multi-modal database of 54 users who provided voiceprint and face data of different speech type and content We compare our approach with mono-modal system and other classic decision-level methods and show that feature-level fusion using dynamic Bayesian network improved performance by about 4-5%, much better than the others.
引用
收藏
页码:577 / 585
页数:9
相关论文
共 50 条
  • [21] A real time speaker identification using artificial neural network
    Hossain, Md. Murad
    Ahmed, Boshir
    Asrafi, Mahrnuda
    PROCEEDINGS OF 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2007), 2007, : 325 - 329
  • [22] Speaker Identification System Using Wavelet Transform and Neural Network
    Daqrouq, K.
    Abu Hilal, T.
    Sherif, M.
    El-Hajar, S.
    Al-Qawasmi, A.
    2009 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTATIONAL TOOLS FOR ENGINEERING APPLICATIONS, 2009, : 560 - +
  • [23] Speaker Identification Using Robust Speech Detection and Neural Network
    Ouzounov, Atanas
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2007, 7 (03) : 48 - 54
  • [24] Multistep Speaker Identification Using Gibbs-Distribution-Based Extended Bayesian Inference for Rejecting Unregistered Speaker
    Mizobe, Yuta
    Kurogi, Shuichi
    Tsukazaki, Tomohiro
    Nishida, Takeshi
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT V, 2012, 7667 : 247 - 255
  • [25] Towards experimental design using a Bayesian framework for parameter identification in dynamic intracellular network models
    Kramer, Andrei
    Radde, Nicole
    ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 1639 - 1647
  • [26] Bayesian networks in multimodal speech recognition and speaker identification
    Nefian, AV
    Liang, LH
    CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 2004 - 2008
  • [27] Hierarchical speaker identification using speaker clustering
    Sun, B
    Liu, WJ
    Zhong, QH
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 299 - 304
  • [28] Time series prediction using dynamic Bayesian network
    Xiao, Qinkun
    Chu Chaoqin
    Li, Zhao
    OPTIK, 2017, 135 : 98 - 103
  • [29] Recognizing Hand Gestures using Dynamic Bayesian Network
    Suk, Heung-Il
    Sin, Bong-Kee
    Lee, Seong-Whan
    2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 390 - +
  • [30] Recognizing interaction activities using dynamic Bayesian network
    Du, Youtian
    Chen, Feng
    Xu, Wenli
    Li, Yongbin
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 618 - +