Bimodal speaker identification using dynamic Bayesian network

被引:0
|
作者
Li, DD [1 ]
Sang, LF [1 ]
Yang, YC [1 ]
Wu, ZH [1 ]
机构
[1] Zhejiang Univ, Dept Comp Sci, Hangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The authentication of a person requires a consistently high recognition accuracy which is difficult to attain using a single recognition modality. This paper assesses the fusion of voiceprint and face feature for bimodal speaker identification using Dynamic Bayesian Network (DBN). Our contribution is to propose a general feature-level fusion framework in bimodal speaker identification. Within the framework, the voice and face feature are combined into a single DBN to obtain better performance than any single system alone. The tests were conducted on a multi-modal database of 54 users who provided voiceprint and face data of different speech type and content We compare our approach with mono-modal system and other classic decision-level methods and show that feature-level fusion using dynamic Bayesian network improved performance by about 4-5%, much better than the others.
引用
收藏
页码:577 / 585
页数:9
相关论文
共 50 条
  • [1] Audio-visual bimodal speaker identification using dynamic Bayesian networks
    Wu, Zhiyong
    Cai, Lianhong
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2006, 43 (03): : 470 - 475
  • [2] Dynamic Bayesian network approach to speaker identification
    Sang, LF
    Yang, YC
    Wu, ZH
    Zhang, WF
    ELECTRONICS LETTERS, 2003, 39 (03) : 329 - 330
  • [3] Automatic speaker recognition using dynamic Bayesian network
    Sang, LF
    Wu, ZH
    Yang, YC
    Zhang, WF
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 188 - 191
  • [4] Automatic speaker recognition using dynamic Bayesian network
    Sang, LF
    Wu, ZH
    Yang, YC
    Zhang, WF
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 613 - 616
  • [5] Speaker Identification in Noisy Environments Using Dynamic Bayesian Networks
    Khanteymoori, A. R.
    Homayounpour, M. M.
    Menhaj, M. B.
    2009 14TH INTERNATIONAL COMPUTER CONFERENCE, 2009, : 600 - +
  • [6] Dynamic hazard identification and scenario mapping using Bayesian network
    Xin, Peiwei
    Khan, Faisal
    Ahmed, Salim
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2017, 105 : 143 - 155
  • [7] A Bayesian approach to sparse dynamic network identification
    Chiuso, Alessandro
    Pillonetto, Gianluigi
    AUTOMATICA, 2012, 48 (08) : 1553 - 1565
  • [8] Dynamic Bayesian Network for Operational Modal Identification
    Li, B.
    Kiureghian, A. Der
    STRUCTURAL HEALTH MONITORING 2013, VOLS 1 AND 2, 2013, : 2696 - 2703
  • [9] Towards Speaker Identification System based on Dynamic Neural Network
    Ivanovas, E.
    Navakauskas, D.
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2012, 18 (10) : 69 - 72
  • [10] A Correction of Missing Reliability for Robust Bimodal Speaker Identification
    Tariquzzaman, Md
    Kim, Jin Young
    Na, Seung You
    2009 INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY, PROCEEDINGS, 2009, : 239 - 243