Bimodal speaker identification using dynamic Bayesian network

被引:0
|
作者
Li, DD [1 ]
Sang, LF [1 ]
Yang, YC [1 ]
Wu, ZH [1 ]
机构
[1] Zhejiang Univ, Dept Comp Sci, Hangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The authentication of a person requires a consistently high recognition accuracy which is difficult to attain using a single recognition modality. This paper assesses the fusion of voiceprint and face feature for bimodal speaker identification using Dynamic Bayesian Network (DBN). Our contribution is to propose a general feature-level fusion framework in bimodal speaker identification. Within the framework, the voice and face feature are combined into a single DBN to obtain better performance than any single system alone. The tests were conducted on a multi-modal database of 54 users who provided voiceprint and face data of different speech type and content We compare our approach with mono-modal system and other classic decision-level methods and show that feature-level fusion using dynamic Bayesian network improved performance by about 4-5%, much better than the others.
引用
收藏
页码:577 / 585
页数:9
相关论文
共 50 条
  • [31] Speaker identification using Dynamic Time Warping with stress compensation technique
    Shahin, I
    Botros, N
    PROCEEDINGS IEEE SOUTHEASTCON '98: ENGINEERING FOR A NEW ERA, 1998, : 65 - 68
  • [32] Conceptual change modeling using Dynamic Bayesian network
    Ting, Choo-Yee
    Chong, Yen-Kuan
    INTELLIGENT TUTORING SYSTEMS, PROCEEDINGS, 2006, 4053 : 95 - 103
  • [33] A Bayesian approach to audio-visual speaker identification
    Nefian, AV
    Liang, LH
    Fu, TY
    Liu, XX
    AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 761 - 769
  • [34] A quantitative framework for network resilience evaluation using Dynamic Bayesian Network
    Jiang, Shanqing
    Yang, Lin
    Cheng, Guang
    Gao, Xianming
    Feng, Tao
    Zhou, Yuyang
    COMPUTER COMMUNICATIONS, 2022, 194 : 387 - 398
  • [35] Gene Network Learning Using Regulated Dynamic Bayesian Network Methods
    Lin, Xiaotong
    Chen, Xue-wen
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 717 - 722
  • [36] A quantitative framework for network resilience evaluation using Dynamic Bayesian Network
    Jiang, Shanqing
    Yang, Lin
    Cheng, Guang
    Gao, Xianming
    Feng, Tao
    Zhou, Yuyang
    Computer Communications, 2022, 194 : 387 - 398
  • [37] Speaker Identification using Wavelet Shannon Entropy and Probabilistic Neural Network
    Lei, Lei
    She, Kun
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 566 - 571
  • [38] Bayesian network model with dynamic structure identification for real time diagnosis
    Dang Trinh Nguyen
    Duong, Quoc Bao
    Zamai, Eric
    Shahzad, Muhammad Kashif
    2014 IEEE EMERGING TECHNOLOGY AND FACTORY AUTOMATION (ETFA), 2014,
  • [39] A BAYESIAN ATTENTION NEURAL NETWORK LAYER FOR SPEAKER RECOGNITION
    Zhu, Weizhong
    Pelecanos, Jason
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6241 - 6245
  • [40] Dynamic structure identification of Bayesian network model for fault diagnosis of FMS
    Dang Trinh Nguyen
    Quoc Bao Duong
    Zamai, Eric
    Shahzad, Muhammad Kashif
    IECON 2014 - 40TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2014, : 119 - 125