Audio-visual bimodal speaker identification using dynamic Bayesian networks

被引:3
|
作者
Wu, Zhiyong [1 ]
Cai, Lianhong [1 ]
机构
[1] Key Laboratory of Pervasive Computing, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
关键词
9;
D O I
10.1360/crad20060315
中图分类号
学科分类号
摘要
引用
收藏
页码:470 / 475
相关论文
共 50 条
  • [1] Dynamic Bayesian Networks for audio-visual speaker recognition
    Li, DD
    Yang, YC
    Wu, ZH
    ADVANCES IN BIOMETRICS, PROCEEDINGS, 2006, 3832 : 539 - 545
  • [2] A Bayesian approach to audio-visual speaker identification
    Nefian, AV
    Liang, LH
    Fu, TY
    Liu, XX
    AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 761 - 769
  • [3] Boosting and structure learning in dynamic Bayesian networks for audio-visual speaker detection
    Choudhury, T
    Rehg, JM
    Pavlovic, V
    Pentland, A
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 789 - 794
  • [4] Audio-visual speaker identification based on the use of dynamic audio and visual features
    Fox, N
    Reilly, RB
    AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 743 - 751
  • [5] Dynamic Bayesian Networks for Audio-Visual Speech Recognition
    Ara V. Nefian
    Luhong Liang
    Xiaobo Pi
    Xiaoxing Liu
    Kevin Murphy
    EURASIP Journal on Advances in Signal Processing, 2002
  • [6] Dynamic Bayesian networks for audio-visual speech recognition
    Nefian, AV
    Liang, LH
    Pi, XB
    Liu, XX
    Murphy, K
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2002, 2002 (11) : 1274 - 1288
  • [7] Bimodal speaker identification using dynamic Bayesian network
    Li, DD
    Sang, LF
    Yang, YC
    Wu, ZH
    ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2004, 3338 : 577 - 585
  • [8] Weight estimation for audio-visual multi-level fusion in bimodal speaker identification
    Wu, Zhiyong
    Cai, Lianhong
    Meng, Helen M.
    INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 1107 - 1112
  • [9] Audio-visual speaker identification using dynamic facial movements and utterance phonetic content
    Asadpour, Vahid
    Homayounpour, Mohammad Mehdi
    Towhidkhah, Farzad
    APPLIED SOFT COMPUTING, 2011, 11 (02) : 2083 - 2093
  • [10] Performance enhancement for audio-visual speaker identification using dynamic facial muscle model
    Vahid Asadpour
    Farzad Towhidkhah
    Mohammad Mehdi Homayounpour
    Medical and Biological Engineering and Computing, 2006, 44 : 919 - 930