Audio-visual bimodal speaker identification using dynamic Bayesian networks

被引:3
|
作者
Wu, Zhiyong [1 ]
Cai, Lianhong [1 ]
机构
[1] Key Laboratory of Pervasive Computing, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
关键词
9;
D O I
10.1360/crad20060315
中图分类号
学科分类号
摘要
引用
收藏
页码:470 / 475
相关论文
共 50 条
  • [31] Bayesian networks and information theory for audio-visual perception modeling
    Patricia Besson
    Jonas Richiardi
    Christophe Bourdin
    Lionel Bringoux
    Daniel R. Mestre
    Jean-Louis Vercher
    Biological Cybernetics, 2010, 103 : 213 - 226
  • [32] Speaker position detection system using audio-visual information
    Matsuo, N
    Kitagawa, H
    Nagata, S
    FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 1999, 35 (02): : 212 - 220
  • [33] Rethinking the visual cues in audio-visual speaker extraction
    Li, Junjie
    Ge, Meng
    Pan, Zexu
    Cao, Rui
    Wang, Longbiao
    Dang, Jianwu
    Zhang, Shiliang
    INTERSPEECH 2023, 2023, : 3754 - 3758
  • [34] Learning Bimodal Structure in Audio-Visual Data
    Monaci, Gianluca
    Vandergheynst, Pierre
    Sommer, Friedrich T.
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (12): : 1898 - 1910
  • [35] Audio-visual modeling for bimodal speech recognition
    Kaynak, MN
    Zhi, Q
    Cheok, AD
    Sengupta, K
    Chung, KC
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 181 - 186
  • [36] Bimodal fusion in audio-visual speech recognition
    Zhang, XZ
    Mersereau, RM
    Clements, M
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2002, : 964 - 967
  • [37] Audio-visual speaker identification via adaptive fusion using reliability estimates of both modalities
    Fox, NA
    O'Mullane, BA
    Reilly, RB
    AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 787 - 796
  • [38] Deep Audio-Visual Beamforming for Speaker Localization
    Qian, Xinyuan
    Zhang, Qiquan
    Guan, Guohui
    Xue, Wei
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1132 - 1136
  • [39] Speaker independent audio-visual speech recognition
    Zhang, Y
    Levinson, S
    Huang, T
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1073 - 1076
  • [40] Multi-Speaker Audio-Visual Corpus RUSAVIC: Russian Audio-Visual Speech in Cars
    Ivanko, Denis
    Ryumin, Dmitry
    Axyonov, Alexandr
    Kashevnik, Alexey
    Karpov, Alexey
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1555 - 1559