共 50 条
- [31] Bayesian networks and information theory for audio-visual perception modeling Biological Cybernetics, 2010, 103 : 213 - 226
- [32] Speaker position detection system using audio-visual information FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 1999, 35 (02): : 212 - 220
- [33] Rethinking the visual cues in audio-visual speaker extraction INTERSPEECH 2023, 2023, : 3754 - 3758
- [34] Learning Bimodal Structure in Audio-Visual Data IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (12): : 1898 - 1910
- [35] Audio-visual modeling for bimodal speech recognition 2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 181 - 186
- [36] Bimodal fusion in audio-visual speech recognition 2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2002, : 964 - 967
- [37] Audio-visual speaker identification via adaptive fusion using reliability estimates of both modalities AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 787 - 796
- [39] Speaker independent audio-visual speech recognition 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1073 - 1076
- [40] Multi-Speaker Audio-Visual Corpus RUSAVIC: Russian Audio-Visual Speech in Cars LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1555 - 1559