共 50 条
[21]
Speech Pattern Discovery using Audio-Visual Fusion and Canonical Correlation Analysis
[J].
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3,
2012,
:2371-2374
[22]
ROBUST CANONICAL CORRELATION ANALYSIS: AUDIO-VISUAL FUSION FOR LEARNING CONTINUOUS INTEREST
[J].
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2014,
[23]
Identification of story units in audio-visual sequences by joint audio and video processing
[J].
1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1,
1998,
:363-367
[24]
An audio-visual speech recognition with a new mandarin audio-visual database
[J].
INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1,
2007,
:19-+
[25]
AUDIO-VISUAL SCENE-AWARE DIALOG AND REASONING USING AUDIO-VISUAL TRANSFORMERS WITH JOINT STUDENT-TEACHER LEARNING
[J].
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2022,
:7732-7736
[26]
Audio-Visual Biometric Recognition Via Joint Sparse Representations
[J].
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR),
2016,
:3031-3035
[29]
Learning joint statistical models for audio-visual fusion and segregation
[J].
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13,
2001, 13
:772-778