共 50 条
- [45] Human interaction categorization by using audio-visual cues Machine Vision and Applications, 2014, 25 : 71 - 84
- [46] An audio-visual speech recognition with a new mandarin audio-visual database INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
- [47] LUMINA: Linguistic unified multimodal Indonesian natural audio-visual dataset DATA IN BRIEF, 2024, 54
- [48] Non-invasive extraction of audio-visual cues for multimodal applications HYBRID IMAGE AND SIGNAL PROCESSING VI, 1998, 3389 : 133 - 138
- [49] Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 281 - 284
- [50] Auxiliary Loss Multimodal GRU Model in Audio-Visual Speech Recognition IEEE ACCESS, 2018, 6 : 5573 - 5583