共 50 条
- [1] Dynamic browsing of audiovisual lecture recordings based on automated speech recognition INTELLIGENT TUTORING SYSTEM, PROCEEDINGS, 2008, 5091 : 662 - 664
- [2] Automatic Acoustic Segmentation for Speech Recognition on Broadcast Recordings INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2580 - 2583
- [3] ALIGNING AUDIOVISUAL FEATURES FOR AUDIOVISUAL SPEECH RECOGNITION 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
- [4] The segmentation of multi-channel meeting recordings for automatic speech recognition INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1213 - +
- [5] Audiovisual Annotation Procedure for Multi-view Field Recordings MULTIMEDIA MODELING (MMM 2019), PT I, 2019, 11295 : 399 - 410
- [6] Audiovisual speech recognition based on a deep convolutional neural network Data Science and Management, 2024, 7 (01): : 25 - 34
- [7] Fusion Architectures for Word-based Audiovisual Speech Recognition INTERSPEECH 2020, 2020, : 3491 - 3495
- [10] Stream-based classification and segmentation of speech events in meeting recordings MULTIMEDIA CONTENT REPRESENTATION, CLASSIFICATION AND SECURITY, 2006, 4105 : 793 - 800