共 50 条
- [21] NON-AUTOREGRESSIVE END-TO-END APPROACHES FOR JOINT AUTOMATIC SPEECH RECOGNITION AND SPOKEN LANGUAGE UNDERSTANDING 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 390 - 397
- [22] END-TO-END MULTI-TALKER AUDIO-VISUAL ASR USING AN ACTIVE SPEAKER ATTENTION MODULE INTERSPEECH 2022, 2022, : 2828 - 2832
- [24] END-TO-END MULTI-SPEAKER SPEECH RECOGNITION 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4819 - 4823
- [25] Utterance invariant training for hybrid two-pass end-to-end speech recognition INTERSPEECH 2020, 2020, : 2827 - 2831
- [26] Joint CTC/attention decoding for end-to-end speech recognition PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 518 - 529
- [27] Super-human multi-talker speech recognition: A graphical modeling approach COMPUTER SPEECH AND LANGUAGE, 2010, 24 (01): : 45 - 66
- [28] HYPOTHESIS STITCHER FOR END-TO-END SPEAKER-ATTRIBUTED ASR ON LONG-FORM MULTI-TALKER RECORDINGS 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6763 - 6767
- [29] IMPROVING RNN TRANSDUCER MODELING FOR END-TO-END SPEECH RECOGNITION 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 114 - 121
- [30] Multi-Head Decoder for End-to-End Speech Recognition 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 801 - 805