共 50 条
- [41] Dynamic Speaker Representations Adjustment and Decoder Factorization for Speaker Adaptation in End-to-End Speech Synthesis INTERSPEECH 2020, 2020, : 4701 - 4705
- [42] End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning INTERSPEECH 2019, 2019, : 4425 - 4429
- [43] INCORPORATING END-TO-END FRAMEWORK INTO TARGET-SPEAKER VOICE ACTIVITY DETECTION 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8362 - 8366
- [44] Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1835 - 1841
- [45] Improving End-to-End Speech Translation by Leveraging Auxiliary Speech and Text Data THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13984 - 13992
- [46] ATTENTION-BASED END-TO-END SPEECH RECOGNITION ON VOICE SEARCH 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4764 - 4768
- [48] NEURAL NOISE EMBEDDING FOR END-TO-END SPEECH ENHANCEMENT WITH CONDITIONAL LAYER NORMALIZATION 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7113 - 7117
- [49] Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation INTERSPEECH 2022, 2022, : 121 - 125
- [50] End-to-End Chinese Speaker Identification NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 2274 - 2285