共 50 条
- [41] Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora INTERSPEECH 2019, 2019, : 1303 - 1307
- [43] DNN based multi-speaker speech synthesis with temporal auxiliary speaker ID embedding 2019 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2019, : 61 - 64
- [44] CLeLfPC: a Large Open Multi-Speaker Corpus of French Cued Speech LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 987 - 994
- [45] Phoneme Duration Modeling Using Speech Rhythm-Based Speaker Embeddings for Multi-Speaker Speech Synthesis INTERSPEECH 2021, 2021, : 3141 - 3145
- [46] Integrating Spectral and Spatial Features for Multi-Channel Speaker Separation 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2718 - 2722
- [47] A multi-channel/multi-speaker interactive 3D Audio-Visual Speech Corpus in Mandarin 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
- [48] Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain INTERSPEECH 2021, 2021, : 3720 - 3724
- [49] Normalization Driven Zero-shot Multi-Speaker Speech Synthesis INTERSPEECH 2021, 2021, : 1354 - 1358
- [50] A Purely End-to-end System for Multi-speaker Speech Recognition PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2620 - 2630