共 32 条
- [21] Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 307 - 311
- [22] VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking [J]. INTERSPEECH 2019, 2019, : 2728 - 2732
- [23] Wisdom S., 2020, PROC NEURIPS 20
- [24] WHAT'S ALL THE FUSS ABOUT FREE UNIVERSAL SOUND SEPARATION DATA? [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 186 - 190
- [25] Xiao X, 2019, INT CONF ACOUST SPEE, P86, DOI [10.1109/ICASSP.2019.8682245, 10.1109/icassp.2019.8682245]
- [26] Xu CL, 2019, 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), P327, DOI [10.1109/ASRU46091.2019.9004016, 10.1109/asru46091.2019.9004016]
- [27] Permutation invariant training of deep models for speaker-independent multi-talker speech separation [J]. MECHANICAL ENGINEERING JOURNAL, 2023,
- [28] TOWARDS ROBUST SPEAKER VERIFICATION WITH TARGET SPEAKER ENHANCEMENT [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6693 - 6697
- [29] X-TaSNet: Robust and Accurate Time-Domain Speaker Extraction Network [J]. INTERSPEECH 2020, 2020, : 1421 - 1425