共 21 条
- [1] Gurugubelli Krishna, 2024, ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P12431, DOI 10.1109/ICASSP48485.2024.10445876
- [2] QUERY-BY-EXAMPLE KEYWORD SPOTTING SYSTEM USING MULTI-HEAD ATTENTION AND SOFTTRIPLE LOSS [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6858 - 6862
- [3] Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining [J]. INTERSPEECH 2020, 2020, : 4676 - 4680
- [4] Ito K., 2017, The LJ speech dataset
- [5] Kim B, 2019, 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), P532, DOI [10.1109/ASRU46091.2019.9004014, 10.1109/asru46091.2019.9004014]
- [6] Kingma D. P., ADAM METHOD STOCHAST
- [7] PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords [J]. INTERSPEECH 2023, 2023, : 3964 - 3968
- [8] LEVENSHT.VI, 1965, DOKL AKAD NAUK SSSR+, V163, P845