共 22 条
[1]
[Anonymous], 2014, ARXIV
[2]
Bluche Th eodore, 2020, ARXIV200210851
[3]
Predicting detection filters for small footprint open-vocabulary keyword spotting
[J].
INTERSPEECH 2020,
2020,
:2552-2556
[4]
QUERY-BY-EXAMPLE KEYWORD SPOTTING SYSTEM USING MULTI-HEAD ATTENTION AND SOFTTRIPLE LOSS
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:6858-6862
[5]
Kamper H, 2020, INT CONF ACOUST SPEE, P6414, DOI [10.1109/ICASSP40776.2020.9054202, 10.1109/icassp40776.2020.9054202]
[6]
Kim B, 2019, 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), P532, DOI [10.1109/asru46091.2019.9004014, 10.1109/ASRU46091.2019.9004014]
[7]
Kudo T, 2018, CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, P66
[8]
Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:1336-1345