共 31 条
[1]
Berg A., 2021, arXiv
[2]
Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
[J].
INTERSPEECH 2019,
2019,
:3372-3376
[3]
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[4]
Gao YX, 2020, INT CONF ACOUST SPEE, P7479, DOI [10.1109/ICASSP40776.2020.9053313, 10.1109/icassp40776.2020.9053313]
[5]
Heittola T., 2019, PROC WORKSHOP DETECT
[6]
Hinton G, 2015, Arxiv, DOI arXiv:1503.02531
[7]
Kim B, 2022, Arxiv, DOI arXiv:2106.04140
[8]
Kim D, 2023, Arxiv, DOI arXiv:2109.11165
[9]
Kim D, 2022, Arxiv, DOI arXiv:2205.01304
[10]
Dual Stage Learning Based Dynamic Time-Frequency Mask Generation For Audio Event Classification
[J].
INTERSPEECH 2020,
2020,
:836-840