共 36 条
[1]
[Anonymous], 2017, INT CONF ACOUST SPEE
[2]
[Anonymous], 2023, DCASE
[3]
[Anonymous], 2023, DCASE
[4]
MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
[J].
INTERSPEECH 2022,
2022,
:2438-2442
[5]
Baevski A., 2020, ADV NEURAL INFORM PR
[6]
Baevski A, 2020, INT CONF ACOUST SPEE, P7694, DOI [10.1109/ICASSP40776.2020.9054224, 10.1109/icassp40776.2020.9054224]
[7]
HTS-AT: A HIERARCHICAL TOKEN-SEMANTIC AUDIO TRANSFORMER FOR SOUND CLASSIFICATION AND DETECTION
[J].
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2022,
:646-650
[8]
Chen Sanyuan, 2023, PROC ICML, P5178
[9]
Chung JS, 2018, INTERSPEECH, P1086
[10]
Dosovitskiy Alexey, 2021, ICLR