共 45 条
[1]
[Anonymous], 2012, P 13 INT SOC MUS INF
[2]
Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction
[J].
INTERSPEECH 2022,
2022,
:4088-4092
[4]
HTS-AT: A HIERARCHICAL TOKEN-SEMANTIC AUDIO TRANSFORMER FOR SOUND CLASSIFICATION AND DETECTION
[J].
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2022,
:646-650
[5]
Chen S, 2022, arXiv
[7]
Defferrard M., 2017, P 18 INT SOC MUS INF, P316
[8]
Défossez A, 2021, Arxiv, DOI arXiv:1911.13254
[9]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[10]
Driedger J., 2014, ISMIR C, P611, DOI DOI 10.5281/ZENODO.1415226