共 22 条
[1]
[Anonymous], 2017, INT CONF ACOUST SPEE
[2]
Baevski A., 2020, wav2vec 2.0: A framework for self-supervised learning of speech representations
[3]
Brown, 2020, ARXIV
[5]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[6]
Dosovitskiy A, 2020, ARXIV
[8]
Gong Yuan, 2021, ARXIV211009784
[9]
He Kaiming, 2021, Masked autoencoders are scalable vision learners
[10]
HUBERT: HOW MUCH CAN A BAD TEACHER BENEFIT ASR PRE-TRAINING?
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:6533-6537