共 24 条
[1]
Ashish V., 2017, ADV NEURAL INFORM PR, DOI [10.48550/arXiv.1706.03762, DOI 10.48550/ARXIV.1706.03762]
[2]
Baevski A., 2020, P ICLR
[3]
Baevski A, 2020, INT CONF ACOUST SPEE, P7694, DOI [10.1109/ICASSP40776.2020.9054224, 10.1109/icassp40776.2020.9054224]
[5]
An Unsupervised Autoregressive Model for Speech Representation Learning
[J].
INTERSPEECH 2019,
2019,
:146-150
[6]
Chung YA, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P2353
[7]
Chung YA, 2020, INT CONF ACOUST SPEE, P3497, DOI [10.1109/icassp40776.2020.9054438, 10.1109/ICASSP40776.2020.9054438]
[8]
Devlin J., 2018, PREPRINT
[9]
Harwath D., 2020, Learning hierarchical discrete linguistic units from visually-grounded speech
[10]
Jang E., 2017, 5 INT C LEARN REPR I