共 58 条
[1]
Adiwardana D, 2020, Arxiv, DOI arXiv:2001.09977
[2]
Akama Reina, 2017, Short Papers, V2, P408
[3]
[Anonymous], 2019, RoBERTa: a robustly optimized BERT pretraining approach
[4]
Bengio Y, 2001, ADV NEUR IN, V13, P932
[5]
Clark K, 2020, Arxiv, DOI [arXiv:2003.10555, DOI 10.48550/ARXIV.2003.10555]
[6]
Dai N, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P5997
[7]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]
Feng Song, 2012, P 2012 JOINT C EMP M, P1522
[10]
Fu ZX, 2018, AAAI CONF ARTIF INTE, P663