共 57 条
[1]
Aggarwal P, 2023, 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), P12375
[2]
Bengio Y, 2001, ADV NEUR IN, V13, P932
[3]
Brown TB, 2020, ADV NEUR IN, V33
[4]
Chowdhery A, 2022, Arxiv, DOI [arXiv:2204.02311, DOI 10.48550/ARXIV.2204.02311, 10.48550/arXiv.2204.02311]
[5]
Clark P, 2020, AI MAG, V41, P39
[6]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]
Feng YL, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P1295
[8]
Geva M., 2019, BERT-large "whole word masking"model on the open mind common sense (OMCS) corpus
[9]
He P., 2021, arXiv, DOI DOI 10.48550/ARXIV.2111.09543