共 33 条
[1]
What does BERT look at? An Analysis of BERT's Attention
[J].
BLACKBOXNLP WORKSHOP ON ANALYZING AND INTERPRETING NEURAL NETWORKS FOR NLP AT ACL 2019,
2019,
:276-286
[2]
Denil M., 2013, ADV NEURAL INFORM PR, P2148, DOI DOI 10.5555/2999792.2999852
[3]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[4]
Gong YC, 2014, LECT NOTES COMPUT SC, V8695, P392, DOI 10.1007/978-3-319-10584-0_26
[5]
Han S., 2016, INT C LEARNING REPRE
[6]
Han S., 2024, P 12 INT C LEARN REP
[7]
Hinton G.E., 2015, Distilling the Knowledge in a Neural Network
[8]
Hou L., 2020, Adv. Neural. Inf. Process. Syst., V33, P9782
[9]
Hu MH, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2077
[10]
Jiao XQ, 2020, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, P4163