共 76 条
[11]
UNITER: UNiversal Image-TExt Representation Learning
[J].
COMPUTER VISION - ECCV 2020, PT XXX,
2020, 12375
:104-120
[12]
Clark K., P INT C LEARN REPR I, P1
[13]
What does BERT look at? An Analysis of BERT's Attention
[J].
BLACKBOXNLP WORKSHOP ON ANALYZING AND INTERPRETING NEURAL NETWORKS FOR NLP AT ACL 2019,
2019,
:276-286
[14]
Denkowski M., 2014, P 9 WORKSH STAT MACH, P376
[15]
Desai K., 2020, ARXIV
[16]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[17]
Fang Z, 2020, C EMP METH NAT LANG
[18]
Modularized Textual Grounding for Counterfactual Resilience
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:6371-6381
[19]
Fang Zhiyuan, ARXIV210104731, P2021
[20]
Gan Zhe, 2020, Advances in Neural Information Processing Systems