共 162 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
[Anonymous], 2013, Advances in Neural Information Processing Systems
[3]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[4]
Banerjee S., 2005, P ACL WORKSHOP INTRI, DOI DOI 10.3115/1626355.1626389
[5]
Bao SQ, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P85
[6]
Bhatia Rahul, 2019, INT J TREND SCIENT R, V3, P4
[7]
Billerbeck B., 2003, P 12 INT C INFORM KN, P2
[8]
Improving Transformer with Sequential Context Representations for Abstractive Text Summarization
[J].
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I,
2019, 11838
:512-524
[9]
Cao M, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P6251
[10]
Cao Q., 2021, P IEEECVF INT C COMP, P1614