共 17 条
[1]
Ali A., 2021, Adv. Neural Inf. Process. Syst., V34, P20014, DOI DOI 10.48550/ARXIV.2106.09681
[2]
LANGUAGE TRANSFORMERS FOR REMOTE SENSING VISUAL QUESTION ANSWERING
[J].
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022),
2022,
:4855-4858
[3]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[4]
Dosovitskiy Alexey., 2021, PROC INT C LEARN REP, P2021
[5]
MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
[J].
2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI),
2021,
:1033-1036
[6]
Li L. H., 2019, 190803557 ARXIV
[7]
RSVQA MEETS BIGEARTHNET: A NEW, LARGE-SCALE, VISUAL QUESTION ANSWERING DATASET FOR REMOTE SENSING
[J].
2021 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM IGARSS,
2021,
:1218-1221
[8]
RSVQA: Visual Question Answering for Remote Sensing Data
[J].
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING,
2020, 58 (12)
:8555-8566
[9]
Mehta S., 2022, ICLR
[10]
Siebert T., 2022, SPIE IMAGE SIGNAL PR