共 62 条
[1]
[Anonymous], 2021, ICCV, DOI DOI 10.1109/ICCV48922.2021.01138
[2]
[Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00575
[3]
[Anonymous], 2022, CVPR, DOI DOI 10.1109/CVPR52688.2022.00513
[4]
[Anonymous], 2021, PMLR
[5]
[Anonymous], 2022, CVPR, DOI DOI 10.1109/CVPR52688.2022.01569
[6]
[Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.00356
[7]
[Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.00831
[8]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[9]
Ba J. L., 2016, Layer Normalization
[10]
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:1708-1718