共 59 条
[1]
[Anonymous], 2022, NAACL 2022 2022 C N
[2]
Argyriou Andreas, 2006, Advances in Neural Information Processing Systems, V19
[3]
ViViT: A Video Vision Transformer
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:6816-6826
[4]
Asai Akari, 2022, ARXIV220511961
[5]
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:2874-2883
[6]
Ben-Zaken E, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, P1
[7]
Carion N., 2020, P EUR C COMP VIS GLA, P213, DOI DOI 10.1007/978-3-030-58452-813
[8]
Emerging Properties in Self-Supervised Vision Transformers
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:9630-9640
[9]
Pre-Trained Image Processing Transformer
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:12294-12305
[10]
Chen Shoufa, 2022, ARXIV220513535