共 85 条
[1]
Bar A, 2022, Arxiv, DOI arXiv:2209.00647
[2]
Beattie C, 2016, Arxiv, DOI arXiv:1612.03801
[3]
Bossard L, 2014, LECT NOTES COMPUT SC, V8694, P446, DOI 10.1007/978-3-319-10599-4_29
[5]
End-to-End Object Detection with Transformers
[J].
COMPUTER VISION - ECCV 2020, PT I,
2020, 12346
:213-229
[6]
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:347-356
[7]
Chen H, 2024, Arxiv, DOI [arXiv:2208.07463, DOI 10.48550/ARXIV.2208.07463]
[8]
Chen SF, 2022, Arxiv, DOI arXiv:2205.13535
[9]
An Empirical Study of Training Self-Supervised Vision Transformers
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:9620-9629