共 61 条
[5]
Chen CF, 2019, Arxiv, DOI [arXiv:1807.03848, 10.48550/arXiv.1807.03848]
[6]
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:347-356
[8]
Dosovitskiy A., 2021, 9 INT C LEARN REPR I
[10]
Multiscale Vision Transformers
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:6804-6815