共 56 条
[1]
Behera A, 2021, AAAI CONF ARTIF INTE, V35, P929
[4]
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:347-356
[5]
Chou PY, 2023, Arxiv, DOI [arXiv:2303.06442, DOI 10.48550/ARXIV.2303.06442]
[6]
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[9]
Fine-Grained Visual Classification via Progressive Multi-granularity Training of Jigsaw Patches
[J].
COMPUTER VISION - ECCV 2020, PT XX,
2020, 12365
:153-168
[10]
Multiscale Vision Transformers
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:6804-6815