共 78 条
[2]
Chen Chun-Fu, 2021, ARXIV210602689
[3]
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:347-356
[5]
Chu X, 2021, ARXIV210210882
[6]
Dosovitskiy A., 2020, INT C LEARN REPR
[9]
Guo M.-H., 2021, ARXIV210502358, V2021