共 110 条
[1]
Amini A., 2021, arXiv
[2]
Bao H, 2022, INT C LEARNING REPRE
[3]
Carion N, 2020, Img Proc Comp Vis Re, V12346, P213, DOI 10.1007/978-3-030-58452-8_13
[4]
Chang SE, 2021, INT S HIGH PERF COMP, P208, DOI [10.1109/HPCA51647.2021.00027, 10.1109/WRCSARA53879.2021.9612678]
[5]
Transformer Interpretability Beyond Attention Visualization
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:782-791
[6]
Chen BY, 2021, Arxiv, DOI arXiv:2108.03428
[7]
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:347-356
[8]
Pre-Trained Image Processing Transformer
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:12294-12305
[9]
Chen M., 2021, PROC IEEECVF INT C C, P12270
[10]
Chen P., 2021, ARXIV