共 78 条
[1]
Ali A., 2021, NeurIPS, V34
[2]
[Anonymous], 2021, NEURIPS
[3]
Bello I, 2021, Arxiv, DOI arXiv:2102.08602
[4]
Berman M, 2019, Arxiv, DOI arXiv:1902.05509
[5]
Brock A, 2021, INT C MACHINE LEARNI, V139
[6]
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:347-356
[7]
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
[J].
COMPUTER VISION - ECCV 2018, PT VII,
2018, 11211
:833-851
[8]
Chen M, 2020, PR MACH LEARN RES, V119
[9]
Visformer: The Vision-friendly Transformer
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:569-578
[10]
Chu X., 2021, arXiv, DOI 10.48550/arXiv.2104.13840