共 40 条
[31]
MAXIM: Multi-Axis MLP for Image Processing
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:5759-5770
[32]
Vaswani A, 2017, ADV NEUR IN, V30
[34]
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:548-558
[35]
Wang Ziyu, 2022, INT C MACHINE LEARNI, P22691
[36]
Woo Sanghyun, 2023, arXiv
[37]
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:538-547
[38]
Zhang David Junhao, 2021, ARXIV211112527
[39]
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6848-6856