共 67 条
[1]
End-to-End Object Detection with Transformers
[J].
COMPUTER VISION - ECCV 2020, PT I,
2020, 12346
:213-229
[2]
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:347-356
[3]
Conde M.V., 2021, arXiv
[4]
Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:4109-4118
[6]
Dosovitskiy Alexey, 2021, P ICLR
[8]
Fine-Grained Visual Classification via Progressive Multi-granularity Training of Jigsaw Patches
[J].
COMPUTER VISION - ECCV 2020, PT XX,
2020, 12365
:153-168
[9]
Dubey A., 2018, P ADV NEUR INF PROC, P635
[10]
Duke B, 2021, Arxiv, DOI arXiv:2101.08833