共 66 条
[42]
U-Net: Convolutional Networks for Biomedical Image Segmentation
[J].
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III,
2015, 9351
:234-241
[43]
Training Region-based Object Detectors with Online Hard Example Mining
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:761-769
[44]
Bottleneck Transformers for Visual Recognition
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:16514-16524
[45]
Deep High-Resolution Representation Learning for Human Pose Estimation
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:5686-5696
[46]
Tao AD, 2020, Arxiv, DOI arXiv:2005.10821
[47]
Vaswani A, 2017, ADV NEUR IN, V30
[48]
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:548-558
[49]
CBAM: Convolutional Block Attention Module
[J].
COMPUTER VISION - ECCV 2018, PT VII,
2018, 11211
:3-19
[50]
CvT: Introducing Convolutions to Vision Transformers
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:22-31