共 70 条
[52]
Vaswani A, 2017, ADV NEUR IN, V30
[53]
Convolutional Embedding Makes Hierarchical Vision Transformer Stronger
[J].
COMPUTER VISION, ECCV 2022, PT XX,
2022, 13680
:739-756
[55]
Motion Guided 3D Pose Estimation from Videos
[J].
COMPUTER VISION - ECCV 2020, PT XIII,
2020, 12358
:764-780
[56]
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:548-558
[60]
Graph Stacked Hourglass Networks for 3D Human Pose Estimation
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021,
2021,
:16100-16109