共 72 条
[61]
Inverted Pyramid Multi-task Transformer for Dense Scene Understanding
[J].
COMPUTER VISION - ECCV 2022, PT XXVII,
2022, 13687
:514-530
[62]
PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation
[J].
COMPUTER VISION - ECCV 2022, PT XXVII,
2022, 13687
:582-599
[63]
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:538-547
[64]
Zhang LF, 2021, IEEE T CYBERNETICS, V51, P673, DOI [10.1109/TCYB.2019.2910151, 10.1109/TCYB.2019.2935066]
[65]
Zhang QM, 2022, Arxiv, DOI arXiv:2202.10108
[66]
Zhang Xiangtai, 2022, PROC IEEECVF C COMPU, P18847
[67]
Pattern-Affinitive Propagation across Depth, Surface Normal and Semantic Segmentation
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:4101-4110
[69]
Pattern-Structure Diffusion for Multi-Task Learning
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2020,
:4513-4522