共 245 条
[81]
UniT: Multimodal Multitask Learning with a Unified Transformer
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:1419-1429
[82]
HOT-Net: Non-Autoregressive Transformer for 3D Hand-Object Pose Estimation
[J].
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA,
2020,
:3136-3145
[83]
Hand-Transformer: Non-Autoregressive Structured Modeling for 3D Hand Pose Estimation
[J].
COMPUTER VISION - ECCV 2020, PT XXV,
2020, 12370
:17-33
[84]
Huang ZL, 2021, Arxiv, DOI [arXiv:2106.03650, DOI 10.48550/ARXIV.2106.03650]
[85]
Ioffe Sergey, 2015, Proceedings of Machine Learning Research, V37, P448
[86]
Jaegle A, 2022, Arxiv, DOI arXiv:2107.14795
[87]
Jaegle A, 2021, PR MACH LEARN RES, V139
[88]
Skeletor: Skeletal Transformers for Robust Body-Pose Estimation
[J].
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021,
2021,
:3389-3397
[89]
Jiang Y., 2021, PROC C NEURAL INFORM
[90]
Jiang Z-H., 2020, Advances in Neural Information Processing Systems, V33, P12837, DOI [10.48550/arXiv.2008.02496, DOI 10.48550/ARXIV.2008.02496]