共 74 条
[1]
Altindis SF, 2021, Arxiv, DOI arXiv:2109.01123
[2]
End-to-End Referring Video Object Segmentation with Multimodal Transformers
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:4975-4985
[3]
End-to-End Object Detection with Transformers
[J].
COMPUTER VISION - ECCV 2020, PT I,
2020, 12346
:213-229
[4]
Chen YW, 2019, Arxiv, DOI arXiv:1910.04748
[5]
Domain Adaptive Faster R-CNN for Object Detection in the Wild
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:3339-3348
[6]
Cheng B, 2021, ADV NEUR IN, V34
[7]
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[9]
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
[J].
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV,
2023,
:2694-2703