共 86 条
[1]
Bai JZ, 2023, Arxiv, DOI arXiv:2308.12966
[2]
Cen Jiazhong, 2023, Advances in Neural Information Processing Systems
[3]
See-Through-Text Grouping for Referring Image Segmentation
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:7453-7462
[4]
UNITER: UNiversal Image-TExt Representation Learning
[J].
COMPUTER VISION - ECCV 2020, PT XXX,
2020, 12375
:104-120
[5]
Chen Z., 2022, P 11 INT C LEARN REP, P1
[6]
Masked-attention Mask Transformer for Universal Image Segmentation
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:1280-1289
[7]
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[8]
Vision-Language Transformer and Query Generation for Referring Segmentation
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:16301-16310
[9]
Ding YH, 2025, IEEE T CIRC SYST VID, V35, P2975, DOI 10.1109/TCSVT.2024.3384503
[10]
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
[J].
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2023,
:19358-19369