共 11 条
[1]
TransVG: End-to-End Visual Grounding with Transformers
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:1749-1759
[2]
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[3]
Deep Residual Learning for Image Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:770-778
[4]
LiteFlowNet: A Lightweight Convolutional Neural Network for Optical Flow Estimation
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:8981-8989
[5]
Jia D., 2024, arXiv
[6]
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
[J].
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2024,
:26648-26658
[7]
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:9992-10002
[8]
Pu YF, 2023, IEEE I CONF COMP VIS, P6566, DOI 10.1109/ICCV51070.2023.00606
[9]
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:18134-18144
[10]
RRSIS: Referring Remote Sensing Image Segmentation
[J].
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING,
2024, 62
:1-12