共 96 条
[41]
Feature Pyramid Networks for Object Detection
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:936-944
[42]
GRES: Generalized Referring Expression Segmentation
[J].
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2023,
:23592-23601
[43]
Recurrent Multimodal Interaction for Referring Image Segmentation
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:1280-1289
[44]
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
[J].
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2023,
:18653-18663
[46]
Video Swin Transformer
[J].
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022),
2022,
:3192-3201
[47]
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
[J].
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021),
2021,
:9992-10002
[48]
Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
[49]
Loshchilov I., 2019, INT C LEARNING REPRE
[50]
Lu JS, 2019, ADV NEUR IN, V32