Instances as Queries

被引:159
作者
Fang, Yuxin [1 ]
Yang, Shusheng [1 ,2 ]
Wang, Xinggang [1 ]
Li, Yu [2 ]
Fang, Chen [3 ]
Shan, Ying [2 ]
Feng, Bin [1 ]
Liu, Wenyu [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch EIC, Wuhan, Peoples R China
[2] Tencent PCG, Appl Res Ctr ARC, Shenzhen, Peoples R China
[3] Tencent, Shenzhen, Peoples R China
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
关键词
D O I
10.1109/ICCV48922.2021.00683
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present QueryInst, a new perspective for instance segmentation. QueryInst is a multi-stage end-to-end system that treats instances of interest as learnable queries, enabling query based object detectors, e.g., Sparse R-CNN, to have strong instance segmentation performance. The attributes of instances such as categories, bounding boxes, instance masks, and instance association embeddings are represented by queries in a unified manner. In QueryInst, a query is shared by both detection and segmentation via dynamic convolutions and driven by parallelly-supervised multi-stage learning. We conduct extensive experiments on three challenging benchmarks, i.e., COCO, CityScapes, and YouTube-VIS to evaluate the effectiveness of QueryInst in object detection, instance segmentation, and video instance segmentation tasks. For the first time, we demonstrate that a simple end-to-end query based framework can achieve the state-of-the-art performance in various instance-level recognition tasks.
引用
收藏
页码:6890 / 6899
页数:10
相关论文
共 62 条
  • [1] [Anonymous], 2016, INT CONF 3D VISION, DOI DOI 10.1109/3DV.2016.79
  • [2] Athar Ali, 2020, Computer Vision - ECCV 2020 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12356), P158, DOI 10.1007/978-3-030-58621-8_10
  • [3] Bolya D., 2019, YOLACT BETTER REAL T
  • [4] Bolya D., 2019, ICCV
  • [5] Cascade R-CNN: High Quality Object Detection and Instance Segmentation
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1483 - 1498
  • [6] Carion N, 2020, EUR C COMP VIS, P213
  • [7] Chan William, 2020, ICML
  • [8] Chen Hao, 2020, CVPR
  • [9] Hybrid Task Cascade for Instance Segmentation
    Chen, Kai
    Pang, Jiangmiao
    Wang, Jiaqi
    Xiong, Yu
    Li, Xiaoxiao
    Sun, Shuyang
    Feng, Wansen
    Liu, Ziwei
    Shi, Jianping
    Ouyang, Wanli
    Loy, Chen Change
    Lin, Dahua
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4969 - 4978
  • [10] Chen Kai, 2019, ARXIV190607155