Panoptic Segmentation with Convex Object Representation

被引：0

作者：

Yao, Zhicheng ^{[1
,2
]}

Wang, Sa ^{[1
,2
]}

Zhu, Jinbin ^{[1
]}

Bao, Yungang ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Comp Technol, 6 Kexueyuan South Rd Zhongguancun, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, 1 Yanqihu East Rd, Beijing 101408, Peoples R China

来源：

COMPUTER JOURNAL | 2023年 / 67卷 / 06期

基金：

中国国家自然科学基金;

关键词：

deep learning; computer vision; image segmentation; panoptic segmentation; instance representation; NETWORK; FREQUENT; CLASSIFICATION; SKETCHES; FUSION;

D O I：

10.1093/comjnl/bxad119

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The accurate representation of objects holds pivotal significance in the realm of panoptic segmentation. Presently, prevalent object representation methodologies, including box-based, keypoint-based and query-based techniques, encounter a challenge known as the 'representation confusion' issue in specific scenarios, often resulting in the mislabeling of instances. In response, this paper introduces Convex Object Representation (COR), a straightforward yet highly effective approach to address this problem. COR leverages a CNN-based Euclidean Distance Transform to convert the target instance into a convex heatmap. Simultaneously, it offers a parallel embedding method for encoding the object. Subsequently, COR characterizes objects based on the distinctive embedding vectors of their convex vertices. This paper seamlessly integrates COR into a state-of-the-art query-based panoptic segmentation framework. Experimental findings validate that COR successfully mitigates the representation confusion predicament, enhancing segmentation accuracy. The COR-augmented methods exhibit notable improvements of +1.3 and +0.7 points in PQ on the Cityscapes validation and MS COCO panoptic 2017 validation datasets, respectively.

引用

页码：2009 / 2019

页数：11

共 57 条

[1]

Abbas Ahmed, 2021, ADV NEUR IN, V34

[2]

Bai M., 2016, ARXIV

[3]

Bonde U., 2020, ARXIV

[4]

Brabandere B.D., 2017, CoRR abs/1708.02551

[5] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[6]

Chen Xinlei., 2020, ARXIV

[7]

Cheng B, 2021, ADV NEUR IN, V34

[8] Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation [J].

Cheng, Bowen ;

Collins, Maxwell D. ;

Zhu, Yukun ;

Liu, Ting ;

Huang, Thomas S. ;

Adam, Hartwig ;

Chen, Liang-Chieh .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :12472-12482

[9] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[10] CenterNet: Keypoint Triplets for Object Detection [J].

Duan, Kaiwen ;

Bai, Song ;

Xie, Lingxi ;

Qi, Honggang ;

Huang, Qingming ;

Tian, Qi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577

← 1 2 3 4 5 6 →