End-to-End Single Shot Detector Using Graph-Based Learnable Duplicate Removal

被引：1

作者：

Ding, Shuxiao ^{[1
,2
]}

Rehder, Eike ^{[1
]}

Schneider, Lukas ^{[1
]}

Cordts, Marius ^{[1
]}

Gall, Juergen ^{[2
]}

机构：

[1] Mercedes Benz AG, Stuttgart, Germany

[2] Univ Bonn, Bonn, Germany

来源：

PATTERN RECOGNITION, DAGM GCPR 2022 | 2022年 / 13485卷

关键词：

End-to-end detection; learning duplicate removal; relationship modeling;

D O I：

10.1007/978-3-031-16788-1_23

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Non-Maximum Suppression (NMS) is widely used to remove duplicates in object detection. In strong disagreement with the deep learning paradigm, NMS often remains as the only heuristic step. Learning NMS methods have been proposed that are either designed for Faster-RCNN or rely on separate networks. In contrast, learning NMS for SSD models is not well investigated. In this paper, we show that even a very simple rescoring network can be trained end-to-end with an underlying SSD model to solve the duplicate removal problem efficiently. For this, detection scores and boxes are refined from image features by modeling relations between detections in a Graph Neural Network (GNN). Our approach is applicable to the large number of object proposals in SSD using a pre-filtering head. It can easily be employed in arbitrary SSD-like models with weight-shared box predictor. Experiments on MS-COCO and KITTI show that our method improves accuracy compared with other duplicate removal methods at significantly lower inference time.

引用

页码：375 / 389

页数：15

共 28 条

[21]

Sun PZ, 2021, Arxiv, DOI arXiv:2012.05780

[22] EfficientDet: Scalable and Efficient Object Detection [J].

Tan, Mingxing ;

Pang, Ruoming ;

Le, Quoc, V .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10778-10787

[23] Learning to Rank Proposals for Object Detection [J].

Tan, Zhiyu ;

Nie, Xuecheng ;

Qian, Qi ;

Li, Nan ;

Li, Hao .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8272-8280

[24] FCOS: Fully Convolutional One-Stage Object Detection [J].

Tian, Zhi ;

Shen, Chunhua ;

Chen, Hao ;

He, Tong .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9626-9635

[25]

Vaswani A, 2023, Arxiv, DOI [arXiv:1706.03762, 10.48550/arXiv.1706.03762]

[26]

Zhou Q, 2021, Arxiv, DOI [arXiv:2101.11782, DOI 10.1109/TMM.2023.3248966]

[27]

Zhou XY, 2019, Arxiv, DOI arXiv:1904.07850

[28]

Zhu XZ, 2021, Arxiv, DOI arXiv:2010.04159

← 1 2 3 →