SCPNet: Self-constrained parallelism network for keypoint-based lightweight object detection?

被引：10

作者：

Zhong, Xian ^{[1
,3
]}

Wang, Mengdie ^{[1
]}

Liu, Wenxuan ^{[1
]}

Yuan, Jingling ^{[1
]}

Huang, Wenxin ^{[2
]}

机构：

[1] Wuhan Univ Technol, Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China

[2] Hubei Univ, Comp Sci & Informat Engn, Wuhan 430062, Peoples R China

[3] Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100091, Peoples R China

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2023年 / 90卷

基金：

中国国家自然科学基金;

关键词：

Keypoint-based lightweight object detection; Parallel multi-scale fusion; Parallel shuffle block; Self-constrained detection;

D O I：

10.1016/j.jvcir.2022.103719

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Keypoint-based object detection achieves better performance without positioning calculations and extensive prediction. However, they have heavy backbone, and high-resolution is restored using upsampling that obtain unreliable features. We propose a self-constrained parallelism keypoint-based lightweight object detection network (SCPNet), which speeds inference, drops parameters, widens receptive fields, and makes prediction accurate. Specifically, the parallel multi-scale fusion module (PMFM) with parallel shuffle blocks (PSB) adopts parallel structure to obtain reliable features and reduce depth, adopts repeated multi-scale fusion to avoid too many parallel branches. The self-constrained detection module (SCDM) has a two-branch structure, with one branch predicting corners, and employing entad offset to match high-quality corner pairs, and the other branch predicting center keypoints. The distances between the paired corners' geometric centers and the center keypoints are used for self-constrained detection. On MS-COCO 2017 and PASCAL VOC, SCPNet's results are competitive with the state-of-the-art lightweight object detection. https://github.com/mengdie-wang/SCPNet.git.

引用

页数：12

共 62 条

[1]

Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934

[2] ON TWO-DIMENSIONAL SPARSE MATRIX PARTITIONING: MODELS, METHODS, AND A RECIPE [J].

Catalyurek, Umit V. ;

Aykanat, Cevdet ;

Ucar, Bora .

SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2010, 32 (02) :656-683

[3] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

[4]

Dong J., 2020, PROC IEEE INT C MULT, P1

[5] CenterNet: Keypoint Triplets for Object Detection [J].

Duan, Kaiwen ;

Bai, Song ;

Xie, Lingxi ;

Qi, Honggang ;

Huang, Qingming ;

Tian, Qi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577

[6] The Pascal Visual Object Classes (VOC) Challenge [J].

Everingham, Mark ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338

[7]

Fu C., 2017, arXiv

[8]

Howard AG, 2017, Arxiv, DOI arXiv:1704.04861

[9]

Ge Z, 2021, Arxiv, DOI arXiv:2107.08430

[10] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

← 1 2 3 4 5 6 7 →