SCPNet: Self-constrained parallelism network for keypoint-based lightweight object detection?

被引:10
作者
Zhong, Xian [1 ,3 ]
Wang, Mengdie [1 ]
Liu, Wenxuan [1 ]
Yuan, Jingling [1 ]
Huang, Wenxin [2 ]
机构
[1] Wuhan Univ Technol, Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[2] Hubei Univ, Comp Sci & Informat Engn, Wuhan 430062, Peoples R China
[3] Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100091, Peoples R China
基金
中国国家自然科学基金;
关键词
Keypoint-based lightweight object detection; Parallel multi-scale fusion; Parallel shuffle block; Self-constrained detection;
D O I
10.1016/j.jvcir.2022.103719
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Keypoint-based object detection achieves better performance without positioning calculations and extensive prediction. However, they have heavy backbone, and high-resolution is restored using upsampling that obtain unreliable features. We propose a self-constrained parallelism keypoint-based lightweight object detection network (SCPNet), which speeds inference, drops parameters, widens receptive fields, and makes prediction accurate. Specifically, the parallel multi-scale fusion module (PMFM) with parallel shuffle blocks (PSB) adopts parallel structure to obtain reliable features and reduce depth, adopts repeated multi-scale fusion to avoid too many parallel branches. The self-constrained detection module (SCDM) has a two-branch structure, with one branch predicting corners, and employing entad offset to match high-quality corner pairs, and the other branch predicting center keypoints. The distances between the paired corners' geometric centers and the center keypoints are used for self-constrained detection. On MS-COCO 2017 and PASCAL VOC, SCPNet's results are competitive with the state-of-the-art lightweight object detection. https://github.com/mengdie-wang/SCPNet.git.
引用
收藏
页数:12
相关论文
共 62 条
[1]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[2]   ON TWO-DIMENSIONAL SPARSE MATRIX PARTITIONING: MODELS, METHODS, AND A RECIPE [J].
Catalyurek, Umit V. ;
Aykanat, Cevdet ;
Ucar, Bora .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2010, 32 (02) :656-683
[3]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[4]  
Dong J., 2020, PROC IEEE INT C MULT, P1
[5]   CenterNet: Keypoint Triplets for Object Detection [J].
Duan, Kaiwen ;
Bai, Song ;
Xie, Lingxi ;
Qi, Honggang ;
Huang, Qingming ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577
[6]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[7]  
Fu C., 2017, arXiv
[8]  
Howard AG, 2017, Arxiv, DOI arXiv:1704.04861
[9]  
Ge Z, 2021, Arxiv, DOI arXiv:2107.08430
[10]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448