Fusing Deep Dilated Convolutions Network and Light-Weight Network for Object Detection

被引:0
|
作者
Quan Y. [1 ]
Li Z.-X. [1 ]
Zhang C.-L. [1 ]
Ma H.-F. [1 ,2 ]
机构
[1] Guangxi Key Lab of Multi-source Information Mining and Security, Guangxi Normal University, Guilin, 541004, Guangxi
[2] College of Computer Science and Engineering, Northwest Normal University, Lanzhou, 730070, Gansu
来源
关键词
Convolution neural network; Dilated convolution network; Image object detection; Light-weight network; Transfer learning;
D O I
10.3969/j.issn.0372-2112.2020.02.023
中图分类号
学科分类号
摘要
Object detection is an important research direction in the field of computer vision. In recent years, object detection has made great advances in public datasets, and there are also breakthroughs in algorithmic performance. In order to improve the accuracy and speed performance of two-stage object detection, this paper proposes a detection model based on transfer learning method that fuses the deep dilated convolutions network and the light-weight network. First, the dilated convolutions network is used to replace the convolutional residual module in the backbone network, namely deep dilated convolution network(D_dNet-65).Then, by compressing the pretrained feature map and adding an 81-class fully connected layer to replace the original two layers, namely light-weight network. Finally, the transfer learning method is introduced in the pretraining to optimize the model (D_dNet and light-weight network).The experiment was carried out on a typical data set, MSCOCO and VOC07.And the experiment shows that the method proposed in this paper has good effectiveness and scalability. © 2020, Chinese Institute of Electronics. All right reserved.
引用
收藏
页码:390 / 397
页数:7
相关论文
共 18 条
  • [11] Krizhevsky A., Sutskever I., Hinton G.E., Imagenet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, pp. 1097-1105, (2012)
  • [12] Deng Z., Li K., Zhao Q., Et al., Effective face landmark localization via single deep network
  • [13] Ghiasi G., Lin T.Y., Le Q.V., Nas-fpn: Learning scalable feature pyramid architecture for object detection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7036-7045, (2019)
  • [14] Pan S.J., Yang Q., A survey on transfer learning, IEEE Transactions on Knowledge and Data Engineering, 22, 10, pp. 1345-1359, (2010)
  • [15] Li Z., Peng C., Yu G., Et al., DetNet: A backbone network for object detection
  • [16] Shrivastava A., Sukthankar R., Malik J., Et al., Beyond skip connections: Top-down modulation for object detection
  • [17] Oquab M., Et al., Learning and transferring mid-level image representations using convolutional neural networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1717-1724, (2014)
  • [18] He K., Gkioxari G., Dollar P., Et al., Mask R-CNN, Proceedings of the IEEE International Conference on Computer Vision, pp. 2980-2988, (2017)