A New Method Based on Deep Convolutional Neural Networks for Object Detection and Classification

被引:0
作者
Yan Liu [1 ]
Zhu Zhuxngjie [1 ]
Zhang, Qiuhui [1 ]
Ding, Xiaotian [1 ]
Wang, Ruonan [1 ]
Han, Senyao [1 ]
Chi Li [1 ]
机构
[1] Taikang Insurance Grp, Block B,Taikang Life Bldg,15b Fuxingmenrei St, Beijing, Peoples R China
来源
AATCC JOURNAL OF RESEARCH | 2021年 / 8卷
关键词
Computer Vision; Image Classification; Neural Networks; Object Detection; Segmentation;
D O I
暂无
中图分类号
TB3 [工程材料学]; TS1 [纺织工业、染整工业];
学科分类号
0805 ; 080502 ; 0821 ;
摘要
Accurate object detection and classification has a broad application in industrial tasks, such as fabric defect and invoice detection. Previous state-of-the-art methods such as SSD and Faster-RCNN usually need to carefully adjust anchor box related hyper parameters and have poor performance in special fields with large object size/ratio variations and complex background texture. In this study, we proposed a new accurate, robust, and anchor-free method to handle automatic object detection and classification problems. First, we used the feature pyramid network (FPN), to merge the feature maps of different scales of features extracted from a convolutional neural network (CNN), which allowed easy and robust multi-scale feature fusion. Second, we built two subnets to generate candidate region proposals from the FPN outputs. followed by another CNN that determined the categories of the proposed regions from the two subnets.
引用
收藏
页码:37 / 45
页数:9
相关论文
共 30 条
[1]  
[Anonymous], 2005, PROC IEEE COMPUT SOC, DOI DOI 10.1109/CVPR.2005.177
[2]  
Everingham M, 2012, PASCAL VISUAL OBJECT
[3]   LeukocyteMask: An automated localization and segmentation method for leukocyte in blood smear images using deep neural networks [J].
Fan, Haoyi ;
Zhang, Fengbin ;
Xi, Liang ;
Li, Zuoyong ;
Liu, Guanghai ;
Xu, Yong .
JOURNAL OF BIOPHOTONICS, 2019, 12 (07)
[4]  
Fei-Fei L, 2005, PROC CVPR IEEE, P524
[5]   Enlarging Effective Receptive Field of Convolutional Neural Networks for Better Semantic Segmentation [J].
Gu, Yifan ;
Zhong, Zuofeng ;
Wu, Shuai ;
Xu, Yong .
PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, :388-393
[6]  
Hariharan B, 2012, LECT NOTES COMPUT SC, V7575, P459, DOI 10.1007/978-3-642-33765-9_33
[7]   Mask R-CNN [J].
He, Kaiming ;
Gkioxari, Georgia ;
Dollar, Piotr ;
Girshick, Ross .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2980-2988
[8]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]   Binary sparse signal recovery algorithms based on logic observation [J].
Hu, Xiao-Li ;
Wen, Jiajun ;
Lai, Zhihui ;
Wong, Wai Keung ;
Shen, Linlin .
PATTERN RECOGNITION, 2019, 90 :147-160
[10]  
Lin Tsung-Yi, 2020, IEEE Trans Pattern Anal Mach Intell, V42, P318, DOI [10.1109/TPAMI.2018.2858826, 10.1109/ICCV.2017.324]