Automatic Object Detection from Digital Images by Deep Learning with Transfer Learning

被引:8
作者
Yabuki, Nobuyoshi [1 ]
Nishimura, Naoto [1 ]
Fukuda, Tomohiro [1 ]
机构
[1] Osaka Univ, Suita, Osaka 5650871, Japan
来源
ADVANCED COMPUTING STRATEGIES FOR ENGINEERING, PT I | 2018年 / 10863卷
关键词
Deep learning; Image detection; Transfer learning;
D O I
10.1007/978-3-319-91635-4_1
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
At construction sites and disaster areas, an enormous number of digital photographs are taken by engineers. Tasks such as collecting, sorting, annotating, storing, deleting, distributing these digital images, as done manually, are cumbersome, error-prone, and time-consuming. Thus, it is desirable to automate the object detection process of pictures so that engineers do not have to waste their valuable time and can improve the efficiency and accuracy. Although conventional machine learning could be a solution, it takes much time for researchers to determine features and contents of digital images, and the accuracy tends to be unsatisfactory. On the other hand, deep learning can automatically determine features and contents of various objects from digital images. Therefore, this research aims to automatically detect each object as an object and its position from digital images by using deep learning. Since deep learning usually requires a very large amount of dataset, this research has adopted deep learning with transfer learning, which enables object detection even if the dataset is not very large. Experiments were executed to detect construction machines, workers, and signboards in photographs, comparing among the conventional machine learning by feature values, deep learning with and without transfer learning. The result showed that the best performance was achieved by the deep learning with transfer learning.
引用
收藏
页码:3 / 15
页数:13
相关论文
共 13 条
[1]  
Dalal N, 2005, P 2005 IEEE COMP SOC, V1, P511
[2]  
Girshick R., 2014, P IEEE C COMP VIS PA, DOI [10.1109/CVPR.2014.81, DOI 10.1109/CVPR.2014.81, 10.1109/cvpr.2014.81]
[3]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[4]  
Levi K, 2004, PROC CVPR IEEE, P53
[5]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[6]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[7]  
Mitsui Tomokazu, 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, P1169, DOI 10.1109/ICCVW.2009.5457478
[8]   A Survey on Transfer Learning [J].
Pan, Sinno Jialin ;
Yang, Qiang .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (10) :1345-1359
[9]  
REDMON J, 2016, PROC CVPR IEEE, P779, DOI [DOI 10.1109/CVPR.2016.91, 10.1109/CVPR.2016.91]
[10]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149