A Befitting Image Data Crawling and Annotating System with CNN based Transfer Learning

被引:3
作者
Hwang, Kyu-hong [1 ]
Lee, Myung-jae [1 ]
Ha, Young-guk [1 ]
机构
[1] Konkuk Univ, Dept Comp Sci & Engn, Seoul, South Korea
来源
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020) | 2020年
关键词
Image Crawler; Bigdata; CNN; Transfer Learning;
D O I
10.1109/BigComp48618.2020.00-81
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the Growth of Big Data technology, Artificial Intelligence technology, especially Deep Learning, is growing rapidly. In Deep Learning technology that like Convolutional Neural Network (CNN), collecting and training a sizeable befitting training data has a significant impact on technology maturity and completeness. But it is difficult for a person to collect a sizeable training data directly, and there are some problems in collecting it automatically with an ordinary web image crawler. For example, the name of a car is often obtained from a specific place name. When these keywords are crawled, the crawler retrieves only the text metadata of the image and gets the image. As a result, the crawler collects all images of different but has the same text. Also, even if the image of the car has been collected, only some of the car's interior or only a portion of the data that is not suitable for learning is also collected. In this paper, we present a method to classify and collect only data that is suitable for learning among crawled images using deep learning networks. Also, we offer a method to automatically generate training data by detecting objects and making object coordinates necessary for CNN learning at the same time.
引用
收藏
页码:165 / 168
页数:4
相关论文
共 17 条
[1]  
[Anonymous], 1996, WWW ROBOTS SEARCH EN
[2]   A Light-Weight Permutation based Method for Data Privacy in Mobile Cloud Computing [J].
Bahrami, Mehdi ;
Singhal, Mukesh .
2015 3RD IEEE INTERNATIONAL CONFERENCE ON MOBILE CLOUD COMPUTING, SERVICES, AND ENGINEERING (MOBILECLOUD 2015), 2015, :189-196
[3]   ON CERTAIN INTEGRALS OF LIPSCHITZ-HANKEL TYPE INVOLVING PRODUCTS OF BESSEL FUNCTIONS [J].
EASON, G ;
NOBLE, B ;
SNEDDON, IN .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL AND PHYSICAL SCIENCES, 1955, 247 (935) :529-551
[4]  
Felzenswalb Pedro., 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence
[5]  
Girshick R.B., 2012, Discriminatively trained deformable part models, release 5
[6]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[7]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[8]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[9]  
Manku G S, 2007, 16 INT C WORLD WID W
[10]   Performance Evaluation of Deep Learning Tools in Docker Containers [J].
Xu, Pengfei ;
Shi, Shaohuai ;
Chu, Xiaowen .
2017 3RD INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM), 2017, :395-403