Data Augmentation on Synthetic Images for Transfer Learning using Deep CNNs

被引：0

作者：

Talukdar, Jonti ^{[1
]}

Biswas, Ayon ^{[2
]}

Gupta, Sanchit ^{[3
]}

机构：

[1] Nirma Univ, Dept Elect & Commun Engn, Ahmadabad, Gujarat, India

[2] Indian Inst Technol, Dept Elect Engn, Gandhinagar, India

[3] BITS Pilani, Dept Comp & Informat Sci, Hyderabad, India

来源：

2018 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN) | 2018年

关键词：

Data Augmentation; Transfer Learning; Synthetic Data; Deep Convolutional Neural Networks; Artificial Intelligence;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Training of deep Convolutional Neural Networks (CNNs) for object detection tasks requires a huge amount of annotated data which is expensive, difficult and time-consuming to produce. This requirement can be fulfilled by automating the process of dataset generation. We utilize the approach of training deep CNNs using completely synthetically rendered data, with the focus of improving the overall transfer learning performance through online and offline data augmentation techniques. We focus on the problem of detecting packaged food products in indoor refrigerator environments. We analyze the impact of various data augmentation strategies like randomized cropping, pixel shifting, image scaling, image rotation, oversaturation, Gaussian blurring, noise addition, color inversion etc. on the overall accuracy of the object detection and increase the overall mean average precision (mAP). It is found that the use of a combination of data augmentation techniques performs best, with highest mAP of 20.54 obtained with combinations of linear augmentation techniques like scaling, shifting and scaling and rotation.

引用

页码：215 / 219

页数：5

共 16 条

[1]

[Anonymous], ARXIV170606782

[2]

[Anonymous], IEEE T PATTERN ANAL

[3]

[Anonymous], ADV NEURAL INF PROCE

[4] SURF: Speeded up robust features [J].

Bay, Herbert ;

Tuytelaars, Tinne ;

Van Gool, Luc .

COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417

[5]

Chang Angel X., 2015, TECHNICAL REPORT ARX

[6] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

[7]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[8] Scalable Object Detection using Deep Neural Networks [J].

Erhan, Dumitru ;

Szegedy, Christian ;

Toshev, Alexander ;

Anguelov, Dragomir .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2155-2162

[9] Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation [J].

Gupta, Saurabh ;

Arbelaez, Pablo ;

Girshick, Ross ;

Malik, Jitendra .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 112 (02) :133-149

[10]

Ke Y., 2004, P 2004 IEEE COMP SOC, V2, pII

← 1 2 →