A Deep Learning Framework Using Convolutional Neural Network for Multi-class Object Recognition

被引:0
作者
Hayat, Shaukat [1 ]
She Kun [1 ]
Zuo Tengtao [1 ]
Yue Yu [1 ]
Tu, Tianyi [1 ]
Du, Yantong [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu, Sichuan, Peoples R China
来源
2018 IEEE 3RD INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC) | 2018年
关键词
multi-class object recognition; deep learning; convolutional neural network; computer vision; bag-of-visual words (BOW); L2-SVM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object recognition is classic technique used to effectively recognize an object in the image. Technologies specifically in field of computer vision are expected to detect and recognize more complex tasks with help of local features detection methods. Over the last decade, there has been sustained increase in the number of researchers from various kind of disciplines i.e. academia, industry, security agencies and even from general public has caught an attention to explore the covered aspects of object detection and recognition concerned problems. It is further significantly amended by adopting deep learning model. In this paper, we applied deep learning to multi-class object recognition and explore convolutional neural network (CNN). The convolutional neural network is created with normalized standard initialization and trained with training set of sample images from 9 different object categories plus sample test images using widely varied dataset. All results are implemented in python tensorflow framework. We examine and compared CNN results with final feature vectors extracted from variant approaches of BOW based on linear L2-SVM classifier. Based on it, sufficient experiments verify our CNN model effectiveness and robustness with rate of 90.12% accuracy.
引用
收藏
页码:194 / 198
页数:5
相关论文
共 27 条
[11]  
[Anonymous], NEUR NETW 1999 IJCNN
[12]  
[Anonymous], 2011, AISTATS
[13]  
Csurka G., VISUAL CATEGORIZATIO
[14]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[15]  
Glorot X., 2010, P 13 INT C ART INT S, P249
[16]  
Gu M, 2014, INT CONF INFRA MILLI, DOI 10.1109/IRMMW-THz.2014.6956528
[17]  
Krizhevsky A., 2017, COMMUN ACM, V60, P84, DOI [DOI 10.1145/3065386, 10.1145/3065386]
[18]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[19]   Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories [J].
Li Fei-Fei ;
Fergus, Rob ;
Perona, Pietro .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2007, 106 (01) :59-70
[20]   Object Detection and Recognition for Assistive Robots Experimentation and Implementation [J].
Martinez-Martin, Ester ;
del Pobil, Angel P. .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2017, 24 (03) :123-138