Object Recognition Using Deep Neural Network with Distinctive Features

被引:1
作者
Song, Hyun Chul [1 ]
Akram, Farhan [2 ]
Choi, Kwang Nam [1 ]
机构
[1] Chung Ang Univ, Dept Comp Sci & Engn, Seoul, South Korea
[2] Bioinformat Inst, Imaging Informat Div, Singapore, Singapore
来源
PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING (ICVIP 2018) | 2018年
关键词
Object recognition; bag of visual words; multi-layer perceptron; scale-invariant feature transform; weighting scheme;
D O I
10.1145/3301506.3301518
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, a new object recognition method using statistically weighting Multi-Layer Perceptron (MLP) is proposed. It uses visual distinctive features, which are computed using Bag of Visual Words (BoVW) framework. The proposed method has the following three main steps. At first it represents the images into their respective co-occurrence matrices, which are vectorized using BoVW and gives distinctive features. Then it computes weights from the histograms of visual words for each class. Finally, the statistically weighting distinctive features are applied to the testing image set to find the object class. In the proposed method, we improved MLP by introducing the weighted visual words, which are extracted by sampling the patches from the current image. From the Caltech 256 dataset, four classes namely pedestrians, cars, motorbikes and airplanes are used for the classification accuracy comparison between the MLP based artificial neural network (ANN) and the proposed method. The experimental results show that our method outperforms traditional MLP yielding an average classification accuracy of 89.60%, which is approximately 6.3% more than the compared MLP.
引用
收藏
页码:203 / 207
页数:5
相关论文
共 23 条
[1]  
Abdullah A, 2010, I C CONT AUTOMAT ROB, P1206, DOI 10.1109/ICARCV.2010.5707326
[2]  
Al-Zaydi Zeyad, 2018, International Journal of Machine Learning and Computing, V8, P268, DOI 10.18178/ijmlc.2018.8.3.698
[3]  
[Anonymous], 2007, P IEEE CVPR
[4]  
[Anonymous], 2004, P 2004WORKSHOP STAT
[5]  
[Anonymous], 2001, P 2001 IEEE COMP VIS
[6]  
Devi Y. R., 2018, INT J MACH LEARN COM, V8, P268
[7]  
Farquhar J.D. H., 2005, Improving "bag-of-keypoints" image categorisation: Generative models and pdf-kernels
[8]  
Fergus R, 2005, IEEE I CONF COMP VIS, P1816
[9]  
Griffin G., 2007, CALTECH 256 OBJECT A
[10]   Hierarchical spatial pyramid max pooling based on SIFT features and sparse coding for image classification [J].
Han, Hong ;
Han, Qiqiang ;
Li, Xiaojun ;
Gu, Jianyin .
IET COMPUTER VISION, 2013, 7 (02) :144-150