Transportation Object Detection with Bag of Visual Words Model by PLSA and MLP

被引:3
作者
Song, Hyun Chul [1 ]
Choi, Kwang Nam [1 ]
机构
[1] Chung Ang Univ, Sch Comp Sci & Engn, Chung Ang, South Korea
基金
新加坡国家研究基金会;
关键词
Transportation detection; Bag of visual words; Multi-layer perceptron; Probabilistic latent semantic analysis; Scale-invariant feature transform; FRAMEWORK; FEATURES;
D O I
10.1007/s11036-018-1075-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Visual big data is an essential and significant research topic, due to its diverse applications. In this paper, a new visual detection method for transportation is proposed based on probabilistic latent semantic analysis with visual data. We detect the distinctiveness by integrating three steps as follows: first, representing the co-ocurrence matrix of images, which were vectorized using the bag of visual words (BoVW) framework; then calculating the histograms of the visual words of each class; and finally applying the test images as the visual words. A multilayer perceptron (MLP) is used as the classification method in our system. The visual words are extracted by sampling the patches from the current image. A new topology of the neural network for the BoVW model is proposed, and management of the learning rate by reducing at specific iterations is exploited. The Probabilistic latent semantic analysis (PLSA) is compared to the MLP using the Caltech 256 datasets. The classes used include cars, motorbikes, and horses. The results of the experiment show that the MLP outperforms current methods in predicting transportation objects, and properly approximates the transportation detection function with extracted local features. It shows that the proposed method yields about 4.4% higher accuracy than the conventional PLSA for all classes.
引用
收藏
页码:1103 / 1110
页数:8
相关论文
共 33 条
[1]  
[Anonymous], P IEEE C COMP VIS
[2]  
[Anonymous], P IEEE C COMP VIS PA
[3]  
[Anonymous], 2004, P 2004WORKSHOP STAT
[4]   SURF: Speeded up robust features [J].
Bay, Herbert ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417
[5]   Social big data: Recent achievements and new challenges [J].
Bello-Orgaz, Gema ;
Jung, Jason J. ;
Camacho, David .
INFORMATION FUSION, 2016, 28 :45-59
[6]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[7]  
Bosch A, 2006, LECT NOTES COMPUT SC, V3954, P517
[8]   Internet of agents framework for connected vehicles: A case study on distributed traffic control system [J].
Bui, Khac-Hoai Nam ;
Jung, Jason J. .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 116 :89-95
[9]   Extraction of Sparse Features of Color Images in Recognizing Objects [J].
Bui, T. T. Quyen ;
Vu, Thang T. ;
Hong, Keum-Shik .
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2016, 14 (02) :616-627
[10]   Object Motion Tracking using a Moving Direction Estimate and Color Updates [J].
Chang, Samuel Henry ;
Shim, Duk-Sun ;
Kim, Hee-Young ;
Choi, Kwang-Nam .
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2012, 10 (01) :136-142