Deep Learning for Visual Indonesian Place Classification with Convolutional Neural Networks

被引：3

作者：

Chowanda, Andry ^{[1
]}

Sutoyo, Rhio ^{[1
]}

机构：

[1] Bina Nusantara Univ, Sch Comp Sci, Comp Sci Dept, Jakarta 11480, Indonesia

来源：

4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY | 2019年 / 157卷

关键词：

Visual Place Classification; Convolutional Neural Networks; Deep Learning; Dataset Collection; RECOGNITION; HELP;

D O I：

10.1016/j.procs.2019.08.236

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Places classification is one of the points of discussion in the computer vision and robotics community. Some renowned techniques such as local-invariant feature extractors (e.g. Scale-invariant feature transform SIFT, Speeded Up Robust Features SURF), as well as Visual BoW approach were used in place classification problems. Nowadays, deep learning methods such as Convolutional Neural Networks (CNNs) have the advantages towards computer vision problems including place classification problem. Albeit, there are several renowned datasets existed to help the community to learn the models, there is no publicly exists in places dataset for specifically places in Indonesia. This paper presents methodology to collect data of visual places in Indonesia, learn deep features from the data, and classify visual places in Indonesia. We aims to contribute a large dataset as well as deep learning models of places in Indonesian. There are more than 16K images collected and augmented to build the places (specifically places in Indonesia) dataset. The highest accuracy score achieved by the models is 92%. (C) 2019 The Authors. Published by Elsevier B.V.

引用

页码：436 / 443

页数：8

共 37 条

[1]

Ancheta Roxanne A., 2018, International Journal of Machine Learning and Computing, V8, P619, DOI 10.18178/ijmlc.2018.8.6.755

[2] SURF: Speeded up robust features [J].

Bay, Herbert ;

Tuytelaars, Tinne ;

Van Gool, Luc .

COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417

[3]

Bishop C. M., 2006, PATTERN RECOGNITION, DOI DOI 10.1117/1.2819119

[4]

Chowanda, 2018, THESIS

[5] Total recall: Automatic query expansion with a generative feature model for object retrieval [J].

Chum, Ondrej ;

Philbin, James ;

Sivic, Josef ;

Isard, Michael ;

Zisserman, Andrew .

2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, :496-+

[6]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[7] How Does the Brain Solve Visual Object Recognition? [J].

DiCarlo, James J. ;

Zoccolan, Davide ;

Rust, Nicole C. .

NEURON, 2012, 73 (03) :415-434

[8]

Jégou H, 2009, PROC CVPR IEEE, P1169, DOI 10.1109/CVPRW.2009.5206609

[9] Caffe: Convolutional Architecture for Fast Feature Embedding [J].

Jia, Yangqing ;

Shelhamer, Evan ;

Donahue, Jeff ;

Karayev, Sergey ;

Long, Jonathan ;

Girshick, Ross ;

Guadarrama, Sergio ;

Darrell, Trevor .

PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678

[10] Visual Search at Pinterest [J].

Jing, Yushi ;

Liu, David ;

Kislyuk, Dmitry ;

Zhai, Andrew ;

Xu, Jiajing ;

Donahue, Jeff ;

Tavel, Sarah .

KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :1889-1898

← 1 2 3 4 →