Architecture proposal for data extraction of chart images using Convolutional Neural Network

被引:10
作者
Chagas, Paulo [1 ]
Freitas, Alexandre [1 ]
Daisuke, Rafael [1 ]
Miranda, Brunelli [1 ]
Araujo, Tiago [1 ]
Santos, Carlos [1 ]
Meiguins, Bianchi [1 ]
Morais, Jefferson [1 ]
机构
[1] Univ Fed Para, Belem, Para, Brazil
来源
2017 21ST INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV) | 2017年
关键词
chart image classification; data extraction; convolutional neural network; CLASSIFICATION;
D O I
10.1109/iV.2017.37
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Different information visualization techniques can be found in the literature due to the quantity and variety of data stored in computational systems. In this context, the classification of chart images becomes important because it allows various types of graphs to be detected automatically in different contexts, allowing a more specific processing for each type of visualization, for example, data extraction. Several techniques of image classification can be used, where the most common are based on the extraction of features of the images, and a later classification using these features. However, one technique that has been gaining prominence in the context of image classification is the Convolutional Neural Network (CNN). This technique is based on deep learning and, in a way, encapsulates the feature extraction process. In this way, the proposal of this article is to use an architecture of a client-server based model to do the chart image classification and later data extraction from this image. The main advantage is doing the CNN processing on the server side, so the application does not rely on client device limitations. For this, an image dataset was generated from the web, and it has ten classes of graphs. From the experiments done, it was seen that the use of this technique was feasible, and modifications in the architecture can be made as a proposal to improve the accuracy of the model.
引用
收藏
页码:318 / 323
页数:6
相关论文
共 21 条
[1]  
[Anonymous], ARXIV160207261
[2]  
[Anonymous], 2016, P IEEE INT S CIRC SY
[3]  
[Anonymous], 2016, P IEEE C COMP VIS PA
[4]  
[Anonymous], 2015, Recent Advances in Convolutional Neural Networks
[5]  
[Anonymous], 2015, P 8 INT C KNOWL CAPT
[6]   Graphical figure classification using data fusion for integrating text and image features [J].
Cheng, Beibei ;
Stanley, R. Joe ;
Antani, Sameer ;
Thoma, George R. .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :693-697
[7]  
Gao JL, 2010, ASSETS 2010: PROCEEDINGS OF THE 12TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, P265
[8]  
Geral Francisco Jr, 2014, P INT C DAT AN
[9]   Visual-Based Classification of Figures from Scientific Literature [J].
Giannakopoulos, Theodoros ;
Foufoulas, Ioannis ;
Stamatogiannakis, Eleftherios ;
Dimitropoulos, Harry ;
Manola, Natalia ;
Ioannidis, Yannis .
WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, :1059-1060
[10]  
Huang Weihua, 2007, APPL COMP VIS WACV 0