Visual-Based Classification of Figures from Scientific Literature

被引:5
作者
Giannakopoulos, Theodoros [1 ]
Foufoulas, Ioannis [1 ]
Stamatogiannakis, Eleftherios [1 ]
Dimitropoulos, Harry [1 ]
Manola, Natalia [1 ]
Ioannidis, Yannis [1 ]
机构
[1] Univ Athens, Management Data Informat & Knowledge Grp, GR-10679 Athens, Greece
来源
WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB | 2015年
关键词
D O I
10.1145/2740908.2742024
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Authors of scientific publications and books use images to present a wide spectrum of information. Despite the richness of the visual content of scientific publications the figures are usually not taken into consideration in the context of text mining methodologies towards the automatic indexing and retrieval of scientific corpora. In this work, we present a system for automatic categorization of figures from scientific literature to a set of predefined classes. We have employed a wide range of visual features that achieve high discrimination ability between the adopted classes. A real-world dataset has been compiled and annotated in order to train and evaluate the proposed method using three different classification schemata.
引用
收藏
页码:1059 / 1060
页数:2
相关论文
共 10 条
[1]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[2]   USE OF HOUGH TRANSFORMATION TO DETECT LINES AND CURVES IN PICTURES [J].
DUDA, RO ;
HART, PE .
COMMUNICATIONS OF THE ACM, 1972, 15 (01) :11-&
[3]  
Fisher J. L., 1990, Proceedings. 10th International Conference on Pattern Recognition (Cat. No.90CH2898-5), P567, DOI 10.1109/ICPR.1990.118166
[4]   A fast learning algorithm for deep belief nets [J].
Hinton, Geoffrey E. ;
Osindero, Simon ;
Teh, Yee-Whye .
NEURAL COMPUTATION, 2006, 18 (07) :1527-1554
[5]   Automated analysis of images in documents for intelligent document search [J].
Lu, Xiaonan ;
Kataria, Saurabh ;
Brouwer, William J. ;
Wang, James Z. ;
Mitra, Prasenjit ;
Giles, C. Lee .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2009, 12 (02) :65-81
[6]  
Muller H., 2012, SPIE MED IMAGING
[7]   Multiresolution gray-scale and rotation invariant texture classification with local binary patterns [J].
Ojala, T ;
Pietikäinen, M ;
Mäenpää, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (07) :971-987
[8]   An overview of the tesseract OCR engine [J].
Smith, Ray .
ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, :629-633
[9]  
Theodoridis S, 2009, PATTERN RECOGNITION, 4RTH EDITION, P1
[10]   A novel figure panel classification and extraction method for document image understanding [J].
Yuan, Xiaohui ;
Ang, Dongyu .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2014, 9 (01) :22-36