CNN based spatial classification features for clustering offline handwritten mathematical expressions

被引:26
作者
Cuong Tuan Nguyen [1 ]
Vu Tran Minh Khuong [1 ]
Hung Tuan Nguyen [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Dept Comp & Informat Sci, 2-24-16 Naka Cho, Koganei, Tokyo 1848588, Japan
关键词
Clustering images; Offline handwritten; Mathematical expression; CNN; Weakly supervised learning;
D O I
10.1016/j.patrec.2019.12.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To help human markers mark a large number of answers of handwritten mathematical expressions (HMEs), clustering them makes marking more efficient and reliable. Clustering HMEs, however, faces the problem of extracting both localization and classification representation of mathematical symbols for an HME image and defining the distance between two HME images. First, we propose a method based on Convolutional Neural Networks (CNN) to extract the representations for an HME. Symbols in various scales are located and classified by a combination of features from a multi-scale CNN. We use weakly supervised training combined with symbols attention to enhance localization and classification predictions. Second, we propose a multi-level spatial distance between two representations for clustering HMEs. Experiments on CROHME 2016 and CROHME 2019 dataset show the promising results of 0.99 and 0.96 in purity, respectively. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:113 / 120
页数:8
相关论文
共 20 条
[1]  
[Anonymous], 15 IAPR INT C DOC AN
[2]  
[Anonymous], 2004, A blueprint for computer assisted assessment, DOI DOI 10.4324/9780203464687
[3]  
[Anonymous], 2016, COGSCI
[4]  
[Anonymous], IEEE COMP SOC C COMP
[5]  
Basu S., 2013, Transactions of the ACL
[6]  
Brown G.T.L., 1997, Assessing Student Learning in Higher Education
[7]  
Caron Mathilde, 2018, Deep clustering for unsupervised learning of visual features
[8]  
Chang JL, 2017, IEEE I CONF COMP VIS, P5880, DOI [10.1109/ICCV.2017.626, 10.1109/ICCV.2017.627]
[9]  
He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]
[10]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507