Online flowchart understanding by combining max-margin Markov random field with grammatical analysis

被引:0
作者
Chengcheng Wang
Harold Mouchère
Aurélie Lemaitre
Christian Viard-Gaudin
机构
[1] Microsoft (China) Co. Ltd.,
[2] UBL/University of Nantes/LS2N,undefined
[3] IRISA - Université de Rennes 2,undefined
来源
International Journal on Document Analysis and Recognition (IJDAR) | 2017年 / 20卷
关键词
Random Forest; Markov Random Field; Conditional Random Field; Handwriting Recognition; Markov Random Field Model;
D O I
暂无
中图分类号
学科分类号
摘要
Flowcharts are considered in this work as a specific 2D handwritten language where the basic strokes are the terminal symbols of a graphical language governed by a 2D grammar. In this way, they can be regarded as structured objects, and we propose to use a MRF to model them, and to allow assigning a label to each of the strokes. We use structured SVM as learning algorithm, maximizing the margin between true labels and incorrect labels. The model would automatically learn the implicit grammatical information encoded among strokes, which greatly improves the stroke labeling accuracy compared to previous researches that incorporated human prior knowledge of flowchart structure. We further complete the recognition by using grammatical analysis, which finally brings coherence to the whole flowchart recognition by labeling the relations between the detected objects.
引用
收藏
页码:123 / 136
页数:13
相关论文
共 30 条
[1]  
Awal A-M(2014)A global learning approach for an online handwritten mathematical expression recognition system Pattern Recogn. Lett. 35 68-77
[2]  
Mouchère H(2001)Random forests Mach. Learn. 45 5-32
[3]  
Viard-Gaudin C(2000)Mathematical expression recognition: a survey Int. J. Doc. Anal. Recogn. 3 3-15
[4]  
Breiman L(2006)DMOS, a generic document recognition method: application to table structure analysis in a general and in a specific way Int. J. Doc. Anal. Recogn. IJDAR 8 111-122
[5]  
Chan K-F(2014)Contextual text/non-text stroke classification in online handwritten notes with conditional random fields Pattern Recogn. 47 959-968
[6]  
Yeung D-Y(2014)Multi-class segmentation of free-form online documents with tree conditional random fields Int. J. Doc. Anal. Recogn. IJDAR 17 313-329
[7]  
Coüasnon B(2009)Cutting-plane training of structural svms Mach. Learn. 77 27-59
[8]  
Delaye A(2014)PyStruct: learning structured prediction in python J. Mach. Learn. Res. 15 2055-2060
[9]  
Liu C-L(2011)Structured learning and prediction in computer vision Found. Trends Comput. Graph. Vis. 6 185-365
[10]  
Delaye A(2014)Efficient semantic image segmentation with multi-class ranking prior Comput. Vis. Image Underst. 120 81-90