Offline handwritten mathematical expression recognition based on YOLOv5s

被引:2
|
作者
Li, Fei [1 ,2 ]
Fang, Hongbo [1 ]
Wang, Dengzhun [1 ]
Liu, Ruixin [1 ]
Hou, Qing [3 ]
Xie, Benliang [1 ,2 ]
机构
[1] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
[2] Minist Educ, Power Semicond Device Reliabil Engn Ctr, Guiyang 550025, Peoples R China
[3] Guizhou Commun Ind Serv Co Ltd, Guiyang 550002, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 03期
基金
中国国家自然科学基金;
关键词
Offline handwritten mathematical expression recognition; Spatial attention mechanism; Bidirectional long short-term memory network; Clustering; Symbolic relation tree; COMPETITION;
D O I
10.1007/s00371-023-02859-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The error accumulation in traditional offline handwritten mathematical expression recognition (OHMER) becomes challenging, because of the two-dimensional structure and writing arbitrariness of offline handwritten mathematical formulas. In this study, an OHMER method based on YOLOv5s was proposed. First, YOLOv5s was used to recognize the symbol category and spatial location information of the expression image. Second, the spatial attention mechanism was introduced in YOLOv5s to enlarge the difference among symbol categories and improve accuracy. Then, a bidirectional long short-term memory network (BiLSTM) was introduced to give the symbols context-related information. Finally, the contextual relevance of the symbols was improved by increasing the number of BiLSTM layers, achieving an accuracy of 95.67%. A mathematical expressions relationship tree was built using the symbol recognition results. Clustering theory was used to analyze the two-dimensional structure of expressions. The recognition accuracy of expressions on the CROHME 2019 Test was 65.47%. The recognition rate of YOLOv5s_SB3CT is second only to that of PAL. However, the recognition rate of YOLOv5_SB3CT is higher than that of PAL when the error is less than three. This finding demonstrates that the proposed model is more fault-tolerant and stable than other models.
引用
收藏
页码:1439 / 1452
页数:14
相关论文
empty
未找到相关数据