Offline handwritten mathematical expression recognition based on YOLOv5s
被引:2
|
作者:
Li, Fei
论文数: 0引用数: 0
h-index: 0
机构:
Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
Minist Educ, Power Semicond Device Reliabil Engn Ctr, Guiyang 550025, Peoples R ChinaGuizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
Li, Fei
[1
,2
]
Fang, Hongbo
论文数: 0引用数: 0
h-index: 0
机构:
Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R ChinaGuizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
Fang, Hongbo
[1
]
Wang, Dengzhun
论文数: 0引用数: 0
h-index: 0
机构:
Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R ChinaGuizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
Wang, Dengzhun
[1
]
Liu, Ruixin
论文数: 0引用数: 0
h-index: 0
机构:
Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R ChinaGuizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
Liu, Ruixin
[1
]
Hou, Qing
论文数: 0引用数: 0
h-index: 0
机构:
Guizhou Commun Ind Serv Co Ltd, Guiyang 550002, Peoples R ChinaGuizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
Hou, Qing
[3
]
Xie, Benliang
论文数: 0引用数: 0
h-index: 0
机构:
Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
Minist Educ, Power Semicond Device Reliabil Engn Ctr, Guiyang 550025, Peoples R ChinaGuizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
Xie, Benliang
[1
,2
]
机构:
[1] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
[2] Minist Educ, Power Semicond Device Reliabil Engn Ctr, Guiyang 550025, Peoples R China
[3] Guizhou Commun Ind Serv Co Ltd, Guiyang 550002, Peoples R China
The error accumulation in traditional offline handwritten mathematical expression recognition (OHMER) becomes challenging, because of the two-dimensional structure and writing arbitrariness of offline handwritten mathematical formulas. In this study, an OHMER method based on YOLOv5s was proposed. First, YOLOv5s was used to recognize the symbol category and spatial location information of the expression image. Second, the spatial attention mechanism was introduced in YOLOv5s to enlarge the difference among symbol categories and improve accuracy. Then, a bidirectional long short-term memory network (BiLSTM) was introduced to give the symbols context-related information. Finally, the contextual relevance of the symbols was improved by increasing the number of BiLSTM layers, achieving an accuracy of 95.67%. A mathematical expressions relationship tree was built using the symbol recognition results. Clustering theory was used to analyze the two-dimensional structure of expressions. The recognition accuracy of expressions on the CROHME 2019 Test was 65.47%. The recognition rate of YOLOv5s_SB3CT is second only to that of PAL. However, the recognition rate of YOLOv5_SB3CT is higher than that of PAL when the error is less than three. This finding demonstrates that the proposed model is more fault-tolerant and stable than other models.