Offline handwritten mathematical expression recognition based on YOLOv5s

被引:0
作者
Fei Li
Hongbo Fang
Dengzhun Wang
Ruixin Liu
Qing Hou
Benliang Xie
机构
[1] Guizhou University,College of Big Data and Information Engineering
[2] Power Semiconductor Device Reliability Engineering Center of the Ministry of Education,undefined
[3] Guizhou Communication Industry Service Co.,undefined
[4] Ltd,undefined
来源
The Visual Computer | 2024年 / 40卷
关键词
Offline handwritten mathematical expression recognition; Spatial attention mechanism; Bidirectional long short-term memory network; Clustering; Symbolic relation tree;
D O I
暂无
中图分类号
学科分类号
摘要
The error accumulation in traditional offline handwritten mathematical expression recognition (OHMER) becomes challenging, because of the two-dimensional structure and writing arbitrariness of offline handwritten mathematical formulas. In this study, an OHMER method based on YOLOv5s was proposed. First, YOLOv5s was used to recognize the symbol category and spatial location information of the expression image. Second, the spatial attention mechanism was introduced in YOLOv5s to enlarge the difference among symbol categories and improve accuracy. Then, a bidirectional long short-term memory network (BiLSTM) was introduced to give the symbols context-related information. Finally, the contextual relevance of the symbols was improved by increasing the number of BiLSTM layers, achieving an accuracy of 95.67%. A mathematical expressions relationship tree was built using the symbol recognition results. Clustering theory was used to analyze the two-dimensional structure of expressions. The recognition accuracy of expressions on the CROHME 2019 Test was 65.47%. The recognition rate of YOLOv5s_SB3CT is second only to that of PAL. However, the recognition rate of YOLOv5_SB3CT is higher than that of PAL when the error is less than three. This finding demonstrates that the proposed model is more fault-tolerant and stable than other models.
引用
收藏
页码:1439 / 1452
页数:13
相关论文
共 52 条
  • [1] Yang C(2022)Tree-based data augmentation and mutual learning for offline handwritten mathematical expression recognition Pattern Recognit. 132 108910-1467
  • [2] Du J(2021)Development of instructional videos for the principles of 3D computer animation J. Phys.: Conf. Ser. 1737 012022-147
  • [3] Zhang JS(2002)Recognizing mathematical expressions using tree transformation IEEE Trans. Pattern. Anal. Mach. Intell. 24 1455-848
  • [4] Wu CJ(2016)An integrated grammar-based approach for mathematical expression recognition Pattern Recognit. 51 135-233
  • [5] Chen MJ(2015)Matching based ground-truth annotation for online handwritten mathematical expressions Pattern Recognit. 48 837-10
  • [6] Wu JJ(2019)Track, attend, and parse (tap): an end-to-end framework for online handwritten mathematical expression recognition IEEE Trans. Multimedia. 21 221-3265
  • [7] Pambudi S(2021)A robust and fast multispectral pedestrian detection deep network Knowl Based Syst. 227 106990-762
  • [8] Hidayatulloh I(2021)Document structure model for survey generation using neural network Front. Comput. Sci. 15 1-190
  • [9] Surjono HD(2022)A general multi-scale image classification based on shared conversion matrix routing Appl. Intell. 52 3249-508
  • [10] Sukardiyono T(2022)Contour-aware semantic segmentation network with spatial attention mechanism for medical image Vis. Comput. 38 749-144