Error correction of semantic mathematical expressions based on bayesian algorithm

被引:0
作者
Wang, Xue [1 ,2 ]
Yang, Fang [1 ,2 ]
Liu, Hongyuan [1 ,2 ]
Shi, Qingxuan [1 ,2 ]
机构
[1] Hebei Univ, Sch Cyber Secur & Comp, Baoding 071002, Peoples R China
[2] Hebei Univ, Inst Intelligent Image & Document Informat Proc, Baoding 071002, Peoples R China
关键词
error correction; mathematical expressions; Bayesian algorithm; presentation MathML; content MathML; INFERENCE;
D O I
10.3934/mbe.2022255
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The semantic information of mathematical expressions plays an important role in information retrieval and similarity calculation. However, a large number of presentational expressions in the presentation MathML format contained in electronic scientific documents do not reflect semantic information. It is a shortcut to extract semantic information using the rule mapping method to convert presentational expressions in presentation MathML format into semantic expressions in the content MathML format. However, the conversion result is prone to semantic errors because the expressions in the two formats do not have exact correspondences in grammatical structures and markups. In this study, a Bayesian error correction algorithm is proposed to correct the semantic errors in the conversion results of mathematical expressions based on the rule mapping method. In this study, the expressions in presentation MathML and content MathML in the NTCIR data set are used as the training set to optimize the parameters of the Bayesian model. The expressions in presentation MathML in the documents collected by the laboratory from the CNKI website are used as the test set to test the error correction results. The experimental results show that the average F-1 value is 0.239 with the rule mapping method, and the average F-1 value is 0.881 with the Bayesian error correction method, with the average error correction rate is 0.853.
引用
收藏
页码:5428 / 5445
页数:18
相关论文
共 42 条
  • [1] Toward perfect neural cascading architecture for grammatical error correction
    Acheampong, Kingsley Nketia
    Tian, Wenhong
    [J]. APPLIED INTELLIGENCE, 2021, 51 (06) : 3775 - 3788
  • [2] [蔡川 Cai Chuan], 2012, [计算机应用与软件, Computer Applications and Software], V29, P30
  • [3] Edit Distance for Pushdown Automata
    Chatterjee, Krishnendu
    Henzinger, Thomas A.
    Ibsen-Jensen, Rasmus
    Otop, Jan
    [J]. AUTOMATA, LANGUAGES, AND PROGRAMMING, PT II, 2015, 9135 : 121 - 133
  • [4] Cui Jia-Xu, 2018, Journal of Software, V29, P3068, DOI 10.13328/j.cnki.jos.005607
  • [5] Dhar S., 2019, INT J INNOVATIVE TEC, V8, P234, DOI [10.35940/ijitee.K1298.0981119, DOI 10.35940/IJITEE.K1298.0981119]
  • [6] Doush I.A., 2010, P 1 INT C INTELLIGEN, P1
  • [7] Semantic preserving bijective mappings for expressions involving special functions between computer algebra systems and document preparation systems
    Greiner-Petter, Andre
    Schubotz, Moritz
    Cohl, Howard S.
    Gipp, Bela
    [J]. ASLIB JOURNAL OF INFORMATION MANAGEMENT, 2019, 71 (03) : 415 - 439
  • [8] Grigore M, 2009, AS S COMP MATH MATH
  • [9] Survey of Automatic Spelling Correction
    Hladek, Daniel
    Stas, Jan
    Pleva, Matus
    [J]. ELECTRONICS, 2020, 9 (10) : 1 - 29
  • [10] Jing Y., 2020, INFORM TECHNOLOGY, V44, P143