A tree-based model with branch parallel decoding for handwritten mathematical expression recognition

被引:0
作者
Li, Zhe [1 ]
Yang, Wentao [1 ]
Qi, Hengnian [2 ]
Jin, Lianwen [1 ]
Huang, Yichao [3 ]
Ding, Kai [3 ]
机构
[1] South China University of Technology, No. 381, Wushan Road, Tianhe District, Guangzhou,510641, China
[2] Huzhou University, No. 759, Erhuandong Road, Wuxing District, Huzhou,313000, China
[3] IntSig Information Co., No. 1268, Wanrong Road, Jing'an District, Shanghai,200040, China
基金
中国国家自然科学基金;
关键词
Benchmarking - Character recognition - Query processing - Trees (mathematics);
D O I
暂无
中图分类号
学科分类号
摘要
Handwritten mathematical expression recognition (HMER) is a challenging task in the field of computer vision due to the complex two-dimensional spatial structure and diverse handwriting styles of mathematical expressions (MEs). Recent mainstream approach treats MEs as objects with tree structures, modeled by sequence decoders or tree decoders. These decoders recognize the symbols and relationships between symbols in MEs in depth-first order, resulting in long decoding steps that can harm their performance, particularly for MEs with complex structures. In this paper, we propose a novel tree-based model with branch parallel decoding for HMER, which parses the structures of ME trees by explicitly predicting the relationships between symbols. In addition, a query constructing module is proposed to assist the decoder in decoding the branches of ME trees in parallel, thus reducing the number of decoding time steps and alleviating the problem of long sequence attention decoding. As a result, our model outperforms existing models on three widely-used benchmarks and demonstrates significant improvements in HMER performance. © 2023 Elsevier Ltd
引用
收藏
相关论文
empty
未找到相关数据