Logical entity recognition in multi-style document page images

被引:0
作者
Mao, Song [1 ]
Xu, Zheng [2 ]
Tjahjadi, Tardi [2 ]
Thoma, George R. [1 ]
机构
[1] US Natl Lib Med, Bethesda, MD 20894 USA
[2] Univ Warwick, Sch Engn, Coventry CV4 7AL, W Midlands, England
来源
18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS | 2006年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Logical entity recognition in document page images is the essential part of a document image analysis system. A heterogeneous collection of document pages usually has many layout styles. Features extracted from same logical entities in different styles may have very different values and vice versa. Therefore, logical entity classifiers learned from a training set of multi-style document pages may not be reliable due to possible feature overlap of different logical entities in different styles. In this paper, we propose a novel method in which style information is used in both logical entity classifier training and recognition phases. In the training phase, training data are first classified into distinct styles, and a dedicated Support Vector Machine (SVM) is then learned for each style. In the recognition phase, the style of a new document page image is first identified and its logical entities are then recognized using corresponding SVM. We show in our experiments that the use of the style information significantly improves the accuracy of logical entity recognition in multi-style document page images.
引用
收藏
页码:876 / +
页数:2
相关论文
共 13 条
  • [1] A tutorial on v-support vector machines
    Chen, PH
    Lin, CJ
    Schölkopf, B
    [J]. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2005, 21 (02) : 111 - 136
  • [2] Conway A., 1993, Proceedings of the Second International Conference on Document Analysis and Recognition (Cat. No.93TH0578-5), P761, DOI 10.1109/ICDAR.1993.395626
  • [3] Dengel A, 1996, INT J IMAG SYST TECH, V7, P271, DOI 10.1002/(SICI)1098-1098(199624)7:4<271::AID-IMA2>3.0.CO
  • [4] 2-5
  • [5] Diligenti M, 2003, IEEE T PATTERN ANAL, V25, P519, DOI 10.1109/TPAMI.2003.1190578
  • [6] Kaufman L., 1990, FINDING GROUPS DATA
  • [7] Kim J, 2001, PROC SPIE, V4307, P111
  • [8] SYNTACTIC SEGMENTATION AND LABELING OF DIGITIZED PAGES FROM TECHNICAL JOURNALS
    KRISHNAMOORTHY, M
    NAGY, G
    SETH, S
    VISWANATHAN, M
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (07) : 737 - 747
  • [9] MAO S, 2005, P IEEE INT C IM PROC, V2, P510
  • [10] Niyogi D., 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P472, DOI 10.1109/ICDAR.1995.599038