A survey of document image classification: problem statement, classifier architecture and performance evaluation

被引:99
作者
Chen, Nawei [1 ]
Blostein, Dorothea [1 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON K7L 3N6, Canada
关键词
document image classification; document classifiers; document classification; document categorization; document features; feature representations; class models; classification algorithms; learning mechanisms; performance evaluation;
D O I
10.1007/s10032-006-0020-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document image classification is an important step in Office Automation, Digital Libraries, and other document image analysis applications. There is great diversity in document image classifiers: they differ in the problems they solve, in the use of training data to construct class models, and in the choice of document features and classification algorithms. We survey this diverse literature using three components: the problem statement, the classifier architecture, and performance evaluation. This brings to light important issues in designing a document classifier, including the definition of document classes, the choice of document features and feature representation, and the choice of classification algorithm and learning mechanism. We emphasize techniques that classify single-page typeset document images without using OCR results. Developing a general, adaptable, high-performance classifier is challenging due to the great variety of documents, the diverse criteria used to define document classes, and the ambiguity that arises due to ill-defined or fuzzy document classes.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 64 条
  • [1] [Anonymous], P 9 INT WORKSH SYST
  • [2] [Anonymous], SYNTACTIC STRUCTURAL
  • [3] [Anonymous], 1999, MEDIATEAM DOCUMENT D
  • [4] [Anonymous], P 7 INT C PATT REC M
  • [5] Automatic document classification and indexing in high-volume applications
    Appiani E.
    Cesarini F.
    Colla A.M.
    Diligenti M.
    Gori M.
    Marinai S.
    Soda G.
    [J]. Marinai, S. (simone@dsi.unifi.it), 2001, Springer Verlag (04) : 69 - 83
  • [6] Fine-grained document genre classification using first order random graphs
    Bagdanov, AD
    Worring, M
    [J]. SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 79 - 83
  • [7] First order Gaussian graphs for efficient structure classification
    Bagdanov, AD
    Worring, M
    [J]. PATTERN RECOGNITION, 2003, 36 (06) : 1311 - 1324
  • [8] Baldi S, 2003, PROC INT CONF DOC, P829
  • [9] Baumann S, 1997, PROC INT CONF DOC, P1055, DOI 10.1109/ICDAR.1997.620670
  • [10] Bengio Y., 1995, Advances in Neural Information Processing Systems 7, P427