A New Mixed Binarization Method Used in a Real Time Application of Automatic Business Document and Postal Mail Sorting

被引:0
作者
Gaceb, Djamel [1 ]
Eglin, Veronique [1 ]
Lebourgeois, Frank [1 ]
机构
[1] Inst Natl Sci Appl, LIRIS Lab, Lyon, France
关键词
Binarization; text zones location; real time processing; automatic sorting of company documents; mail; LOCALIZATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The binarization is applied in the first stage of segmentation process and has a very strong impact on the performances of the system of the automatic sorting of company documents and mail. We present in the beginning of this paper a complete study of the different existing binarization mechanisms that are developed to meet the needs of specific applications. These conventional approaches, present weaknesses that it is crucial to overcome and unfortunately they remain unsuitable for our real time application. The separation between the thresholding and the text zones location stages considerably increase the computation time and lead to an over-segmentation of the noise and of the paper texture on empty zones of the image. Indeed, none of the traditional methods (whether global or local) efficiently meets all the required conditions. We have managed to optimize this stage by applying a local threshold only near the text zones that can be located by the cumulated gradients method with the multi-resolution and mathematical morphology. We demonstrate the consistent performance of the proposed method on several types of business documents and mail with wide-ranging content and image quality.
引用
收藏
页码:179 / 188
页数:10
相关论文
共 41 条
  • [1] [Anonymous], 1998, MED IMAGE ANAL
  • [2] [Anonymous], 2006, 2006 9 INT C CONTR A
  • [3] Babaguchi N., 1990, Proceedings. 10th International Conference on Pattern Recognition (Cat. No.90CH2898-5), P51, DOI 10.1109/ICPR.1990.119329
  • [4] BADEKAS E, 2003, P 3 INT S IM SIGN PR, V2, P909
  • [5] Automatic Multilevel Thresholding Based on a Fuzzy Entropy Measure
    Bruzzese, D.
    Giani, U.
    [J]. CLASSIFICATION AND MULTIVARIATE ANALYSIS FOR COMPLEX DATA STRUCTURES, 2011, : 125 - 133
  • [6] A two-stage binarization approach for document images
    Chi, Z
    Wong, KW
    [J]. PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 275 - 278
  • [7] CHIGUSA Y, 1992, 1992 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, P2292, DOI 10.1109/ISCAS.1992.230501
  • [8] Couto P, 2010, LECT NOTES ARTIF INT, V6098, P341, DOI 10.1007/978-3-642-13033-5_35
  • [9] Nonextensive entropic image thresholding
    Esquef, I
    Albuquerque, M
    Abuquerque, M
    [J]. SIBGRAPI 2002: XV BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2002, : 402 - 402
  • [10] Adaptive binarization method for document image analysis
    Feng, ML
    Tan, YP
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 339 - 342