A Review of Document Binarization: Main Techniques, New Challenges, and Trends

被引:1
作者
Yang, Zhengxian [1 ]
Zuo, Shikai [1 ]
Zhou, Yanxi [1 ]
He, Jinlong [1 ]
Shi, Jianwen [1 ]
机构
[1] Xiamen Univ Technol, Sch Optoelect & Commun Engn, Dept Microelect, Xiamen 361024, Peoples R China
关键词
degraded document images; binarization; threshold processing; deep learning; THRESHOLD SELECTION METHOD; IMAGE BINARIZATION; NETWORK; COMBINATION; ALGORITHM; TEXT;
D O I
10.3390/electronics13071394
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Document image binarization is a challenging task, especially when it comes to text segmentation in degraded document images. The binarization, as a pre-processing step of Optical Character Recognition (OCR), is one of the most fundamental and commonly used segmentation methods. It separates the foreground text from the background of the document image to facilitate subsequent image processing. In view of the different degradation degrees of document images, researchers have proposed a variety of solutions. In this paper, we have summarized some challenges and difficulties in the field of document image binarization. Approximately 60 methods documenting image binarization techniques are mentioned, including traditional algorithms and deep learning-based algorithms. Here, we evaluated the performance of 25 image binarization techniques on the H-DIBCO2016 dataset to provide some help for future research.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] A New Mixed Binarization Method Used in a Real Time Application of Automatic Business Document and Postal Mail Sorting
    Gaceb, Djamel
    Eglin, Veronique
    Lebourgeois, Frank
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2013, 10 (02) : 179 - 188
  • [32] A Review of Deep Learning Techniques in Document Image Word Spotting
    Lalita Kumari
    Anuj Sharma
    Archives of Computational Methods in Engineering, 2022, 29 : 1085 - 1106
  • [33] A review on document image analysis techniques directly in the compressed domain
    Javed, Mohammed
    Nagabhushan, P.
    Chaudhuri, Bidyut B.
    ARTIFICIAL INTELLIGENCE REVIEW, 2018, 50 (04) : 539 - 568
  • [34] Comprehensive review on intelligent security defences in cloud: Taxonomy, security issues, ML/DL techniques, challenges and future trends
    Belal, Mohamad Mulham
    Sundaram, Divya Meena
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 9102 - 9131
  • [35] A survey on deep learning for polyp segmentation: techniques, challenges and future trends
    Jiaxin Mei
    Tao Zhou
    Kaiwen Huang
    Yizhe Zhang
    Yi Zhou
    Ye Wu
    Huazhu Fu
    Visual Intelligence, 2025, 3 (1):
  • [36] A review of wireless channel estimation techniques: challenges and solutions
    Drakshayini M.N.
    Kounte M.R.
    International Journal of Wireless and Mobile Computing, 2022, 23 (02) : 193 - 203
  • [37] A Review on Business Analytics: Definitions, Techniques, Applications and Challenges
    Liu, Shiyu
    Liu, Ou
    Chen, Junyang
    MATHEMATICS, 2023, 11 (04)
  • [38] Hate speech detection in social media: Techniques, recent trends, and future challenges
    Rawat, Anchal
    Kumar, Santosh
    Samant, Surender Singh
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2024, 16 (02)
  • [39] A comprehensive taxonomy on multimedia video forgery detection techniques: challenges and novel trends
    Walid El-Shafai
    Mona A. Fouda
    El-Sayed M. El-Rabaie
    Nariman Abd El-Salam
    Multimedia Tools and Applications, 2024, 83 : 4241 - 4307
  • [40] A comprehensive taxonomy on multimedia video forgery detection techniques: challenges and novel trends
    El-Shafai, Walid
    Fouda, Mona A.
    El-Rabaie, El-Sayed M.
    El-Salam, Nariman Abd
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (2) : 4241 - 4307