A Review of Document Binarization: Main Techniques, New Challenges, and Trends

被引:1
作者
Yang, Zhengxian [1 ]
Zuo, Shikai [1 ]
Zhou, Yanxi [1 ]
He, Jinlong [1 ]
Shi, Jianwen [1 ]
机构
[1] Xiamen Univ Technol, Sch Optoelect & Commun Engn, Dept Microelect, Xiamen 361024, Peoples R China
关键词
degraded document images; binarization; threshold processing; deep learning; THRESHOLD SELECTION METHOD; IMAGE BINARIZATION; NETWORK; COMBINATION; ALGORITHM; TEXT;
D O I
10.3390/electronics13071394
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Document image binarization is a challenging task, especially when it comes to text segmentation in degraded document images. The binarization, as a pre-processing step of Optical Character Recognition (OCR), is one of the most fundamental and commonly used segmentation methods. It separates the foreground text from the background of the document image to facilitate subsequent image processing. In view of the different degradation degrees of document images, researchers have proposed a variety of solutions. In this paper, we have summarized some challenges and difficulties in the field of document image binarization. Approximately 60 methods documenting image binarization techniques are mentioned, including traditional algorithms and deep learning-based algorithms. Here, we evaluated the performance of 25 image binarization techniques on the H-DIBCO2016 dataset to provide some help for future research.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions
    Sulaiman, Alaa
    Omar, Khairuddin
    Nasrudin, Mohammad F.
    JOURNAL OF IMAGING, 2019, 5 (04)
  • [2] Binarization Techniques for Degraded Document Images - A Review
    Jyotsna
    Chauhan, Shivani
    Sharma, Ekta
    Doegar, Amit
    2016 5TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO), 2016, : 163 - 166
  • [3] Assessing Binarization Techniques for Document Images
    Lins, Rafael Dueire
    de Almeida, Marcos Martins
    Bernardino, Rodrigo Barros
    Jesus, Darlisson
    Oliveira, Jose Mario
    PROCEEDINGS OF THE 2017 ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG 17), 2017, : 183 - 192
  • [4] Historical Document Image Binarization: A Review
    Tensmeyer C.
    Martinez T.
    SN Computer Science, 2020, 1 (3)
  • [5] ESTIMATION OF APPROPRIATE PARAMETER VALUES FOR DOCUMENT BINARIZATION TECHNIQUES
    Badekas, E.
    Papamarkos, N.
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2009, 24 (01) : 66 - 78
  • [6] A new binarization method for degraded document images
    Rani U.
    Kaur A.
    Josan G.
    International Journal of Information Technology, 2023, 15 (2) : 1035 - 1053
  • [7] A review of digital watermarking techniques: Current trends, challenges and opportunities
    Singh, Balkar
    Kasana, Geeta
    WEB INTELLIGENCE, 2024, 22 (04)
  • [8] Deep Learning-Based Watermarking Techniques Challenges: A Review of Current and Future Trends
    Ben Jabra, Saoussen
    Ben Farah, Mohamed
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (7) : 4339 - 4368
  • [9] A systematic literature review on sentiment analysis techniques, challenges, and future trends
    Hafiz Muhammad Usman Ali
    Qaisar Farooq
    Azhar Imran
    Khalil El Hindi
    Knowledge and Information Systems, 2025, 67 (5) : 3967 - 4034
  • [10] A new efficient binarization method: application to degraded historical document images
    Hadjadj, Zineb
    Cheriet, Mohamed
    Meziane, Abdelkrim
    Cherfa, Yazid
    SIGNAL IMAGE AND VIDEO PROCESSING, 2017, 11 (06) : 1155 - 1162