Segmentation of Meaningful Text-Regions from Camera Captured Document Images

被引:0
|
作者
Dutta, Arpita [1 ]
Garai, Arpan [1 ]
Biswas, Samit [1 ]
机构
[1] Indian Inst Engn Sci & Technol, Comp Sci & Technol, Sibpur 711103, Howrah, India
来源
PROCEEDINGS OF 2018 FIFTH INTERNATIONAL CONFERENCE ON EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT) | 2018年
关键词
Border noise; projection profile; connected component; REMOVAL;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the era of digitization, digitizing books, magazines, journals etc. are very much beneficial for humans. The scanning of a thick document and warped pages in the poor luminance cause long dark region alongside the margin. Imperfect thresholding during binarization also causes noise blocks. This paper proposes an approach for the segmentation of meaningful regions from camera captured document images based on projection profile and connected component analysis. The approach is tested on Tobacco-800 and Born-digital Warped Bangla Document Image Dataset (BdWBDID); The result is encouraging.
引用
收藏
页数:4
相关论文
共 4 条
  • [1] Rectification of Camera Captured Document Images using Component Analysis
    Banerjee, Debanshu
    Bhowal, Pratik
    Bera, Suman Kumar
    Sarkar, Ram
    2020 IEEE CALCUTTA CONFERENCE (CALCON), 2020, : 421 - 425
  • [2] Extraction of Text Regions from Complex Background in Document Images by Multilevel Clustering
    Hoai Nam Vu
    Tuan Anh Tran
    Seop, Na In
    Kim, Soo Hyung
    INTERNATIONAL JOURNAL OF NETWORKED AND DISTRIBUTED COMPUTING, 2016, 4 (01) : 11 - 21
  • [3] Automatic Extraction of Text Regions from Document Images by Multilevel Thresholding and K-means Clustering
    Hoai Nam Vu
    Tuan Anh Tran
    Na, In Seop
    Kim, Soo Hyung
    2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 329 - 334
  • [4] Edge Based Segmentation Approach to Extract Text from Scene Images
    Kumuda, T.
    Basavaraj, L.
    2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 706 - 710