Separation of Text from Non-Text Doodles of Poet Rabindranath Tagore's Manuscripts

被引:0
|
作者
Chaudhuri, B. B. [1 ]
Borah, Samarjeet [1 ]
Saraf, Ankita [1 ]
Goyal, Alisha [1 ]
Kumari, Alka [1 ]
机构
[1] Indian Stat Inst, CVPR Unit, Kolkata 700108, India
来源
2012 NATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION SYSTEMS (NCCCS) | 2012年
关键词
Text; Non text Doodles; Rabindranath Tagore; Connected Components; pixels; Stroke Width; EXTRACTION; SEGMENTATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As gaining popularity of internet facilities have given a convenient and faster approach to mine a warehouse of both historical and contemporary handwritten documents; this has led to a continuous research and development in the field of information retrieval algorithm. In such handwritten documents, graphics and images are combined with text and often overlap one another. This paper presents a technique for separating textual data from non-textual information. The technique is based on some already published works. It is implemented in poet Rabindranath Tagore's manuscript. The approach generates connected components as basic primitive and tries to classify them as text or non-text based on a comparison between the total number of pixels and the number of boundary pixels constituting the component. A window is generated and further separation is done on the basis of the stroke width computed for each window. The paper also contains a brief review on some of the already published works.
引用
收藏
页码:165 / 169
页数:5
相关论文
共 29 条
  • [1] Text and non-text separation in offline document images: a survey
    Showmik Bhowmik
    Ram Sarkar
    Mita Nasipuri
    David Doermann
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 1 - 20
  • [2] Text and non-text separation in offline document images: a survey
    Bhowmik, Showmik
    Sarkar, Ram
    Nasipuri, Mita
    Doermann, David
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (1-2) : 1 - 20
  • [3] Separation of Text and Non-text in Document Layout Analysis using a Recursive Filter
    Tuan-Anh Tran
    Na, In-Seop
    Kim, Soo-Hyung
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2015, 9 (10): : 4072 - 4091
  • [4] Text/non-text classification of connected components in document images
    Julca-Aguilar, Frank D.
    Maia, Ana L. L. M.
    Hirata, Nina S. T.
    2017 30TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2017, : 450 - 455
  • [5] Text segmentation by integrating hybrid strategy and non-text filtering
    Minhua Li
    Meng Bai
    Yingjun Lv
    Multimedia Tools and Applications, 2022, 81 : 44505 - 44522
  • [6] Text segmentation by integrating hybrid strategy and non-text filtering
    Li, Minhua
    Bai, Meng
    Lv, Yingjun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (30) : 44505 - 44522
  • [7] The poet's school and the parrot's cage: the educational spirituality of Rabindranath Tagore
    Pridmore, John
    INTERNATIONAL JOURNAL OF CHILDRENS SPIRITUALITY, 2009, 14 (04) : 355 - 367
  • [8] Deep features based convolutional neural network model for text and non-text region segmentation from document images
    Umer, Saiyed
    Mondal, Ranjan
    Pandey, Hari Mohan
    Rout, Ranjeet Kumar
    APPLIED SOFT COMPUTING, 2021, 113
  • [9] Video Text Binarization using Connected Component Level Non-text Filtering
    Cho, Beom Geun
    Kim, Shin Gon
    Koo, Hyung Il
    2018 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2018, : 493 - 494
  • [10] Malayalam Text and Non-Text Classification of Natural Scene Images Based on Multiple Instance Learning
    Manjaly, Anit V.
    Priya, B. Shanmuga
    2016 IEEE INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER APPLICATIONS (ICACA), 2016, : 190 - 196