Mathematical Morphology Based Image Segmentation and Character String Extraction Using Fuzzy Inference

被引:0
作者
Chen, Jianjun [1 ]
Takagi, Noboru [1 ]
机构
[1] Toyama Prefectural Univ, Dept Intelligent Syst Design Engn, Imizu, Toyama 9390398, Japan
关键词
fuzzy inference; homogeneous region; natural scene image; text extraction; visually impaired people;
D O I
10.20965/jaciii.2015.p0544
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Signs are ubiquitous indoors and outdoors, and they are often used for finding public places and other locations. However, information on signs is inaccessible to many visually impaired people, unless represented non-visually such as with Braille, tactile graphics, or speech. Automatically reading text from signs in natural scene images is a vital application for assisting visually impaired people. However, finding text in scene images is a great challenge because it cannot be assumed that the acquired image contains only characters. Natural scene images usually contain diverse text in different sizes, styles, fonts, and colors, and complex backgrounds. Therefore, we turn to the development of a portable camera-based assistive system to aid visually impaired people reading text from natural scenery. In this paper, a new method for character string extraction from scene images is discussed. The algorithm is implemented and evaluated using a set of natural scene images. Accuracy, precision, and recall rates of the proposed method are calculated and analyzed to determine success and limitations. Recommendations for improvements are given based on the results.
引用
收藏
页码:544 / 554
页数:11
相关论文
共 23 条
[1]  
Ashida K., 2005, Transactions of the Institute of Electronics, Information and Communication Engineers D-II, VJ88-D-II, P1817
[3]   Extraction of special effects caption text events from digital video [J].
David Crandall ;
Sameer Antani ;
Rangachar Kasturi .
International Journal on Document Analysis and Recognition, 2003, 5 (2) :138-157
[4]  
Dorini L. B., 2007, P 8 INT S MATH MORPH, P101
[5]  
Duda R., 2001, PATTERN CLASSIFICATI
[6]   Improved text-detection methods for a camera-based text reading system for blind persons [J].
Ezaki, N ;
Kiyota, K ;
Minh, BT ;
Bulacu, M ;
Schomaker, L .
EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, :257-261
[7]  
Fabrizio J., 2009, P 16 IEEE INT C IM P, P2349
[8]  
Foong OM, 2011, INT PROC COMPUT SCI, V5, P488
[9]  
Gabbouj M., 1990, Communication, Control and Signal Processing. Proceedings of the 1990 Bilkent International Conference on New Trends in Communication, Control and Signal Processing, P1080
[10]  
Hanif S. M., 2007, P C WORKSH ASS TECHN