The influence of language orthographic characteristics on digital word recognition

被引:0
作者
Biller, Ofer [1 ]
El-Sana, Jihad [1 ]
Kedem, Klara [1 ]
机构
[1] Ben Gurion Univ Negev, IL-84105 Beer Sheva, Israel
关键词
LEXICAL ACCESS; FREQUENCY; SIMILARITY; RETRIEVAL;
D O I
10.1093/llc/fqu051
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
This research studies the effect of language orthographic characteristics on the performance of digital word recognition in degraded documents such as historical documents. We provide a rigorous scheme for quantifying the statistical influence of the orthographic characteristics on the quality of word recognition in such documents. We study and compare several orthographic characteristics for four natural languages and measure the effect of each individual characteristic on the digital word recognition process. To this end, we create synthetic languages, for which all characteristics, except the one we examine, are identical, and measure the performance of two word recognition algorithms on synthetic documents of these languages. We examine and summarize the influence of the values of each characteristic on the performance of these word recognition methods.
引用
收藏
页码:495 / 502
页数:8
相关论文
共 28 条
[2]   FREQUENCY AND NEIGHBORHOOD EFFECTS ON LEXICAL ACCESS - LEXICAL SIMILARITY OR ORTHOGRAPHIC REDUNDANCY [J].
ANDREWS, S .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1992, 18 (02) :234-254
[3]   The effect of orthographic similarity on lexical retrieval: Resolving neighborhood conflicts [J].
Andrews, S .
PSYCHONOMIC BULLETIN & REVIEW, 1997, 4 (04) :439-461
[4]  
Biadsy F., 2006, P 10 INT WORKSH FRON, P1009
[5]  
Blando L. R., 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P319, DOI 10.1109/ICDAR.1995.599003
[6]  
Coltheart M., 1977, ATTENTION PERFORM, P535, DOI 10.4324/9781003309734-29
[7]  
ESAKOV J, 1994, P SOC PHOTO-OPT INS, V2181, P204, DOI 10.1117/12.171108
[8]   A segmentation-free approach for keyword search in historical typewritten documents [J].
Gatos, B ;
Konidaris, T ;
Ntzios, K ;
Pratikakis, I ;
Perantonis, SJ .
EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, :54-58
[9]   WORD-FREQUENCY AND NEIGHBORHOOD FREQUENCY-EFFECTS IN LEXICAL DECISION AND NAMING [J].
GRAINGER, J .
JOURNAL OF MEMORY AND LANGUAGE, 1990, 29 (02) :228-244
[10]  
Kanungo T., 1993, GLOBAL LOCAL DOCUMEN, P730