The OCRopus open source OCR system

被引:77
作者
Breuel, Thomas M. [1 ]
机构
[1] DFKI, Kaiserslautern, Germany
来源
DOCUMENT RECOGNITION AND RETRIEVAL XV | 2008年 / 6815卷
关键词
D O I
10.1117/12.783598
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
OCRopus is a new, open source OCR system emphasizing modularity, easy extensibility, and reuse, aimed at both the research community and large scale commercial document conversions. This paper describes the current status of the system, its general architecture, as well as the major algorithms currently being used for layout analysis and text line recognition.
引用
收藏
页数:15
相关论文
共 29 条
[1]  
[Anonymous], P 12 INT C IMPL APPL
[2]  
[Anonymous], P C DOC AN SYST KAIS
[3]  
[Anonymous], 2007, INT C DOC AN REC ICD
[4]  
[Anonymous], P 7 INT C PATT REC M
[5]  
BAIRD HS, 1994, DOCUMENT IMAGE ANAL, P17
[6]  
Breuel T. M., 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition, P821, DOI 10.1109/ICDAR.2001.953902
[7]  
Breuel TM, 2002, P SOC PHOTO-OPT INS, V4670, P20
[8]  
BREUEL TM, 2007, INT C DOC AN REC ICD
[9]  
BREUEL TM, 2008, 3 INT C COM IN PRESS
[10]  
BREUEL TM, 2003, S DOC IM UNDST TECHN