Bayesian super-resolution of text in video with a text-specific bimodal prior

被引:15
作者
Donaldson K. [1 ]
Myers G.K. [1 ]
机构
[1] SRI International, Menlo Park, CA 94025
来源
International Journal of Document Analysis and Recognition (IJDAR) | 2005年 / 7卷 / 2-3期
关键词
OCR; Super-resolution; Video;
D O I
10.1007/s10032-004-0139-y
中图分类号
学科分类号
摘要
To increase the range of sizes of video scene text recognizable by optical character recognition (OCR), we developed a Bayesian super-resolution algorithm that uses a text-specific bimodal prior. We evaluated the effectiveness of the bimodal prior, compared and in conjunction with a piecewise smoothness prior, visually and by measuring the accuracy of the OCR results on the variously super-resolved images. The bimodal prior improved the readability of 4- to 7-pixel-high scene text significantly better than bicubic interpolation and increased the accuracy of OCR results better than the piecewise smoothness prior. © Springer-Verlag 2005.
引用
收藏
页码:159 / 167
页数:8
相关论文
共 30 条
[1]  
Aradhye H., Dorai C., Shim J., Study of embedded font context and kernel space methods for improved videotext recognition, IEEE International Conference on Image Processing, (2001)
[2]  
Dorai C., Aradhye H., Shim J., End-to-end videotext recognition for multimedia content analysis, IEEE International Conference on Multimedia and Expo., (2001)
[3]  
Chaudhuri S., Super-resolution imaging, (2001)
[4]  
Park S.C., Park M.K., Moon G.K., Super-resolution image reconstruction: A technical overview, IEEE Signal Process Mag., 20, 3, pp. 21-36, (2003)
[5]  
Wu V., Manmatha R., Riseman E., Automatic text detection and ecognition, Proceedings of the Workshop on Image Understanding, pp. 707-712, (1997)
[6]  
Li H., Doermann D., Omid K., Automatic text detection and tracking in digital video, IEEE Trans. Image Process, 9, 1, pp. 147-156, (2000)
[7]  
Clark P., Mirmehdi M., Combining statistical measures to find image text regions, Proceedings of the International Conference on Pattern Recognition, pp. 450-453, (2000)
[8]  
Clark P., Mirmehdi M., Finding text regions using localised measures, Proceedings of the 11th British Machine Vision Conference, pp. 675-684, (2000)
[9]  
Mirmehdi M., Clark P., Lam J., Extracting low resolution text with an active camera for OCR, Proceedings of the 9th Spanish Symposium on Pattern Recognition and Image Processing, pp. 43-48, (2001)
[10]  
Doermann D., Liang J., Li H., Progress in camera-based document image analysis, Proceedings of the International Conference on Document Analysis and Recognition, pp. 606-616, (2003)