Bayesian super-resolution of text in video with a text-specific bimodal prior

被引：15

作者：

Donaldson K. ^{[1
]}

Myers G.K. ^{[1
]}

机构：

[1] SRI International, Menlo Park, CA 94025

来源：

International Journal of Document Analysis and Recognition (IJDAR) | 2005年 / 7卷 / 2-3期

关键词：

OCR; Super-resolution; Video;

D O I：

10.1007/s10032-004-0139-y

中图分类号：

学科分类号：

摘要：

To increase the range of sizes of video scene text recognizable by optical character recognition (OCR), we developed a Bayesian super-resolution algorithm that uses a text-specific bimodal prior. We evaluated the effectiveness of the bimodal prior, compared and in conjunction with a piecewise smoothness prior, visually and by measuring the accuracy of the OCR results on the variously super-resolved images. The bimodal prior improved the readability of 4- to 7-pixel-high scene text significantly better than bicubic interpolation and increased the accuracy of OCR results better than the piecewise smoothness prior. © Springer-Verlag 2005.

引用

页码：159 / 167

页数：8

共 30 条

[1]

Aradhye H., Dorai C., Shim J., Study of embedded font context and kernel space methods for improved videotext recognition, IEEE International Conference on Image Processing, (2001)

[2]

Dorai C., Aradhye H., Shim J., End-to-end videotext recognition for multimedia content analysis, IEEE International Conference on Multimedia and Expo., (2001)

[3]

Chaudhuri S., Super-resolution imaging, (2001)

[4]

Park S.C., Park M.K., Moon G.K., Super-resolution image reconstruction: A technical overview, IEEE Signal Process Mag., 20, 3, pp. 21-36, (2003)

[5]

Wu V., Manmatha R., Riseman E., Automatic text detection and ecognition, Proceedings of the Workshop on Image Understanding, pp. 707-712, (1997)

[6]

Li H., Doermann D., Omid K., Automatic text detection and tracking in digital video, IEEE Trans. Image Process, 9, 1, pp. 147-156, (2000)

[7]

Clark P., Mirmehdi M., Combining statistical measures to find image text regions, Proceedings of the International Conference on Pattern Recognition, pp. 450-453, (2000)

[8]

Clark P., Mirmehdi M., Finding text regions using localised measures, Proceedings of the 11th British Machine Vision Conference, pp. 675-684, (2000)

[9]

Mirmehdi M., Clark P., Lam J., Extracting low resolution text with an active camera for OCR, Proceedings of the 9th Spanish Symposium on Pattern Recognition and Image Processing, pp. 43-48, (2001)

[10]

Doermann D., Liang J., Li H., Progress in camera-based document image analysis, Proceedings of the International Conference on Document Analysis and Recognition, pp. 606-616, (2003)

← 1 2 3 →