Optical Character Recognition Guided Image Super Resolution

被引：0

作者：

Hildebrandt, Philipp ^{[1
]}

Schulze, Maximilian ^{[1
]}

Cohen, Sarel ^{[2
]}

Doskoc, Vanja ^{[1
]}

Saabni, Raid ^{[3
]}

Friedrich, Tobias ^{[1
]}

机构：

[1] Univ Potsdam, Hasso Plattner Inst, Potsdam, Germany

[2] Acad Coll Tel Aviv Yaffo, Tel Aviv, Israel

[3] Acad Coll Tel Aviv Yaffo, Triangle R&D Ctr, Tel Aviv, Israel

来源：

PROCEEDINGS OF THE 2022 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2022 | 2022年

关键词：

optical character recognition; image super-resolution; deep learning; unfocused images;

D O I：

10.1145/3558100.3563841

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recognizing disturbed text in real-life images is a difficult problem, as information that is missing due to low resolution or out-of-focus text has to be recreated. Combining text super-resolution and optical character recognition deep learning models can be a valuable tool to enlarge and enhance text images for better readability, as well as recognize text automatically afterwards. We achieve improved peak signal-to-noise ratio and text recognition accuracy scores over a state-of-the-art text super-resolution model TBSRN on the real-world low-resolution dataset TextZoom while having a smaller theoretical model size due to the usage of quantization techniques. In addition, we show how different training strategies influence the performance of the resulting model.

引用

页数：4

共 18 条

[1] NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].

Agustsson, Eirikur ;

Timofte, Radu .

2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131

[2]

Banner R, 2019, Arxiv, DOI arXiv:1810.05723

[3] Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding [J].

Bevilacqua, Marco ;

Roumy, Aline ;

Guillemot, Christine ;

Morel, Marie-Line Alberi .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,

[4] Scene Text Telescope: Text-Focused Scene Image Super-Resolution [J].

Chen, Jingye ;

Li, Bin ;

Xue, Xiangyang .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12021-12030

[5]

Chen XX, 2020, Arxiv, DOI [arXiv:2005.03492, DOI 10.48550/ARXIV.2005.03492]

[6] Learning a Deep Convolutional Network for Image Super-Resolution [J].

Dong, Chao ;

Loy, Chen Change ;

He, Kaiming ;

Tang, Xiaoou .

COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 :184-199

[7]

Gholami A., 2021, arXiv, DOI [DOI 10.48550/ARXIV.2103.13630, 10.48550/arXiv.2103.13630]

[8]

He KM, 2015, Arxiv, DOI arXiv:1512.03385

[9]

He Y, 2021, Arxiv, DOI arXiv:2112.12916

[10]

Huang JB, 2015, PROC CVPR IEEE, P5197, DOI 10.1109/CVPR.2015.7299156

← 1 2 →