Optical Character Recognition Guided Image Super Resolution

被引:0
作者
Hildebrandt, Philipp [1 ]
Schulze, Maximilian [1 ]
Cohen, Sarel [2 ]
Doskoc, Vanja [1 ]
Saabni, Raid [3 ]
Friedrich, Tobias [1 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, Potsdam, Germany
[2] Acad Coll Tel Aviv Yaffo, Tel Aviv, Israel
[3] Acad Coll Tel Aviv Yaffo, Triangle R&D Ctr, Tel Aviv, Israel
来源
PROCEEDINGS OF THE 2022 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2022 | 2022年
关键词
optical character recognition; image super-resolution; deep learning; unfocused images;
D O I
10.1145/3558100.3563841
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing disturbed text in real-life images is a difficult problem, as information that is missing due to low resolution or out-of-focus text has to be recreated. Combining text super-resolution and optical character recognition deep learning models can be a valuable tool to enlarge and enhance text images for better readability, as well as recognize text automatically afterwards. We achieve improved peak signal-to-noise ratio and text recognition accuracy scores over a state-of-the-art text super-resolution model TBSRN on the real-world low-resolution dataset TextZoom while having a smaller theoretical model size due to the usage of quantization techniques. In addition, we show how different training strategies influence the performance of the resulting model.
引用
收藏
页数:4
相关论文
共 18 条
[1]   NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].
Agustsson, Eirikur ;
Timofte, Radu .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131
[2]  
Banner R, 2019, Arxiv, DOI arXiv:1810.05723
[3]   Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding [J].
Bevilacqua, Marco ;
Roumy, Aline ;
Guillemot, Christine ;
Morel, Marie-Line Alberi .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
[4]   Scene Text Telescope: Text-Focused Scene Image Super-Resolution [J].
Chen, Jingye ;
Li, Bin ;
Xue, Xiangyang .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12021-12030
[5]  
Chen XX, 2020, Arxiv, DOI [arXiv:2005.03492, DOI 10.48550/ARXIV.2005.03492]
[6]   Learning a Deep Convolutional Network for Image Super-Resolution [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 :184-199
[7]  
Gholami A., 2021, arXiv, DOI [DOI 10.48550/ARXIV.2103.13630, 10.48550/arXiv.2103.13630]
[8]  
He KM, 2015, Arxiv, DOI arXiv:1512.03385
[9]  
He Y, 2021, Arxiv, DOI arXiv:2112.12916
[10]  
Huang JB, 2015, PROC CVPR IEEE, P5197, DOI 10.1109/CVPR.2015.7299156