Deep learning for terahertz image denoising in nondestructive historical document analysis

被引:14
作者
Dutta, Balaka [1 ]
Root, Konstantin [2 ]
Ullmann, Ingrid [2 ]
Wagner, Fabian [1 ]
Mayr, Martin [1 ]
Seuret, Mathias [1 ]
Thies, Mareike [1 ]
Stromer, Daniel [1 ,3 ]
Christlein, Vincent [1 ]
Schuer, Jan [2 ]
Maier, Andreas [1 ]
Huang, Yixing [4 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Pattern Recognit Lab, Erlangen, Germany
[2] Friedrich Alexander Univ Erlangen Nurnberg, Inst Microwaves & Photon, Erlangen, Germany
[3] Siemens Healthcare GmbH, Erlangen, Germany
[4] Friedrich Alexander Univ Erlangen Nurnberg, Univ Hosp Erlangen, Dept Radiat Oncol, Erlangen, Germany
基金
欧洲研究理事会;
关键词
NETWORK;
D O I
10.1038/s41598-022-26957-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Historical documents contain essential information about the past, including places, people, or events. Many of these valuable cultural artifacts cannot be further examined due to aging or external influences, as they are too fragile to be opened or turned over, so their rich contents remain hidden. Terahertz (THz) imaging is a nondestructive 3D imaging technique that can be used to reveal the hidden contents without damaging the documents. As noise or imaging artifacts are predominantly present in reconstructed images processed by standard THz reconstruction algorithms, this work intends to improve THz image quality with deep learning. To overcome the data scarcity problem in training a supervised deep learning model, an unsupervised deep learning network (CycleGAN) is first applied to generate paired noisy THz images from clean images (clean images are generated by a handwriting generator). With such synthetic noisy-to-clean paired images, a supervised deep learning model using Pix2pixGAN is trained, which is effective to enhance real noisy THz images. After Pix2pixGAN denoising, 99% characters written on one-side of the Xuan paper can be clearly recognized, while 61% characters written on one-side of the standard paper are sufficiently recognized. The average perceptual indices of Pix2pixGAN processed images are 16.83, which is very close to the average perceptual index 16.19 of clean handwriting images. Our work has important value for THz-imaging-based nondestructive historical document analysis.
引用
收藏
页数:11
相关论文
共 63 条
[1]   A method and system for enhancing the resolution of terahertz imaging [J].
Ahi, Kiarash .
MEASUREMENT, 2019, 138 :614-619
[2]   Mathematical Modeling of THz Point Spread Function and Simulation of THz Imaging Systems [J].
Ahi, Kiarash .
IEEE TRANSACTIONS ON TERAHERTZ SCIENCE AND TECHNOLOGY, 2017, 7 (06) :747-754
[3]  
Atito S., 2021, P ICCV, P9650
[4]  
Batson J, 2019, PR MACH LEARN RES, V97
[5]   The 2018 PIRM Challenge on Perceptual Image Super-Resolution [J].
Blau, Yochai ;
Mechrez, Roey ;
Timofte, Radu ;
Michaeli, Tomer ;
Zelnik-Manor, Lihi .
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT V, 2019, 11133 :334-355
[6]   Terahertz imaging with compressed sensing and phase retrieval [J].
Chan, Wai Lam ;
Moravec, Matthew L. ;
Baraniuk, Richard G. ;
Mittleman, Daniel M. .
OPTICS LETTERS, 2008, 33 (09) :974-976
[7]   HINet: Half Instance Normalization Network for Image Restoration [J].
Chen, Liangyu ;
Lu, Xin ;
Zhang, Jie ;
Chu, Xiaojie ;
Chen, Chengpeng .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, :182-192
[8]   Synthetic data in machine learning for medicine and healthcare [J].
Chen, Richard J. ;
Lu, Ming Y. ;
Chen, Tiffany Y. ;
Williamson, Drew F. K. ;
Mahmood, Faisal .
NATURE BIOMEDICAL ENGINEERING, 2021, 5 (06) :493-497
[9]   Improving signal-to-noise ratio of a terahertz signal using a WaveNet-based neural network [J].
Choi, Hyunkook ;
Kim, Sangmin ;
Maeng, Inhee ;
Son, Joo-Hiuk ;
Park, Hochong .
OPTICS EXPRESS, 2022, 30 (04) :5473-5485
[10]   Terahertz and Cultural Heritage Science: Examination of Art and Archaeology [J].
Cosentino, Antonino .
TECHNOLOGIES, 2016, 4 (01)