Deep learning of cuneiform sign detection with weak supervision using transliteration alignment

被引:18
作者
Dencker, Tobias [1 ]
Klinkisch, Pablo [1 ]
Maul, Stefan M. [2 ]
Ommer, Bjoern [1 ]
机构
[1] Heidelberg Univ, Heidelberg Collaboratory Image Proc, Interdisciplinary Ctr Sci Comp, Heidelberg, Germany
[2] Heidelberg Univ, Inst Assyriol, Dept Languages & Cultures Near East, Heidelberg, Germany
关键词
D O I
10.1371/journal.pone.0243039
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The cuneiform script provides a glimpse into our ancient history. However, reading age-old clay tablets is time-consuming and requires years of training. To simplify this process, we propose a deep-learning based sign detector that locates and classifies cuneiform signs in images of clay tablets. Deep learning requires large amounts of training data in the form of bounding boxes around cuneiform signs, which are not readily available and costly to obtain in the case of cuneiform script. To tackle this problem, we make use of existing transliterations, a sign-by-sign representation of the tablet content in Latin script. Since these do not provide sign localization, we propose a weakly supervised approach: We align tablet images with their corresponding transliterations to localize the transliterated signs in the tablet image, before using these localized signs in place of annotations to re-train the sign detector. A better sign detector in turn boosts the quality of the alignments. We combine these steps in an iterative process that enables training a cuneiform sign detector from transliterations only. While our method works weakly supervised, a small number of annotations further boost the performance of the cuneiform sign detector which we evaluate on a large collection of clay tablets from the Neo-Assyrian period. To enable experts to directly apply the sign detector in their study of cuneiform texts, we additionally provide a web application for the analysis of clay tablets with a trained cuneiform sign detector.
引用
收藏
页数:21
相关论文
共 48 条
[1]  
Adams RG, 1925, TRANSLATION ROSETTA
[2]   Unwrapping and visualizing cuneiform tablets [J].
Anderson, SE ;
Levoy, M .
IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2002, 22 (06) :82-88
[3]  
Berg-Kirkpatrick T., 2013, Long Papers, V1, P207
[4]  
Bogacz B, 2016, INT CONF FRONT HAND, P301, DOI [10.1109/ICFHR.2016.59, 10.1109/ICFHR.2016.0064]
[5]  
Bogasz B, 2015, P 20 COMP VIS WINT W
[6]  
Borger R, 2004, MESOPOTAMISCHES ZEIC
[7]   Handwriting Recognition of Historical Documents with few labeled data [J].
Chammas, Edgard ;
Mokbel, Chafic ;
Likforman-Sulem, Laurence .
2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, :43-48
[8]  
Charpin Dominique., 2010, Reading and Writing in Babylon
[9]  
Cohen A, 2004, MOL B INT U, P135
[10]   Characterization of the cuneiform signs by the use of a multifunctional optoelectronic device [J].
Demoli, N ;
Gruber, H ;
Dahms, U ;
Wernicke, G .
APPLIED OPTICS, 1996, 35 (29) :5811-5820