SEMANTIC SEARCH BASED ON NATURAL LANGUAGE PROCESSING - A NUMISMATIC EXAMPLE

被引:2
作者
Klinger, Patricia [1 ]
Gampe, Sebastian [1 ]
Tolle, Karsten [1 ]
Peter, Ulrike [1 ]
机构
[1] Goethe Univ Frankfurt, Berlin Brandenburg Akad Wissensch, Frankfurt, Germany
来源
JOURNAL OF ANCIENT HISTORY AND ARCHAEOLOGY | 2018年 / 5卷 / 03期
关键词
Natural Language Processing; Ontology; Numismatics; Iconography;
D O I
10.14795/j.v5i3.334
中图分类号
K85 [文物考古];
学科分类号
0601 ;
摘要
Iconographic representations on ancient artifacts are described in many existing databases and literature as human readable text. We applied Natural Language Processing (NLP) approaches in order to extract the semantics out of these textual descriptions and in this way enable semantic searches over them. This allows more sophisticated requests compared to the common existing keyword searches. As we show in our experiments based on numismatic datasets, the approach is generic in the sense that once the system is trained on one dataset, it can be applied without any further manual work also to datasets that have similar content. Of course, additional adaptions would further improve the results. Since the approach requires manual work only during the training phase, it can easily be applied to huge datasets without manual work and therefore without major extra costs. In fact, in our experience bigger datasets generate even better results because there is more data for training. Since our approach is not bound to a certain domain and the numismatic datasets are just an example, it could serve as a blueprint for many other areas. It could also help to build bridges between disciplines since textual iconographic descriptions are to be found also for pottery, sculpture and elsewhere.
引用
收藏
页码:68 / 79
页数:12
相关论文
共 10 条
[1]  
Babcock J, 2016, MASTERING PREDICTIVE
[2]  
Bird S., 2009, NATURAL LANGUAGE PRO
[3]  
Bishop C. M., 2006, PATTERN RECOGNITION
[4]   An Innovative Cloud-Based System for the Diachronic Analysis in Numismatics [J].
Celesti, Antonio ;
Salamone, Grazia ;
Sapienza, Anna ;
Spinelli, Marianna ;
Puglisi, Mariangela ;
Caltabiano, Maria .
ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2017, 10 (04)
[5]  
Graf Fritz, 2009, APOLLO
[6]  
Kemkes M., 2013, CARACALLA KAISER TYR, P7
[7]   Review of Relation Extraction Methods: What Is New Out There? [J].
Konstantinova, Natalia .
ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, 2014, 436 :15-28
[8]  
Lambrinoudakis W., 1984, LIMC 2, P183
[9]  
Maass M., 1993, ANTIKE DELPHI ORAKEL
[10]  
Pedregosa F, 2011, J MACH LEARN RES, V12, P2825