Connecting a French Dictionary from the Beginning of the 20th Century to Wikidata

被引:0
作者
Nugues, Pierre [1 ]
机构
[1] Lund Univ, Lund, Sweden
来源
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年
基金
瑞典研究理事会;
关键词
entity annotation; entity linking; digital humanities;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Petit Larousse illustr ' e is a French dictionary first published in 1905. Its division in two main parts on language and on history and geography corresponds to a major milestone in French lexicography as well as a repository of general knowledge from this period. Although the value of many entries from 1905 remains intact, some descriptions now have a dimension that is more historical than contemporary. They are nonetheless significant to analyze and understand cultural representations from this time. A comparison with more recent information or a verification of these entries would require a tedious manual work. In this paper, we describe a new lexical resource, where we connected all the dictionary entries of the history and geography part to current data sources. For this, we linked each of these entries to a wikidata identifier. Using the wikidata links, we can automate more easily the identification, comparison, and verification of historically-situated representations. We give a few examples on how to process wikidata identifiers and we carried out a small analysis of the entities described in the dictionary to outline possible applications. The resource, i.e. the annotation of 20,245 dictionary entries with wikidata links, is available from GitHub (https://github.com/pnugues/petit_larousse_1905/).
引用
收藏
页码:2548 / 2555
页数:8
相关论文
共 18 条
[1]  
Bohbot H., 2018, P 11 INT C LANG RES
[2]  
Bollacker Kurt., P 2008 ACM SIGMOD IN, DOI DOI 10.1145/1376616.1376746
[3]  
Botha JA, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P7833
[4]  
Chen MD, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P421
[5]  
Chervel A., 2008, HIST ENSEIGNEMENT FR
[6]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[7]  
DARNTON R, 1971, DAEDALUS, V100, P214
[8]  
Escarpit, 1958, SOCIOLOGIE LIT
[9]  
Hamdi A., 2021, P 44 INT ACM SIGIR C
[10]  
Hoffart Johannes., 2011, Proceedings of the Conference on Empirical Methods in Natural Language Processing. EMNLP '11, P782