Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation

被引:2
作者
Scharpf, Philipp [1 ]
Schubotz, Moritz [2 ]
Gipp, Bela [3 ]
机构
[1] Univ Konstanz, Constance, Germany
[2] FIZ Karlsruhe, Karlsruhe, Germany
[3] Univ Wuppertal, Wuppertal, Germany
来源
WEB CONFERENCE 2021: COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2021) | 2021年
关键词
Entity Linking; Wikipedia; Wikidata; Recommender Systems;
D O I
10.1145/3442442.3452348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mathematical information retrieval (MathIR) applications such as semantic formula search and question answering systems rely on knowledge-bases that link mathematical expressions to their natural language names. For database population, mathematical formulae need to be annotated and linked to semantic concepts, which is very time-consuming. In this paper, we present our approach to structure and speed up this process by using an application-driven strategy and AI-aided system. We evaluate the quality and time-savings of AI-generated formula and identifier annotation recommendations on a test selection of Wikipedia articles from the physics domain. Moreover, we evaluate the community acceptance of Wikipedia formula entity links and Wikidata item creation and population to ground the formula semantics. Our evaluation shows that the AI guidance was able to significantly speed up the annotation process by a factor of 1.4 for formulae and 2.4 for identifiers. Our contributions were accepted in 88% of the edited Wikipedia articles and 67% of the Wikidata items. The "AnnoMathTeX" annotation recommender system is hosted by Wikimedia at annomathtex.wmflabs.org . In the future, our data refinement pipeline will be integrated seamlessly into the Wikimedia user interfaces.
引用
收藏
页码:602 / 609
页数:8
相关论文
共 24 条
  • [1] Aizawa A., 2014, Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, P88
  • [2] Cohl Howard S, 2017, CONTENT DICT DESCRIP
  • [3] Fast and Accurate Annotation of Short Texts with Wikipedia Pages
    Ferragina, Paolo
    Scaiella, Ugo
    [J]. IEEE SOFTWARE, 2012, 29 (01) : 70 - 75
  • [4] Geiss Johanna, 2017, LECT NOTES COMPUTER, V10713, P115
  • [5] Discovering Mathematical Objects of Interest-A Study of Mathematical Notations
    Greiner-Petter, Andre
    Schubotz, Moritz
    Mueller, Fabian
    Breitinger, Corinna
    Cohl, Howard S.
    Aizawa, Akiko
    Gipp, Bela
    [J]. WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 1445 - 1456
  • [6] Evaluating Entity Linking with Wikipedia
    Hachey, Ben
    Radford, Will
    Nothman, Joel
    Honnibal, Matthew
    Curran, James R.
    [J]. ARTIFICIAL INTELLIGENCE, 2013, 194 : 130 - 150
  • [7] Cumulated gain-based evaluation of IR techniques
    Järvelin, K
    Kekäläinen, J
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (04) : 422 - 446
  • [8] The TagRec Framework as a Toolkit for the Development of Tag-Based Recommender Systems
    Kowald, Dominik
    Kopeinik, Simone
    Lex, Elisabeth
    [J]. ADJUNCT PUBLICATION OF THE 25TH CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION (UMAP'17), 2017, : 23 - 28
  • [9] Entity Linking for Mathematical Expressions in Scientific Documents
    Kristianto, Giovanni Yoko
    Topic, Goran
    Aizawa, Akiko
    [J]. DIGITAL LIBRARIES: KNOWLEDGE, INFORMATION, AND DATA IN AN OPEN ACCESS SOCIETY, 2016, 10075 : 144 - 149
  • [10] Kristianto GiovanniYoko., 2017, Proceedings of the 1st Workshop on Scholarly Web Mining, SWM'17, P57