The Missing Science of Knowledge Curation (Improving incentives for large-scale knowledge curation)

被引:1
|
作者
Paritosh, Praveen [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
来源
COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018) | 2018年
关键词
D O I
10.1145/3184558.3191551
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dictionaries, encyclopedias, knowledge graphs, annotated corpora, library classification systems and world maps are all examples of human-curated knowledge resources that have been highly valuable to science as well as amortized across multiple large-scale systems in practice. Many of these were started and built even before a crowdsourcing research community existed. While the last decade has seen unprecedented growth in research and practice in building crowdsourcing systems to do increasingly complex tasks at scale, many of these resources are still woefully incomplete-lacking coverage in languages and subject matter domains. Moreover, many knowledge resources needed to fill other semantic gaps for artificial intelligence systems simply don't exist or aren't being built. Why? I argue that we don't have the right incentives, and that in order to improve the incentives, we have some fundamental scientific questions to answer. While building a large knowledge resource, we have little more than intuitions when it comes to estimating the reusability, maintainability, and long-term value of the effort. These make it difficult to fund or manage such projects, often requiring herculean personalities or fortunate businesses. Building or expanding a resource is often not seen as "sexy," which results in lack of resources to answer those questions in any principled manner. These problems begin to outline a new science of curation, making progress on which could help improve the discussion around and funding for building sorely needed knowledge resources.
引用
收藏
页码:1105 / 1106
页数:2
相关论文
共 50 条
  • [31] Domain knowledge and data quality perceptions in genome curation work
    Huang, Hong
    JOURNAL OF DOCUMENTATION, 2015, 71 (01) : 116 - 142
  • [32] Autonomic Curation of Crowdsourced Knowledge: The Case of Career Data Management
    Patelli, Alina
    Lewis, Peter R.
    Wang, Hai
    Nabney, Ian
    Bennett, David
    Lucas, Ralph
    Cole, Alex
    2016 INTERNATIONAL CONFERENCE ON CLOUD AND AUTONOMIC COMPUTING (ICCAC), 2016, : 40 - 49
  • [33] Developing a Body of Knowledge for the Management of Large-Scale International Science Projects
    Chaiy, Soeil
    Ciarlette, Dan
    Cross, Ben
    Manwani, Sharm
    Iandoli, Luca
    Shore, Barry
    Strawbridge, Carl
    Zollo, Giuseppe
    PROCEEDINGS OF PICMET 09 - TECHNOLOGY MANAGEMENT IN THE AGE OF FUNDAMENTAL CHANGE, VOLS 1-5, 2009, : 1438 - +
  • [34] Community Intelligence in Knowledge Curation: An Application to Managing Scientific Nomenclature
    Dai, Lin
    Xu, Chao
    Tian, Ming
    Sang, Jian
    Zou, Dong
    Li, Ang
    Liu, Guocheng
    Chen, Fei
    Wu, Jiayan
    Xiao, Jingfa
    Wang, Xumin
    Yu, Jun
    Zhang, Zhang
    PLOS ONE, 2013, 8 (02):
  • [35] Ontoclick: a web browser extension to facilitate biomedical knowledge curation
    Xu, Anthony
    Venkateswaran, Aravind
    Zhou, Lianguizi
    Zankl, Andreas
    BIOINFORMATICS, 2022, 38 (01) : 301 - 302
  • [37] Mining large-scale knowledge sources for case adaptation knowledge
    Leake, David
    Powell, Jay
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2007, 4626 : 209 - +
  • [38] Deep Learning-Assisted Peak Curation for Large-Scale LC-MS Metabolomics
    Gloaguen, Yoann
    Kirwan, Jennifer A.
    Beule, Dieter
    ANALYTICAL CHEMISTRY, 2022, 94 (12) : 4930 - 4937
  • [39] DC Proposal: Capturing Knowledge Evolution and Expertise in Community-Driven Knowledge Curation Platforms
    Ziaimatin, Hasti
    SEMANTIC WEB - ISWC 2011, PT II, 2011, 7032 : 381 - 388
  • [40] Trust, but Verify: Predicting Contribution Quality for Knowledge Base Construction and Curation
    Tan, Chun How
    Agichtein, Eugene
    Ipeirotis, Panos
    Gabrilovich, Evgeniy
    WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 553 - 562