The Missing Science of Knowledge Curation (Improving incentives for large-scale knowledge curation)

被引：1

作者：

Paritosh, Praveen ^{[1
]}

机构：

[1] Google, Mountain View, CA 94043 USA

来源：

COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018) | 2018年

关键词：

D O I：

10.1145/3184558.3191551

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Dictionaries, encyclopedias, knowledge graphs, annotated corpora, library classification systems and world maps are all examples of human-curated knowledge resources that have been highly valuable to science as well as amortized across multiple large-scale systems in practice. Many of these were started and built even before a crowdsourcing research community existed. While the last decade has seen unprecedented growth in research and practice in building crowdsourcing systems to do increasingly complex tasks at scale, many of these resources are still woefully incomplete-lacking coverage in languages and subject matter domains. Moreover, many knowledge resources needed to fill other semantic gaps for artificial intelligence systems simply don't exist or aren't being built. Why? I argue that we don't have the right incentives, and that in order to improve the incentives, we have some fundamental scientific questions to answer. While building a large knowledge resource, we have little more than intuitions when it comes to estimating the reusability, maintainability, and long-term value of the effort. These make it difficult to fund or manage such projects, often requiring herculean personalities or fortunate businesses. Building or expanding a resource is often not seen as "sexy," which results in lack of resources to answer those questions in any principled manner. These problems begin to outline a new science of curation, making progress on which could help improve the discussion around and funding for building sorely needed knowledge resources.

引用

页码：1105 / 1106

页数：2

共 50 条

[31] Domain knowledge and data quality perceptions in genome curation work
Huang, Hong
JOURNAL OF DOCUMENTATION, 2015, 71 (01) : 116 - 142
[32] Autonomic Curation of Crowdsourced Knowledge: The Case of Career Data Management
Patelli, Alina
Lewis, Peter R.
Wang, Hai
Nabney, Ian
Bennett, David
Lucas, Ralph
Cole, Alex
2016 INTERNATIONAL CONFERENCE ON CLOUD AND AUTONOMIC COMPUTING (ICCAC), 2016, : 40 - 49
[33] Developing a Body of Knowledge for the Management of Large-Scale International Science Projects
Chaiy, Soeil
Ciarlette, Dan
Cross, Ben
Manwani, Sharm
Iandoli, Luca
Shore, Barry
Strawbridge, Carl
Zollo, Giuseppe
PROCEEDINGS OF PICMET 09 - TECHNOLOGY MANAGEMENT IN THE AGE OF FUNDAMENTAL CHANGE, VOLS 1-5, 2009, : 1438 - +
[34] Community Intelligence in Knowledge Curation: An Application to Managing Scientific Nomenclature
Dai, Lin
Xu, Chao
Tian, Ming
Sang, Jian
Zou, Dong
Li, Ang
Liu, Guocheng
Chen, Fei
Wu, Jiayan
Xiao, Jingfa
Wang, Xumin
Yu, Jun
Zhang, Zhang
PLOS ONE, 2013, 8 (02):
[35] Ontoclick: a web browser extension to facilitate biomedical knowledge curation
Xu, Anthony
Venkateswaran, Aravind
Zhou, Lianguizi
Zankl, Andreas
BIOINFORMATICS, 2022, 38 (01) : 301 - 302
[36] Knowledge Extraction Framework for Building a Large-scale Knowledge Base
Kim, Haklae (haklaekim@gmail.com), 2016, European Alliance for Innovation (03)
[37] Mining large-scale knowledge sources for case adaptation knowledge
Leake, David
Powell, Jay
CASE-BASED REASONING RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2007, 4626 : 209 - +
[38] Deep Learning-Assisted Peak Curation for Large-Scale LC-MS Metabolomics
Gloaguen, Yoann
Kirwan, Jennifer A.
Beule, Dieter
ANALYTICAL CHEMISTRY, 2022, 94 (12) : 4930 - 4937
[39] DC Proposal: Capturing Knowledge Evolution and Expertise in Community-Driven Knowledge Curation Platforms
Ziaimatin, Hasti
SEMANTIC WEB - ISWC 2011, PT II, 2011, 7032 : 381 - 388
[40] Trust, but Verify: Predicting Contribution Quality for Knowledge Base Construction and Curation
Tan, Chun How
Agichtein, Eugene
Ipeirotis, Panos
Gabrilovich, Evgeniy
WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 553 - 562

← 1 2 3 4 5 →