Creating return on investment for large-scale metadata creation

被引：0

作者：

Urberg M. ^{[1
]}

机构：

[1] Seattle, WA

来源：

Information Services and Use | 2021年 / 41卷 / 1-2期

关键词：

Algorithmic bias; Discovery; Historical bias; Humanities research; Machine learning; Metadata;

D O I：

10.3233/ISU-210117

中图分类号：

学科分类号：

摘要：

The scholarly communications industry is turning its attention to large-scale metadata creation for enhancing discovery of content. Algorithms used to train machine learning are powerful, but need to be used carefully. Several ethical and technological challenges need to be faced head-on to use of machine learning without exacerbating bias, racism, and discrimination. This article highlights the specific needs of humanities research to address historical bias and curtail algorithmic bias in creating metadata for machine learning. It also argues that the return on investment for large-scale metadata creation begins with building transparency into metadata creation and handling. © 2021 - The authors. This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (CC BY-NC 4.0).

引用

页码：53 / 60

页数：7

共 50 条

[21] Processing large-scale data with Apache Spark
Ko, Seyoon
Won, Joong-Ho
KOREAN JOURNAL OF APPLIED STATISTICS, 2016, 29 (06) : 1077 - 1094
[22] Ensemble Learning for Large-Scale Workload Prediction
Singh, Nidhi
Rao, Shrisha
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (02) : 149 - 165
[23] Large-Scale Machine Learning and Neuroimaging in Psychiatry
Thompson, Paul
BIOLOGICAL PSYCHIATRY, 2018, 83 (09) : S51 - S51
[24] Optimization Methods for Large-Scale Machine Learning
Bottou, Leon
Curtis, Frank E.
Nocedal, Jorge
SIAM REVIEW, 2018, 60 (02) : 223 - 311
[25] Large-Scale Experiments with NP Chunking of Polish
Radziszewski, Adam
Pawlaczek, Adam
TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 143 - 149
[26] Large-scale exploration and analysis of drug combinations
Li, Peng
Huang, Chao
Fu, Yingxue
Wang, Jinan
Wu, Ziyin
Ru, Jinlong
Zheng, Chunli
Guo, Zihu
Chen, Xuetong
Zhou, Wei
Zhang, Wenjuan
Li, Yan
Chen, Jianxin
Lu, Aiping
Wang, Yonghua
BIOINFORMATICS, 2015, 31 (12) : 2007 - 2016
[27] Evaluating the consistency of large-scale pharmacogenomic studies
Rahman, Raziur
Dhruba, Saugato Rahman
Matlock, Kevin
De-Niz, Carlos
Ghosh, Souparno
Pal, Ranadip
BRIEFINGS IN BIOINFORMATICS, 2019, 20 (05) : 1734 - 1753
[28] Understanding Large-Scale Dynamic Purchase Behavior
Jacobs, Bruno
Fok, Dennis
Donkers, Bas
MARKETING SCIENCE, 2021, 40 (05) : 844 - 870
[29] Creating large scale probabilistic boundaries using Gaussian Processes
Ball, Adrian
Silversides, Katherine L.
Chlingaryan, Anna
Melkumyan, Arman
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 199
[30] Large-scale comparison of machine learning algorithms for target prediction of natural products
Liang, Lu
Liu, Ye
Kang, Bo
Wang, Ru
Sun, Meng-Yu
Wu, Qi
Meng, Xiang-Fei
Lin, Jian-Ping
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)

← 1 2 3 4 5 →