Creating return on investment for large-scale metadata creation

被引:0
|
作者
Urberg M. [1 ]
机构
[1] Seattle, WA
关键词
Algorithmic bias; Discovery; Historical bias; Humanities research; Machine learning; Metadata;
D O I
10.3233/ISU-210117
中图分类号
学科分类号
摘要
The scholarly communications industry is turning its attention to large-scale metadata creation for enhancing discovery of content. Algorithms used to train machine learning are powerful, but need to be used carefully. Several ethical and technological challenges need to be faced head-on to use of machine learning without exacerbating bias, racism, and discrimination. This article highlights the specific needs of humanities research to address historical bias and curtail algorithmic bias in creating metadata for machine learning. It also argues that the return on investment for large-scale metadata creation begins with building transparency into metadata creation and handling. © 2021 - The authors. This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (CC BY-NC 4.0).
引用
收藏
页码:53 / 60
页数:7
相关论文
共 50 条
  • [21] Processing large-scale data with Apache Spark
    Ko, Seyoon
    Won, Joong-Ho
    KOREAN JOURNAL OF APPLIED STATISTICS, 2016, 29 (06) : 1077 - 1094
  • [22] Ensemble Learning for Large-Scale Workload Prediction
    Singh, Nidhi
    Rao, Shrisha
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (02) : 149 - 165
  • [23] Large-Scale Machine Learning and Neuroimaging in Psychiatry
    Thompson, Paul
    BIOLOGICAL PSYCHIATRY, 2018, 83 (09) : S51 - S51
  • [24] Optimization Methods for Large-Scale Machine Learning
    Bottou, Leon
    Curtis, Frank E.
    Nocedal, Jorge
    SIAM REVIEW, 2018, 60 (02) : 223 - 311
  • [25] Large-Scale Experiments with NP Chunking of Polish
    Radziszewski, Adam
    Pawlaczek, Adam
    TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 143 - 149
  • [26] Large-scale exploration and analysis of drug combinations
    Li, Peng
    Huang, Chao
    Fu, Yingxue
    Wang, Jinan
    Wu, Ziyin
    Ru, Jinlong
    Zheng, Chunli
    Guo, Zihu
    Chen, Xuetong
    Zhou, Wei
    Zhang, Wenjuan
    Li, Yan
    Chen, Jianxin
    Lu, Aiping
    Wang, Yonghua
    BIOINFORMATICS, 2015, 31 (12) : 2007 - 2016
  • [27] Evaluating the consistency of large-scale pharmacogenomic studies
    Rahman, Raziur
    Dhruba, Saugato Rahman
    Matlock, Kevin
    De-Niz, Carlos
    Ghosh, Souparno
    Pal, Ranadip
    BRIEFINGS IN BIOINFORMATICS, 2019, 20 (05) : 1734 - 1753
  • [28] Understanding Large-Scale Dynamic Purchase Behavior
    Jacobs, Bruno
    Fok, Dennis
    Donkers, Bas
    MARKETING SCIENCE, 2021, 40 (05) : 844 - 870
  • [29] Creating large scale probabilistic boundaries using Gaussian Processes
    Ball, Adrian
    Silversides, Katherine L.
    Chlingaryan, Anna
    Melkumyan, Arman
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 199
  • [30] Large-scale comparison of machine learning algorithms for target prediction of natural products
    Liang, Lu
    Liu, Ye
    Kang, Bo
    Wang, Ru
    Sun, Meng-Yu
    Wu, Qi
    Meng, Xiang-Fei
    Lin, Jian-Ping
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)