Creating return on investment for large-scale metadata creation

被引:0
作者
Urberg M. [1 ]
机构
[1] Seattle, WA
关键词
Algorithmic bias; Discovery; Historical bias; Humanities research; Machine learning; Metadata;
D O I
10.3233/ISU-210117
中图分类号
学科分类号
摘要
The scholarly communications industry is turning its attention to large-scale metadata creation for enhancing discovery of content. Algorithms used to train machine learning are powerful, but need to be used carefully. Several ethical and technological challenges need to be faced head-on to use of machine learning without exacerbating bias, racism, and discrimination. This article highlights the specific needs of humanities research to address historical bias and curtail algorithmic bias in creating metadata for machine learning. It also argues that the return on investment for large-scale metadata creation begins with building transparency into metadata creation and handling. © 2021 - The authors. This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (CC BY-NC 4.0).
引用
收藏
页码:53 / 60
页数:7
相关论文
共 50 条
  • [41] Large-Scale Nodes Classification With Deep Aggregation Network
    Li, Jiangtao
    Wu, Jianshe
    He, Weiquan
    Zhou, Peng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (06) : 2560 - 2572
  • [42] An investigation of complex fuzzy sets for large-scale learning
    Sobhi, Sayedabbas
    Dick, Scott
    FUZZY SETS AND SYSTEMS, 2023, 471
  • [43] A Survey of Approximate Quantile Computation on Large-Scale Data
    Chen, Zhiwei
    Zhang, Aoqian
    IEEE ACCESS, 2020, 8 (08): : 34585 - 34597
  • [44] Machine learning for large-scale crop yield forecasting
    Paudel, Dilli
    Boogaard, Hendrik
    de Wit, Allard
    Janssen, Sander
    Osinga, Sjoukje
    Pylianidis, Christos
    Athanasiadis, Ioannis N.
    AGRICULTURAL SYSTEMS, 2021, 187
  • [45] Compressed linear algebra for large-scale machine learning
    Ahmed Elgohary
    Matthias Boehm
    Peter J. Haas
    Frederick R. Reiss
    Berthold Reinwald
    The VLDB Journal, 2018, 27 : 719 - 744
  • [46] A review of Nystrom methods for large-scale machine learning
    Sun, Shiliang
    Zhao, Jing
    Zhu, Jiang
    INFORMATION FUSION, 2015, 26 : 36 - 48
  • [47] A Novel Clustering Algorithm for Large-Scale Graph Processing
    Qu, Zhaoyang
    Ding, Wei
    Qu, Nan
    Yan, Jia
    Wang, Ling
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 349 - 358
  • [48] Modernizing Analytics for Melanoma with a Large-Scale Research Dataset
    Richter, Aaron N.
    Khoshgoftaar, Taghi M.
    2017 IEEE 18TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI 2017), 2017, : 551 - 558
  • [49] Ensemble learning for large-scale crowd flow prediction
    Karbovskii, Vladislav
    Lees, Michael
    Presbitero, Alva
    Kurilkin, Alexey
    Voloshin, Daniil
    Derevitskii, Ivan
    Karsakov, Andrey
    Sloot, Peter M. A.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [50] Kernel methods for large-scale genomic data analysis
    Wang, Xuefeng
    Xing, Eric P.
    Schaid, Daniel J.
    BRIEFINGS IN BIOINFORMATICS, 2015, 16 (02) : 183 - 192