Data-Centric Artificial Intelligence

被引:25
作者
Jakubik, Johannes [1 ]
Voessing, Michael [1 ]
Kuehl, Niklas [2 ]
Walk, Jannis [1 ]
Satzger, Gerhard [1 ]
机构
[1] Karlsruhe Inst Technol, Kaiserstr 12, D-76131 Karlsruhe, Germany
[2] Univ Bayreuth, Univ Str 30, D-95447 Bayreuth, Germany
关键词
Data-centric artificial intelligence; Data quality; Data work; BIG DATA; ANALYTICS;
D O I
10.1007/s12599-024-00857-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data-centric artificial intelligence (data-centric AI) represents an emerging paradigm that emphasizes the importance of enhancing data systematically and at scale to build effective and efficient AI-based systems. The novel paradigm complements recent model-centric AI, which focuses on improving the performance of AI-based systems based on changes in the model using a fixed set of data. The objective of this article is to introduce practitioners and researchers from the field of Business and Information Systems Engineering (BISE) to data-centric AI. The paper defines relevant terms, provides key characteristics to contrast the paradigm of data-centric AI with the model-centric one, and introduces a framework to illustrate the different dimensions of data-centric AI. In addition, an overview of available tools for data-centric AI is presented and this novel paradigm is differenciated from related concepts. Finally, the paper discusses the longer-term implications of data-centric AI for the BISE community.
引用
收藏
页码:507 / 515
页数:9
相关论文
共 43 条
[21]   Machine learning: Trends, perspectives, and prospects [J].
Jordan, M. I. ;
Mitchell, T. M. .
SCIENCE, 2015, 349 (6245) :255-260
[22]  
Kaggle, 2023, KAGGLE COMPETITIONS
[23]   Artificial intelligence and machine learning [J].
Kuehl, Niklas ;
Schemmer, Max ;
Goutier, Marc ;
Satzger, Gerhard .
ELECTRONIC MARKETS, 2022, 32 (04) :2235-2244
[24]   Accumulating Design Knowledge with Reference Models: Insights from 12 Years' Research into Data Management [J].
Legner, Christine ;
Pentek, Tobias ;
Otto, Boris .
JOURNAL OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2020, 21 (03) :735-770
[25]  
Lin Q., 2022, PMLR, V164, P1789
[26]  
McInnes L, 2020, Arxiv, DOI [arXiv:1802.03426, 10.21105/joss.00861, DOI 10.21105/JOSS.00861]
[27]  
Ng A, 2021, DATA CENTRIC AI WORK
[28]  
Ng A, 2022, DATA CENTRIC AI COMP
[29]  
Northcutt C. G., 2021, arXiv
[30]   Designing a multi-sided data platform: findings from the International Data Spaces case [J].
Otto, Boris ;
Jarke, Matthias .
ELECTRONIC MARKETS, 2019, 29 (04) :561-580