Data-centric AI: Techniques and Future Perspectives

被引:9
|
作者
Zha, Daochen [1 ]
Lai, Kwei-Herng [2 ]
Yang, Fan [3 ]
Zou, Na [4 ]
Gao, Huiji [1 ]
Hu, Xia [2 ]
机构
[1] Airbnb Inc, San Francisco, CA 94103 USA
[2] Rice Univ, Houston, TX USA
[3] Wake Forest Univ, Winston Salem, NC USA
[4] Texas A&M Univ, College Stn, TX USA
关键词
D O I
10.1145/3580305.3599553
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The role of data in AI has been significantly magnified by the emerging concept of data-centric AI. In contrast to the traditional model-centric paradigm, which focuses on developing more effective models given fixed datasets, data-centric AI emphasizes the systematic engineering of data in building AI systems. However, as a new concept, many critical aspects of data-centric AI remain ambiguous, such as its definitions, associated tasks, algorithms, challenges, and benchmarks. This tutorial aims to review and discuss this emerging field, with a particular focus on the three general data-centric AI goals: training data development, inference data development, and data maintenance. The objective of this tutorial is threefold: (1) to formally categorize the field of data-centric AI using a goal-driven taxonomy and discuss the needs and challenges of each goal, (2) to comprehensively review the state-of-the-art techniques, and (3) to discuss the future perspectives and open research directions to inspire further innovations in this field.
引用
收藏
页码:5839 / 5840
页数:2
相关论文
共 50 条
  • [31] Unpacking data-centric geotechnics
    Phoon, Kok-Kwang
    Ching, Jianye
    Cao, Zijun
    UNDERGROUND SPACE, 2022, 7 (06) : 967 - 989
  • [32] GitWorkflow for Active Learning: A Development Methodology Proposal for Data-Centric AI Projects
    Stieler, Fabian
    Bauer, Bernhard
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING, ENASE 2023, 2023, : 202 - 213
  • [33] Data-centric decision support
    Kulhavy, R
    PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 3395 - 3400
  • [34] Data-Centric Mobile Crowdsensing
    Jiang, Changkun
    Gao, Lin
    Duan, Lingjie
    Huang, Jianwei
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2018, 17 (06) : 1275 - 1288
  • [35] Cognitive Data-Centric Systems
    Chang, Leland
    PROCEEDINGS OF THE GREAT LAKES SYMPOSIUM ON VLSI 2017 (GLSVLSI' 17), 2017, : 1 - 1
  • [36] Data-Centric Security for the IoT
    Schreckling, Daniel
    Parra, Juan David
    Doukas, Charalampos
    Posegga, Joachim
    INTERNET OF THINGS: IOT INFRASTRUCTURES, IOT 360, PT II, 2016, 170 : 77 - 86
  • [37] A Data-Centric Approach to Synchronization
    Dolby, Julian
    Hammer, Christian
    Marino, Daniel
    Tip, Frank
    Vaziri, Mandana
    Vitek, Jan
    ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2012, 34 (01):
  • [38] Orchestrating Data-Centric Workflows
    Barker, Adam
    Weissman, Jon B.
    van Hemert, Jano
    CCGRID 2008: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, VOLS 1 AND 2, PROCEEDINGS, 2008, : 210 - 217
  • [39] Data-Centric Intelligent Computing
    Jun Shen
    Chih-Cheng Hung
    Ghassan Beydoun
    Yan Li
    William Guo
    International Journal of Computational Intelligence Systems, 2018, 11 : 616 - 617
  • [40] Data-Centric Artificial Intelligence
    Jakubik, Johannes
    Voessing, Michael
    Kuehl, Niklas
    Walk, Jannis
    Satzger, Gerhard
    BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2024, 66 (04) : 507 - 515