Opportunities and Challenges in Data-Centric AI

被引:8
|
作者
Kumar, Sushant [1 ]
Datta, Sumit [2 ]
Singh, Vishakha [1 ]
Singh, Sanjay Kumar [1 ]
Sharma, Ritesh [3 ]
机构
[1] Indian Inst Technol BHU, Dept Comp Sci & Engn, Varanasi 221005, India
[2] Digital Univ Kerala Formerly IIITM Kerala, Sch Elect Syst & Automat, Thiruvananthapuram 695317, India
[3] Manipal Acad Higher Educ, Manipal Inst Technol, Dept Informat & Commun Technol, Manipal 576104, Karnataka, India
关键词
Artificial intelligence; model-centric AI; data-centric AI; data;
D O I
10.1109/ACCESS.2024.3369417
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial intelligence (AI) systems are trained to solve complex problems and learn to perform specific tasks by using large volumes of data, such as prediction, classification, recognition, decision-making, etc. In the past three decades, AI research has focused mostly on the model-centric approach compared to the data-centric approach. In the model-centric approach, the focus is to improve the code or model architecture to enhance performance, whereas in data-centric AI, the focus is to improve the dataset to enhance performance. Data is food for AI. As a result, there has been a recent push in the AI community toward data-centric AI from model-centric AI. This paper provides a comprehensive and critical analysis of the current state of research in data-centric AI, presenting insights into the latest developments in this rapidly evolving field. By emphasizing the importance of data in AI, the paper identifies the key challenges and opportunities that must be addressed to improve the effectiveness of AI systems. Finally, this paper gives some recommendations for research opportunities in data-centric AI.
引用
收藏
页码:33173 / 33189
页数:17
相关论文
共 50 条
  • [21] Data-Centric Green AI An Exploratory Empirical Study
    Verdecchia, Roberto
    Cruz, Luis
    Sallou, June
    Lin, Michelle
    Wickenden, James
    Hotellier, Estelle
    2022 INTERNATIONAL CONFERENCE ON ICT FOR SUSTAINABILITY (ICT4S 2022), 2022, : 35 - 45
  • [22] A review on data-centric decision tools for offshore wind operation and maintenance activities: Challenges and opportunities
    Hadjoudj, Yannis
    Pandit, Ravi
    ENERGY SCIENCE & ENGINEERING, 2023, 11 (04) : 1501 - 1515
  • [23] Data-centric AI to Improve Early Detection of Mental Illness
    Wang, Alex X.
    Chukova, Stefanka S.
    Simpson, Colin R.
    Nguyen, Binh P.
    2023 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP, SSP, 2023, : 369 - 373
  • [24] Challenges of Information Retrieval and Evaluation in Data-Centric Biology
    Yu, Yi-Kuo
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2011, 15 (04) : 239 - 240
  • [25] A Data-centric AI Framework for Automating Exploratory Data Analysis and Data Quality Tasks
    Patel, Hima
    Guttula, Shanmukha
    Gupta, Nitin
    Hans, Sandeep
    Mittal, Ruhi Sharma
    Lokesh, N.
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2023, 15 (04):
  • [26] From Concept to Implementation: The Data-Centric Development Process for AI in Industry
    Luley, Paul-Philipp
    Deriu, Jan M.
    Yan, Peng
    Schatte, Gerrit A.
    Stadelmann, Thilo
    2023 10TH IEEE SWISS CONFERENCE ON DATA SCIENCE, SDS, 2023, : 73 - 76
  • [27] Data-centric Edge-AI: A Symbolic Representation Use Case
    Ilager, Shashikant
    De Maio, Vincenzo
    Lujic, Ivan
    Brandic, Ivona
    2023 IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING AND COMMUNICATIONS, EDGE, 2023, : 301 - 308
  • [28] ydata-profiling: Accelerating data-centric AI with high-quality data
    Clemente, Fabiana
    Ribeiro, Goncalo Martins
    Quemy, Alexandre
    Santos, Miriam Seoane
    Pereira, Ricardo Cardoso
    Barros, Alex
    NEUROCOMPUTING, 2023, 554
  • [29] Towards Data-centric Decision Making for Smart Infrastructure: Data and Its Challenges
    Droo, Didem Gurdur
    Schooling, Jennifer
    IFAC PAPERSONLINE, 2020, 53 (03): : 90 - 94
  • [30] Reimagining Synthetic Tabular Data Generation through Data-Centric AI: A Comprehensive Benchmark
    Hansen, Lasse
    Seedat, Nabeel
    van der Schaar, Mihaela
    Petrovic, Andrija
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,