Opportunities and Challenges in Data-Centric AI

被引:9
|
作者
Kumar, Sushant [1 ]
Datta, Sumit [2 ]
Singh, Vishakha [1 ]
Singh, Sanjay Kumar [1 ]
Sharma, Ritesh [3 ]
机构
[1] Indian Inst Technol BHU, Dept Comp Sci & Engn, Varanasi 221005, India
[2] Digital Univ Kerala Formerly IIITM Kerala, Sch Elect Syst & Automat, Thiruvananthapuram 695317, India
[3] Manipal Acad Higher Educ, Manipal Inst Technol, Dept Informat & Commun Technol, Manipal 576104, Karnataka, India
关键词
Artificial intelligence; model-centric AI; data-centric AI; data;
D O I
10.1109/ACCESS.2024.3369417
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial intelligence (AI) systems are trained to solve complex problems and learn to perform specific tasks by using large volumes of data, such as prediction, classification, recognition, decision-making, etc. In the past three decades, AI research has focused mostly on the model-centric approach compared to the data-centric approach. In the model-centric approach, the focus is to improve the code or model architecture to enhance performance, whereas in data-centric AI, the focus is to improve the dataset to enhance performance. Data is food for AI. As a result, there has been a recent push in the AI community toward data-centric AI from model-centric AI. This paper provides a comprehensive and critical analysis of the current state of research in data-centric AI, presenting insights into the latest developments in this rapidly evolving field. By emphasizing the importance of data in AI, the paper identifies the key challenges and opportunities that must be addressed to improve the effectiveness of AI systems. Finally, this paper gives some recommendations for research opportunities in data-centric AI.
引用
收藏
页码:33173 / 33189
页数:17
相关论文
共 50 条
  • [1] A Data-Centric AI Paradigm for Socio-Industrial and Global Challenges
    Majeed, Abdul
    Hwang, Seong Oun
    ELECTRONICS, 2024, 13 (11)
  • [2] Data collection and quality challenges in deep learning: a data-centric AI perspective
    Steven Euijong Whang
    Yuji Roh
    Hwanjun Song
    Jae-Gil Lee
    The VLDB Journal, 2023, 32 : 791 - 813
  • [3] Data collection and quality challenges in deep learning: a data-centric AI perspective
    Whang, Steven Euijong
    Roh, Yuji
    Song, Hwanjun
    Lee, Jae-Gil
    VLDB JOURNAL, 2023, 32 (04) : 791 - 813
  • [4] dcbench: A Benchmark for Data-Centric AI Systems
    Eyuboglu, Sabri
    Karlas, Bojan
    Re, Christopher
    Zhang, Ce
    Zou, James
    PROCEEDINGS OF THE 6TH WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, DEEM 2022, 2022,
  • [5] Data-Centric and Model-Centric AI: Twin Drivers of Compact and Robust Industry 4.0 Solutions
    Hamid, Oussama H.
    APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [6] Potential Impact of Data-Centric AI on Society
    Kumar, Sushant
    Sharma, Ritesh
    Singh, Vishakha
    Tiwari, Shrikant
    Singh, Sanjay Kumar
    Datta, Sumit
    IEEE TECHNOLOGY AND SOCIETY MAGAZINE, 2023, 42 (03) : 98 - 107
  • [7] Data-centric Engineering: integrating simulation, machine learning and statistics. Challenges and opportunities
    Pan, Indranil
    Mason, Lachlan R.
    Matar, Omar K.
    CHEMICAL ENGINEERING SCIENCE, 2022, 249
  • [8] Data-Centric Green AI An Exploratory Empirical Study
    Verdecchia, Roberto
    Cruz, Luis
    Sallou, June
    Lin, Michelle
    Wickenden, James
    Hotellier, Estelle
    2022 INTERNATIONAL CONFERENCE ON ICT FOR SUSTAINABILITY (ICT4S 2022), 2022, : 35 - 45
  • [9] A data-centric approach for ethical and trustworthy AI in journalism
    Dierickx, Laurence
    Opdahl, Andreas Lothe
    Khan, Sohail Ahmed
    Linden, Carl-Gustav
    Guerrero Rojas, Diana Carolina
    ETHICS AND INFORMATION TECHNOLOGY, 2024, 26 (04)
  • [10] Enhancing Collaboration and Agility in Data-Centric AI Projects
    Stieler, Fabian
    Baul, Bernhard
    EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING, ENASE 2023, 2024, 2028 : 321 - 343