Opportunities and Challenges in Data-Centric AI

被引:8
|
作者
Kumar, Sushant [1 ]
Datta, Sumit [2 ]
Singh, Vishakha [1 ]
Singh, Sanjay Kumar [1 ]
Sharma, Ritesh [3 ]
机构
[1] Indian Inst Technol BHU, Dept Comp Sci & Engn, Varanasi 221005, India
[2] Digital Univ Kerala Formerly IIITM Kerala, Sch Elect Syst & Automat, Thiruvananthapuram 695317, India
[3] Manipal Acad Higher Educ, Manipal Inst Technol, Dept Informat & Commun Technol, Manipal 576104, Karnataka, India
关键词
Artificial intelligence; model-centric AI; data-centric AI; data;
D O I
10.1109/ACCESS.2024.3369417
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial intelligence (AI) systems are trained to solve complex problems and learn to perform specific tasks by using large volumes of data, such as prediction, classification, recognition, decision-making, etc. In the past three decades, AI research has focused mostly on the model-centric approach compared to the data-centric approach. In the model-centric approach, the focus is to improve the code or model architecture to enhance performance, whereas in data-centric AI, the focus is to improve the dataset to enhance performance. Data is food for AI. As a result, there has been a recent push in the AI community toward data-centric AI from model-centric AI. This paper provides a comprehensive and critical analysis of the current state of research in data-centric AI, presenting insights into the latest developments in this rapidly evolving field. By emphasizing the importance of data in AI, the paper identifies the key challenges and opportunities that must be addressed to improve the effectiveness of AI systems. Finally, this paper gives some recommendations for research opportunities in data-centric AI.
引用
收藏
页码:33173 / 33189
页数:17
相关论文
共 50 条
  • [31] Towards Unlocking the Hidden Potentials of the Data-Centric AI Paradigm in the Modern Era
    Majeed, Abdul
    Hwang, Seong Oun
    APPLIED SYSTEM INNOVATION, 2024, 7 (04)
  • [32] Data-centric automated data mining
    Campos, MM
    Stengard, PJ
    Milenova, BL
    ICMLA 2005: FOURTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2005, : 97 - 104
  • [33] Rapidly predicting Kohn–Sham total energy using data-centric AI
    Hasan Kurban
    Mustafa Kurban
    Mehmet M. Dalkilic
    Scientific Reports, 12
  • [34] RDF Data-Centric Storage
    Levandoski, Justin J.
    Mokbel, Mohamed F.
    2009 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, VOLS 1 AND 2, 2009, : 911 - 918
  • [35] Data-centric challenges with the application and adoption of artificial intelligence for drug discovery
    Ghislat, Ghita
    Hernandez-Hernandez, Saiveth
    Piwajanusorn, Chayanit
    Ballester, Pedro J.
    EXPERT OPINION ON DRUG DISCOVERY, 2024, 19 (11) : 1297 - 1307
  • [36] Data-Centric and Model-Centric AI: Twin Drivers of Compact and Robust Industry 4.0 Solutions
    Hamid, Oussama H.
    APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [37] Unpacking data-centric geotechnics
    Phoon, Kok-Kwang
    Ching, Jianye
    Cao, Zijun
    UNDERGROUND SPACE, 2022, 7 (06) : 967 - 989
  • [38] GitWorkflow for Active Learning: A Development Methodology Proposal for Data-Centric AI Projects
    Stieler, Fabian
    Bauer, Bernhard
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING, ENASE 2023, 2023, : 202 - 213
  • [39] Data-centric decision support
    Kulhavy, R
    PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 3395 - 3400
  • [40] Data-Centric Mobile Crowdsensing
    Jiang, Changkun
    Gao, Lin
    Duan, Lingjie
    Huang, Jianwei
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2018, 17 (06) : 1275 - 1288