Opportunities and Challenges in Data-Centric AI

被引:9
|
作者
Kumar, Sushant [1 ]
Datta, Sumit [2 ]
Singh, Vishakha [1 ]
Singh, Sanjay Kumar [1 ]
Sharma, Ritesh [3 ]
机构
[1] Indian Inst Technol BHU, Dept Comp Sci & Engn, Varanasi 221005, India
[2] Digital Univ Kerala Formerly IIITM Kerala, Sch Elect Syst & Automat, Thiruvananthapuram 695317, India
[3] Manipal Acad Higher Educ, Manipal Inst Technol, Dept Informat & Commun Technol, Manipal 576104, Karnataka, India
关键词
Artificial intelligence; model-centric AI; data-centric AI; data;
D O I
10.1109/ACCESS.2024.3369417
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial intelligence (AI) systems are trained to solve complex problems and learn to perform specific tasks by using large volumes of data, such as prediction, classification, recognition, decision-making, etc. In the past three decades, AI research has focused mostly on the model-centric approach compared to the data-centric approach. In the model-centric approach, the focus is to improve the code or model architecture to enhance performance, whereas in data-centric AI, the focus is to improve the dataset to enhance performance. Data is food for AI. As a result, there has been a recent push in the AI community toward data-centric AI from model-centric AI. This paper provides a comprehensive and critical analysis of the current state of research in data-centric AI, presenting insights into the latest developments in this rapidly evolving field. By emphasizing the importance of data in AI, the paper identifies the key challenges and opportunities that must be addressed to improve the effectiveness of AI systems. Finally, this paper gives some recommendations for research opportunities in data-centric AI.
引用
收藏
页码:33173 / 33189
页数:17
相关论文
共 50 条
  • [31] Acting with Inherently Uncertain Data: Practices of Data-Centric Knowing
    Mikalsen, Marius
    Monteiro, Eric
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2021, 22 (06): : 1715 - 1735
  • [32] Fair AI Challenges and Opportunities
    Feuerriegel, Stefan
    Dolata, Mateusz
    Schwabe, Gerhard
    BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2020, 62 (04) : 379 - 384
  • [33] Data-Centric Optimization Approach for Small, Imbalanced Datasets
    Tanov, Vladislav
    JOURNAL OF INFORMATION AND ORGANIZATIONAL SCIENCES, 2023, 47 (01) : 167 - 177
  • [34] Data-Centric Machine Learning in Nursing: A Concept Clarification
    Ball Dunlap, Patricia A.
    Nahm, Eun-Shim
    Umberfield, Elizabeth E.
    CIN-COMPUTERS INFORMATICS NURSING, 2024, 42 (05) : 325 - 333
  • [35] Taxonomy of machine learning paradigms: A data-centric perspective
    Emmert-Streib, Frank
    Dehmer, Matthias
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 12 (05)
  • [36] Blending is all you need: Data-centric ensemble synthetic data
    Wang, Alex X.
    Simpson, Colin R.
    Nguyen, Binh P.
    INFORMATION SCIENCES, 2025, 691
  • [37] Integration of Big Data and AI in Educational Leadership Practices: Opportunities and Challenges
    Meng, Nian
    EURASIAN JOURNAL OF EDUCATIONAL RESEARCH, 2024, (111): : 47 - 67
  • [38] Materials data science using CRADLE: A distributed, data-centric approach
    Ciardi, Thomas G.
    Nihar, Arafath
    Chawla, Rounak
    Akanbi, Olatunde
    Tripathi, Pawan K.
    Wu, Yinghui
    Chaudhary, Vipin
    French, Roger H.
    MRS COMMUNICATIONS, 2024, 14 (04) : 601 - 611
  • [39] Data-Centric Approach to Hepatitis C Virus Severity Prediction
    Sharma, Aniket
    Arora, Ashok
    Gupta, Anuj
    Singh, Pramod Kumar
    INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 421 - 431
  • [40] A Data-Centric Approach for Reducing Carbon Emissions in Deep Learning
    Anselmo, Martin
    Vitali, Monica
    ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2023, 2023, 13901 : 123 - 138