Data-Centric Green AI An Exploratory Empirical Study

被引:20
|
作者
Verdecchia, Roberto [1 ]
Cruz, Luis [2 ]
Sallou, June [3 ]
Lin, Michelle [4 ]
Wickenden, James [5 ]
Hotellier, Estelle [6 ]
机构
[1] Vrije Univ Amsterdam, Amsterdam, Netherlands
[2] Delft Univ Technol, Delft, Netherlands
[3] Univ Rennes, Rennes, France
[4] McGill Univ, Montreal, PQ, Canada
[5] Univ Bristol, Bristol, England
[6] Inria, Villeneuve dAscq, France
关键词
Energy Efficiency; Artificial Intelligence; Green AI; Data-centric; Empirical Experiment; ENERGY-CONSUMPTION;
D O I
10.1109/ICT4S55073.2022.00015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the growing availability of large-scale datasets, and the popularization of affordable storage and computational capabilities, the energy consumed by AI is becoming a growing concern. To address this issue, in recent years, studies have focused on demonstrating how AI energy efficiency can be improved by tuning the model training strategy. Nevertheless, how modifications applied to datasets can impact the energy consumption of AI is still an open question. To fill this gap, in this exploratory study, we evaluate if data-centric approaches can be utilized to improve AI energy efficiency. To achieve our goal, we conduct an empirical experiment, executed by considering 6 different AI algorithms, a dataset comprising 5,574 data points, and two dataset modifications (number of data points and number of features). Our results show evidence that, by exclusively conducting modifications on datasets, energy consumption can be drastically reduced (up to 92.16%), often at the cost of a negligible or even absent accuracy decline. As additional introductory results, we demonstrate how, by exclusively changing the algorithm used, energy savings up to two orders of magnitude can be achieved. In conclusion, this exploratory investigation empirically demonstrates the importance of applying data-centric techniques to improve AI energy efficiency. Our results call for a research agenda that focuses on data-centric techniques, to further enable and democratize Green AI.
引用
收藏
页码:35 / 45
页数:11
相关论文
共 50 条
  • [1] Data-Centric AI
    Malerba, Donato
    Pasquadibisceglie, Vincenzo
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (06) : 1493 - 1502
  • [2] The Principles of Data-Centric AI
    Jarrahi, Mohammad Hossein
    Memariani, Ali
    Guha, Shion
    COMMUNICATIONS OF THE ACM, 2023, 66 (08) : 84 - 92
  • [3] A Data-centric AI Framework for Automating Exploratory Data Analysis and Data Quality Tasks
    Patel, Hima
    Guttula, Shanmukha
    Gupta, Nitin
    Hans, Sandeep
    Mittal, Ruhi Sharma
    Lokesh, N.
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2023, 15 (04):
  • [4] Data-centric AI: Perspectives and Challenges
    Zha, Daochen
    Bhat, Zaid Pervaiz
    Lai, Kwei-Herng
    Yang, Fan
    Hu, Xia
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 945 - 948
  • [5] Opportunities and Challenges in Data-Centric AI
    Kumar, Sushant
    Datta, Sumit
    Singh, Vishakha
    Singh, Sanjay Kumar
    Sharma, Ritesh
    IEEE ACCESS, 2024, 12 (33173-33189) : 33173 - 33189
  • [6] Knowledge sharing and protection in data-centric collaborations: An exploratory study
    Zeiringer, Johannes P.
    Thalmann, Stefan
    KNOWLEDGE MANAGEMENT RESEARCH & PRACTICE, 2022, 20 (03) : 436 - 448
  • [7] dcbench: A Benchmark for Data-Centric AI Systems
    Eyuboglu, Sabri
    Karlas, Bojan
    Re, Christopher
    Zhang, Ce
    Zou, James
    PROCEEDINGS OF THE 6TH WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, DEEM 2022, 2022,
  • [8] Potential Impact of Data-Centric AI on Society
    Kumar, Sushant
    Sharma, Ritesh
    Singh, Vishakha
    Tiwari, Shrikant
    Singh, Sanjay Kumar
    Datta, Sumit
    IEEE TECHNOLOGY AND SOCIETY MAGAZINE, 2023, 42 (03) : 98 - 107
  • [9] Data-centric AI: Techniques and Future Perspectives
    Zha, Daochen
    Lai, Kwei-Herng
    Yang, Fan
    Zou, Na
    Gao, Huiji
    Hu, Xia
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5839 - 5840
  • [10] Data-Centric AI for Healthcare Fraud Detection
    Johnson J.M.
    Khoshgoftaar T.M.
    SN Computer Science, 4 (4)