Integrating Generative AI into Information Systems Research: A Framework for Synthetic Data Evaluation

被引:0
作者
Rossello, Nicolas Bono [1 ]
Simonofski, Anthony [1 ]
Rossello, Lluc Bono [2 ]
Castiaux, Annick [1 ]
机构
[1] Univ Namur, Namur, Belgium
[2] Univ Libre Bruxelles, Brussels, Belgium
来源
PROCEEDINGS OF THE 58TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES | 2025年
关键词
Synthetic data; Data quality; Control Theory; Information Systems Research;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative AI is paving its way into the research process. Among the plethora of available generative AI solutions, the generation of synthetic data is one of the most controversial. The current division of opinion and the lack of formal approach to AI use in research create a situation of conflicting bad practices and under-used potential. This work aims to add nuance and structure to this research practice by providing a general framework to evaluate the use of synthetic data in different stages of the research process, based on the objective and methods of generation. Relying on a breakout literature review, we explore the fields of Data quality management and Control theory to transfer method theories from these fields to help us build the framework. The resulting conceptual framework provides an iterative scheme where, based on the desired properties of the data and its comparison to the synthetic result, the researcher can improve the outcome of the generation process and, equivalently, formally present the properties that make this data suitable for research.
引用
收藏
页码:7195 / 7204
页数:10
相关论文
共 34 条
[1]  
Arnold C., 2020, PREPRINT
[2]   Methodologies for Data Quality Assessment and Improvement [J].
Batini, Carlo ;
Cappiello, Cinzia ;
Francalanci, Chiara ;
Maurino, Andrea .
ACM COMPUTING SURVEYS, 2009, 41 (03)
[3]   Data-Driven Computationally Intensive Theory Development [J].
Berente, Nicholas ;
Seidel, Stefan ;
Safadi, Hani .
INFORMATION SYSTEMS RESEARCH, 2019, 30 (01) :50-64
[4]   Theorizing Through Literature Reviews: The Miner-Prospector Continuum [J].
Breslin, Dermot ;
Gatrell, Caroline .
ORGANIZATIONAL RESEARCH METHODS, 2023, 26 (01) :139-167
[5]  
Cai L., 2015, Data Sci. J., V14, DOI [10.5334/dsj-2015-002, DOI 10.5334/DSJ-2015-002]
[6]   Synthetic data in machine learning for medicine and healthcare [J].
Chen, Richard J. ;
Lu, Ming Y. ;
Chen, Tiffany Y. ;
Williamson, Drew F. K. ;
Mahmood, Faisal .
NATURE BIOMEDICAL ENGINEERING, 2021, 5 (06) :493-497
[7]  
Christie M. A., 2005, Error Analysis and Simulations of Complex Phenomena
[8]   An Overview of Data Quality Frameworks [J].
Cichy, Corinna ;
Rass, Stefan .
IEEE ACCESS, 2019, 7 :24634-24648
[9]  
Conte R., 2001, Computational & Mathematical Organization Theory, V7, P183, DOI [10.1023/A:1012919018402, DOI 10.1023/A:1012919018402]
[10]  
Dai L., 2022, SSRN Electronic Journal