Comparing Generative AI Literature Reviews Versus Human-Led Systematic Literature Reviews: A Case Study on Big Data Research

被引:0
作者
Tosi, Davide [1 ]
机构
[1] Univ Insubria, Dept Theoret & Appl Sci, I-20110 Varese, Italy
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Big Data; Artificial intelligence; Real-time systems; Accuracy; Manuals; Generative AI; Finance; Scalability; AI-assisted research; big data; generative AI; large language models; systematic literature review; MANAGEMENT;
D O I
10.1109/ACCESS.2025.3554504
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generative Artificial Intelligence (GenAI) and Large Language Models (LLMs) are transforming research methodologies, including Systematic Literature Reviews (SLRs). While traditional, human-led SLRs are labor-intensive, AI-driven approaches promise efficiency and scalability. However, the reliability and accuracy of AI-generated literature reviews remain uncertain. This study investigates the performance of GPT-4-powered Consensus in conducting an SLR on Big Data research, comparing its results with a manually conducted SLR. To evaluate Consensus, we analyzed its ability to detect relevant studies, extract key insights, and synthesize findings. Our human-led SLR identified 32 primary studies (PSs) and 207 related works, whereas Consensus detected 22 PSs, with 16 overlapping with the manual selection and 5 false positives. The AI-selected studies had an average citation count of 202 per study, significantly higher than the 64.4 citations per study in the manual SLR, indicating a possible bias toward highly cited papers. However, none of the 32 PSs selected manually were included in the AI-generated results, highlighting recall and selection accuracy limitations. Key findings reveal that Consensus accelerates literature retrieval but suffers from hallucinations, reference inaccuracies, and limited critical analysis. Specifically, it failed to capture nuanced research challenges and missed important application domains. Precision, recall, and F1 scores of the AI-selected studies were 76.2%, 38.1%, and 50.6%, respectively, demonstrating that while AI retrieves relevant papers with high precision, it lacks comprehensiveness. To mitigate these limitations, we propose a hybrid AI-human SLR framework, where AI enhances search efficiency while human reviewers ensure rigor and validity. While AI can support literature reviews, human oversight remains essential for ensuring accuracy and depth. Future research should assess AI-assisted SLRs across multiple disciplines to validate generalizability and explore domain-specific LLMs for improved performance.
引用
收藏
页码:56210 / 56219
页数:10
相关论文
共 36 条
  • [1] Privacy-aware Big Data Analytics as a service for public health policies in smart cities
    Anisetti, Marco
    Ardagna, Claudio
    Bellandi, Valerio
    Cremonini, Marco
    Frati, Fulvio
    Damiani, Ernesto
    [J]. SUSTAINABLE CITIES AND SOCIETY, 2018, 39 : 68 - 77
  • [2] Bolanos F, 2024, Arxiv, DOI [arXiv:2402.08565, 10.48550/arXiv.2402.08565, DOI 10.48550/ARXIV.2402.08565]
  • [3] Bozkurt A., 2021, Eurasia Proc. Sci. Technol. Eng. Math., V24, P177
  • [4] Castillo-Segura P., 2023, P WORLD ENG ED FOR G, P1, DOI [10.1109/weef-gedc59520.2023.10344098, DOI 10.1109/WEEF-GEDC59520.2023.10344098]
  • [5] Big Data and Predictive Analytics for Business Intelligence: A Bibliographic Study (2000-2021)
    Chen, Yili
    Li, Congdong
    Wang, Han
    [J]. FORECASTING, 2022, 4 (04): : 767 - 786
  • [7] Medical Internet of Things and Big Data in Healthcare
    Dimitrov, Dimiter V.
    [J]. HEALTHCARE INFORMATICS RESEARCH, 2016, 22 (03) : 156 - 163
  • [8] How to optimize the systematic review process using AI tools
    Fabiano, Nicholas
    Gupta, Arnav
    Bhambra, Nishaant
    Luu, Brandon
    Wong, Stanley
    Maaz, Muhammad
    Fiedorowicz, Jess G.
    Smith, Andrew L.
    Solmi, Marco
    [J]. JCPP ADVANCES, 2024, 4 (02):
  • [9] BIG DATA AND DATA SCIENCE METHODS FOR MANAGEMENT RESEARCH
    George, Gerard
    Osinga, Ernst C.
    Lavie, Dovev
    Scott, Brent A.
    [J]. ACADEMY OF MANAGEMENT JOURNAL, 2016, 59 (05) : 1493 - 1507
  • [10] Retail business analytics: Customer visit segmentation using market basket data
    Griva, Anastasia
    Bardaki, Cleopatra
    Pramatari, Katerina
    Papakiriakopoulos, Dimitris
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 100 : 1 - 16