Thematic Trends on Data Quality Studies in Big Data Analytics: A Review

被引:0
作者
Chikon, Nazliah [1 ]
Abdul-Rahman, Shuzlina [1 ]
Aris, Syaripah Ruzaini Syed [1 ]
机构
[1] Univ Teknol MARA, Coll Comp Informat & Math, Shah Alam 40000, Selangor, Malaysia
来源
PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY | 2025年 / 33卷 / 03期
关键词
Artificial intelligence; big data analytics; data analytics; data quality; governance;
D O I
10.47836/pjst.33.3.07; 10.47836/pjst.33.3.07
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Data quality has become a critical issue in research and practice in the era of exponential data generation and increasing reliance on big data analytics (BDA) across industries. This study conducts a thematic analysis of literature published between 2020 and 2024 to examine the prevailing trends, challenges, and advancements in data quality studies within the domain of BDA. Guided by the systematic thematic review methodology, the research analysed 34 peer-reviewed studies identified from SCOPUS and Web of Science (WoS) databases, using qualitative data analysis tools such as ATLAS.ti. The findings reveal five major themes: Ontology and Data Quality Frameworks, Big Data Analytics in Various Industries, Machine Learning and AI Integration, Governance and Data Stewardship, and Tools and Techniques for Data Analysis. These themes highlight a shift towards interdisciplinary approaches, integrating advanced technologies like Artificial Intelligence (AI) and the Internet of Things (IoT) to address data quality issues. Limitations include potential selection bias from database restrictions and the exclusion of subscription-based journals, which may limit the generalisability of the findings. The study contributes to the theory by providing a comprehensive synthesis of data quality trends and their implications across various sectors. Methodologically, it demonstrates the utility of thematic analysis for consolidating diverse research. Practically, the insights inform data practitioners and policymakers on governance and technological strategies for ensuring data integrity. This review is original in its systematic exploration of thematic trendsin data quality, offering a valuable roadmap for future research and addressing the critical intersection of data quality and BDA.
引用
收藏
页码:1231 / 1256
页数:26
相关论文
共 41 条
[1]  
Al-madhrahi Zaeem, 2022, International Journal of Advanced Computer Science and Applications, V13, P461, DOI 10.14569/ijacsa.2022.0130657
[2]   BIGOWL4DQ: Ontology-driven approach for Big Data quality meta-modelling, selection and reasoning [J].
Barba-Gonzalez, Cristobal ;
Caballero, Ismael ;
Varela-Vaca, Angel Jesus ;
Cruz-Lemus, Jose A. ;
Gomez-Lopez, Maria Teresa ;
Navas-Delgado, Ismael .
INFORMATION AND SOFTWARE TECHNOLOGY, 2024, 167
[3]   From Big Data to Deep Data to Support People Analytics for Employee Attrition Prediction [J].
Ben Yahia, Nesrine ;
Hlel, Jihen ;
Colomo-Palacios, Ricardo .
IEEE ACCESS, 2021, 9 (09) :60447-60458
[4]   Advanced data analytics for ship performance monitoring under localized operational conditions [J].
Bui, Khanh Q. ;
Perera, Lokukaluge P. .
OCEAN ENGINEERING, 2021, 235
[5]   Efficient Detection of Environmental Violators: A Big Data Approach [J].
Chang, Xiangyu ;
Huang, Yinghui ;
Li, Mei ;
Bo, Xin ;
Kumar, Subodha .
PRODUCTION AND OPERATIONS MANAGEMENT, 2021, 30 (05) :1246-1270
[6]   Data, attitudinal and organizational determinants of big data analytics systems use [J].
Chen, Charlie ;
Choi, Hoon Seok ;
Ractham, Peter .
COGENT BUSINESS & MANAGEMENT, 2022, 9 (01)
[7]  
Chen Yuling, 2023, Wireless Communications and Mobile Computing, DOI [10.1155/2023/6217495, 10.1155/2023/6217495]
[8]  
Clarke V, 2013, PSYCHOLOGIST, V26, P120
[9]   Leveraging internet of things and big data analytics initiatives in European and American firms: Is data quality a way to extract business value? [J].
Corte-Real, Nadine ;
Ruivo, Pedro ;
Oliveira, Tiago .
INFORMATION & MANAGEMENT, 2020, 57 (01)
[10]   Artificial intelligence in supply chain decision-making: an environmental, social, and governance triggering and technological inhibiting protocol [J].
Hao, Xinyue ;
Demir, Emrah .
JOURNAL OF MODELLING IN MANAGEMENT, 2024, 19 (02) :605-629