A Data Quality Multidimensional Model for Social Media Analysis

被引:0
作者
Aramburu, Maria Jose [1 ]
Berlanga, Rafael [2 ]
Lanza-Cruz, Indira [2 ]
机构
[1] Univ Jaume 1, Dept Deengn & Ciencia Comp, Castellon de La Plana 12071, Spain
[2] Univ Jaume 1, Dept Llenguatges & Sistemes Informat, Castellon de La Plana 12071, Spain
关键词
Data quality; Social media data; Business intelligence; Text analytics; DECISION-MAKING; ANALYTICS; CREDIBILITY; TWITTER;
D O I
10.1007/s12599-023-00840-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media platforms have become a new source of useful information for companies. Ensuring the business value of social media first requires an analysis of the quality of the relevant data and then the development of practical business intelligence solutions. This paper aims at building high-quality datasets for social business intelligence (SoBI). The proposed method offers an integrated and dynamic approach to identify the relevant quality metrics for each analysis domain. This method employs a novel multidimensional data model for the construction of cubes with impact measures for various quality metrics. In this model, quality metrics and indicators are organized in two main axes. The first one concerns the kind of facts to be extracted, namely: posts, users, and topics. The second axis refers to the quality perspectives to be assessed, namely: credibility, reputation, usefulness, and completeness. Additionally, quality cubes include a user-role dimension so that quality metrics can be evaluated in terms of the user business roles. To demonstrate the usefulness of this approach, the authors have applied their method to two separate domains: automotive business and natural disasters management. Results show that the trade-off between quantity and quality for social media data is focused on a small percentage of relevant users. Thus, data filtering can be easily performed by simply ranking the posts according to the quality metrics identified with the proposed method. As far as the authors know, this is the first approach that integrates both the extraction of analytical facts and the assessment of social media data quality in the same framework.
引用
收藏
页码:667 / 689
页数:23
相关论文
共 50 条
[21]   A Study on Sentiment Analysis on Social Media Data [J].
Manasa, K. N. ;
Padma, M. C. .
EMERGING RESEARCH IN ELECTRONICS, COMPUTER SCIENCE AND TECHNOLOGY, ICERECT 2018, 2019, 545 :661-667
[22]   Mobilizing Social Media Data: Reflections of a Researcher Mediating between Data and Organization [J].
Garcia, Adriana Alvarado ;
Wong-Villacres, Marisol ;
Miceli, Milagros ;
Hernandez, Benjamin ;
Le Dantec, Christopher A. .
PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2023, 2023,
[23]   Multidimensional sentiment analysis method on social media data: comparison of emotions during and after the COVID-19 pandemic [J].
Dogan, Bulent ;
Balcioglu, Yavuz Selim ;
Elci, Meral .
KYBERNETES, 2025, 54 (04) :2414-2456
[24]   Dynamic product quality improvement using social media data and competitor-based Kano model [J].
Zheng, Lu ;
Sun, Lin ;
He, Zhen ;
He, Shuguang .
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2025, 285
[25]   Issues of social data analytics with a new method for sentiment analysis of social media data [J].
Wang, Zhaoxia ;
Tong, Victor Joo Chuan ;
Chan, David .
2014 IEEE 6TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2014, :899-904
[26]   Multifactor and multidimensional data quality analysis of judge scoring in diving competition [J].
Cai, Weijun ;
Xiang, Rong .
FRONTIERS IN PSYCHOLOGY, 2024, 15
[27]   Disaster impacts analysis using social media data [J].
Gangadhari, Rajan Kumar ;
Khanzode, Vivek ;
Murthy, Shankar .
2021 INTERNATIONAL CONFERENCE ON MAINTENANCE AND INTELLIGENT ASSET MANAGEMENT (ICMIAM), 2021,
[28]   Sentiment Analysis Techniques for Social Media Data: A Review [J].
Sharma, Dipti ;
Sabharwal, Munish ;
Goyal, Vinay ;
Vij, Mohit .
FIRST INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR COMPUTATIONAL INTELLIGENCE, 2020, 1045 :75-90
[29]   A novel probabilistic graphic model to detect product defects from social media data [J].
Zheng, Lu ;
He, Zhen ;
He, Shuguang .
DECISION SUPPORT SYSTEMS, 2020, 137
[30]   Dynamic perceived quality analysis using social media data at macro- and micro-levels [J].
Yang, Tong ;
Dang, Yanzhong ;
Wu, Jiangning .
INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2023, 123 (05) :1465-1495