A Data Quality Multidimensional Model for Social Media Analysis

被引:0
|
作者
Aramburu, Maria Jose [1 ]
Berlanga, Rafael [2 ]
Lanza-Cruz, Indira [2 ]
机构
[1] Univ Jaume 1, Dept Deengn & Ciencia Comp, Castellon de La Plana 12071, Spain
[2] Univ Jaume 1, Dept Llenguatges & Sistemes Informat, Castellon de La Plana 12071, Spain
关键词
Data quality; Social media data; Business intelligence; Text analytics; DECISION-MAKING; ANALYTICS; CREDIBILITY; TWITTER;
D O I
10.1007/s12599-023-00840-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media platforms have become a new source of useful information for companies. Ensuring the business value of social media first requires an analysis of the quality of the relevant data and then the development of practical business intelligence solutions. This paper aims at building high-quality datasets for social business intelligence (SoBI). The proposed method offers an integrated and dynamic approach to identify the relevant quality metrics for each analysis domain. This method employs a novel multidimensional data model for the construction of cubes with impact measures for various quality metrics. In this model, quality metrics and indicators are organized in two main axes. The first one concerns the kind of facts to be extracted, namely: posts, users, and topics. The second axis refers to the quality perspectives to be assessed, namely: credibility, reputation, usefulness, and completeness. Additionally, quality cubes include a user-role dimension so that quality metrics can be evaluated in terms of the user business roles. To demonstrate the usefulness of this approach, the authors have applied their method to two separate domains: automotive business and natural disasters management. Results show that the trade-off between quantity and quality for social media data is focused on a small percentage of relevant users. Thus, data filtering can be easily performed by simply ranking the posts according to the quality metrics identified with the proposed method. As far as the authors know, this is the first approach that integrates both the extraction of analytical facts and the assessment of social media data quality in the same framework.
引用
收藏
页码:667 / 689
页数:23
相关论文
共 50 条
  • [11] Fulmqa: a fuzzy logic-based model for social media data quality assessment
    Oumaima Reda
    Ahmed Zellou
    Social Network Analysis and Mining, 13
  • [12] Effective Text Data Preprocessing Technique for Sentiment Analysis in Social Media Data
    Pradha, Saurav
    Halgamuge, Malka N.
    Nguyen Tran Quoc Vinh
    PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 108 - 115
  • [13] Analysis of Epidemic Outbreak in Delhi Using Social Media Data
    Swain, Sweta
    Seeja, K. R.
    INFORMATION, COMMUNICATION AND COMPUTING TECHNOLOGY, 2017, 750 : 25 - 34
  • [14] Survey on Data Analysis in Social Media: A Practical Application Aspect
    Hou, Qixuan
    Han, Meng
    Cai, Zhipeng
    BIG DATA MINING AND ANALYTICS, 2020, 3 (04): : 259 - 279
  • [15] The effect of social media marketing on voting intention; an application of multidimensional panel data
    Moslehpour, Massoud
    Tiwari, Aviral Kumar
    Pourfaez, Sahand Ebrahimi
    INTERNATIONAL JOURNAL OF EMERGING MARKETS, 2024,
  • [16] A Model of Preprocessing For Social Media Data Extraction
    Abidin, Dodo Zaenal
    Nurmaini, Siti
    Malik, Reza Firsandaya
    Jasmir
    Rasywir, Errissya
    Pratama, Yovi
    2019 INTERNATIONAL CONFERENCE ON INFORMATICS, MULTIMEDIA, CYBER AND INFORMATION SYSTEM (ICIMCIS), 2019, : 67 - 72
  • [17] Data Quality in Social Media Analytics for Operations and Supply Chain Performance Management
    Siekmann, Fabian
    Kinra, Aseem
    Kotzab, Herbert
    DYNAMICS IN LOGISTICS (LDIC 2022), 2022, : 104 - 116
  • [18] Mobilizing Social Media Data: Reflections of a Researcher Mediating between Data and Organization
    Garcia, Adriana Alvarado
    Wong-Villacres, Marisol
    Miceli, Milagros
    Hernandez, Benjamin
    Le Dantec, Christopher A.
    PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2023, 2023,
  • [19] TALISON - Tensor Analysis of Social Media Data
    Kao, Anne
    Ferng, William
    Poteet, Stephen
    Quach, Lesley
    Tjoelker, Rod
    2013 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS: BIG DATA, EMERGENT THREATS, AND DECISION-MAKING IN SECURITY INFORMATICS, 2013, : 137 - 142
  • [20] Forensic Analysis of Heterogeneous Social Media Data
    Nikolaidou, Aikaterini
    Lazaridis, Michalis
    Semertzidis, Theodoros
    Axenopoulos, Apostolos
    Daras, Petros
    KEOD: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL 2: KEOD, 2019, : 343 - 350