T-CREo: A Twitter Credibility Analysis Framework

被引:9
作者
Cardinale, Yudith [1 ,2 ]
Dongo, Irvin [1 ,3 ]
Robayo, German [2 ]
Cabeza, David [2 ]
Aguilera, Ana [4 ]
Medina, Sergio [2 ]
机构
[1] Univ Catolica San Pablo, Elect & Elect Engn Dept, Arequipa 04001, Peru
[2] Univ Simon Bolivar, Dept Computac & Tecnol Informac, Caracas 1080, Venezuela
[3] Univ Bordeaux, ESTIA Inst Technol, F-64210 Bidart, France
[4] Univ Valparaiso, Fac Ingn, Escuela Ingn Informat, Valparaiso 2340000, Chile
关键词
Social networking (online); Blogs; Real-time systems; Computer architecture; Analytical models; Adaptation models; Service-oriented architecture; API; credibilty; fake news; information sources; twitter; web scraping;
D O I
10.1109/ACCESS.2021.3060623
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media and other platforms on Internet are commonly used to communicate and generate information. In many cases, this information is not validated, which makes it difficult to use and analyze. Although there exist studies focused on information validation, most of them are limited to specific scenarios. Thus, a more general and flexible architecture is needed, that can be adapted to user/developer requirements and be independent of the social media platform. We propose a framework to automatically and in real-time perform credibility analysis of posts on social media, based on three levels of credibility: Text, User, and Social. The general architecture of our framework is composed of a front-end, a light client proposed as a web plug-in for any browser; a back-end that implements the logic of the credibility model; and a third-party services module. We develop a first version of the proposed system, called T-CREo (Twitter CREdibility analysis framework) and evaluate its performance and scalability. In summary, the main contributions of this work are: the general framework design; a credibility model adaptable to various social networks, integrated into the framework; and T-CREo as a proof of concept that demonstrates the framework applicability and allows evaluating its performance for unstructured information sources; results show that T-CREo qualifies as a highly scalable real-time service. The future work includes the improvement of T-CREo implementation, to provide a robust architecture for the development of third-party applications, as well as the extension of the credibility model for considering bots detection, semantic analysis and multimedia analysis.
引用
收藏
页码:32498 / 32516
页数:19
相关论文
共 35 条
[11]   Twitter Heron: Towards Extensible Streaming Engines [J].
Fu, Maosong ;
Agrawal, Ashvin ;
Floratou, Avrilia ;
Graham, Bill ;
Jorgensen, Andrew ;
Li, Runhang ;
Lu, Neng ;
Ramasamy, Karthik ;
Rao, Sriram ;
Wang, Cong .
2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, :1165-1172
[12]  
Giovanetti R, 2016, PROCEEDINGS OF THE 2016 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING ASONAM 2016, P677
[13]  
Goonetilleke O., 2014, ACM SIGKDD Explorations Newsletter, V16, P11, DOI [10.1145/2674026.2674029, DOI 10.1145/2674026.2674029]
[14]  
Gupta A, 2014, LECT NOTES COMPUT SC, V8851, P228, DOI 10.1007/978-3-319-13734-6_16
[15]   A Hybrid Approach for Fake News Detection in Twitter Based on User Features and Graph Embedding [J].
Hamdi, Tarek ;
Slimi, Hamda ;
Bounhas, Ibrahim ;
Slimani, Yahya .
DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY (ICDCIT 2020), 2020, 11969 :266-280
[16]  
Hernandez-Suarez, 2018, WEB SCRAPING METHODO
[17]  
Hernandez-Suarez A, 2018, P SOMET, P453
[18]  
Idrees AM, 2019, INT J ADV COMPUT SC, V10, P311
[19]  
Iftene A, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P6166
[20]   Towards a Service-Oriented Architecture for Pre-processing Crowd-Sourced Sentiment from Twitter [J].
Jarrett, Julian ;
Hemmings-Jarrett, Kimberley ;
Blake, M. Brian .
2019 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2019), 2019, :163-171