Meta-learning for fake news detection surrounding the Syrian war

被引:10
作者
Abu Salem, Fatima K. [1 ]
Al Feel, Roaa [1 ]
Elbassuoni, Shady [1 ]
Ghannam, Hiyam [1 ]
Jaber, Mohamad [2 ]
Farah, May [3 ]
机构
[1] Amer Univ Beirut, Dept Comp Sci, Beirut, Lebanon
[2] Google, Zurich, Switzerland
[3] Amer Univ Beirut, Dept Sociol Anthropol & Media Studies, Beirut, Lebanon
来源
PATTERNS | 2021年 / 2卷 / 11期
关键词
DSML 3: Development/pre-production: Data science output has been rolled out/validated across multiple domains/problems; fake news detection; feature importance; feature selection; machine learning; meta-learning; Syrian war;
D O I
10.1016/j.patter.2021.100369
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we pursue the automatic detection of fake news reporting on the Syrian war using machine learning and meta-learning. The proposed approach is based on a suite of features that include a given article's linguistic style; its level of subjectivity, sensationalism, and sectarianism; the strength of its attribution; and its consistency with other news articles from the same "media camp". To train our models, we use FA-KES, a fake news dataset about the Syrian war. A suite of basic machine learning models is explored, as well as the model-agnostic meta-learning algorithm (MAML) suitable for few-shot learning, using datasets of amodest size. Feature-importance analysis confirms that the collected features specific to the Syrian war are indeed very important predictors for the output label. The meta-learning model achieves the best performance, improving upon the baseline approaches that are trained exclusively on text features in FA-KES.
引用
收藏
页数:14
相关论文
共 28 条
[1]  
Abu Salem F.K., 2019, P ICWSM 19, P573
[2]  
[Anonymous], 2005, UAI, DOI DOI 10.5555/3020336.3020388
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   Characterizing semantic space: Neighborhood effects in word recognition [J].
Buchanan, L ;
Westbury, C ;
Burgess, C .
PSYCHONOMIC BULLETIN & REVIEW, 2001, 8 (03) :531-544
[5]  
Chawla N.V., 2006, P AAAI WORKSH 06
[6]  
Chung C.K., 2011, Applied Natural Language Processing: Identification, Investigation and Resolution, P206, DOI DOI 10.4018/978-1-60960-741-8.CH012
[7]  
Elhadad Mohamed K., 2020, Advances on P2P, Parallel, Grid, Cloud and Internet Computing. Proceedings of the 14th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC-2019). Lecture Notes in Networks and Systems (LNNS 96), P914, DOI 10.1007/978-3-030-33509-0_86
[8]  
Finn C., 2018, MODEL AGNOSTIC METAL
[9]  
Finn C, 2017, PR MACH LEARN RES, V70
[10]   Fake News vs Satire: A Dataset and Analysis [J].
Golbeck, Jennifer ;
Mauriello, Matthew ;
Auxier, Brooke ;
Bhanushali, Keval H. ;
Bonk, Christopher ;
Bouzaghrane, Mohamed Amine ;
Buntain, Cody ;
Chanduka, Riya ;
Cheakalos, Paul ;
Everett, Jeannine B. ;
Falak, Waleed ;
Gieringer, Carl ;
Graney, Jack ;
Hoffman, Kelly M. ;
Huth, Lindsay ;
Ma, Zhenye ;
Jha, Mayanka ;
Khan, Misbah ;
Kori, Varsha ;
Lewis, Elo ;
Mirano, George ;
Mohn, William T. ;
Mussenden, Sean ;
Nelson, Tammie M. ;
Mcwillie, Sean ;
Pant, Akshat ;
Shetye, Priya ;
Shrestha, Rusha ;
Steinheimer, Alexandra ;
Subramanian, Aditya ;
Visnansky, Gina .
WEBSCI'18: PROCEEDINGS OF THE 10TH ACM CONFERENCE ON WEB SCIENCE, 2018, :17-21