Detecting Fake News With Weak Social Supervision

被引:18
作者
Shu, Kai [1 ]
Dumais, Susan [2 ]
Awadallah, Ahmed Hassan [2 ]
Liu, Huan [1 ]
机构
[1] Arizona State Univ, Tempe, AZ 85281 USA
[2] Microsoft Res, Redmond, WA 98052 USA
关键词
Social media; Social networking; Weak supervision;
D O I
10.1109/MIS.2020.2997781
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Limited labeled data are becoming one of the largest bottlenecks for supervised learning systems. This is especially the case for many real-world tasks, where large-scale labeled examples are either too expensive to acquire or unavailable due to privacy or data access constraints. Weak supervision has shown to be effective in mitigating the scarcity of labeled data by leveraging weak labels or injecting constraints from heuristic rules and/or extrinsic knowledge sources. Social media has little labeled data but possesses unique characteristics that make it suitable for generating weak supervision, resulting in a new type of weak supervision, i.e., weak social supervision. In this article, we illustrate how various aspects of social media can be used as weak social supervision. Specifically, we use the recent research on fake news detection as the use case, where social engagements are abundant but annotated examples are scarce, to show that weak social supervision is effective when facing the labeled data scarcity problem. This article opens the door to learning with weak social supervision for similar emerging tasks when labeled data are limited.
引用
收藏
页码:96 / 103
页数:8
相关论文
共 16 条
[1]   What Makes Online Content Viral? [J].
Berger, Jonah ;
Milkman, Katherine L. .
JOURNAL OF MARKETING RESEARCH, 2012, 49 (02) :192-205
[2]  
Castillo C., 2011, P 20 INT C WORLD WID, P675, DOI 10.1145/1963405.1963500
[3]   Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes [J].
Kumar, Srijan ;
West, Robert ;
Leskovec, Jure .
PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, :591-602
[4]   Making Deep Neural Networks Robust to Label Noise: a Loss Correction Approach [J].
Patrini, Giorgio ;
Rozza, Alessandro ;
Menon, Aditya Krishna ;
Nock, Richard ;
Qu, Lizhen .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2233-2241
[5]  
Pennebaker JW, 2015, LIWC 2015 operator's manual. Linguistic Inquiry and Word Count: LIWC 2015
[6]  
Quattrociocchi W., 2016, SOCIAL SCI RES NETWO
[7]  
Rubin Victoria, 2015, P HAW INT C SYST SCI
[8]  
Shu K., 2020, LECT NOTES SOCIAL NE
[9]   dEFEND: Explainable Fake News Detection [J].
Shu, Kai ;
Cui, Limeng ;
Wang, Suhang ;
Lee, Dongwon ;
Liu, Huan .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :395-405
[10]   Beyond News Contents: The Role of Social Context for Fake New Detection [J].
Shu, Kai ;
Wang, Suhang ;
Liu, Huan .
PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, :312-320