FINDING PEOPLE WITH EMOTIONAL DISTRESS IN ONLINE SOCIAL MEDIA: A DESIGN COMBINING MACHINE LEARNING AND RULE-BASED CLASSIFICATION

被引:77
作者
Chau, Michael [1 ]
Li, Tim M. H. [2 ]
Wong, Paul W. C. [2 ]
Xu, Jennifer J. [3 ]
Yip, Paul S. F. [2 ,4 ]
Chen, Hsinchun [5 ]
机构
[1] Univ Hong Kong, Fac Business & Econ, Pokfulam, Hong Kong, Peoples R China
[2] Univ Hong Kong, Dept Social Work & Social Adm, Pokfulam, Hong Kong, Peoples R China
[3] Bentley Univ, Comp Informat Syst, Waltham, MA 02452 USA
[4] Univ Hong Kong, Fac Social Sci, HKJC Ctr Suicide Res & Prevent, Pokfulam, Hong Kong, Peoples R China
[5] Univ Arizona, Dept Management Informat Syst, Tucson, AZ 85721 USA
关键词
Social media; emotional distress; suicide research; design science; classification; PSYCHOLOGICAL MECHANISMS; GENETIC ALGORITHM; LANGUAGE USE; SUICIDE; HEALTH; VALIDITY; SCIENCE; LEISURE; WORDS; MODEL;
D O I
10.25300/MISQ/2020/14110
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many people face problems of emotional distress. Early detection of high-risk individuals is the key to prevent suicidal behavior. There is increasing evidence that the Internet and social media provide clues of people's emotional distress. In particular, some people leave messages showing emotional distress or even suicide notes on the Internet. Identifying emotionally distressed people and examining their posts on the Internet are important steps for health and social work professionals to provide assistance, but the process is very timeconsuming and ineffective if conducted manually using standard search engines. Following the design science approach, we present the design of a system called KAREN, which identifies individuals who blog about their emotional distress in the Chinese language, using a combination of machine learning classification and rulebased classification with rules obtained from experts. A controlled experiment and a user study were conducted to evaluate system performance in searching and analyzing blogs written by people who might be emotionally distressed. The results show that the proposed system achieved better classification performance than the benchmark methods and that professionals perceived the system to be more useful and effective for identifying bloggers with emotional distress than benchmark approaches.
引用
收藏
页码:933 / 955
页数:23
相关论文
共 80 条
[1]  
Abbasi Ahmed, 2007, 2007 IEEE Intelligence and Security Informatics, P282, DOI 10.1109/ISI.2007.379486
[2]   Affect analysis of web forums and blogs using correlation ensembles [J].
Abbasi, Ahmed ;
Chen, Hsinchun ;
Thoms, Sven ;
Fu, Tianjun .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (09) :1168-1180
[3]  
Abbasi A, 2008, MIS QUART, V32, P811
[4]  
[Anonymous], 1979, INFORM RETRIEVAL
[5]  
[Anonymous], 2013, P SIGCHI C HUM FACT, DOI DOI 10.1145/2470654.2466447
[6]  
[Anonymous], GENETIC ALGORITHMS S
[7]  
[Anonymous], 2012, SENTIMENT ANAL OPINI
[8]  
[Anonymous], 2003, PRACTICAL GUIDE SUPP
[9]  
[Anonymous], 1998, SPRINGER INT SER ENG
[10]  
Balahur A., 2006, P WORKSH INT AN PROC, P2216