Suicidal ideation prediction in twitter data using machine learning techniques

被引:30
作者
Kumar, E. Rajesh [1 ]
Rao, K. V. S. N. Rama [1 ]
Nayak, Soumya Ranjan [2 ]
Chandra, Ramesh [3 ,4 ]
机构
[1] Koneru Lakshmaiah Educ Fdn, Dept Comp Sci & Engn, Guntur 522502, Andhra Pradesh, India
[2] Amity Univ, Amity Sch Engn & Technol, Noida 201303, Uttar Pradesh, India
[3] Norwegian Univ Sci & Technol, Cyber Phys Syst Lab, Dept ICT & Nat Sci, Alesund, Norway
[4] Amity Univ Rajasthan, Amity Inst Informat Technol, Jaipur, Rajasthan, India
关键词
Suicidal ideation; Classification; Sentiment analysis; Suicidal detection; Feature selection;
D O I
10.1080/09720502.2020.1721674
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
People prefer new technology by using online social media as a communication channels to express their suicidal thoughts. Primary identification and detection are viewed as an effective approach to avoid suicidal attempt and suicidal ideation- two basic hazards causing effective suicide. This paper exhibits different techniques to comprehend suicidal ideation through online user contents in particularly by considering twitter data for past last two years as an objective of early detection by means of sentiment analysis and supervised leaning methods. Analysing the text descriptions and users language exposes rich knowledge that can be utilized as a primary cautioning system for suicidal detection. To identify tweets exhibiting suicidal ideation, several features are extracted and a set of features are proposed for training the model over the dataset by using ensemble and baseline classifiers. Based on the outcome of baseline classifier; improved ensemble random forest (RF) algorithm achieved an accuracy of 0.99% compared to other classification methods for suicidal prediction with tweets containing suicidal thought is better when compared to the existing system. Such experimentation and monitoring may help individual and population-wide prevention by counseling and informing to suicidal research and policy. The experimental analysis expresses the feasibility of the methodology used by providing a benchmark for suicidal detection on online social network: Twitter.
引用
收藏
页码:117 / 125
页数:9
相关论文
共 15 条
[1]  
Andrews LJB, 2017, J STAT MANAG SYST, V20, P659, DOI 10.1080/09720510.2017.1395186
[2]   Adolescent Suicide Statements on MySpace [J].
Cash, Scottye J. ;
Thelwall, Michael ;
Peck, Sydney N. ;
Ferrell, Jared Z. ;
Bridge, Jeffrey A. .
CYBERPSYCHOLOGY BEHAVIOR AND SOCIAL NETWORKING, 2013, 16 (03) :166-174
[3]  
Cerel J., 2008, J MACHINE LEARNING R, V3, P993
[4]  
Choudhury MA, 2015, HANDBOOK OF RESEARCH ON ISLAMIC BUSINESS ETHICS, P85
[5]   Analysing the connectivity and communication of suicidal users on twitter [J].
Colombo, Gualtiero B. ;
Burnap, Pete ;
Hodorog, Andrei ;
Scourfield, Jonathan .
COMPUTER COMMUNICATIONS, 2016, 73 :291-300
[6]  
Huang H.Y., 2016, P CHI C HUM FACT COM
[7]   Sentiment classification of twitter data belonging to renewable energy using machine learning [J].
Jain, Achin ;
Jain, Vanita .
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (02) :521-533
[8]   Tracking Suicide Risk Factors Through Twitter in the US [J].
Jashinsky, Jared ;
Burton, Scott H. ;
Hanson, Carl L. ;
West, Josh ;
Giraud-Carrier, Christophe ;
Barnes, Michael D. ;
Argyle, Trenton .
CRISIS-THE JOURNAL OF CRISIS INTERVENTION AND SUICIDE PREVENTION, 2014, 35 (01) :51-59
[9]   Ground truth study on fractal dimension of color images of similar texture [J].
Nayak, Soumya ;
Khandual, Asimananda ;
Mishra, Jibitesh .
JOURNAL OF THE TEXTILE INSTITUTE, 2018, 109 (09) :1159-1167
[10]   A modified triangle box-counting with precision in error fit [J].
Nayak, Soumya Ranjan ;
Mishra, Jibitesh .
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2018, 39 (01) :113-128