Applying text mining methods to suicide research

被引:7
作者
Cheng, Qijin [1 ]
Lui, Carrie S. M. [2 ]
机构
[1] Chinese Univ Hong Kong, Dept Social Work, Hong Kong, Peoples R China
[2] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Peoples R China
关键词
LANGUAGE; RISK;
D O I
10.1111/sltb.12680
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Objective To introduce the research methods of computerized text mining and its possible applications in suicide research and to demonstrate the procedures of applying a specific text mining area, document classification, to a suicide-related study. Method A systematic search of academic papers that applied text mining methods to suicide research was conducted. Relevant papers were reviewed focusing on their research objectives and sources of data. Furthermore, a case of using natural language processing and document classification methods to analyze a large amount of suicide news was elaborated to showcase the methods. Results Eighty-six papers using text mining methods for suicide research have been published since 2001. The most common research objective (72.1%) was to classify which documents exhibit suicide risk or were written by suicidal people. The most frequently used data source was online social media posts (45.3%), followed by e-healthcare records (25.6%). For the news classification case, the top three classifiers trained for classification tasks achieved 84% or higher accuracy. Conclusions Computerized text mining methods can help to scale up content analysis capacity and efficiency and uncover new insights and perspectives for suicide research.
引用
收藏
页码:137 / 147
页数:11
相关论文
共 71 条
  • [1] Abdelali A., 2016, P 2016 C N AM CHAPT, VVolume 2016, P11, DOI 10.18653/v1/N16-3003
  • [2] Toward Automatic Risk Assessment to Support Suicide Prevention
    Adamou, Marios
    Antoniou, Grigoris
    Greasidou, Elissavet
    Lagani, Vincenzo
    Charonyktakis, Paulos
    Tsamardinos, Ioannis
    Doyle, Michael
    [J]. CRISIS-THE JOURNAL OF CRISIS INTERVENTION AND SUICIDE PREVENTION, 2019, 40 (04) : 249 - 256
  • [3] Aggarwal C.C., 2012, MINING TEXT DATA, V2012, DOI [10.1007/978, DOI 10.1007/978]
  • [4] Monitoring Suicidal Patients in Primary Care Using Electronic Health Records
    Anderson, Heather D.
    Pace, Wilson D.
    Brandt, Elias
    Nielsen, Rodney D.
    Allen, Richard R.
    Libby, Anne M.
    West, David R.
    Valuck, Robert J.
    [J]. JOURNAL OF THE AMERICAN BOARD OF FAMILY MEDICINE, 2015, 28 (01) : 65 - 71
  • [5] [Anonymous], 2014, INT C MACHINE LEARNI
  • [6] Long short-term memory
    Hochreiter, S
    Schmidhuber, J
    [J]. NEURAL COMPUTATION, 1997, 9 (08) : 1735 - 1780
  • [7] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [8] Calzolari N, 2008, SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008
  • [9] [曹莹 Cao Ying], 2013, [自动化学报, Acta Automatica Sinica], V39, P745
  • [10] Identification of suicidal behavior among psychiatrically hospitalized adolescents using natural language processing and machine learning of electronic health records
    Carson, Nicholas J.
    Mullin, Brian
    Sanchez, Maria Jose
    Lu, Frederick
    Yang, Kelly
    Menezes, Michelle
    Le Cook, Benjamin
    [J]. PLOS ONE, 2019, 14 (02):