A systematic review of text mining approaches applied to various application areas in the biomedical domain

被引:17
作者
Cheerkoot-Jalim, Sudha [1 ]
Khedo, Kavi Kumar [2 ]
机构
[1] Univ Mauritius, Dept Informat & Commun Technol, Reduit, Mauritius
[2] Univ Mauritius, Dept Digital Technol, Reduit, Mauritius
关键词
Biomedical literature; Biomedical text mining; Kitchenham methodology; Text mining Techniques; Text mining tools; ADVERSE DRUG EVENTS; SOCIAL MEDIA; CLINICAL TEXT; EXTRACTION; PHARMACOVIGILANCE; DISCOVERY; COVERAGE; CRITERIA; DISEASE; RECORDS;
D O I
10.1108/JKM-09-2019-0524
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Purpose This work shows the results of a systematic literature review on biomedical text mining. The purpose of this study is to identify the different text mining approaches used in different application areas of the biomedical domain, the common tools used and the challenges of biomedical text mining as compared to generic text mining algorithms. This study will be of value to biomedical researchers by allowing them to correlate text mining approaches to specific biomedical application areas. Implications for future research are also discussed. Design/methodology/approach The review was conducted following the principles of the Kitchenham method. A number of research questions were first formulated, followed by the definition of the search strategy. The papers were then selected based on a list of assessment criteria. Each of the papers were analyzed and information relevant to the research questions were extracted. Findings It was found that researchers have mostly harnessed data sources such as electronic health records, biomedical literature, social media and health-related forums. The most common text mining technique was natural language processing using tools such as MetaMap and Unstructured Information Management Architecture, alongside the use of medical terminologies such as Unified Medical Language System. The main application area was the detection of adverse drug events. Challenges identified included the need to deal with huge amounts of text, the heterogeneity of the different data sources, the duality of meaning of words in biomedical text and the amount of noise introduced mainly from social media and health-related forums. Originality/value To the best of the authors' knowledge, other reviews in this area have focused on either specific techniques, specific application areas or specific data sources. The results of this review will help researchers to correlate most relevant and recent advances in text mining approaches to specific biomedical application areas by providing an up-to-date and holistic view of work done in this research area. The use of emerging text mining techniques has great potential to spur the development of innovative applications, thus considerably impacting on the advancement of biomedical research.
引用
收藏
页码:642 / 668
页数:27
相关论文
共 73 条
  • [1] DiseaSE: A biomedical text analytics system for disease symptom extraction and characterization
    Abulaish, Muhammad
    Parwez, Md Aslam
    Jahiruddin
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 100
  • [2] Impact of Predicting Health Care Utilization Via Web Search Behavior: A Data-Driven Analysis
    Agarwal, Vibhu
    Zhang, Liangliang
    Zhu, Josh
    Fang, Shiyuan
    Cheng, Tim
    Hong, Chloe
    Shah, Nigam H.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2016, 18 (09)
  • [3] Text mining brain imaging reports
    Alex, Beatrice
    Grover, Claire
    Tobin, Richard
    Sudlow, Cathie
    Mair, Grant
    Whiteley, William
    [J]. JOURNAL OF BIOMEDICAL SEMANTICS, 2019, 10 (Suppl 1)
  • [4] Allahyari M., 2017, ARXIV PREPRINT ARXIV
  • [5] Amrit C., 2016, ARXIV PREPRINT ARXIV
  • [6] Mining free-text medical records for companion animal enteric syndrome surveillance
    Anholt, R. M.
    Berezowski, J.
    Jamal, I.
    Ribble, C.
    Stephen, C.
    [J]. PREVENTIVE VETERINARY MEDICINE, 2014, 113 (04) : 417 - 422
  • [7] Arnold P., 2015, DATENBANKSYSTEME BUS
  • [8] Bendig Regina B., 2009, Issues in Science & Technology Librarianship
  • [9] A Case Study of Data Quality in Text Mining Clinical Progress Notes
    Berndt, Donald J.
    Mccart, James A.
    Finch, Dezon K.
    Luther, Stephen L.
    [J]. ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2015, 6 (01)
  • [10] Getting evidence into practice-implementation science for paediatricians
    Blair, Mitch
    [J]. ARCHIVES OF DISEASE IN CHILDHOOD, 2014, 99 (04) : 307 - 309