Deep Learning Based Biomedical Literature Classification Using Criteria of Scientific Rigor

被引:5
作者
Afzal, Muhammad [1 ]
Park, Beom Joo [2 ]
Hussain, Maqbool [1 ]
Lee, Sungyoung [2 ]
机构
[1] Sejong Univ, Dept Software, Seoul 05006, South Korea
[2] Kyung Hee Univ, Dept Comp Sci & Engn, Ubiquitous Comp Lab, Yongin 446701, Gyeonggi Do, South Korea
关键词
healthcare; deep learning; evidence-based medicine; biomedical literature; health information management; IDENTIFICATION;
D O I
10.3390/electronics9081253
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A major blockade to support the evidence-based clinical decision-making is accurately and efficiently recognizing appropriate and scientifically rigorous studies in the biomedical literature. We trained a multi-layer perceptron (MLP) model on a dataset with two textual features, title and abstract. The dataset consisting of 7958 PubMed citations classified in two classes: scientific rigor and non-rigor, is used to train the proposed model. We compare our model with other promising machine learning models such as Support Vector Machine (SVM), Decision Tree, Random Forest, and Gradient Boosted Tree (GBT) approaches. Based on the higher cumulative score, deep learning was chosen and was tested on test datasets obtained by running a set of domain-specific queries. On the training dataset, the proposed deep learning model obtained significantly higher accuracy and AUC of 97.3% and 0.993, respectively, than the competitors, but was slightly lower in the recall of 95.1% as compared to GBT. The trained model sustained the performance of testing datasets. Unlike previous approaches, the proposed model does not require a human expert to create fresh annotated data; instead, we used studies cited in Cochrane reviews as a surrogate for quality studies in a clinical topic. We learn that deep learning methods are beneficial to use for biomedical literature classification. Not only do such methods minimize the workload in feature engineering, but they also show better performance on large and noisy data.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 21 条
[1]   Context-aware grading of quality evidences for evidence-based decision-making [J].
Afzal, Muhammad ;
Hussain, Maqbool ;
Haynes, Robert Brian ;
Lee, Sungyoung .
HEALTH INFORMATICS JOURNAL, 2019, 25 (02) :429-445
[2]  
Anderlucci L., 2019, ARXIV190207068
[3]  
[Anonymous], **NON-TRADITIONAL**
[4]  
[Anonymous], **NON-TRADITIONAL**
[5]   'Scoping the scope' of a cochrane review [J].
Armstrong, Rebecca ;
Hall, Belinda J. ;
Doyle, Jodie ;
Waters, Elizabeth .
JOURNAL OF PUBLIC HEALTH, 2011, 33 (01) :147-150
[6]   Automatic identification of high impact articles in PubMed to support clinical decision making [J].
Bian, Jiantao ;
Morid, Mohammad Amin ;
Jonnalagadda, Siddhartha ;
Luo, Gang ;
Del Fiol, Guilherme .
JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 73 :95-103
[7]   Building deep learning models for evidence classification from the open access biomedical literature [J].
Burns, Gully A. ;
Li, Xiangci ;
Peng, Nanyun .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2019,
[8]   A Deep Learning Method to Automatically Identify Reports of Scientifically Rigorous Clinical Research from the Biomedical Literature: Comparative Analytic Study [J].
Del Fiol, Guilherme ;
Michelson, Matthew ;
Iorio, Alfonso ;
Cotoi, Chris ;
Haynes, R. Brian .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2018, 20 (06)
[9]   Evidence based medicine manifesto for better healthcare [J].
Heneghan, Carl ;
Mahtani, Kamal R. ;
Goldacre, Ben ;
Godlee, Fiona ;
Macdonald, Helen ;
Jarvies, Duncan .
BMJ-BRITISH MEDICAL JOURNAL, 2017, 357
[10]  
Khan Aurangzeb, 2010, Journal of Advances in Information Technology, V1, P4, DOI 10.4304/jait.1.1.4-20