Binary classification of Lupus scientific articles applying deep ensemble model on text data

被引：0

作者：

Samami, Maryam ^{[1
]}

Soure, Elham Mousazade ^{[2
]}

机构：

[1] Islamic Azad Univ, Dept Comp Engn, Sari Branch, Sari, Iran

[2] Islamic Azad Univ, Dept Comp, Mashhad Branch, Mashhad, Razavi Khorasan, Iran

来源：

2019 SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS (ICDIPC 2019) | 2019年

关键词：

Lupus; Deep learning; Text classification; PubMed; LSTM; Ensemble model; DIAGNOSIS;

D O I：

10.1109/icdipc.2019.8723787

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Lupus as a chronic autoimmune disease is a difficult disease to diagnose, where more than 60 percent of people with lupus were incorrectly diagnosed. Accurate and early diagnosis of lupus is a daunting challenge, where its signs and symptoms usually mimic some other diseases and may appear as either permanent or temporary signs. On the other hand, tons of scientific articles are published, accumulating valuable information, such as lupus prevention, diagnosis, and treatment plans. The availability of big data scientific articles plus deep learning text analytics methods makes it now possible to harness this data to advance Lupus research in general, providing timely fashion and high-quality information for Lupus diagnosis and treatment. In this work, we develop deep learning text categorization techniques on top of the PubMed articles to automatically classify Lupus scientific articles, demonstrating the potential for mining large-scale scientific articles with real-time update by new articles published in a daily basis. Using ensemble deep learning models help us improve the weakness of individual deep learning models when it comes to diagnostic classification. In the proposed deep ensemble model, Majority scheme is used for the output results of applied individual models, LSTM, CuDNNGRU, RNN and CNN, aiming to predict a class per each sample.

引用

页码：12 / 17

页数：6

共 25 条

[1] Abdar M., 2018, Journal of AI and Data Mining, V6, P277, DOI 10.22044/JADM.2017.4673.1555
[2] Mass appraisal of residential apartments: An application of Random forest for valuation and a CART-based approach for model diagnostics
Antipov, Evgeny A.
Pokryshevskaya, Elena B.
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) : 1772 - 1778
[3] Enhancing deep learning sentiment analysis with ensemble techniques in social applications
Araque, Oscar
Corcuera-Platas, Ignacio
Sanchez-Rada, J. Fernando
Iglesias, Carlos A.
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 77 : 236 - 246
[4] Natural language processing in text mining for structural modeling of protein complexes
Badal, Varsha D.
Kundrotas, Petras J.
Vakser, Ilya A.
[J]. BMC BIOINFORMATICS, 2018, 19
[5] Caution! Warning Labels About Alcohol and Pregnancy: Unintended Consequences and Questionable Effectiveness
Bell, Emily
Zizzo, Natalie
Racine, Eric
[J]. AMERICAN JOURNAL OF BIOETHICS, 2015, 15 (03) : 18 - 20
[6] Chung J, 2014, ARXIV
[7] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[8] Hassoon M, 2017, 2017 INTERNATIONAL CONFERENCE ON COMPUTER AND APPLICATIONS (ICCA), P299, DOI 10.1109/COMAPP.2017.8079783
[9] Jiang ZC, 2015, IEEE INT C BIOINFORM, P625, DOI 10.1109/BIBM.2015.7359756
[10] Biosorption of Lead(II) by Arthrobacter sp 25: Process Optimization and Mechanism
Jin, Yu
Wang, Xin
Zang, Tingting
Hu, Yang
Hu, Xiaojing
Ren, Guangming
Xu, Xiuhong
Qu, Juanjuan
[J]. JOURNAL OF MICROBIOLOGY AND BIOTECHNOLOGY, 2016, 26 (08) : 1428 - 1438

← 1 2 3 →