Binary classification of Lupus scientific articles applying deep ensemble model on text data

被引:0
作者
Samami, Maryam [1 ]
Soure, Elham Mousazade [2 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Sari Branch, Sari, Iran
[2] Islamic Azad Univ, Dept Comp, Mashhad Branch, Mashhad, Razavi Khorasan, Iran
来源
2019 SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS (ICDIPC 2019) | 2019年
关键词
Lupus; Deep learning; Text classification; PubMed; LSTM; Ensemble model; DIAGNOSIS;
D O I
10.1109/icdipc.2019.8723787
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Lupus as a chronic autoimmune disease is a difficult disease to diagnose, where more than 60 percent of people with lupus were incorrectly diagnosed. Accurate and early diagnosis of lupus is a daunting challenge, where its signs and symptoms usually mimic some other diseases and may appear as either permanent or temporary signs. On the other hand, tons of scientific articles are published, accumulating valuable information, such as lupus prevention, diagnosis, and treatment plans. The availability of big data scientific articles plus deep learning text analytics methods makes it now possible to harness this data to advance Lupus research in general, providing timely fashion and high-quality information for Lupus diagnosis and treatment. In this work, we develop deep learning text categorization techniques on top of the PubMed articles to automatically classify Lupus scientific articles, demonstrating the potential for mining large-scale scientific articles with real-time update by new articles published in a daily basis. Using ensemble deep learning models help us improve the weakness of individual deep learning models when it comes to diagnostic classification. In the proposed deep ensemble model, Majority scheme is used for the output results of applied individual models, LSTM, CuDNNGRU, RNN and CNN, aiming to predict a class per each sample.
引用
收藏
页码:12 / 17
页数:6
相关论文
共 25 条
  • [1] Abdar M., 2018, Journal of AI and Data Mining, V6, P277, DOI 10.22044/JADM.2017.4673.1555
  • [2] Mass appraisal of residential apartments: An application of Random forest for valuation and a CART-based approach for model diagnostics
    Antipov, Evgeny A.
    Pokryshevskaya, Elena B.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) : 1772 - 1778
  • [3] Enhancing deep learning sentiment analysis with ensemble techniques in social applications
    Araque, Oscar
    Corcuera-Platas, Ignacio
    Sanchez-Rada, J. Fernando
    Iglesias, Carlos A.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 77 : 236 - 246
  • [4] Natural language processing in text mining for structural modeling of protein complexes
    Badal, Varsha D.
    Kundrotas, Petras J.
    Vakser, Ilya A.
    [J]. BMC BIOINFORMATICS, 2018, 19
  • [5] Caution! Warning Labels About Alcohol and Pregnancy: Unintended Consequences and Questionable Effectiveness
    Bell, Emily
    Zizzo, Natalie
    Racine, Eric
    [J]. AMERICAN JOURNAL OF BIOETHICS, 2015, 15 (03) : 18 - 20
  • [6] Chung J, 2014, ARXIV
  • [7] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
  • [8] Hassoon M, 2017, 2017 INTERNATIONAL CONFERENCE ON COMPUTER AND APPLICATIONS (ICCA), P299, DOI 10.1109/COMAPP.2017.8079783
  • [9] Jiang ZC, 2015, IEEE INT C BIOINFORM, P625, DOI 10.1109/BIBM.2015.7359756
  • [10] Biosorption of Lead(II) by Arthrobacter sp 25: Process Optimization and Mechanism
    Jin, Yu
    Wang, Xin
    Zang, Tingting
    Hu, Yang
    Hu, Xiaojing
    Ren, Guangming
    Xu, Xiuhong
    Qu, Juanjuan
    [J]. JOURNAL OF MICROBIOLOGY AND BIOTECHNOLOGY, 2016, 26 (08) : 1428 - 1438