Automatic Classification of Pulmonary Tuberculosis and Sarcoidosis based on Random Forest

被引:0
作者
Wu, Yuanli [1 ]
Wang, Hong [1 ]
Wu, Fei [1 ]
机构
[1] 309th Hosp Chinese PLA, Informat Dept, Beijing, Peoples R China
来源
2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI) | 2017年
关键词
machine learning; random forest; desease classification;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
With the accumulation of medical data and rapid development of artificial intelligence, machine learning has entered the medical field, and especially has been widely adopted in disease diagnosis. The essence of disease identification is classification. In this article, we used the medical data of hospitalized patients in our hospital to train random forest classifiers to make disease differentiation between pulmonary tuberculosis and sarcoidosis. Since there were various medical data formats for patients, and these data spreaded in many isolated medical systems, feature selection was difficult for disease classification. We made feature selection automatically only on laboratory result based on some strategies. Using the laboratory result data set, we performed classification with an average AUC of 81% automatically without doctor's intervention. The results of random forest model gave the importance score of each feature, which provided a basis for early diagnosis and optimization of diagnostic processes of pulmonary tuberculosis and sarcoidosis.
引用
收藏
页数:5
相关论文
共 8 条
[1]  
Aixiang C., 2014, COMPUTER SCI, V41, P103
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   Sarcoidosis Clinical Presentation, Immunopathogenesis, and Therapeutics [J].
Iannuzzi, Michael C. ;
Fontana, Joseph R. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2011, 305 (04) :391-399
[4]  
Mantzaris DH, 2008, IEEE INT C BIOINF BI, P793
[5]   A comparison of random forest and its Gini importance with standard chemometric methods for the feature selection and classification of spectral data [J].
Menze, Bjoern H. ;
Kelm, B. Michael ;
Masuch, Ralf ;
Himmelreich, Uwe ;
Bachert, Peter ;
Petrich, Wolfgang ;
Hamprecht, Fred A. .
BMC BIOINFORMATICS, 2009, 10
[6]  
Wei L. F., 2011, P INT MULT ENG COMP, V1, P1
[7]   A comparative study of ensemble learning approaches in the classification of breast cancer metastasis [J].
Zhang, Wangshu ;
Zeng, Feng ;
Wu, Xuebing ;
Zhang, Xuegong ;
Jiang, Rui .
2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, :242-+
[8]  
zhongshun K., 2016, CHINESE J ANTITUBERC, V38, P443