Differential Diagnosis of Pleural Effusion Using Machine Learning

被引:4
作者
Kim, Na Young [1 ]
Jang, Boa [2 ,4 ]
Gu, Kang-Mo [5 ]
Park, Young Sik [3 ]
Kim, Young-Gon [2 ,6 ,8 ]
Cho, Jaeyoung [3 ,7 ,9 ]
机构
[1] Hallym Univ Dongtan Sacred Heart Hosp, Div Pulm Allergy & Crit Care Med, Dept Internal Med, Hwaseong, South Korea
[2] Seoul Natl Univ Hosp, Dept Transdisciplinary Med, Seoul, South Korea
[3] Seoul Natl Univ Hosp, Div Pulm & Crit Care Med, Dept Internal Med, Seoul, South Korea
[4] Seoul Natl Univ, Coll Engn, Interdisciplinary Program Bioengn, Seoul, South Korea
[5] Chung Ang Univ Coll Med, Dept Internal Med, Seoul, South Korea
[6] Seoul Natl Univ Coll Med, Dept Med, Seoul, South Korea
[7] Seoul Natl Univ Coll Med, Dept Internal Med, Seoul, South Korea
[8] Seoul Natl Univ Hosp, Dept Transdisciplinary Med, 101 Daehak Ro, Seoul, South Korea
[9] Seoul Natl Univ Hosp, Div Pulm & Crit Care Med, Dept Internal Med, 101 Daehak Ro, Seoul 03080, South Korea
关键词
differential diagnosis; machine learning; pleural; effusion; TUBERCULOUS PLEURISY; FLUID; YIELD; CULTURE;
D O I
10.1513/AnnalsATS.202305-410OC
中图分类号
R56 [呼吸系及胸部疾病];
学科分类号
摘要
Rationale: Differential diagnosis of pleural effusion is challenging in clinical practice. Objectives: We aimed to develop a machine learning model to classify the five common causes of pleural effusions. Methods: This retrospective study collected 49 features from clinical information, blood, and pleural fluid of adult patients who underwent diagnostic thoracentesis between October 2013 and December 2018. Pleural effusions were classified into the following five categories: transudative, malignant, parapneumonic, tuberculous, and other. The performance of five different classifiers, including multinomial logistic regression, support vector machine, random forest, extreme gradient boosting, and light gradient boosting machine (LGB), was evaluated in terms of accuracy and area under the receiver operating characteristic curve through fivefold cross-validation. Hybrid feature selection was applied to determine the most relevant features for classifying pleural effusion. Results: We analyzed 2,253 patients (training set, n = 1,459; validation set, n = 365; extra-validation set, n = 429) and found that the LGB model achieved the best performance in both validation and extra-validation sets. After feature selection, the accuracy of the LGB model with the selected 18 features was equivalent to that with all 49 features (mean6standard deviation): 0.81860.012 and 0.77760.007 in the validation and extra-validation sets, respectively. The model's mean area under the receiver operating characteristic curve was as high as 0.93060.042 and 0.91660.044 in the validation and extra-validation sets, respectively. In our model, pleural lactate dehydrogenase, protein, and adenosine deaminase levels were the most important factors for classifying pleural effusions. Conclusions: Our LGB model showed satisfactory performance for differential diagnosis of the common causes of pleural effusions. This model could provide clinicians with valuable information regarding the major differential diagnoses of pleural diseases.
引用
收藏
页码:211 / 217
页数:7
相关论文
共 29 条
  • [1] Management of malignant pleural effusions
    Antony, VB
    Loddenkemper, R
    Astoul, P
    Boutin, C
    Goldstraw, P
    Hott, J
    Panadero, FR
    Sahn, SA
    [J]. EUROPEAN RESPIRATORY JOURNAL, 2001, 18 (02) : 402 - 419
  • [2] A Randomized Trial Comparing the Diagnostic Yield of Rigid and Semirigid Thoracoscopy in Undiagnosed Pleural Effusions
    Dhooria, Sahajal
    Singh, Navneet
    Aggarwal, Ashutosh N.
    Gupta, Dheeraj
    Agarwal, Ritesh
    [J]. RESPIRATORY CARE, 2014, 59 (05) : 756 - 764
  • [3] Multiple imputation using nearest neighbor methods
    Faisal, Shahla
    Tutz, Gerhard
    [J]. INFORMATION SCIENCES, 2021, 570 : 500 - 516
  • [4] Pleural Disease
    Feller-Kopman, David
    Light, Richard
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2018, 378 (08) : 740 - 751
  • [5] Diagnostic accuracy of adenosine deaminase for pleural tuberculosis in a low prevalence setting: A machine learning approach within a 7-year prospective multi-center study
    Garcia-Zamalloa, Alberto
    Vicente, Diego
    Arnay, Rafael
    Arrospide, Arantzazu
    Taboada, Jorge
    Castilla-Rodriguez, Ivan
    Aguirre, Urko
    Mugica, Nekane
    Aldama, Ladislao
    Aguinagalde, Borja
    Jimenez, Montserrat
    Bikuna, Edurne
    Begona Basauri, Miren
    Alonso, Marta
    Perez-Trallero, Emilio
    [J]. PLOS ONE, 2021, 16 (11):
  • [6] Diagnosis and treatment of tuberculous pleural effusion in 2006
    Gopi, Arun
    Madhavan, Sethu M.
    Sharma, Surendra K.
    Sahn, Steven A.
    [J]. CHEST, 2007, 131 (03) : 880 - 889
  • [7] Investigation of a unilateral pleural effusion in adults: British Thoracic Society pleural disease guideline 2010
    Hooper, Clare
    Lee, Y. C. Gary
    Maskell, Nick
    [J]. THORAX, 2010, 65 : 4 - 17
  • [8] Jamshidian M., 2007, Handbook of Latent Variable and Related Models., P21, DOI [DOI 10.1016/B978-044452044-9/50005-7, 10.1016/B978-044452044-9/50005-7, DOI 10.1016/S1871-0301(06)01002-X]
  • [9] Decreased annual risk of tuberculosis infection in South Korean healthcare workers using interferon-gamma release assay between 1986 and 2005
    Lee, Eun Hye
    Son, Nak-Hoon
    Kwak, Se Hyun
    Choi, Ji Soo
    Kim, Min Chul
    Seol, Chang Hwan
    Kim, Sung-Ryeol
    Park, Byung Hoon
    Kang, Young Ae
    [J]. BMC INFECTIOUS DISEASES, 2021, 21 (01)
  • [10] Classification of pleural effusions using deep learning visual models: contrastive-loss
    Lee, Jang Ho
    Choi, Chang-Min
    Park, Namu
    Park, Hyung Jun
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)