Development and Interpretation of Multiple Machine Learning Models for Predicting Postoperative Delayed Remission of Acromegaly Patients During Long-Term Follow-Up

被引:17
作者
Dai, Congxin [1 ]
Fan, Yanghua [1 ]
Li, Yichao [2 ]
Bao, Xinjie [1 ]
Li, Yansheng [2 ]
Su, Mingliang [2 ]
Yao, Yong [1 ]
Deng, Kan [1 ]
Xing, Bing [1 ]
Feng, Feng [3 ]
Feng, Ming [1 ]
Wang, Renzhi [1 ]
机构
[1] Chinese Acad Med Sci & Peking Union Med Coll, Peking Union Med Coll Hosp, Dept Neurosurg, Beijing, Peoples R China
[2] DHC Mediway Technol Co Ltd, Beijing, Peoples R China
[3] Chinese Acad Med Sci & Peking Union Med Coll, Peking Union Med Coll Hosp, Dept Radiol, Beijing, Peoples R China
来源
FRONTIERS IN ENDOCRINOLOGY | 2020年 / 11卷
关键词
acromegaly; delayed remission; machine learning; LIME; SHAP; PITUITARY-ADENOMAS; CANCER PROGNOSIS; DIAGNOSIS; DISEASE;
D O I
10.3389/fendo.2020.00643
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background:Some patients with acromegaly do not reach the remission standard in the short term after surgery but achieve remission without additional postoperative treatment during long-term follow-up; this phenomenon is defined as postoperative delayed remission (DR). DR may complicate the interpretation of surgical outcomes in patients with acromegaly and interfere with decision-making regarding postoperative adjuvant therapy. Objective:We aimed to develop and validate machine learning (ML) models for predicting DR in acromegaly patients who have not achieved remission within 6 months of surgery. Methods:We enrolled 306 acromegaly patients and randomly divided them into training and test datasets. We used the recursive feature elimination (RFE) algorithm to select features and applied six ML algorithms to construct DR prediction models. The performance of these ML models was validated using receiver operating characteristics analysis. We used permutation importance, SHapley Additive exPlanations (SHAP), and local interpretable model-agnostic explanation (LIME) algorithms to determine the importance of the selected features and interpret the ML models. Results:Fifty-five (17.97%) acromegaly patients met the criteria for DR, and five features (post-1w rGH, post-1w nGH, post-6m rGH, post-6m IGF-1, and post-6m nGH) were significantly associated with DR in both the training and the test datasets. After the RFE feature selection, the XGboost model, which comprised the 15 important features, had the greatest discriminatory ability (area under the curve = 0.8349, sensitivity = 0.8889, Youden's index = 0.6842). The XGboost model showed good discrimination ability and provided significantly better estimates of DR of patients with acromegaly compared with using only the Knosp grade. The results obtained from permutation importance, SHAP, and LIME algorithms showed that post-6m IGF-1 is the most important feature in XGboost algorithm prediction and showed the reliability and the clinical practicability of the XGboost model in DR prediction. Conclusions:ML-based models can serve as an effective non-invasive approach to predicting DR and could aid in determining individual treatment and follow-up strategies for acromegaly patients who have not achieved remission within 6 months of surgery.
引用
收藏
页数:14
相关论文
共 51 条
[1]  
Al Shalabi L., 2006, Journal of Computer Sciences, V2, P735, DOI 10.3844/jcssp.2006.735.739
[2]   Clinical applications of machine learning in cardiovascular disease and its relevance to cardiac imaging [J].
Al'Aref, Subhi J. ;
Anchouche, Khalil ;
Singh, Gurpreet ;
Slomka, Piotr J. ;
Kolli, Kranthi K. ;
Kumar, Amit ;
Pandey, Mohit ;
Maliakal, Gabriel ;
van Rosendael, Alexander R. ;
Beecy, Ashley N. ;
Berman, Daniel S. ;
Leipsic, Jonathan ;
Nieman, Koen ;
Andreini, Daniele ;
Pontone, Gianluca ;
Schoepf, U. Joseph ;
Shaw, Leslee J. ;
Chang, Hyuk-Jae ;
Narula, Jagat ;
Bax, Jeroen J. ;
Guan, Yuanfang ;
Min, James K. .
EUROPEAN HEART JOURNAL, 2019, 40 (24) :1975-+
[3]   Permutation importance: a corrected feature importance measure [J].
Altmann, Andre ;
Tolosi, Laura ;
Sander, Oliver ;
Lengauer, Thomas .
BIOINFORMATICS, 2010, 26 (10) :1340-1347
[4]  
Batista G.E., 2004, ACM SIGKDD Explor. Newsl, V6, P20, DOI [DOI 10.1145/1007730.1007735, 10.1145/1007730.1007735]
[5]  
Batista GEAPA, 2003, APPL ARTIF INTELL, V17, P519, DOI [10.1080/713827181, 10.1080/08839510390219309]
[6]   Molecular markers for bladder cancer: the road to a multimarker approach [J].
Birkhahn, Marc ;
Mitra, Anirban P. ;
Cote, Richard J. .
EXPERT REVIEW OF ANTICANCER THERAPY, 2007, 7 (12) :1717-1727
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   Predicting lung cancer prognosis using machine learning [J].
Burki, Talha Khan .
LANCET ONCOLOGY, 2016, 17 (10) :E421-E421
[9]  
Chawla NV, 2005, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, P853, DOI 10.1007/0-387-25465-X_40
[10]   Key feature selection and risk prediction for lane-changing behaviors based on vehicles' trajectory data [J].
Chen, Tianyi ;
Shi, Xiupeng ;
Wong, Yiik Diew .
ACCIDENT ANALYSIS AND PREVENTION, 2019, 129 (156-169) :156-169