Construction and Evaluation of Robust Interpretation Models for Breast Cancer Metastasis Prediction

被引:3
作者
Adnan, Nahim [1 ]
Zand, Maryam [1 ]
Huang, Tim H. M. [2 ]
Ruan, Jianhua [1 ]
机构
[1] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
[2] Univ Texas Hlth Sci Ctr San Antonio, Dept Mol Med, San Antonio, TX 78230 USA
关键词
Predictive models; Prediction algorithms; Biological system modeling; Computational modeling; Metastasis; Perturbation methods; Breast cancer; Cancer metastasis; interpretable machine learning; feature engineering; performance evaluation;
D O I
10.1109/TCBB.2021.3120673
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Interpretability of machine learning (ML) models represents the extent to which a model's decision-making process can be understood by model developers and/or end users. Transcriptomics-based cancer prognosis models, for example, while achieving good accuracy, are usually hard to interpret, due to the high-dimensional feature space and the complexity of models. As interpretability is critical for the transparency and fairness of ML models, several algorithms have been proposed to improve the interpretability of arbitrary classifiers. However, evaluation of these algorithms often requires substantial domain knowledge. Here, we propose a breast cancer metastasis prediction model using a very small number of biologically interpretable features, and a simple yet novel model interpretation approach that can provide personalized interpretations. In addition, we contributed, to the best of our knowledge, the first method to quantitatively compare different interpretation algorithms. Experimental results show that our model not only achieved competitive prediction accuracy, but also higher inter-classifier interpretation consistency than state-of-the-art interpretation methods. Importantly, our interpretation results can improve the generalizability of the prediction models. Overall, this work provides several novel ideas to construct and evaluate interpretable ML models that can be valuable to both the cancer machine learning community and related application domains.
引用
收藏
页码:1344 / 1353
页数:10
相关论文
共 42 条
[1]   Comparative evaluation of network features for the prediction of breast cancer metastasis [J].
Adnan, Nahim ;
Liu, Zhijie ;
Huang, Tim H. M. ;
Ruan, Jianhua .
BMC MEDICAL GENOMICS, 2020, 13 (Suppl 5)
[2]   Visualizing the effects of predictor variables in black box supervised learning models [J].
Apley, Daniel W. ;
Zhu, Jingyu .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2020, 82 (04) :1059-1086
[3]   Mitochondrial Flexibility of Breast Cancers: A Growth Advantage and a Therapeutic Opportunity [J].
Avagliano, Angelica ;
Ruocco, Maria Rosaria ;
Aliotta, Federica ;
Belviso, Immacolata ;
Accurso, Antonello ;
Masone, Stefania ;
Montagnani, Stefania ;
Arcucci, Alessandro .
CELLS, 2019, 8 (05)
[4]   Network Dissection: Quantifying Interpretability of Deep Visual Representations [J].
Bau, David ;
Zhou, Bolei ;
Khosla, Aditya ;
Oliva, Aude ;
Torralba, Antonio .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3319-3327
[5]   THE ROLE OF CELL-ADHESION MOLECULES IN CANCER INVASION AND METASTASIS [J].
BEHRENS, J .
BREAST CANCER RESEARCH AND TREATMENT, 1993, 24 (03) :175-184
[6]  
Buitinck L., 2013, ECML PKDD WORKSH LAN, P108
[7]   Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission [J].
Caruana, Rich ;
Lou, Yin ;
Gehrke, Johannes ;
Koch, Paul ;
Sturm, Marc ;
Elhadad, Noemie .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :1721-1730
[8]  
Doshi-Velez F., 2017, arXiv
[9]   Primary Cilium in Cancer Hallmarks [J].
Fabbri, Lucilla ;
Bost, Frederic ;
Mazure, Nathalie M. .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (06)
[10]  
Fisher A, 2019, J MACH LEARN RES, V20