Overlooked pitfalls in multi-class machine learning classification in radiation oncology and how to avoid them

被引：4

作者：

Chatterjee, Avishek ^{[1
]}

Vallieres, Martin ^{[1
]}

Seuntjens, Jan ^{[1
]}

机构：

[1] McGill Univ, Med Phys Unit, Montreal, PQ, Canada

来源：

PHYSICA MEDICA-EUROPEAN JOURNAL OF MEDICAL PHYSICS | 2020年 / 70卷

基金：

加拿大自然科学与工程研究理事会; 加拿大健康研究院;

关键词：

Machine Learning; Multi-class classification; Radiomics; Surrogate marker; RADIOMICS; FEATURES;

D O I：

10.1016/j.ejmp.2020.01.009

中图分类号：

R8 [特种医学]; R445 [影像诊断学];

学科分类号：

1002 ; 100207 ; 1009 ;

摘要：

In radiation oncology, Machine Learning classification publications are typically related to two outcome classes, e.g. the presence or absence of distant metastasis. However, multi-class classification problems also have great clinical relevance, e.g., predicting the grade of a treatment complication following lung irradiation. This work comprised two studies aimed at making work in this domain less prone to statistical blindsides. In multi-class classification, AUC is not defined, whereas correlation coefficients are. It may seem like solely quoting the correlation coefficient value (in lieu of the AUC value) is a suitable choice. In the first study, we illustrated using Monte Carlo (MC) models why this choice is misleading. We also considered the special case where the multiple classes are not ordinal, but nominal, and explained why Pearson or Spearman correlation coefficients are not only providing incomplete information but are actually meaningless. The second study concerned surrogate biomarkers for a clinical endpoint, which have purported benefits including potential for early assessment, being inexpensive, and being non-invasive. Using a MC experiment, we showed how conclusions derived from surrogate markers can be misleading. The simulated endpoint was radiation toxicity (scale of 0-5). The surrogate marker was the true toxicity grade plus a noise term. Five patient cohorts were simulated, including one control. Two of the cohorts were designed to have a statistically significant difference in toxicity. Under 1000 repeated experiments using the biomarker, these two cohorts were often found to be statistically indistinguishable, with the fraction of such occurrences rising with the level of noise.

引用

页码：96 / 100

页数：5

共 50 条

[21] Linear Multi-class Classification Support Vector Machine
Xu, Yan
Shao, Yuanhai
Tian, Yingjie
Deng, Naiyang
CUTTING-EDGE RESEARCH TOPICS ON MULTIPLE CRITERIA DECISION MAKING, PROCEEDINGS, 2009, 35 : 635 - +
[22] A Novel Incremental Class Learning Technique for Multi-class Classification
Er, Meng Joo
Yalavarthi, Vijaya Krishna
Wang, Ning
Venkatesan, Rajasekar
ADVANCES IN NEURAL NETWORKS - ISNN 2016, 2016, 9719 : 474 - 481
[23] A Multi-class Classification for Detection of IoT Network Attacks Using Machine Learning Models
Ashok, Gadde
Serath, Kommula
Kumar, T. Gireesh
DISTRIBUTED COMPUTING AND INTELLIGENT TECHNOLOGY, ICDCIT 2024, 2024, 14501 : 167 - 178
[24] Multi-class sentiment classification on Bengali social media comments using machine learning
Haque R.
Islam N.
Tasneem M.
Das A.K.
International Journal of Cognitive Computing in Engineering, 2023, 4 : 21 - 35
[25] Multi-class Cell Line Classification using Digital Holographic Microscopy and Machine Learning
Sun, Anyu
Van Lam
Thuc Phan
Chang, Lin-Ching
Nehmetallah, George
Raub, Christopher
BIG DATA IV: LEARNING, ANALYTICS, AND APPLICATIONS, 2022, 12097
[26] Multi-class classification of COVID-19 documents using machine learning algorithms
Gollam Rabby
Petr Berka
Journal of Intelligent Information Systems, 2023, 60 : 571 - 591
[27] Multi-class classification of COVID-19 documents using machine learning algorithms
Rabby, Gollam
Berka, Petr
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2023, 60 (02) : 571 - 591
[28] A algorithm to incremental learning with Support Vector Machine and its application in multi-class classification
Zhao Ying
Wan Fuyong
2006 CHINESE CONTROL CONFERENCE, VOLS 1-5, 2006, : 943 - +
[29] Multi-class Text Classification Using Machine Learning Models for Online Drug Reviews
Joshi, Shreehar
Abdelfattah, Eman
2021 IEEE WORLD AI IOT CONGRESS (AIIOT), 2021, : 262 - 267
[30] Integrated Multi-Class Classification and Prediction of GPCR Allosteric Modulators by Machine Learning Intelligence
Hou, Tianling
Bian, Yuemin
McGuire, Terence
Xie, Xiang-Qun
BIOMOLECULES, 2021, 11 (06)

← 1 2 3 4 5 →