PToPI: A Comprehensive Review, Analysis, and Knowledge Representation of Binary Classification Performance Measures/Metrics

被引:19
|
作者
Canbek G. [1 ,2 ]
Taskaya Temizel T. [2 ]
Sagiroglu S. [3 ]
机构
[1] Pointr, Ankara
[2] Informatics Institute Middle East Technical University, Ankara
[3] Computer Engineering Department, Gazi University, Ankara
关键词
Classification; Knowledge representation; Machine learning; Performance evaluation; Performance measures; Performance metrics; Periodic table;
D O I
10.1007/s42979-022-01409-1
中图分类号
学科分类号
摘要
Although few performance evaluation instruments have been used conventionally in different machine learning-based classification problem domains, there are numerous ones defined in the literature. This study reviews and describes performance instruments via formally defined novel concepts and clarifies the terminology. The study first highlights the issues in performance evaluation via a survey of 78 mobile-malware classification studies and reviews terminology. Based on three research questions, it proposes novel concepts to identify characteristics, similarities, and differences of instruments that are categorized into ‘performance measures’ and ‘performance metrics’ in the classification context for the first time. The concepts reflecting the intrinsic properties of instruments such as canonical form, geometry, duality, complementation, dependency, and leveling, aim to reveal similarities and differences of numerous instruments, such as redundancy and ground-truth versus prediction focuses. As an application of knowledge representation, we introduced a new exploratory table called PToPI (Periodic Table of Performance Instruments) for 29 measures and 28 metrics (69 instruments including variant and parametric ones). Visualizing proposed concepts, PToPI provides a new relational structure for the instruments including graphical, probabilistic, and entropic ones to see their properties and dependencies all in one place. Applications of the exploratory table in six examples from different domains in the literature have shown that PToPI aids overall instrument analysis and selection of the proper performance metrics according to the specific requirements of a classification problem. We expect that the proposed concepts and PToPI will help researchers comprehend and use the instruments and follow a systematic approach to classification performance evaluation and publication. © 2022, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 38 条
  • [1] Binary Classification Performance Measures/Metrics: A Comprehensive Visualized Roadmap to Gain New Insights
    Canbek, Gurol
    Sagiroglu, Seref
    Temizel, Tugba Taskaya
    Baykal, Nazife
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 821 - 826
  • [2] Exploring Symmetry of Binary Classification Performance Metrics
    Luque, Amalia
    Carrasco, Alejandro
    Martin, Alejandro
    Ramon Lama, Juan
    SYMMETRY-BASEL, 2019, 11 (01):
  • [3] An Analysis of Performance Metrics for Imbalanced Classification
    Gaudreault, Jean-Gabriel
    Branco, Paula
    Gama, Joao
    DISCOVERY SCIENCE (DS 2021), 2021, 12986 : 67 - 77
  • [4] The impact of class imbalance in classification performance metrics based on the binary confusion matrix
    Luque, Amalia
    Carrasco, Alejandro
    Martin, Alejandro
    de las Heras, Ana
    PATTERN RECOGNITION, 2019, 91 : 216 - 231
  • [5] BenchMetrics: a systematic benchmarking method for binary classification performance metrics
    Canbek, Gurol
    Temizel, Tugba Taskaya
    Sagiroglu, Seref
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (21) : 14623 - 14650
  • [6] A systematic analysis of performance measures for classification tasks
    Sokolova, Marina
    Lapalme, Guy
    INFORMATION PROCESSING & MANAGEMENT, 2009, 45 (04) : 427 - 437
  • [7] BenchMetrics: a systematic benchmarking method for binary classification performance metrics
    Gürol Canbek
    Tugba Taskaya Temizel
    Seref Sagiroglu
    Neural Computing and Applications, 2021, 33 : 14623 - 14650
  • [8] Evaluation Metrics for Wind Power Forecasts: A Comprehensive Review and Statistical Analysis of Errors
    Piotrowski, Pawel
    Rutyna, Inajara
    Baczynski, Dariusz
    Kopyt, Marcin
    ENERGIES, 2022, 15 (24)
  • [9] Exploring Evaluation Metrics for Binary Classification in Data Analysis: the Worthiness Benchmark Concept
    Shirdel, Mohammad
    Di Mauro, Mario
    Liotta, Antonio
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2024, 2024, 14912 : 120 - 125
  • [10] Review and analysis of performance metrics of droplet microfluidics systems
    Rosenfeld, Liat
    Lin, Tiras
    Derda, Ratmir
    Tang, Sindy K. Y.
    MICROFLUIDICS AND NANOFLUIDICS, 2014, 16 (05) : 921 - 939