On Machine Learning with Imbalanced Data and Research Quality Evaluation Methodologies

被引:0
作者
Lipitakis, Anastasia-Dimitra [1 ]
Lipitakis, Evangelia A. E. C. [2 ]
机构
[1] Univ Patras, Dept Math, Patras 26504, Hellas, Greece
[2] Univ Kent, Kent Business Sch, Canterbury CT2 7PE, Kent, England
来源
2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), VOL 1 | 2014年
关键词
Bibliometric Indicators; Business Intelligence; Citation Analysis; Computational Intelligence; Data Mining; Learning Algorithms; Imbalanced Data; Machine Learning; Quantitative Methods; Research Quality Evaluation; FIRM PERFORMANCE; ROTATION FOREST; E-BUSINESS; CLASSIFICATION; INTELLIGENCE; PREDICTION; STRATEGY;
D O I
10.1109/CSCI.2014.81
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article a synoptic review of machine learning techniques with imbalanced data and a class of corresponding learning algorithms is presented. This class of algorithms includes the meta-algorithms: Cost sensitive, Metacost, Rotation forest-cost sensitive, rotation forest-smote. Four learning algorithms (with base classifiers J48 and part processing with F-measure and a predetermined imbalanced data set) are compared in the computational environment WEKA leading to comparative numerical results. The basic concepts of research quality evaluation methodologies are presented, an adaptive citation qualitative-quantitative approach and advanced bibliometric indicators are given. Basic components of research quality performance such as research journal cited publications, citing publications and research quality evaluations at various academic levels are considered and corresponding numerical results are given. An alternative approach using certain machine learning algorithms with imbalanced data in the case of research quality evaluation methodologies is proposed.
引用
收藏
页码:451 / 457
页数:7
相关论文
共 50 条
[21]   A new concordant partial AUC and partial c statistic for imbalanced data in the evaluation of machine learning algorithms [J].
André M. Carrington ;
Paul W. Fieguth ;
Hammad Qazi ;
Andreas Holzinger ;
Helen H. Chen ;
Franz Mayr ;
Douglas G. Manuel .
BMC Medical Informatics and Decision Making, 20
[22]   A new concordant partial AUC and partial c statistic for imbalanced data in the evaluation of machine learning algorithms [J].
Carrington, Andre M. ;
Fieguth, Paul W. ;
Qazi, Hammad ;
Holzinger, Andreas ;
Chen, Helen H. ;
Mayr, Franz ;
Manuel, Douglas G. .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)
[23]   Metric Learning from Imbalanced Data [J].
Gautheron, Leo ;
Habrard, Amaury ;
Morvant, Emilie ;
Sebban, Marc .
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, :923-930
[24]   Handling imbalanced data in supervised machine learning for lithological mapping using remote sensing and airborne geophysical data [J].
Nugroho, Hary ;
Wikantika, Ketut ;
Bijaksana, Satria ;
Saepuloh, Asep .
OPEN GEOSCIENCES, 2023, 15 (01)
[25]   Addressing imbalanced data for machine learning based mineral prospectivity mapping [J].
Farahnakian, Fahimeh ;
Sheikh, Javad ;
Zelioli, Luca ;
Nidhi, Dipak ;
Seppa, Iiro ;
Ilo, Rami ;
Nevalainen, Paavo ;
Heikkonen, Jukka .
ORE GEOLOGY REVIEWS, 2024, 174
[26]   An evaluation of the robustness of MTS for imbalanced data [J].
Su, Chao-Ton ;
Hsiao, Yu-Hsiang .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (10) :1321-1332
[27]   Machine learning applications in Alzheimer's disease research: a comprehensive analysis of data sources, methodologies, and insights [J].
Rezaie, Zahra ;
Banad, Yaser .
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
[28]   A review of the application of machine learning in water quality evaluation [J].
Zhu, Mengyuan ;
Wang, Jiawei ;
Yang, Xiao ;
Zhang, Yu ;
Zhang, Linyu ;
Ren, Hongqiang ;
Wu, Bing ;
Ye, Lin .
ECO-ENVIRONMENT & HEALTH, 2022, 1 (02) :107-116
[29]   Integrating Data Selection and Extreme Learning Machine for Imbalanced Data [J].
Mahdiyah, Umi ;
Irawan, M. Isa ;
Imah, Elly Matul .
INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2015), 2015, 59 :221-229
[30]   Classification of Imbalanced Immunotherapy and Health-Related Data Utilising Novel Machine Learning Experiments [J].
Mahmoud, Ahsanullah Yunas .
ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2022, 2024, 1454 :158-169