From failure to fusion: A survey on learning from bad machine learning models

被引：0

作者：

Naser, M. Z. ^{[1
,2
]}

机构：

[1] Clemson Univ, Sch Civil & Environm Engn & Earth Sci, Anderson, SC 29625 USA

[2] Clemson Univ, Artificial Intelligence Res Inst Sci & Engn AIRISE, Clemson, SC USA

来源：

INFORMATION FUSION | 2025年 / 120卷

关键词：

Models; Machine learning; Engineering; Review; Artificial intelligence;

D O I：

10.1016/j.inffus.2025.103122

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Machine learning (ML) models are ubiquitous across diverse applications; however, only a fraction achieves optimal performance, often leading to the deployment of a singular model while dismissing others as experimental failures. This paper challenges this commonly accepted practice by systematically investigating the utility of suboptimal ML models. We posit that these models encapsulate valuable information regarding data biases, architectural limitations, and systemic misalignments, which can be leveraged to enhance overall system performance. Central to our approach is the integration of information fusion techniques, which combine heterogeneous data sources to robustly analyze and contextualize the errors and biases present in underperforming models. Our methodology includes advanced negative knowledge distillation, as well as error-based curriculum learning frameworks that are derived from multiple data modalities. We propose a comprehensive debugging framework that utilizes meta-learning for failure detection and correction to enable continuous improvement through rigorous cross-validation and iterative refinement. This study stresses the importance of documenting negative outcomes to promote transparency and foster interdisciplinary collaboration to build resilient and generalizable ML systems, particularly in information fusion. We advocate for a paradigm shift in the ML community and urge both researchers and institutions to systematically harness the insights derived from socalled "failed" models. We then conclude this paper by discussing several challenges and possible pathways for future research.

引用

页数：12

共 124 条

[1] Abadi D., 2009, Bulletin of the IEEE computer society Technical committee ondata engineering
[2] Abbott D., 2014, Applied predictive analytics: principles and techniques for the professional data analyst
[3] Dataset Shift in Machine Learning
Adams, Niall
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2010, 173 : 274 - 274
[4] Error-Based Noise Filtering During Neural Network Training
Alharbi, Fahad
Hindi, Khalil El
Al-Ahmadi, Saad
[J]. IEEE ACCESS, 2020, 8 : 156996 - 157004
[5] Aliferis C., 2024, Artificial Intelligence and Machine Learning in Health Care and Medical Sciences: Best Practices and Pitfalls, P477, DOI [DOI 10.1007/978-3-031-39355-6_10, 10.1007/978-3-031-39355-6_10]
[6] [Anonymous], 2009, Proceedings of the 26th Annual International Conference on Machine Learning, DOI [DOI 10.1145/1553374.1553380, 10.1145/1553374.155338]
[7] Bach F, 2012, OPTIMIZATION FOR MACHINE LEARNING, P19
[8] Baik S, 2020, ADV NEUR IN, V33
[9] Bansal T., 2022, P MACH LEARN RES
[10] When something goes wrong: Who is responsible for errors in ML decision-making?
Berber, Andrea
Sreckovic, Sanja
[J]. AI & SOCIETY, 2024, 39 (04) : 1891 - 1903

← 1 2 3 4 5 6 7 8 9 10 →