Estimating non-overfitted convex production technologies: A stochastic machine learning approach

被引：0

作者：

Guillen, Maria D. ^{[1
]}

Charles, Vincent ^{[2
]}

Aparicio, Juan ^{[1
,3
]}

机构：

[1] Miguel Hernandez Univ Elche, Ctr Operat Res, Avda Univ S-N, Elche 03202, Spain

[2] Queens Univ Belfast, Queens Business Sch, Belfast BT9 5EE, North Ireland

[3] ValgrAI Valencian Grad Sch & Res Network Artificia, Joint Res Unit, Camino Vera S-N, Valencia 46022, Spain

来源：

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH | 2025年 / 323卷 / 01期

关键词：

Data Envelopment Analysis; Technical efficiency measurement; Stochastic gradient boosting; Machine learning; DATA ENVELOPMENT ANALYSIS; MEASURING EFFICIENCY; BOOTSTRAP; MODELS; DEA;

D O I：

10.1016/j.ejor.2024.11.030

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

Overfitting is a classical statistical issue that occurs when a model fits a particular observed data sample too closely, potentially limiting its generalizability. While Data Envelopment Analysis (DEA) is a powerful nonparametric method for assessing the relative efficiency of decision-making units (DMUs), its reliance on the minimal extrapolation principle can lead to concerns about overfitting, particularly when the goal extends beyond evaluating the specific DMUs in the sample to making broader inferences. In this paper, we propose an adaptation of Stochastic Gradient Boosting to estimate production possibility sets that mitigate overfitting while satisfying shape constraints such as convexity and free disposability. Our approach is not intended to replace DEA but to complement it, offering an additional tool for scenarios where generalization is important. Through simulation experiments, we demonstrate that the proposed method performs well compared to DEA, especially in high-dimensional settings. Furthermore, the new machine learning-based technique is compared to the Corrected Concave Non-parametric Least Squares (C2NLS), showing competitive performance. We also illustrate how the usual efficiency measures in DEA can be implemented under our approach. Finally, we provide an empirical example based on data from the Program for International Student Assessment (PISA) to demonstrate the applicability of the new method.

引用

页码：224 / 240

页数：17

共 50 条

[1] Two-Stage Monitoring of Patients in Intensive Care Unit for Sepsis Prediction Using Non-Overfitted Machine Learning Models
Abromavicius, Vytautas
Plonis, Darius
Tarasevicius, Deividas
Serackis, Arturas
ELECTRONICS, 2020, 9 (07) : 1 - 14
[2] Estimating scale economies in non-convex production models
Cesaroni, Giovanni
Kerstens, Kristiaan
Van de Woestyne, Ignace
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2017, 68 (11) : 1442 - 1451
[3] A robust stochastic quasi-Newton algorithm for non-convex machine learning
Liu, Hanger
Liang, Yuqing
Liu, Jinlan
Xu, Dongpo
APPLIED INTELLIGENCE, 2025, 55 (07)
[4] An adaptation of Random Forest to estimate convex non-parametric production technologies: an empirical illustration of efficiency measurement in education
Espana, Victor J.
Aparicio, Juan
Barber, Xavier
INTERNATIONAL TRANSACTIONS IN OPERATIONAL RESEARCH, 2025, 32 (05) : 2523 - 2546
[5] Estimating non-convex production sets - imposing convex input sets and output sets in data envelopment analysis
Post, T
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2001, 131 (01) : 132 - 142
[6] Measuring environmental inefficiency through machine learning: An approach based on efficiency analysis trees and by-production technology
Guillen, Maria D.
Aparicio, Juan
Kapelko, Magdalena
Esteve, Miriam
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2025, 321 (02) : 529 - 542
[7] Stochastic Approach for System Identification using Machine Learning
Abdufattokhov, Shokhjakhon
Muhiddinov, Behzod
2019 DYNAMICS OF SYSTEMS, MECHANISMS AND MACHINES (DYNAMICS), 2019,
[8] Stochastic nonlocal damage analysis by a machine learning approach
Feng, Yuan
Wang, Qihan
Wu, Di
Gao, Wei
Tin-Loi, Francis
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2020, 372
[9] Machine Learning Approach to Production Order Planning A Paper to the Implementation of a Machine Learning Algorithm in Production
Mielke J.
Winkler H.
ZWF Zeitschrift fuer Wirtschaftlichen Fabrikbetrieb, 2022, 117 (06): : 384 - 389
[10] A supervised machine learning approach for estimating plate interface locking: Application to Central Chile
Barra, Sebastian
Moreno, Marcos
Ortega-Culaciati, Francisco
Benavente, Roberto
Araya, Rodolfo
Bedford, Jonathan
Calisto, Ignacia
PHYSICS OF THE EARTH AND PLANETARY INTERIORS, 2024, 352

← 1 2 3 4 5 →