Estimating non-overfitted convex production technologies: A stochastic machine learning approach

被引:0
|
作者
Guillen, Maria D. [1 ]
Charles, Vincent [2 ]
Aparicio, Juan [1 ,3 ]
机构
[1] Miguel Hernandez Univ Elche, Ctr Operat Res, Avda Univ S-N, Elche 03202, Spain
[2] Queens Univ Belfast, Queens Business Sch, Belfast BT9 5EE, North Ireland
[3] ValgrAI Valencian Grad Sch & Res Network Artificia, Joint Res Unit, Camino Vera S-N, Valencia 46022, Spain
关键词
Data Envelopment Analysis; Technical efficiency measurement; Stochastic gradient boosting; Machine learning; DATA ENVELOPMENT ANALYSIS; MEASURING EFFICIENCY; BOOTSTRAP; MODELS; DEA;
D O I
10.1016/j.ejor.2024.11.030
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Overfitting is a classical statistical issue that occurs when a model fits a particular observed data sample too closely, potentially limiting its generalizability. While Data Envelopment Analysis (DEA) is a powerful nonparametric method for assessing the relative efficiency of decision-making units (DMUs), its reliance on the minimal extrapolation principle can lead to concerns about overfitting, particularly when the goal extends beyond evaluating the specific DMUs in the sample to making broader inferences. In this paper, we propose an adaptation of Stochastic Gradient Boosting to estimate production possibility sets that mitigate overfitting while satisfying shape constraints such as convexity and free disposability. Our approach is not intended to replace DEA but to complement it, offering an additional tool for scenarios where generalization is important. Through simulation experiments, we demonstrate that the proposed method performs well compared to DEA, especially in high-dimensional settings. Furthermore, the new machine learning-based technique is compared to the Corrected Concave Non-parametric Least Squares (C2NLS), showing competitive performance. We also illustrate how the usual efficiency measures in DEA can be implemented under our approach. Finally, we provide an empirical example based on data from the Program for International Student Assessment (PISA) to demonstrate the applicability of the new method.
引用
收藏
页码:224 / 240
页数:17
相关论文
共 50 条
  • [1] Two-Stage Monitoring of Patients in Intensive Care Unit for Sepsis Prediction Using Non-Overfitted Machine Learning Models
    Abromavicius, Vytautas
    Plonis, Darius
    Tarasevicius, Deividas
    Serackis, Arturas
    ELECTRONICS, 2020, 9 (07) : 1 - 14
  • [2] Estimating scale economies in non-convex production models
    Cesaroni, Giovanni
    Kerstens, Kristiaan
    Van de Woestyne, Ignace
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2017, 68 (11) : 1442 - 1451
  • [3] A robust stochastic quasi-Newton algorithm for non-convex machine learning
    Liu, Hanger
    Liang, Yuqing
    Liu, Jinlan
    Xu, Dongpo
    APPLIED INTELLIGENCE, 2025, 55 (07)
  • [4] An adaptation of Random Forest to estimate convex non-parametric production technologies: an empirical illustration of efficiency measurement in education
    Espana, Victor J.
    Aparicio, Juan
    Barber, Xavier
    INTERNATIONAL TRANSACTIONS IN OPERATIONAL RESEARCH, 2025, 32 (05) : 2523 - 2546
  • [5] Estimating non-convex production sets - imposing convex input sets and output sets in data envelopment analysis
    Post, T
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2001, 131 (01) : 132 - 142
  • [6] Measuring environmental inefficiency through machine learning: An approach based on efficiency analysis trees and by-production technology
    Guillen, Maria D.
    Aparicio, Juan
    Kapelko, Magdalena
    Esteve, Miriam
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2025, 321 (02) : 529 - 542
  • [7] Stochastic Approach for System Identification using Machine Learning
    Abdufattokhov, Shokhjakhon
    Muhiddinov, Behzod
    2019 DYNAMICS OF SYSTEMS, MECHANISMS AND MACHINES (DYNAMICS), 2019,
  • [8] Stochastic nonlocal damage analysis by a machine learning approach
    Feng, Yuan
    Wang, Qihan
    Wu, Di
    Gao, Wei
    Tin-Loi, Francis
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2020, 372
  • [9] Machine Learning Approach to Production Order Planning A Paper to the Implementation of a Machine Learning Algorithm in Production
    Mielke J.
    Winkler H.
    ZWF Zeitschrift fuer Wirtschaftlichen Fabrikbetrieb, 2022, 117 (06): : 384 - 389
  • [10] A supervised machine learning approach for estimating plate interface locking: Application to Central Chile
    Barra, Sebastian
    Moreno, Marcos
    Ortega-Culaciati, Francisco
    Benavente, Roberto
    Araya, Rodolfo
    Bedford, Jonathan
    Calisto, Ignacia
    PHYSICS OF THE EARTH AND PLANETARY INTERIORS, 2024, 352