Estimating non-overfitted convex production technologies: A stochastic machine learning approach

被引:0
|
作者
Guillen, Maria D. [1 ]
Charles, Vincent [2 ]
Aparicio, Juan [1 ,3 ]
机构
[1] Miguel Hernandez Univ Elche, Ctr Operat Res, Avda Univ S-N, Elche 03202, Spain
[2] Queens Univ Belfast, Queens Business Sch, Belfast BT9 5EE, North Ireland
[3] ValgrAI Valencian Grad Sch & Res Network Artificia, Joint Res Unit, Camino Vera S-N, Valencia 46022, Spain
关键词
Data Envelopment Analysis; Technical efficiency measurement; Stochastic gradient boosting; Machine learning; DATA ENVELOPMENT ANALYSIS; MEASURING EFFICIENCY; BOOTSTRAP; MODELS; DEA;
D O I
10.1016/j.ejor.2024.11.030
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Overfitting is a classical statistical issue that occurs when a model fits a particular observed data sample too closely, potentially limiting its generalizability. While Data Envelopment Analysis (DEA) is a powerful nonparametric method for assessing the relative efficiency of decision-making units (DMUs), its reliance on the minimal extrapolation principle can lead to concerns about overfitting, particularly when the goal extends beyond evaluating the specific DMUs in the sample to making broader inferences. In this paper, we propose an adaptation of Stochastic Gradient Boosting to estimate production possibility sets that mitigate overfitting while satisfying shape constraints such as convexity and free disposability. Our approach is not intended to replace DEA but to complement it, offering an additional tool for scenarios where generalization is important. Through simulation experiments, we demonstrate that the proposed method performs well compared to DEA, especially in high-dimensional settings. Furthermore, the new machine learning-based technique is compared to the Corrected Concave Non-parametric Least Squares (C2NLS), showing competitive performance. We also illustrate how the usual efficiency measures in DEA can be implemented under our approach. Finally, we provide an empirical example based on data from the Program for International Student Assessment (PISA) to demonstrate the applicability of the new method.
引用
收藏
页码:224 / 240
页数:17
相关论文
共 50 条
  • [41] A machine learning approach for country-level deployment of greenhouse gas removal technologies
    Asibor, Jude O.
    Clough, Peter T.
    Nabavi, Seyed Ali
    Manovic, Vasilije
    INTERNATIONAL JOURNAL OF GREENHOUSE GAS CONTROL, 2023, 130
  • [42] Stochastic three-term conjugate gradient method with variance technique for non-convex learning
    Ouyang, Chen
    Lu, Chenkaixiang
    Zhao, Xiong
    Huang, Ruping
    Yuan, Gonglin
    Jiang, Yiyan
    STATISTICS AND COMPUTING, 2024, 34 (03)
  • [43] Machine learning approach for estimating the human-related VOC emissions in a university classroom
    Jialong Liu
    Rui Zhang
    Jianyin Xiong
    Building Simulation, 2023, 16 : 915 - 925
  • [44] Towards Trustworthy Machine Learning in Production: An Overview of the Robustness in MLOps Approach
    Bayram, Firas
    Ahmed, Bestoun s.
    ACM COMPUTING SURVEYS, 2025, 57 (05)
  • [45] Innovative composite machine learning approach for biodiesel production in public vehicles
    Yang, Yun
    Gao, Lizhen
    Abbas, Mohamed
    Elkamchouchi, Dalia H.
    Alkhalifah, Tamim
    Alturise, Fahad
    Ponnore, Joffin Jose
    ADVANCES IN ENGINEERING SOFTWARE, 2023, 184
  • [46] Solar photovoltaic panel production in Mexico: A novel machine learning approach
    Lopez-Flores, Francisco Javier
    Ramirez-Marquez, Cesar
    Rubio-Castro, Eusiel
    Ponce-Ortega, Jose Mari
    ENVIRONMENTAL RESEARCH, 2024, 246
  • [47] A Machine Learning Approach to Estimating Student Mastery by Predicting Feedback Request and Solving Time in Online Learning System
    Kannan, N.
    Yeh, Charles Y. C.
    Chou, Chih-Yueh
    Chan, Tak-Wai
    29TH INTERNATIONAL CONFERENCE ON COMPUTERS IN EDUCATION (ICCE 2021), VOL I, 2021, : 241 - 250
  • [48] A Machine Learning Approach for Non-Invasive Diagnosis of Metabolic Syndrome
    Datta, Suparno
    Schraplau, Anne
    da Cruz, Harry Freitas
    Sachs, Jan Philipp
    Mayer, Frank
    Boettinger, Erwin
    2019 IEEE 19TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2019, : 933 - 940
  • [49] Estimating and predicting the human development index with uncertain data: a common weight fuzzy benefit-of-the-doubt machine learning approach
    Omrani, Hashem
    Yang, Zijiang
    Imanirad, Raha
    ANNALS OF OPERATIONS RESEARCH, 2024,
  • [50] A new machine learning approach for estimating shear wave velocity profile using borelog data
    Joshi, Anushka
    Raman, Balasubramanian
    Mohan, C. Krishna
    Cenkeramaddi, Linga Reddy
    SOIL DYNAMICS AND EARTHQUAKE ENGINEERING, 2024, 177