Estimating non-overfitted convex production technologies: A stochastic machine learning approach

被引:0
|
作者
Guillen, Maria D. [1 ]
Charles, Vincent [2 ]
Aparicio, Juan [1 ,3 ]
机构
[1] Miguel Hernandez Univ Elche, Ctr Operat Res, Avda Univ S-N, Elche 03202, Spain
[2] Queens Univ Belfast, Queens Business Sch, Belfast BT9 5EE, North Ireland
[3] ValgrAI Valencian Grad Sch & Res Network Artificia, Joint Res Unit, Camino Vera S-N, Valencia 46022, Spain
关键词
Data Envelopment Analysis; Technical efficiency measurement; Stochastic gradient boosting; Machine learning; DATA ENVELOPMENT ANALYSIS; MEASURING EFFICIENCY; BOOTSTRAP; MODELS; DEA;
D O I
10.1016/j.ejor.2024.11.030
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Overfitting is a classical statistical issue that occurs when a model fits a particular observed data sample too closely, potentially limiting its generalizability. While Data Envelopment Analysis (DEA) is a powerful nonparametric method for assessing the relative efficiency of decision-making units (DMUs), its reliance on the minimal extrapolation principle can lead to concerns about overfitting, particularly when the goal extends beyond evaluating the specific DMUs in the sample to making broader inferences. In this paper, we propose an adaptation of Stochastic Gradient Boosting to estimate production possibility sets that mitigate overfitting while satisfying shape constraints such as convexity and free disposability. Our approach is not intended to replace DEA but to complement it, offering an additional tool for scenarios where generalization is important. Through simulation experiments, we demonstrate that the proposed method performs well compared to DEA, especially in high-dimensional settings. Furthermore, the new machine learning-based technique is compared to the Corrected Concave Non-parametric Least Squares (C2NLS), showing competitive performance. We also illustrate how the usual efficiency measures in DEA can be implemented under our approach. Finally, we provide an empirical example based on data from the Program for International Student Assessment (PISA) to demonstrate the applicability of the new method.
引用
收藏
页码:224 / 240
页数:17
相关论文
共 50 条
  • [21] A Machine Learning Approach to Detecting Low Medication State with Wearable Technologies
    Cheon, Andy
    Jung, Stephanie Yeoju
    Prather, Collin
    Sarmiento, Matthew
    Wong, Kevin
    Woodbridge, Diane Myung-kyung
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 4252 - 4255
  • [22] Predicting and Explaining Pig Iron Production on Charcoal Blast Furnaces: A Machine Learning Approach
    Melo Lima, Marcio Salles
    Eryarsoy, Enes
    Delen, Dursun
    INFORMS JOURNAL ON APPLIED ANALYTICS, 2021, 51 (03): : 213 - 235
  • [23] A machine learning approach for estimating surface tension based on pendant drop images
    Soori, Tejaswi
    Rassoulinejad-Mousavi, Seyed Moein
    Zhang, Lige
    Rokoni, Arif
    Sun, Ying
    FLUID PHASE EQUILIBRIA, 2021, 538
  • [24] A Machine Learning Approach to Estimating Solar Radiation Shading Rates in Mountainous Areas
    Xu, Luting
    Li, Yanru
    Wang, Xiao
    Liu, Lei
    Ma, Ming
    Yang, Junhui
    Al-Dahidi, Sameer
    Louzazni, Mohamed
    Madhiarasan, Manoharan
    SUSTAINABILITY, 2024, 16 (02)
  • [25] Machine learning model based on non-convex penalized huberized-SVM
    Wang P.
    Guo J.
    Li L.-F.
    Journal of Electronic Science and Technology, 2024, 22 (01)
  • [26] A machine learning approach to estimating the error in satellite sea surface temperature retrievals
    Kumar, Chirag
    Podesta, Guillermo
    Kilpatrick, Katherine
    Minnett, Peter
    REMOTE SENSING OF ENVIRONMENT, 2021, 255
  • [27] A machine learning-based approach for estimating and testing associations with multivariate outcomes
    Benkeser, David
    Mertens, Andrew
    Colford, John M.
    Hubbard, Alan
    Arnold, Benjamin F.
    Stein, Aryeh
    van der Laan, Mark J.
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2021, 17 (01) : 7 - 21
  • [28] A machine learning approach for resource mapping analysis of greenhouse gas removal technologies
    Asibor, Jude O.
    Clough, Peter T.
    Nabavi, Seyed Ali
    Manovic, Vasilije
    ENERGY AND CLIMATE CHANGE, 2023, 4
  • [29] Implementing Machine Learning Methods in Estimating the Size of the Non-observed Economy
    Shami, Labib
    Lazebnik, Teddy
    COMPUTATIONAL ECONOMICS, 2024, 63 (04) : 1459 - 1476
  • [30] A stochastic machine learning based approach for observability enhancement of automated smart grids
    Min, Li
    Alnowibet, Khalid Abdulaziz
    Alrasheedi, Adel Fahad
    Moazzen, Farid
    Awwad, Emad Mahrous
    Mohamed, Mohamed A.
    SUSTAINABLE CITIES AND SOCIETY, 2021, 72