Explainable AI-driven evaluation of plant protein rheology using tree-based and Gaussian process machine learning models

被引:0
作者
Yilmaz, Mustafa Tahsin [1 ,2 ]
Badurayq, Salman [3 ]
Polat, Kemal [4 ]
Milyani, Ahmad H. [5 ,6 ]
Alkabaa, Abdulaziz S. [1 ]
Gul, Osman [7 ]
Saricaoglu, Furkan Turker [8 ]
机构
[1] King Abdulaziz Univ, Fac Engn, Dept Ind Engn, Jeddah, Saudi Arabia
[2] King Abdulaziz Univ, Ctr Res Excellence Artificial Intelligence & Data, Jeddah, Saudi Arabia
[3] Pladis Global, Phase 3, Jeddah, Saudi Arabia
[4] King Abdulaziz Univ, Fac Engn, Dept Elect & Comp Engn, Jeddah, Saudi Arabia
[5] King Abdulaziz Univ, Ctr Excellence Intelligent Engn Syst CEIES, Jeddah, Saudi Arabia
[6] Kastamonu Univ, Fac Engn & Architecture, Dept Food Engn, Kastamonu, Turkiye
[7] Bursa Tech Univ, Fac Engn & Nat Sci, Dept Food Engn, Bursa, Turkiye
[8] Abant Izzet Baysal Univ, Fac Engn, Dept Elect & Elect Engn, TR-14280 Bolu, Turkiye
关键词
Explainable artificial intelligence; Sesame protein isolates; Steady shear rheology; Tree-based machine learning models; Gaussian Process regressor; HIGH-PRESSURE; BEHAVIOR; AGGREGATION; TEMPERATURE; PREDICTION; ISOLATE; MEAL; ANN;
D O I
10.1016/j.asej.2025.103565
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this study, we conducted a comparative analysis of the explainability of Decision Tree Regressor (DTR) and Gaussian Process Regressor (GPR) models in predicting the shear stress and viscosity of sesame protein isolate (SPI) systems, employing explainable machine learning (EML) techniques to elucidate complex, nonlinear relationships among processing parameters. SPI samples were processed across pressure levels ranging from 0 to 100 MPa and ion concentration (IC) values from 0 to 200 mM. DTR model accurately predicted shear stress (R2 = 0.999), while a GPR model achieved high performance for viscosity prediction (R2 = 0.9925). Formally, the modeling task is framed as learning a predicting mapping function f : Rp -> R, where x is an element of Rp denotes the vector of predictors (pressure, IC, shear rate) and y is an element of R is the target variable (shear stress or viscosity), by minimizing a loss function such as mean squared error. Interpretation of model predictions using SHapley Additive exPlanations (SHAP), permutation importance, and partial dependence analysis revealed that pressure and IC are the most influential factors affecting shear stress and viscosity, with pressure inducing protein conformational changes that impact rheological properties. The shear rate exhibited a lesser direct impact within the systems examined. Partial Dependence Plots (PDPs) from the DTR model revealed strong, nearly linear positive relationships between pressure and shear stress, while the GPR model depicted more nuanced responses, highlighting the models' differing sensitivities. Variance-Based Sensitivity Indices (VBSIs) further quantified these influences, with pressure and IC showing higher sensitivity scores in the DTR model compared to the GPR model. Permutation importance and SHAP interaction analyses corroborated these results, emphasizing the dominant role of pressure and IC, both independently and interactively, in determining shear stress. In contrast, viscosity predictions were influenced by more distributed and subtle interactions among all features. Employing explainable machine learning techniques enables a comprehensive understanding of feature relevance in complex, nonlinear rheological systems, facilitating the elucidation of viscosity development in sesame protein systems through rheological indices. This approach ensures no bias toward formulation composition and applied pressure, offering valuable insights for optimizing formulation and processing conditions in food applications to enhance the functional properties of SPI-based products.
引用
收藏
页数:21
相关论文
共 81 条
[31]   Machine learning-based prediction of Clostridium growth in pork meat using explainable artificial intelligence [J].
Ince, Volkan ;
Bader-El-Den, Mohamed ;
Alderton, Jack ;
Arabikhan, Farzad ;
Sari, Omer Faruk ;
Sansom, Annette .
JOURNAL OF FOOD SCIENCE AND TECHNOLOGY-MYSORE, 2025,
[32]  
Janzing DMinorics L, 2020, PMLR
[33]   Rheology-Based Classification of Foods for the Elderly by Machine Learning Analysis [J].
Jeong, Sungmin ;
Kim, Heesu ;
Lee, Suyong .
APPLIED SCIENCES-BASEL, 2021, 11 (05) :1-10
[34]   Multicollinearity and misleading statistical results [J].
Kim, Jong Hae .
KOREAN JOURNAL OF ANESTHESIOLOGY, 2019, 72 (06) :558-569
[35]   Rheological properties of soy protein isolate solution for fibers and films [J].
Liu, Pengchao ;
Xu, Helan ;
Zhao, Yi ;
Yang, Yiqi .
FOOD HYDROCOLLOIDS, 2017, 64 :149-156
[36]   Development of a liquid nutritional supplement using a Sesamum indicum L. protein isolate [J].
López, G ;
Flores, I ;
Gálvez, A ;
Quirasco, M ;
Farrés, A .
LEBENSMITTEL-WISSENSCHAFT UND-TECHNOLOGIE-FOOD SCIENCE AND TECHNOLOGY, 2003, 36 (01) :67-74
[37]  
Lundberg SM, 2017, ADV NEUR IN, V30
[38]   Impact of high-pressure homogenization on physico-chemical, structural, and rheological properties of quinoa protein isolates [J].
Luo, Lan ;
Cheng, Lirong ;
Zhang, Ruijia ;
Yang, Zhi .
FOOD STRUCTURE-NETHERLANDS, 2022, 32
[39]  
Bishop CM, 2013, Arxiv, DOI arXiv:1301.3838
[40]  
Lundberg SM, 2019, Arxiv, DOI arXiv:1802.03888