A Comparative Analysis of XGBoost and Neural Network Models for Predicting Some Tomato Fruit Quality Traits from Environmental and Meteorological Data

被引:16
作者
M'hamdi, Oussama [1 ,2 ]
Takacs, Sandor [1 ]
Palotas, Gabor [3 ]
Ilahy, Riadh [4 ]
Helyes, Lajos [1 ]
Pek, Zoltan [1 ]
机构
[1] Hungarian Univ Agr & Life Sci, Inst Hort Sci, Pater K Str 1, H-2100 Godollo, Hungary
[2] Hungarian Univ Agr & Life Sci, Doctoral Sch Plant Sci, Pater K Str 1, H-2100 Godollo, Hungary
[3] Univer Prod Zrt, Szolnok Ut 35, H-6000 Kecskemet, Hungary
[4] Univ Carthage, Natl Agr Res Inst Tunisia INRAT, Lab Hort, Ariana 1004, Tunisia
来源
PLANTS-BASEL | 2024年 / 13卷 / 05期
关键词
tomato quality; extreme gradient boosting; artificial neural network; prediction; shapley additive explanations; CROP YIELD; LYCOPENE CONTENT; CLIMATE-CHANGE; SUGAR CONTENT; GENOTYPE; COLOR; CLASSIFICATION; AGRICULTURE; TEMPERATURE; METABOLISM;
D O I
10.3390/plants13050746
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
The tomato as a raw material for processing is globally important and is pivotal in dietary and agronomic research due to its nutritional, economic, and health significance. This study explored the potential of machine learning (ML) for predicting tomato quality, utilizing data from 48 cultivars and 28 locations in Hungary over 5 seasons. It focused on degrees Brix, lycopene content, and colour (a/b ratio) using extreme gradient boosting (XGBoost) and artificial neural network (ANN) models. The results revealed that XGBoost consistently outperformed ANN, achieving high accuracy in predicting degrees Brix (R-2 = 0.98, RMSE = 0.07) and lycopene content (R-2 = 0.87, RMSE = 0.61), and excelling in colour prediction (a/b ratio) with a R-2 of 0.93 and RMSE of 0.03. ANN lagged behind particularly in colour prediction, showing a negative R-2 value of -0.35. Shapley additive explanation's (SHAP) summary plot analysis indicated that both models are effective in predicting degrees Brix and lycopene content in tomatoes, highlighting different aspects of the data. SHAP analysis highlighted the models' efficiency (especially in degrees Brix and lycopene predictions) and underscored the significant influence of cultivar choice and environmental factors like climate and soil. These findings emphasize the importance of selecting and fine-tuning the appropriate ML model for enhancing precision agriculture, underlining XGBoost's superiority in handling complex agronomic data for quality assessment.
引用
收藏
页数:22
相关论文
共 135 条
[1]   State-of-the-art in artificial neural network applications: A survey [J].
Abiodun, Oludare Isaac ;
Jantan, Aman ;
Omolara, Abiodun Esther ;
Dada, Kemi Victoria ;
Mohamed, Nachaat AbdElatif ;
Arshad, Humaira .
HELIYON, 2018, 4 (11)
[2]  
Agbemafle R., 2014, Food Sci. Qual. Manag, V34, P10
[3]   Quantification of sugars and organic acids in tomato fruits [J].
Agius, Carlos ;
von Tucher, Sabine ;
Poppenberger, Brigitte ;
Rozhon, Wilfried .
METHODSX, 2018, 5 :537-550
[4]   Plant responses to environmental stresses-from gene to biotechnology [J].
Ahanger, Mohammad Abass ;
Akram, Nudrat Aisha ;
Ashraf, Muhammad ;
Alyemeni, Mohammed Nasser ;
Wijaya, Leonard ;
Ahmad, Parvaiz .
AOB PLANTS, 2017, 9
[5]   Enabling interpretable machine learning for biological data with reliability scores [J].
Ahlquist, K. D. ;
Sugden, Lauren ;
Ramachandran, Sohini .
PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (05)
[6]   The effectiveness of using mathematical modeling in assessing the quality of food products [J].
Akimov, S. S. ;
Lebedev, S., V ;
Grechkina, V. V. ;
Miroshnikova, M. S. ;
Topuria, G. M. .
INTERNATIONAL CONFERENCE ON WORLD TECHNOLOGICAL TRENDS IN AGRIBUSINESS, 2021, 624
[7]   Cultivar choice provides options for local production of organic and conventionally produced tomatoes with higher quality and antioxidant content [J].
Aldrich, Heather Troxell ;
Salandanan, Karen ;
Kendall, Patricia ;
Bunning, Marisa ;
Stonaker, Frank ;
Kuelen, Oktay ;
Stushnoff, Cecil .
JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE, 2010, 90 (15) :2548-2555
[8]   Predictive non-linear modeling of complex data by artificial neural networks [J].
Almeida, JS .
CURRENT OPINION IN BIOTECHNOLOGY, 2002, 13 (01) :72-76
[9]   Climate variability, agricultural technologies adoption, and productivity in rural Nigeria: a plot-level analysis [J].
Amankwah, Akuffo .
AGRICULTURE & FOOD SECURITY, 2023, 12 (01)
[10]   Recognizing intertwined patterns using a network of spiking pattern recognition platforms [J].
Amiri, Masoud ;
Jafari, Amir Homayoun ;
Makkiabadi, Bahador ;
Nazari, Soheila .
SCIENTIFIC REPORTS, 2022, 12 (01)