Machine learning and multiple linear regression models can predict ascorbic acid and polyphenol contents, and antioxidant activity in strawberries

被引:0
|
作者
Zushi, Kazufumi [1 ]
Yamamoto, Miyu [1 ]
Matsuura, Momoka [1 ]
Tsutsuki, Kan [2 ]
Yonehana, Asumi [1 ]
Imamura, Ren [1 ]
Takahashi, Hiromi [1 ]
Kirimura, Masaaki [1 ]
机构
[1] Univ Miyazaki, Fac Agr, Dept Agr & Environm Sci, Miyazaki 8892192, Japan
[2] Univ Miyazaki, Grad Sch Agr, Miyazaki 8892192, Japan
基金
日本学术振兴会;
关键词
antioxidant compounds; artificial neural network; environmental conditions; Lasso regression; stepwise regression; QUALITY; IMPACT;
D O I
10.1002/jsfa.13906
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
BACKGROUND: Strawberry is a rich source of antioxidants, including ascorbic acid (ASA) and polyphenols, which have numerous health benefits. fi ts. Antioxidant content and activity are often determined manually using laboratory equipment, which is destructive and time-consuming. This study constructs a prediction model for antioxidant compounds utilizing machine learning (ML) and multiple linear regression based on environmental, plant growth and agronomic fruit quality-related parameters as well as antioxidant levels. These were studied in three farms at two-week intervals during two years of cultivation. RESULTS: During the ML model screening, artificial fi cial neural network (ANN)-boosted models displayed a moderate coefficient fi cient of determination (R2) R 2 ) at 0.68-0.78 - 0.78 and relative root mean square error (RRMSE) at 3.8-4.8% - 4.8% in polyphenols and total ASA levels, as well as a high R 2 of 0.96 and low RRMSE at < 3.0% in antioxidant activity. Additionally, we developed variable selection models regarding the antioxidant activity, and variables two and fi ve (environmental parameters and leaf length, respectively) with high accuracy were selected. The linear regression analysis between the actual and predicted data of antioxidants in the ANN-boosted models revealed high fi tness with all parameters in almost all training, validation and test sets. Furthermore, environmental parameters are essential in developing such reliable models. CONCLUSION: We conclude that ANN-boosted, stepwise and double-Lasso regression models can predict antioxidant compounds with enhanced accuracy, and the relevant parameters can be easily acquired on-site without the need for any specific fi c equipment. (c) 2024 Society of Chemical Industry.
引用
收藏
页码:1159 / 1169
页数:11
相关论文
共 13 条
  • [1] Machine learning and regression models to predict multiple tree stem volumes for teak
    Tavares Junior, Ivaldo da Silva
    Mazzini de Souza, Jianne Rafaela
    de Sousa Lopes, Lucas Sergio
    Fardin, Leonardo Pereira
    Casas, Gianmarco Goycochea
    de Oliveira Neto, Ricardo Rodrigues
    Leite, Rodrigo Vieira
    Leite, Helio Garcia
    SOUTHERN FORESTS-A JOURNAL OF FOREST SCIENCE, 2021, 83 (04) : 294 - 302
  • [2] Integrating Multiple Linear Regression Analysis and Machine Learning Models to Predict the Bearing Capacity of Strip Footings on Sandy Clay Slopes
    Mase, Lindung Zalbuin
    Misliniyati, Rena
    Muharama, Nia Afriantialina
    Supriani, Fepy
    Ahmad, Debby Ariansyah
    Fernanda, Ryan
    Chauhan, Vinay Bhushan
    Chaiyaput, Salisa
    TRANSPORTATION INFRASTRUCTURE GEOTECHNOLOGY, 2025, 12 (02)
  • [3] Multiple regression models and Computer Vision Systems to predict antioxidant activity and total phenols in pigmented carrots
    Pace, Bernardo
    Cefola, Maria
    Renna, Floriana
    Renna, Massimiliano
    Serio, Francesco
    Attolico, Giovanni
    JOURNAL OF FOOD ENGINEERING, 2013, 117 (01) : 74 - 81
  • [4] Evaluation of Multiple Linear Regression and Machine Learning Approaches to Predict Soil Compaction and Shear Stress Based on Electrical Parameters
    Pentos, Katarzyna
    Mbah, Jasper Tembeck
    Pieczarka, Krzysztof
    Niedbala, Gniewko
    Wojciechowski, Tomasz
    APPLIED SCIENCES-BASEL, 2022, 12 (17):
  • [5] Use of kinetic, Weibull and PLSR models to predict the retention of ascorbic acid, total phenols and antioxidant activity during storage of pasteurized pineapple juice
    Zheng, Hong
    Lu, Hongfei
    LWT-FOOD SCIENCE AND TECHNOLOGY, 2011, 44 (05) : 1273 - 1281
  • [6] Can machine learning algorithms perform better than multiple linear regression in predicting nitrogen excretion from lactating dairy cows
    Xianjiang Chen
    Huiru Zheng
    Haiying Wang
    Tianhai Yan
    Scientific Reports, 12
  • [7] Can machine learning algorithms perform better than multiple linear regression in predicting nitrogen excretion from lactating dairy cows
    Chen, Xianjiang
    Zheng, Huiru
    Wang, Haiying
    Yan, Tianhai
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [8] Dual ensemble approach to predict rice heading date by integrating multiple rice phenology models and machine learning-based genetic parameter regression models
    Kawakita, Satoshi
    Yamasaki, Masanori
    Teratani, Ryo
    Yabe, Shiori
    Kajiya-Kanegae, Hiromi
    Yoshida, Hiroe
    Fushimi, Erina
    Nakagawa, Hiroshi
    AGRICULTURAL AND FOREST METEOROLOGY, 2024, 344
  • [9] Managing Rockfall Hazard on Strategic Linear Stakes: How Can Machine Learning Help to Better Predict Periods of Increased Rockfall Activity?
    Chanut, Marie-Aurelie
    Courteille, Hermann
    Levy, Clara
    Atto, Abdourrahmane
    Meignan, Lucas
    Trouve, Emmanuel
    Gasc-Barbier, Muriel
    SUSTAINABILITY, 2024, 16 (09)
  • [10] Machine learning and linear regression models to predict catchment- level base cation weathering rates across the southern Appalachian Mountain region, USA
    Povak, Nicholas A.
    Hessburg, Paul F.
    McDonnell, Todd C.
    Reynolds, Keith M.
    Sullivan, Timothy J.
    Salter, R. Brion
    Cosby, Bernard J.
    WATER RESOURCES RESEARCH, 2014, 50 (04) : 2798 - 2814