New Partially Linear Regression and Machine Learning Models Applied to Agronomic Data

被引:1
|
作者
Rodrigues, Gabriela M. [1 ]
Ortega, Edwin M. M. [1 ]
Cordeiro, Gauss M. [2 ]
机构
[1] Univ Sao Paulo, Dept Exact Sci, BR-13418900 Piracicaba, Brazil
[2] Univ Fed Pernambuco, Dept Stat, BR-50670901 Recife, Brazil
关键词
agronomic experimentation; cross validation; decision tree; maximum likelihood estimation; random forest; residual analysis; CROSS-VALIDATION; CLASSIFICATION; TREE;
D O I
10.3390/axioms12111027
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Regression analysis can be appropriate to describe a nonlinear relationship between the response variable and the explanatory variables. This article describes the construction of a partially linear regression model with two systematic components based on the exponentiated odd log-logistic normal distribution. The parameters are estimated by the penalized maximum likelihood method. Simulations for some parameter settings and sample sizes empirically prove the accuracy of the estimators. The superiority of the proposed regression model over other regression models is shown by means of agronomic experimentation data. The predictive performance of the new model is compared with two machine learning techniques: decision trees and random forests. These methods achieved similar prediction performance, i.e., none stands out as a better predictor. In this sense, the objective of the research is to choose the best method. If the objective is only predictive, the decision tree can be used due to its simplicity. For inference purposes, the regression model is recommended, which can provide much more information regarding the relationship of the variables under study.
引用
收藏
页数:18
相关论文
共 50 条