Sugarcane Yield Grade Prediction Using Random Forest with Forward Feature Selection and Hyper-parameter Tuning

被引:23
作者
Charoen-Ung, Phusanisa [1 ]
Mittrapiyanuruk, Pradit [1 ]
机构
[1] Srinakharinwirot Univ, Dept Comp Sci, Fac Sci, Bangkok, Thailand
来源
RECENT ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY 2018 | 2019年 / 769卷
关键词
Yield grade prediction; Machine learning; Random forest; Forward feature selection; Hyper-parameter tuning;
D O I
10.1007/978-3-319-93692-5_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a Random Forest (RF) based method for predicting the sugarcane yield grade of a farmer plot. The dataset used in this work is obtained from a set of sugarcane plots around a sugar mill in Thailand. The number of records in the train dataset and the test dataset are 8,765 records and 3,756 records, respectively. We propose a forward feature selection in conjunction with hyper-parameter tuning for training the random forest classifier. The accuracy of our method is 71.88%. We compare the accuracy of our method with two non-machine-learning baselines. The first baseline is to use the actual yield of the last year as the prediction. The second baseline is that the target yield of each plot is manually predicted by human expert. The accuracies of these baselines are 51.52% and 65.50%, respectively. The results on accuracy indicate that our proposed method can be used for aiding the decision making of sugar mill operation planning.
引用
收藏
页码:33 / 42
页数:10
相关论文
共 9 条
[1]  
[Anonymous], 1997, Machine Learning: Proceedings of the Fourteenth International Conference (ICML'97)
[2]   The effect of tuning, feature engineering, and feature selection in data mining applied to rainfed sugarcane yield modelling [J].
Bocca, Felipe F. ;
Antunes Rodrigues, Luiz Henrique .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2016, 128 :67-76
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   Accurate prediction of sugarcane yield using a random forest algorithm [J].
Everingham, Yvette ;
Sexton, Justin ;
Skocaj, Danielle ;
Inman-Bamber, Geoff .
AGRONOMY FOR SUSTAINABLE DEVELOPMENT, 2016, 36 (02)
[5]   Attribute Selection Impact on Linear and Nonlinear Regression Models for Crop Yield Prediction [J].
Gonzalez-Sanchez, Alberto ;
Frausto-Solis, Juan ;
Ojeda-Bustamante, Waldo .
SCIENTIFIC WORLD JOURNAL, 2014,
[6]   From spreadsheets to sugar content modeling: A data mining approach [J].
Gravina de Oliveira, Monique Pires ;
Bocca, Felipe Ferreira ;
Antunes Rodrigues, Luiz Henrique .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2017, 132 :14-20
[7]  
Pedregosa F, 2011, J MACH LEARN RES, V12, P2825
[8]   Feature Selection for Wheat Yield Prediction [J].
Russ, Georg ;
Kruse, Rudolf .
RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, :465-478
[9]   Harvest scheduling algorithm to equalize supplier benefits: A case study from the Thai sugar cane industry [J].
Thuankaewsing, S. ;
Khamjan, S. ;
Piewthongngam, K. ;
Pathumnakul, S. .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2015, 110 :42-55