An Ensemble Machine Learning Framework for Cotton Crop Yield Prediction Using Weather Parameters: A Case Study of Pakistan

被引:2
作者
Haider, Syed Tahseen [1 ,2 ]
Ge, Wenping [2 ]
Li, Jianqiang [1 ]
Rehman, Saif Ur [3 ]
Imran, Azhar [4 ]
Sharaf, Mohamed Abdel Fattah [5 ]
Haider, Syed Muhammad [3 ]
机构
[1] Beijing Univ Technol, Sch Software Engn, Beijing 100124, Peoples R China
[2] Xinjiang Univ, Sch Comp Sci & Technol, Urumqi 830046, Xinjiang, Peoples R China
[3] Univ Inst Informat Technol, Pir Mehr Ali Shah Arid Agr Univ, Rawalpindi 46300, Pakistan
[4] Air Univ, Fac Comp & Artificial Intelligence, Islamabad 44000, Pakistan
[5] King Saud Univ, Coll Engn, Ind Engn Dept, Riyadh 11421, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Cotton; Meteorology; Predictive models; Production; Agriculture; Crop yield; Machine learning; Climate change; Plant diseases; machine learning; climatic change; yield; weather; CHINA;
D O I
10.1109/ACCESS.2024.3454511
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In Pakistan, agriculture is one of the most common and least lucrative professions. It provides between 18% and 25% of Pakistan's overall gross domestic product (GDP). The majority of Pakistan's crops, like cotton, are completely weather-dependent. Regarding this, farmers are constantly attempting to implement new techniques and technology to boost crop yields. Technology-based approaches to crop yield analysis, such as machine learning (ML) and data mining, are causing a boom in the agricultural sector by altering the revenue scenario through the growth of the best crop. By utilizing ML algorithms to analyze agriculture climatic data, it is possible to increase crop yields. The proposed research was carried out in two dimensions. Initially, field observations were made to determine the effects of daily variations in meteorological parameters, such as rainfall, temperature, and wind, on plant growth and development at each phonological stage of cotton crop production. Throughout the Kharif Seasons 2005-2020, various phonological stages of the cotton crop grown in the fields of the Ayyub Agriculture Research Institute in Faisalabad (Central Punjab) were monitored using meteorological and phonological observations, as well as soil data. Finally, the cotton prediction framework as Random Forest Extreme Gradient (RFXG) has been proposed to predict cotton production based on observed data. RFXG concentrates on the quantification of machine learning algorithms and their practical application. The workings of RFXG have been divided into two phases. In the very first phase of data collection, preprocessing, attribute selection, and data splitting have been presented. In the following phase, prediction and evaluation were developed. The comparative results show that the prediction results of the proposed RFXG using the optimization algorithm are significantly improved by 0.05 RMSE (Root Mean Square Error) in comparison to the traditional Extreme Gradient Boost (XGB) model, which has a RMSE of 0.07. Proposed technique also compared with some baseline approaches of cotton predication. Comparison shows that proposed technique achieves better results as compared to baseline approaches. The proposed RFXG model (ensemble-based method) can bag, stack, and boost, making it fast and efficient predications as compared with existing approaches. Bagging averages, the results of numerous decision tree fit to different subsets of the same dataset to increase accuracy. The proposed study will be very useful in the future to close the gap between the current yield obtained and the potential yield of this cultivar, which is grown in Pakistan and other cotton-growing locations.
引用
收藏
页码:124045 / 124061
页数:17
相关论文
共 30 条
[1]   Flow-Aware Forwarding in SDN Datacenters Using a Knapsack-PSO-Based Solution [J].
Abdollahi, Sahar ;
Deldari, Arash ;
Asadi, Hamid ;
Montazerolghaem, AhmadReza ;
Mazinani, Sayyed Majid .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (03) :2902-2914
[2]   Rainfall trend analysis and weather forecast accuracy in selected parts of Khyber Pakhtunkhwa, Pakistan [J].
Alam, Fakhri ;
Salam, Muhammad ;
Khalil, Nasir Ahmad ;
Khan, Owais ;
Khan, Masaud .
SN APPLIED SCIENCES, 2021, 3 (05)
[3]   Machine learning algorithms for forecasting the incidence of Coffea arabica pests and diseases [J].
Aparecido, Lucas Eduardo de Oliveira ;
Rolim, Glauco de Souza ;
De Moraes, Jose Reinaldo da Silva ;
Costa, Cicero Teixeira Silva ;
de Souza, Paulo Sergio .
INTERNATIONAL JOURNAL OF BIOMETEOROLOGY, 2020, 64 (04) :671-688
[4]   Autoencoders and their applications in machine learning: a survey [J].
Berahmand, Kamal ;
Daneshfar, Fatemeh ;
Salehi, Elaheh Sadat ;
Li, Yuefeng ;
Xu, Yue .
ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (02)
[5]   Random forests for classification in ecology [J].
Cutler, D. Richard ;
Edwards, Thomas C., Jr. ;
Beard, Karen H. ;
Cutler, Adele ;
Hess, Kyle T. .
ECOLOGY, 2007, 88 (11) :2783-2792
[6]   Autumn Crop Yield Prediction using Data-Driven Approaches:- Support Vector Machines, Random Forest, and Deep Neural Network Methods [J].
Dang, Chaoya ;
Liu, Ying ;
Yue, Hui ;
Qian, JiaXin ;
Zhu, Rong .
CANADIAN JOURNAL OF REMOTE SENSING, 2021, 47 (02) :162-181
[7]  
Durgabai R. P. L., 2019, PROC INT C COMPUTATI, P37
[8]  
Gandhi N, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON TECHNOLOGICAL INNOVATIONS IN ICT FOR AGRICULTURE AND RURAL DEVELOPMENT (TIAR), P105, DOI 10.1109/TIAR.2016.7801222
[9]   Prediction of Winter Wheat Yield Based on Multi-Source Data and Machine Learning in China [J].
Han, Jichong ;
Zhang, Zhao ;
Cao, Juan ;
Luo, Yuchuan ;
Zhang, Liangliang ;
Li, Ziyue ;
Zhang, Jing .
REMOTE SENSING, 2020, 12 (02)
[10]  
Imran A., 2019, COTTON CROP DEV FAIS