Investigation on the use of ensemble learning and big data in crop identification

被引:6
|
作者
Ahmed, Sayed [1 ]
Mahmoud, Amira S. [1 ]
Farg, Eslam [1 ]
Mohamed, Amany M. [1 ]
Moustafa, Marwa S. [1 ]
Abutaleb, Khaled [1 ]
Saleh, Ahmed M. [1 ]
AbdelRahman, Mohamed A. E. [1 ]
AbdelSalam, Hisham M. [2 ]
Arafat, Sayed M. [1 ]
机构
[1] Natl Author Remote Sensing & Space Sci NARSS, Cairo, Egypt
[2] Cairo Univ, Fac Comp & Artificial Intelligence, Giza, Egypt
关键词
Big data; Crop identification; Ensemble learning; DB Framework; Apache spark;
D O I
10.1016/j.heliyon.2023.e13339
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The agriculture sector in Egypt faces several problems, such as climate change, water storage, and yield variability. The comprehensive capabilities of Big Data (BD) can help in tackling the uncertainty of food supply occurs due to several factors such as soil erosion, water pollution, climate change, socio-cultural growth, governmental regulations, and market fluctuations. Crop identification and monitoring plays a vital role in modern agriculture. Although several machine learning models have been utilized in identifying crops, the performance of ensemble learning has not been investigated extensively. The massive volume of satellite imageries has been established as a big data problem forcing to deploy the proposed solution using big data technologies to manage, store, analyze, and visualize satellite data. In this paper, we have developed a weighted voting mechanism for improving crop classification performance in a large scale, based on ensemble learning and big data schema. Built upon Apache Spark, the popular DB Framework, the proposed approach was tested on El Salheya, Ismaili governate. The proposed ensemble approach boosted accuracy by 6.5%, 1.9%, 4.4%, 4.9%, 4.7% in precision, recall, F-score, Overall Accuracy (OA), and Matthews correlation coefficient (MCC) metrics respectively. Our findings confirm the generalization of the proposed crop identification approach at a large-scale setting.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] An Ensemble Random Forest Algorithm for Insurance Big Data Analysis
    Wu, Ziming
    Lin, Weiwei
    Zhang, Zilong
    Wen, Angzhan
    Lin, Longxin
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 1, 2017, : 531 - 536
  • [22] Landslide Susceptibility Mapping: Machine and Ensemble Learning Based on Remote Sensing Big Data
    Kalantar, Bahareh
    Ueda, Naonori
    Saeidi, Vahideh
    Ahmadi, Kourosh
    Halin, Alfian Abdul
    Shabani, Farzin
    REMOTE SENSING, 2020, 12 (11)
  • [23] Multi-step forecasting for big data time series based on ensemble learning
    Galicia, A.
    Talavera-Llames, R.
    Troncoso, A.
    Koprinska, I.
    Martinez-Alvarez, F.
    KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 830 - 841
  • [24] Data analytics for crop management: a big data view
    Nabila Chergui
    Mohand Tahar Kechadi
    Journal of Big Data, 9
  • [25] Data analytics for crop management: a big data view
    Chergui, Nabila
    Kechadi, Mohand Tahar
    JOURNAL OF BIG DATA, 2022, 9 (01)
  • [26] Self-paced ensemble and big data identification: a classification of substantial imbalance computational analysis
    Shahzadi Bano
    Weimei Zhi
    Baozhi Qiu
    Muhammad Raza
    Nabila Sehito
    Mian Muhammad Kamal
    Ghadah Aldehim
    Nuha Alruwais
    The Journal of Supercomputing, 2024, 80 : 9848 - 9869
  • [27] Self-paced ensemble and big data identification: a classification of substantial imbalance computational analysis
    Bano, Shahzadi
    Zhi, Weimei
    Qiu, Baozhi
    Raza, Muhammad
    Sehito, Nabila
    Kamal, Mian Muhammad
    Aldehim, Ghadah
    Alruwais, Nuha
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (07) : 9848 - 9869
  • [28] The use of Big Data in politcs, or the politcs of Big Data
    Ardini, Claudia
    Nahum Mirad, Heraldo
    COMUNICACION Y HOMBRE, 2020, (16): : 225 - 240
  • [29] Prediction of crop yield using big data
    Wu Fan
    Chen Chong
    Guo Xiaoling
    Yu Hua
    Wang Juyun
    2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2015, : 255 - 260
  • [30] Fuzzy Divergence Weighted Ensemble Clustering With Spectral Learning Based on Random Projections for Big Data
    Lahmar, Ines
    Zaier, Aida
    Yahia, Mohamed
    Ali, Tarig
    Boaullegue, Ridha
    IEEE ACCESS, 2024, 12 : 20197 - 20208