A modified genetic algorithm and weighted principal component analysis based feature selection and extraction strategy in agriculture

被引:18
|
作者
Shastry, K. Aditya [1 ]
Sanjay, H. A. [1 ]
机构
[1] Nitte Meenakshi Inst Technol, Bengaluru 64, India
关键词
Feature selection; Feature extraction; Hybrid; Genetic Algorithm; Weighted-Principal Component Analysis;
D O I
10.1016/j.knosys.2021.107460
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data pre-processing is a technique that transforms the raw data into a useful format for applying machine learning (ML) techniques. Feature selection (FS) and feature extraction (FeExt) form significant components of data pre-processing. FS is the identification of relevant features that enhances the accuracy of a model. Since, agricultural data contain diverse features related to climate, soil, fertilizer, FS attains significant importance as irrelevant features may adversely impact the prediction of the model built. Likewise, FeExt involves the derivation of new attributes from the prevailing attributes. All the information that the original attributes possess is present in these new features minus the duplicity. Keeping these points in mind, this work proposes a hybrid feature selection and feature extraction strategy for selecting features from the agricultural data set. A modified-Genetic Algorithm (m-GA) was developed by designing a fitness function based on "Mutual Information" (MutInf), and "Root Mean Square Error" (RtMSE) to choose the best features that affected the target attribute (crop yield in this case). These selected features were then subjected to feature extraction using "weighted principal component analysis" (wgt-PCA). The extracted features were then fed into different ML models viz. "Regression" (Reg), "Artificial Neural Networks" (ArtNN), "Adaptive Neuro Fuzzy Inference System" (ANFIS), "Ensemble of Trees" (EnT), and "Support Vector Regression" (SuVR). Trials on 3 benchmark and 8 real-world farming datasets revealed that the developed hybrid feature selection and extraction technique performed with significant improvements with respect to Rsq2, RtMSE, and "mean absolute error" (MAE) in comparison to FS and FeExt methods such as Correlation Analysis (CA), Singular Valued Decomposition (SiVD), Genetic Algorithm (GA), and wgt-PCA on "benchmark" and "real-world" farming datasets. (C) 2021 Published by Elsevier B.V.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm
    Uguz, Harun
    KNOWLEDGE-BASED SYSTEMS, 2011, 24 (07) : 1024 - 1032
  • [22] Feature Selection Based on Genetic Algorithm, Particle Swarm Optimization and Principal Component Analysis for Opinion Mining Cosmetic Product Review
    Kristiyanti, Dinar Ajeng
    Wahyudi, Mochamad
    2017 5TH INTERNATIONAL CONFERENCE ON CYBER AND IT SERVICE MANAGEMENT (CITSM 2017), 2017, : 309 - 314
  • [23] The fault feature extraction and classification of gear using principal component analysis and kernel principal component analysis based on the wavelet packet transform
    Shao, Renping
    Hu, Wentao
    Wang, Yayun
    Qi, Xiankun
    MEASUREMENT, 2014, 54 : 118 - 132
  • [24] Feature Selection for Image Retrieval based on Genetic Algorithm
    Kushwaha, Preeti
    Welekar, R. R.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2016, 4 (02): : 16 - 21
  • [25] Principal Component Analysis With Fuzzy Elastic Net for Feature Selection
    Gao, Yunlong
    Wu, Qinting
    Xu, Zhenghong
    Cao, Chao
    Pan, Jinyan
    Shao, Guifang
    Nie, Feiping
    Zhu, Qingyuan
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (12) : 6878 - 6890
  • [26] Hybrid Feature Selection Based on Principal Component Analysis and Grey Wolf Optimizer Algorithm for Arabic News Article Classification
    Alomari, Osama Ahmad
    Elnagar, Ashraf
    Afyouni, Imad
    Shahin, Ismail
    Nassif, Ali Bou
    Hashem, Ibrahim Abaker
    Tubishat, Mohammad
    IEEE ACCESS, 2022, 10 : 121816 - 121830
  • [27] STUDY ON FEATURE EXTRACTION OF PIG FACE BASED ON PRINCIPAL COMPONENT ANALYSIS
    Yan, Hongwen
    Hu, Zhiwei
    Cui, Qingliang
    INMATEH-AGRICULTURAL ENGINEERING, 2022, 68 (03): : 333 - 340
  • [28] Feature selection using guided population based genetic algorithm with modified crossover and parent selection
    Naskar, Anurup
    Ghosh, Soumyajit
    Kundu, Mahantapas
    Sarkar, Ram
    APPLIED SOFT COMPUTING, 2025, 172
  • [29] Image feature selection based on genetic algorithm
    Lei, Liang
    Peng, Jun
    Yang, Bo
    Lecture Notes in Electrical Engineering, 2013, 219 LNEE (VOL. 4): : 825 - 831
  • [30] Deluge based Genetic Algorithm for feature selection
    Guha, Ritam
    Ghosh, Manosij
    Kapri, Souvik
    Shaw, Sushant
    Mutsuddi, Shyok
    Bhateja, Vikrant
    Sarkar, Ram
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 357 - 367