Machine learning;
Intelligent water drop algorithm;
Hybrid feature selection;
High dimensional datasets;
Medical applications;
OPTIMIZATION;
ALGORITHM;
ENSEMBLE;
SEARCH;
D O I:
10.1016/j.compbiolchem.2022.107809
中图分类号:
Q [生物科学];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
Classifying microarray datasets, which usually contains many noise genes that degrade the performance of classifiers and decrease classification accuracy rate, is a competitive research topic. Feature selection (FS) is one of the most practical ways for finding the most optimal subset of genes that increases classification's accuracy for diagnostic and prognostic prediction of tumor cancer from the microarray datasets. This means that we always need to develop more efficient FS methods, that select only optimal or close-to-optimal subset of features to improve classification performance. In this paper, we propose a hybrid FS method for microarray data processing, that combines an ensemble filter with an Improved Intelligent Water Drop (IIWD) algorithm as a wrapper by adding one of three local search (LS) algorithms: Tabu search (TS), Novel LS algorithm (NLSA), or Hill Climbing (HC) in each iteration from IWD, and using a correlation coefficient filter as a heuristic undesirability (HUD) for next node selection in the original IWD algorithm. The effects of adding three different LS algorithms to the proposed IIWD algorithm have been evaluated through comparing the performance of the proposed ensemble filter-IIWD-based wrapper without adding any LS algorithms named (PHFS-IWD) FS method versus its performance when adding a specific LS algorithm from (TS, NLSA or HC) in FS methods named, (PHFS-IWDTS, PHFS-IWDNLSA, and PHFS-IWDHC), respectively. Naive Bayes(NB) classifier with five microarray datasets have been deployed for evaluating and comparing the proposed hybrid FS methods. Results show that using LS algorithms in each iteration from the IWD algorithm improves F-score value with an average equal to 5% compared with PHFS-IWD. Also, PHFS-IWDNLSA improves the F-score value with an average of 4.15% over PHFS-IWDTS, and 5.67% over PHFS-IWDHC while PHFS-IWDTS outperformed PHFS-IWDHC with an average of increment equal to 1.6%. On the other hand, the proposed hybrid-based FS methods improve accuracy with an average equal to 8.92% in three out of five datasets and decrease the number of genes with a percentage of 58.5% in all five datasets compared with six of the most recent state-of-the-art FS methods.
机构:
Univ Hawaii Manoa, Dept Civil Environm & Construct, Engn & Water Resources Res Ctr, Honolulu, HI USAUniv Hawaii Manoa, Dept Civil Environm & Construct, Engn & Water Resources Res Ctr, Honolulu, HI USA
Lee, Jinwook
Bateni, Sayed M.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Hawaii Manoa, Dept Civil Environm & Construct, Engn & Water Resources Res Ctr, Honolulu, HI USAUniv Hawaii Manoa, Dept Civil Environm & Construct, Engn & Water Resources Res Ctr, Honolulu, HI USA
Bateni, Sayed M.
Jun, Changhyun
论文数: 0引用数: 0
h-index: 0
机构:
Chung Ang Univ, Coll Engn, Dept Civil & Environm Engn, Seoul, South KoreaUniv Hawaii Manoa, Dept Civil Environm & Construct, Engn & Water Resources Res Ctr, Honolulu, HI USA
Jun, Changhyun
Heggy, Essam
论文数: 0引用数: 0
h-index: 0
机构:
Univ Southern Calif, Ming Hsieh Dept Elect & Comp Engn, Los Angeles, CA USA
CALTECH, NASA Jet Prop Lab, Pasadena, CA USAUniv Hawaii Manoa, Dept Civil Environm & Construct, Engn & Water Resources Res Ctr, Honolulu, HI USA
Heggy, Essam
Jamei, Mehdi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Prince Edward Isl, Canadian Ctr Climate Change & Adaptat, St Peters, PE, Canada
Shahid Chamran Univ Ahvaz, Fac Civil Engn & Architecture, Dept Civil Engn, Ahvaz, Iran
Al Ayen Univ, Sci Res Ctr, New Era & Dev Civil Engn Res Grp, Thi Qar, IraqUniv Hawaii Manoa, Dept Civil Environm & Construct, Engn & Water Resources Res Ctr, Honolulu, HI USA
Jamei, Mehdi
Kim, Dongkyun
论文数: 0引用数: 0
h-index: 0
机构:
Hongik Univ, Dept Civil & Environm Engn, Seoul, South KoreaUniv Hawaii Manoa, Dept Civil Environm & Construct, Engn & Water Resources Res Ctr, Honolulu, HI USA
Kim, Dongkyun
Ghafouri, Hamid Reza
论文数: 0引用数: 0
h-index: 0
机构:
Shahid Chamran Univ Ahvaz, Fac Civil Engn & Architecture, Dept Civil Engn, Ahvaz, IranUniv Hawaii Manoa, Dept Civil Environm & Construct, Engn & Water Resources Res Ctr, Honolulu, HI USA
Ghafouri, Hamid Reza
Deenik, Jonathan L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Hawaii Manoa, Dept Trop Plant & Soil Sci, Honolulu, HI USAUniv Hawaii Manoa, Dept Civil Environm & Construct, Engn & Water Resources Res Ctr, Honolulu, HI USA
机构:
Suez University,Department of Computer Science, Faculty of Computers and InformationSuez University,Department of Computer Science, Faculty of Computers and Information
Ahmed M. Elshewey
Rasha Y. Youssef
论文数: 0引用数: 0
h-index: 0
机构:
Suez University,Department of Information Systems, Faculty of Computers and InformationSuez University,Department of Computer Science, Faculty of Computers and Information
Rasha Y. Youssef
Hazem M. El-Bakry
论文数: 0引用数: 0
h-index: 0
机构:
Mansoura University,Department of Information Systems, Faculty of Computers and InformationSuez University,Department of Computer Science, Faculty of Computers and Information
Hazem M. El-Bakry
Ahmed M. Osman
论文数: 0引用数: 0
h-index: 0
机构:
Suez University,Department of Information Systems, Faculty of Computers and InformationSuez University,Department of Computer Science, Faculty of Computers and Information
机构:
China Univ Petr, State Key Lab Heavy Oil Proc, Beijing 102249, Peoples R ChinaChina Univ Petr, State Key Lab Heavy Oil Proc, Beijing 102249, Peoples R China
Li, Qing
Zhang, Mengxuan
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Petr, State Key Lab Heavy Oil Proc, Beijing 102249, Peoples R ChinaChina Univ Petr, State Key Lab Heavy Oil Proc, Beijing 102249, Peoples R China
Zhang, Mengxuan
Shi, Xiaogang
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Petr, State Key Lab Heavy Oil Proc, Beijing 102249, Peoples R ChinaChina Univ Petr, State Key Lab Heavy Oil Proc, Beijing 102249, Peoples R China
Shi, Xiaogang
Lan, Xingying
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Petr, State Key Lab Heavy Oil Proc, Beijing 102249, Peoples R ChinaChina Univ Petr, State Key Lab Heavy Oil Proc, Beijing 102249, Peoples R China
Lan, Xingying
Guo, Xuqiang
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Petr, State Key Lab Heavy Oil Proc, Beijing 102249, Peoples R ChinaChina Univ Petr, State Key Lab Heavy Oil Proc, Beijing 102249, Peoples R China
Guo, Xuqiang
Guan, Yunlong
论文数: 0引用数: 0
h-index: 0
机构:
PetroChina, Dushanzi Petrochem Co, Karamay 833699, Xinjiang, Peoples R ChinaChina Univ Petr, State Key Lab Heavy Oil Proc, Beijing 102249, Peoples R China