Feature selection for software effort estimation with localized neighborhood mutual information

被引:11
作者
Liu, Qin [1 ]
Xiao, Jiakai [2 ]
Zhu, Hongming [1 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai 201804, Peoples R China
[2] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2019年 / 22卷 / Suppl 3期
关键词
Feature selection; Case based reasoning; Neighborhood mutual information; Software effort estimation;
D O I
10.1007/s10586-018-1884-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is usually employed before applying case based reasoning (CBR) for Software Effort Estimation (SEE). Unfortunately, most feature selection methods treat CBR as a black box method so there is no guarantee on the appropriateness of CBR on selected feature subset. The key to solve the problem is to measure the appropriateness of CBR assumption for a given feature set. In this paper, a measure called localized neighborhood mutual information (LNI) is proposed for this purpose and a greedy method called LNI based feature selection (LFS) is designed for feature selection. Experiment with leave-one-out cross validation (LOOCV) on 6 benchmark datasets demonstrates that: (1) CBR makes effective estimation with the LFS selected subset compared with a randomized baseline method. Compared with three representative feature selection methods, (2) LFS achieves optimal MAR value on 3 out of 6 datasets with a 14% average improvement and (3) LFS achieves optimal MMRE on 5 out of 6 datasets with a 24% average improvement.
引用
收藏
页码:S6953 / S6961
页数:9
相关论文
共 50 条
[41]   Fast binary feature selection with conditional mutual information [J].
Fleuret, F .
JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 :1531-1555
[42]   Feature selection based on mutual information with correlation coefficient [J].
Hongfang Zhou ;
Xiqian Wang ;
Rourou Zhu .
Applied Intelligence, 2022, 52 :5457-5474
[43]   Feature selection using mutual information in CT colonography [J].
Ong, Ju Lynn ;
Seghouane, Abd-Krim .
PATTERN RECOGNITION LETTERS, 2011, 32 (02) :337-341
[44]   An Improved Feature Selection for Categorization Based on Mutual Information [J].
Liu, Haifeng ;
Su, Zhan ;
Yao, Zeqing ;
Liu, Shousheng .
WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, 5854 :80-87
[45]   Effective feature selection scheme using mutual information [J].
Huang, D ;
Chow, TWS .
NEUROCOMPUTING, 2005, 63 :325-343
[46]   Feature selection using Joint Mutual Information Maximisation [J].
Bennasar, Mohamed ;
Hicks, Yulia ;
Setchi, Rossitza .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) :8520-8532
[47]   Mutual information-based feature selection for radiomics [J].
Oubel, Estanislao ;
Beaumont, Hubert ;
Iannessi, Antoine .
MEDICAL IMAGING 2016: PACS AND IMAGING INFORMATICS: NEXT GENERATION AND INNOVATIONS, 2016, 9789
[48]   Using Mutual Information for Feature Selection in Programmatic Advertising [J].
Ciesielczyk, Michal .
2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, :290-295
[49]   A review of feature selection methods based on mutual information [J].
Vergara, Jorge R. ;
Estevez, Pablo A. .
NEURAL COMPUTING & APPLICATIONS, 2014, 24 (01) :175-186
[50]   Mutual Information Based Feature Selection for Fingerprint Identification [J].
Adjimi, Ahlem ;
Hacine-Gharbi, Abdenour ;
Ravier, Philippe ;
Mostefai, Messaoud .
INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2019, 43 (02) :187-198