Feature selection for software effort estimation with localized neighborhood mutual information

被引:11
|
作者
Liu, Qin [1 ]
Xiao, Jiakai [2 ]
Zhu, Hongming [1 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai 201804, Peoples R China
[2] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2019年 / 22卷 / Suppl 3期
关键词
Feature selection; Case based reasoning; Neighborhood mutual information; Software effort estimation;
D O I
10.1007/s10586-018-1884-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is usually employed before applying case based reasoning (CBR) for Software Effort Estimation (SEE). Unfortunately, most feature selection methods treat CBR as a black box method so there is no guarantee on the appropriateness of CBR on selected feature subset. The key to solve the problem is to measure the appropriateness of CBR assumption for a given feature set. In this paper, a measure called localized neighborhood mutual information (LNI) is proposed for this purpose and a greedy method called LNI based feature selection (LFS) is designed for feature selection. Experiment with leave-one-out cross validation (LOOCV) on 6 benchmark datasets demonstrates that: (1) CBR makes effective estimation with the LFS selected subset compared with a randomized baseline method. Compared with three representative feature selection methods, (2) LFS achieves optimal MAR value on 3 out of 6 datasets with a 14% average improvement and (3) LFS achieves optimal MMRE on 5 out of 6 datasets with a 24% average improvement.
引用
收藏
页码:S6953 / S6961
页数:9
相关论文
共 50 条
  • [1] Feature selection for software effort estimation with localized neighborhood mutual information
    Qin Liu
    Jiakai Xiao
    Hongming Zhu
    Cluster Computing, 2019, 22 : 6953 - 6961
  • [2] Multi-label feature selection based on neighborhood mutual information
    Lin, Yaojin
    Hu, Qinghua
    Liu, Jinghua
    Chen, Jinkun
    Duan, Jie
    APPLIED SOFT COMPUTING, 2016, 38 : 244 - 256
  • [3] A study of mutual information based feature selection for case based reasoning in software cost estimation
    Li, Y. F.
    Xie, M.
    Go, T. N.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 5921 - 5931
  • [4] Mutual information for feature selection: estimation or counting?
    Nguyen H.B.
    Xue B.
    Andreae P.
    Evolutionary Intelligence, 2016, 9 (3) : 95 - 110
  • [5] Partial label feature selection via label disambiguation and neighborhood mutual information
    Ding, Jinfei
    Qian, Wenbin
    Li, Yihui
    Yang, Wenji
    Huang, Jintao
    INFORMATION SCIENCES, 2024, 680
  • [6] FEATURE SELECTION BASED ON STATISTICAL ESTIMATION OF MUTUAL INFORMATION
    Kozhevin, A. A.
    SIBERIAN ELECTRONIC MATHEMATICAL REPORTS-SIBIRSKIE ELEKTRONNYE MATEMATICHESKIE IZVESTIYA, 2021, 18 : 720 - 728
  • [7] Threshold based Neighborhood Selection for Case-Based Reasoning in Software Effort Estimation
    Liu, Qin
    Xiao, Jiakai
    Zhu, Hongming
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS, ELECTRONICS AND CONTROL (ICCSEC), 2017, : 258 - 262
  • [8] Software Development Effort Estimation Using Feature Selection Techniques
    Hosni, Mohamed
    Idri, Ali
    NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_18), 2018, 303 : 439 - 452
  • [9] Feature Selection via Label Enhancement and Weighted Neighborhood Mutual Information for Multilabel Data
    Sun, Lin
    Guo, Jiaqi
    Wu, Xuejiao
    Xu, Jiucheng
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024, 2024, 14876 : 470 - 480
  • [10] A Mutual Information-Based Hybrid Feature Selection Method for Software Cost Estimation Using Feature Clustering
    Liu, Qin
    Shi, Shihai
    Zhu, Hongming
    Xiao, Jiakai
    2014 IEEE 38TH ANNUAL INTERNATIONAL COMPUTERS, SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2014, : 27 - 32