A simulation study of the model evaluation criterion MMRE

被引:271
作者
Foss, T
Stensrud, E
Kitchenham, B
Myrtveit, I
机构
[1] Norwegian Sch Management B1, N-1301 Sandvika, Norway
[2] Myrtveit Strensrud ANS, N-0752 Oslo, Norway
[3] Univ Keele, Dept Comp Sci, Keele ST5 5BG, Staffs, England
关键词
mean magnitude of relative error; software metrics; simulation; regression analysis; prediction models; software cost estimation; software engineering; empirical software engineering; prediction accuracy;
D O I
10.1109/TSE.2003.1245300
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Mean Magnitude of Relative Error, MMRE, is probably the most widely used evaluation criterion for-assessing the performance of competing software prediction models. One purpose of MMRE is to assist us to select the best model. In this paper, we have performed a simulation study demonstrating that MMRE does not always select the best model. Our findings cast some doubt on the conclusions of any study of competing software prediction models that used MMRE as a basis of model comparison. We therefore recommend not using MMRE to evaluate and compare prediction models. At present, we do not have any universal replacement for MMRE Meanwhile, we therefore recommend using a combination of theoretical justification of the models that are proposed together with other metrics proposed in this paper.
引用
收藏
页码:985 / 995
页数:11
相关论文
共 49 条
  • [1] A simulation tool for efficient analogy based cost estimation
    Angelis L.
    Stamelos I.
    [J]. Empirical Software Engineering, 2000, 5 (1) : 35 - 68
  • [2] [Anonymous], P 7 INT SOFTW METR S
  • [3] Briand L. C., 1999, Proceedings of the 1999 International Conference on Software Engineering (IEEE Cat. No.99CB37002), P303, DOI 10.1109/ICSE.1999.841021
  • [4] Briand L. C., 2000, Proceedings of the 2000 International Conference on Software Engineering. ICSE 2000 the New Millennium, P377, DOI 10.1109/ICSE.2000.870428
  • [5] A PATTERN-RECOGNITION APPROACH FOR SOFTWARE ENGINEERING DATA-ANALYSIS
    BRIAND, LC
    BASILI, VR
    THOMAS, WM
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1992, 18 (11) : 931 - 942
  • [6] COBRA: A hybrid method for software cost estimation, benchmarking, and risk assessment
    Briand, LC
    El Emam, K
    Bomarius, F
    [J]. PROCEEDINGS OF THE 1998 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, 1998, : 390 - 399
  • [7] DEVELOPING INTERPRETABLE MODELS WITH OPTIMIZED SET REDUCTION FOR IDENTIFYING HIGH-RISK SOFTWARE COMPONENTS
    BRIAND, LC
    BASILI, VR
    HETMANSKI, CJ
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1993, 19 (11) : 1028 - 1044
  • [8] Using simulation for assessing the real impact of test-coverage on defect-coverage
    Briand, LC
    Pfahl, D
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2000, 49 (01) : 60 - 70
  • [9] Briand LC, 1999, P 21 INT C SOFTW ENG, DOI DOI 10.1145/302405.302647
  • [10] BRIAND LC, IN PRESS ENCY SOFTWA