A Multicriteria Statistical Based Comparison Methodology for Evaluating Evolutionary Algorithms

被引:21
作者
Carrano, Eduardo G. [1 ]
Wanner, Elizabeth F. [1 ]
Takahashi, Ricardo H. C. [2 ]
机构
[1] Ctr Fed Educ Tecnol Minas Gerais, Dept Comp Engn, BR-30480000 Belo Horizonte, MG, Brazil
[2] Univ Fed Minas Gerais, Dept Math, BR-31270901 Belo Horizonte, MG, Brazil
关键词
Algorithm evaluation; evolutionary algorithms; multicriteria statistical comparison; EFFICIENCY; BOOTSTRAP;
D O I
10.1109/TEVC.2010.2069567
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a statistical based comparison methodology for performing evolutionary algorithm comparison under multiple merit criteria. The analysis of each criterion is based on the progressive construction of a ranking of the algorithms under analysis, with the determination of significance levels for each ranking step. The multicriteria analysis is based on the aggregation of the different criteria rankings via a non-dominance analysis which indicates the algorithms which constitute the efficient set. In order to avoid correlation effects, a principal component analysis pre-processing is performed. Bootstrapping techniques allow the evaluation of merit criteria data with arbitrary probability distribution functions. The algorithm ranking in each criterion is built progressively, using either ANOVA or first order stochastic dominance. The resulting ranking is checked using a permutation test which detects possible inconsistencies in the ranking-leading to the execution of more algorithm runs which refine the ranking confidence. As a by-product, the permutation test also delivers p-values for the ordering between each two algorithms which have adjacent rank positions. A comparison of the proposed method with other methodologies has been performed using reference probability distribution functions (PDFs). The proposed methodology has always reached the correct ranking with less samples and, in the case of non-Gaussian PDFs, the proposed methodology has worked well, while the other methods have not been able even to detect some PDF differences. The application of the proposed method is illustrated in benchmark problems.
引用
收藏
页码:848 / 870
页数:23
相关论文
共 39 条
[1]  
[Anonymous], 2002, Principal components analysis
[2]  
[Anonymous], 2005, INTRO PRACTICE STAT
[3]  
[Anonymous], 2006, PROBLEM DEFINITIONS
[4]  
[Anonymous], 2000, MULTICRITERIA OPTIMI
[5]   SOME THEOREMS ON QUADRATIC FORMS APPLIED IN THE STUDY OF ANALYSIS OF VARIANCE PROBLEMS .2. EFFECTS OF INEQUALITY OF VARIANCE AND OF CORRELATION BETWEEN ERRORS IN THE 2-WAY CLASSIFICATION [J].
BOX, GEP .
ANNALS OF MATHEMATICAL STATISTICS, 1954, 25 (03) :484-498
[6]  
CAPON J, 1965, J AM STAT ASSOC, V60, P843
[7]  
Carrano E.G., 2007, 2007 IEEE Int. Conf. Systems, P1969
[8]  
Carrano E. G., 2008, P 10 ANN C GEN EV CO, P897
[9]   Nonlinear Network Optimization-An Embedding Vector Space Approach [J].
Carrano, Eduardo G. ;
Takahashi, Ricardo H. C. ;
Fonseca, Carlos M. ;
Neto, Oriane M. .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2010, 14 (02) :206-226
[10]  
Chernick M.R., 1999, WILEY SERIES PROBABI