Towards a methodology to search for near-optimal representations in classification problems

被引：0

作者：

del Valle, M ^{[1
]}

Sánchez, B

Lago-Fernández, LF

Corbacho, FJ

机构：

[1] Univ Autonoma Madrid, Escuela Politecn Super, E-28049 Madrid, Spain

[2] Telefon Invest & Desarrollo, Madrid 28043, Spain

[3] Cognodata Consulting, Madrid 28010, Spain

来源：

ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING APPLICATIONS: A BIOINSPIRED APPROACH, PT 2, PROCEEDINGS | 2005年 / 3562卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper provides a first step towards a methodology that allows the search for near-optimal representations in classification problems by combining feature transformations from an initial family of basis functions. The original representation for the problem data may not be the most appropriate, and therefore it might be necessary to search for a new representation space that is closer to the structure of the problem to be solved. The outcome of this search is critical for the successful solution of the problem. For instance, if the objective function has certain global statistical properties, such as periodicity, it will be hard for methods based on local pattern information to capture the underlying structure and, hence, afford generalization capabilities. Conversely, once this optimal representation is found, most of the problems may be solved by a linear method. Hence, the key is to find the proper representation. As a proof of concept we present a particular problem where the class distributions have a very intricate overlap on the space of original attributes. For this problem, the proposed algorithm finds a representation based on the trigonometric basis that provides a solution where some of the classical learning methods, e.g. multilayer perceptrons and decision trees, fail. The methodology is composed by a discrete search within the space of basis functions and a linear mapping performed by a Fisher discriminant. We play special emphasis on the first part. Finding the optimal combination of basis functions is a difficult problem because of its non-gradient nature and the large number of possible combinations. We rely on the global search capabilities of a genetic algorithm to scan the space of function compositions.

引用

页码：291 / 299

页数：9

共 9 条

[1]

Bishop C. M., 1996, Neural networks for pattern recognition

[2]

DUA RO, 2001, PATTERN CLASSIFICATI, P84

[3] Wrappers for feature subset selection [J].

Kohavi, R ;

John, GH .

ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :273-324

[4]

Lago-Fernández LF, 2002, LECT NOTES COMPUT SC, V2415, P631

[5]

LEVINE D, 1996, TRANL9518

[6] LEARNING AND GENERALIZATION CHARACTERISTICS OF THE RANDOM VECTOR FUNCTIONAL-LINK NET [J].

PAO, YH ;

PARK, GH ;

SOBAJIC, DJ .

NEUROCOMPUTING, 1994, 6 (02) :163-180

[7]

QUINLAN JR, 1992, C4 5 PROGRAM MACHINE

[8] Evolution of functional link networks [J].

Sierra, A ;

Macías, JA ;

Corbacho, F .

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2001, 5 (01) :54-65

[9]

Vapnik V., 1998, STAT LEARNING THEORY, V1, P2

← 1 →