Feature Selection and Kernel Design via Linear Programming

被引：0

作者：

Fung, Glenn ^{[1
]}

Rosales, Romer ^{[1
]}

Rao, R. Bharat ^{[1
]}

机构：

[1] Siemens Med Solut, Malvern, PA USA

来源：

20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The definition of object (e. g., data point) similarity is critical to the performance of many machine learning algorithms, both in terms of accuracy and computational efficiency. However, it is often the case that a similarity function is unknown or chosen by hand. This paper introduces a formulation that given relative similarity comparisons among triples of points of the form object i is more like object j than object k, it constructs a kernel function that preserves the given relationships. Our approach is based on learning a kernel that is a combination of functions taken from a set of base functions (these could be kernels as well). The formulation is based on defining an optimization problem that can be solved using linear programming instead of a semi-definite program usually required for kernel learning. We show how to construct a convex problem from the given set of similarity comparisons and then arrive to a linear programming formulation by employing a subset of the positive definite matrices. We extend this formulation to consider representation/evaluation efficiency based on formulating a novel form of feature selection using kernels (that is not much more expensive to solve). Using publicly available data, we experimentally demonstrate how the formulation introduced in this paper shows excellent performance in practice by comparing it with a baseline method and a related state-of-the art approach, in addition of being much more efficient computationally.

引用

页码：786 / 791

页数：6

共 14 条

[1]

[Anonymous], 2003, P ADV NEUR INF PROC

[2]

ATHITSOS V, 2004, COMPUTER VISION PATT

[3]

BENNETT K, 2002, P KNOWL DISC DAT MIN

[4] OCCAM RAZOR [J].

BLUMER, A ;

EHRENFEUCHT, A ;

HAUSSLER, D ;

WARMUTH, MK .

INFORMATION PROCESSING LETTERS, 1987, 24 (06) :377-380

[5]

COHEN W. W., 1998, ADV NEURAL INFORM PR, V10

[6]

Cristianini N., 2000, Intelligent Data Analysis: An Introduction

[7]

Fung Glenn, 2004, ICML

[8]

Golub G. H., 1996, MATRIX COMPUTATIONS

[9]

GRAEPEL T, 2002, P INT C NEUR NETW IC

[10]

KilianQ Weinberger, 2004, P 21 INT C MACH LEAR, V106

← 1 2 →