Domain Adaptation with Good Edit Similarities: a Sparse Way to deal with Scaling and Rotation Problems in Image Classification

被引:1
作者
Habrard, Amaury [1 ]
Peyrache, Jean-Philippe [2 ]
Sebban, Marc [2 ]
机构
[1] Univ Aix Marseille, Lab Informat Fondamentale, CNRS, UMR 6166, F-13453 Marseille 13, France
[2] Univ Saint Etienne, Lab Hubert Curien, CNRS, UMR 5516, F-13453 Marseille 13, France
来源
2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011) | 2011年
关键词
Domain Adaptation; Edit Distance; Sparse Learning;
D O I
10.1109/ICTAI.2011.35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many real-life applications, the available source training information is either too small or not representative enough of the underlying target test problem. In the past few years, a new line of machine learning research has been developed to overcome such awkward situations, called Domain Adaptation (DA), giving rise to many adaptation algorithms and theoretical results in the form of generalization bounds. In this paper, a novel contribution is proposed in the form of a DA algorithm dealing with string-structured data, inspired from the DA support vector machine (SVM) technique introduced in [Bruzzone et al, PAMI 2010]. To ensure the convergence of SVM-based learning, the similarity functions involved in the process must be valid kernels, i.e. positive semi-definite (PSD) and symmetric. However, in the string-based context that we are considering in this paper, this condition is often not satisfied. Indeed, it has been proven that most string similarity functions based on the edit distance are not PSD. To overcome this drawback, we make use in this paper of the new theory of learning with good similarity functions introduced by Balcan et al., which (i) does not require the use of a valid kernel to learn well and (ii) allows us to induce sparser models. We take advantage of this theoretical framework to propose a new DA algorithm using good edit similarity functions. Using a suitable string-representation of handwritten digits, we show that are our new algorithm is very efficient to deal with the scaling and rotation problems usually encountered in image classification.
引用
收藏
页码:181 / 188
页数:8
相关论文
共 19 条
[1]  
[Anonymous], P NIPS 2007
[2]  
[Anonymous], 2006, Advances in neural information processing systems
[3]  
[Anonymous], P 12 ANN C COMP LEAR
[4]  
Balcan M., 2008, COLT, P287
[5]   A theory of learning from different domains [J].
Ben-David, Shai ;
Blitzer, John ;
Crammer, Koby ;
Kulesza, Alex ;
Pereira, Fernando ;
Vaughan, Jennifer Wortman .
MACHINE LEARNING, 2010, 79 (1-2) :151-175
[6]  
Blitzer J., 2006, Advances in Neural Information Processing Systems, V19, P137
[7]  
Blitzer J., 2006, P C EMPIRICAL METHOD, P120, DOI DOI 10.3115/1610075.1610094
[8]   Domain Adaptation Problems: A DASVM Classification Technique and a Circular Validation Strategy [J].
Bruzzone, Lorenzo ;
Marconcini, Mattia .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (05) :770-787
[9]  
Cortes C, 2004, J MACH LEARN RES, V5, P1035
[10]  
Daume H, 2007, P 45 ANN M ASS COMP, V45, P256