Synthetic pattern generation for imbalanced learning in image retrieval

被引:19
作者
Piras, Luca [1 ]
Giacinto, Giorgio [1 ]
机构
[1] Univ Cagliari, Dept Elect & Elect Engn, Cagliari, Italy
关键词
Imbalanced learning; Small sample-size; Artificial pattern injection; Image retrieval; Relevance feedback; RELEVANCE-FEEDBACK;
D O I
10.1016/j.patrec.2012.08.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays very large archives of digital images are easily produced thanks to the wide availability of digital cameras, that are often embedded into a number of portable devices. One of the ways of exploring an image archive is to search for similar images. Relevance feedback mechanisms can be employed to refine the search, as the most similar images according to a set of visual features may not contain the same semantic concepts according to the users' needs. Relevance feedback allows users to label the images returned by the system as being relevant or not. Then, this labelled set is used to learn the characteristics of relevant images. As the number of images provided to users to receive feedback is usually quite small, and relevant images typically represent a tiny fraction, it turns out that the learning problem is heavily imbalanced. in order to reduce this imbalance, this paper proposes the use of techniques aimed at artificially increasing the number of examples of the relevant class. The new examples are generated as new points in the feature space so that they are in agreement with the local distribution of the available relevant examples. The locality of the proposed approach makes it quite suited to relevance feedback techniques based on the Nearest-Neighbor (NN) paradigm. The effectiveness of the proposed approach is assessed on two image datasets and comparisons with editing techniques that eliminate redundancies in non-relevant examples are also reported. (c) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:2198 / 2205
页数:8
相关论文
共 27 条
[1]  
[Anonymous], 2008, P 16 INT C MULTIMEDI, DOI DOI 10.1145/1459359.1459577
[2]  
[Anonymous], 2003, ISOIEC1593832003
[3]  
Batista G. E., 2004, ACM SIGKDD Explor. Newslett., P20, DOI [10.1145/1007730.1007735, DOI 10.1145/1007730.1007735]
[4]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[5]  
Chatzichristofis SA, 2008, LECT NOTES COMPUT SC, V5008, P312
[6]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[7]  
Chen YQ, 2001, IEEE IMAGE PROC, P34, DOI 10.1109/ICIP.2001.958946
[8]   Image retrieval: Ideas, influences, and trends of the new age [J].
Datta, Ritendra ;
Joshi, Dhiraj ;
Li, Jia ;
Wang, James Z. .
ACM COMPUTING SURVEYS, 2008, 40 (02)
[9]  
DUIN RPW, 2004, WIC WINT S EINDH NET
[10]   Bayesian relevance feedback for content-based image retrieval [J].
Giacinto, G ;
Roli, F .
PATTERN RECOGNITION, 2004, 37 (07) :1499-1508