Web Multimedia Object Classification Using Cross-Domain Correlation Knowledge

被引：17

作者：

Lu, Wenting ^{[1
]}

Li, Jingxuan ^{[2
]}

Li, Tao ^{[2
]}

Guo, Weidong ^{[1
]}

Zhang, Honggang ^{[3
]}

Guo, Jun ^{[3
]}

机构：

[1] Capital Univ Econ & Business, Beijing 100070, Peoples R China

[2] Florida Int Univ, Miami, FL 33199 USA

[3] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2013年 / 15卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Bag-of-Visual-Phrases Model; Correlation Knowledge; Cross-Domain; Multimedia Object Classification; Transfer Learning;

D O I：

10.1109/TMM.2013.2280895

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Given a collection of web images with the corresponding textual descriptions, in this paper, we propose a novel cross-domain learning method to classify these web multimedia objects by transferring the correlation knowledge among different information sources. Here, the knowledge is extracted from unlabeled objects through unsupervised learning and applied to perform supervised classification tasks. To mine more meaningful correlation knowledge, instead of using commonly used visual words in the traditional bag-of-visual-words (BoW) model, we discover higher level visual components (words and phrases) to incorporate the spatial and semantic information into our image representation model, i.e., bag-of-visual-phrases (BoP). By combining the enriched visual components with the textual words, we calculate the frequently co-occurring pairs among them to construct a cross-domain correlated graph in which the correlation knowledge is mined. After that, we investigate two different strategies to apply such knowledge to enrich the feature space where the supervised classification is performed. By transferring such knowledge, our cross-domain transfer learning method can not only handle large scale web multimedia objects, but also deal with the situation that the textual descriptions of a small portion of web images are missing. Empirical experiments on two different datasets of web multimedia objects are conducted to demonstrate the efficacy and effectiveness of our proposed cross-domain transfer learning method.

引用

页码：1920 / 1929

页数：10

共 32 条

[1]

[Anonymous], P MULT INF RETR

[2]

[Anonymous], P BRIT MACH VIS C SE

[3]

[Anonymous], P IEEE ICPR

[4]

[Anonymous], 2011, 25 AAAI C ART INT

[5]

[Anonymous], VISUAL RECOGNITION C

[6]

[Anonymous], 2007, P 24 INT C MACH LEAR

[7]

[Anonymous], 2006, PATTERN RECOGN

[8]

[Anonymous], 2008, Advances in neural information processing systems, DOI DOI 10.5555/2981780.2981825

[9]

[Anonymous], ALGORITHM 457 FINDIN

[10]

[Anonymous], 2011, Pei. data mining concepts and techniques

← 1 2 3 4 →