Adaptive Semi-Supervised Feature Selection for Cross-Modal Retrieval

被引：124

作者：

Yu, En ^{[1
]}

Sun, Jiande ^{[1
]}

Li, Jing ^{[2
,3
]}

Chang, Xiaojun ^{[4
]}

Han, Xian-Hua ^{[5
]}

Hauptmann, Alexander G. ^{[6
]}

机构：

[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250014, Shandong, Peoples R China

[2] Shandong Management Univ, Sch Mech & Elect Engn, Jinan 250014, Shandong, Peoples R China

[3] Shandong Normal Univ, Jinan 250014, Shandong, Peoples R China

[4] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia

[5] Yamaguchi Univ, Grad Sch Sci & Technol Innovat, Yamaguchi 7538511, Japan

[6] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2019年 / 21卷 / 05期

关键词：

Semi-supervised; cross-modal retrieval; feature selection; REPRESENTATION;

D O I：

10.1109/TMM.2018.2877127

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Inorder to exploit the abundant potential information of the unlabeled data and contribute to analyzing the correlation among heterogeneous data, we propose the semi-supervised model named adaptive semi-supervised feature selection for cross-modal retrieval. First, we utilize the semantic regression to strengthen the neighboring relationship between the data with the same semantic. And the correlation between heterogeneous data can be optimized via keeping the pairwise closeness when learning the common latent space. Second, we adopt the graph-based constraint to predict accurate labels for unlabeled data, and it can also keep the geometric structure consistency between the label space and the feature space of heterogeneous data in the common latent space. Finally, an efficient joint optimization algorithm is proposed to update the mapping matrices and the label matrix for unlabeled data simultaneously and iteratively. It makes samples from different classes to be far apart, while the samples from same class lie as close as possible. Meanwhile, the l(2,1)-norm constraint is used for feature selection and outlier reduction when the mapping matrices are learned. In addition, we propose learning different mapping matrices corresponding to different sub-tasks to emphasize the semantic and structural information of query data. Experiment results on three datasets demonstrate that our method performs better than the state-of-the-art methods.

引用

页码：1276 / 1288

页数：13

共 45 条

[1] [Anonymous], 2003, P ACM INT C MULT ACM
[2] [Anonymous], P 3 INT C LEARNING R
[3] [Anonymous], 2017, P IEEE C COMP VIS PA
[4] [Anonymous], 2013, P 3 ACM INT C MULT R
[5] Latent Dirichlet allocation
Blei, DM
Ng, AY
Jordan, MI
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
[6] Generalized Multi-View Embedding for Visual Recognition and Cross-Modal Retrieval
Cao, Guanqun
Iosifidis, Alexandros
Chen, Ke
Gabbouj, Moncef
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (09) : 2542 - 2555
[7] Multi-View Nonparametric Discriminant Analysis for Image Retrieval and Recognition
Cao, Guanqun
Iosifidis, Alexandros
Gabbouj, Moncef
[J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (10) : 1537 - 1541
[8] Correlation Autoencoder Hashing for Supervised Cross-Modal Search
Cao, Yue
Long, Mingsheng
Wang, Jianmin
Zhu, Han
[J]. ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 197 - 204
[9] Semisupervised Feature Analysis by Mining Correlations Among Multiple Tasks
Chang, Xiaojun
Yang, Yi
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (10) : 2294 - 2305
[10] Semantic Pooling for Complex Event Analysis in Untrimmed Videos
Chang, Xiaojun
Yu, Yao-Liang
Yang, Yi
Xing, Eric P.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) : 1617 - 1632

← 1 2 3 4 5 →