Adaptive Semi-Supervised Feature Selection for Cross-Modal Retrieval

被引:124
作者
Yu, En [1 ]
Sun, Jiande [1 ]
Li, Jing [2 ,3 ]
Chang, Xiaojun [4 ]
Han, Xian-Hua [5 ]
Hauptmann, Alexander G. [6 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250014, Shandong, Peoples R China
[2] Shandong Management Univ, Sch Mech & Elect Engn, Jinan 250014, Shandong, Peoples R China
[3] Shandong Normal Univ, Jinan 250014, Shandong, Peoples R China
[4] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia
[5] Yamaguchi Univ, Grad Sch Sci & Technol Innovat, Yamaguchi 7538511, Japan
[6] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
关键词
Semi-supervised; cross-modal retrieval; feature selection; REPRESENTATION;
D O I
10.1109/TMM.2018.2877127
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Inorder to exploit the abundant potential information of the unlabeled data and contribute to analyzing the correlation among heterogeneous data, we propose the semi-supervised model named adaptive semi-supervised feature selection for cross-modal retrieval. First, we utilize the semantic regression to strengthen the neighboring relationship between the data with the same semantic. And the correlation between heterogeneous data can be optimized via keeping the pairwise closeness when learning the common latent space. Second, we adopt the graph-based constraint to predict accurate labels for unlabeled data, and it can also keep the geometric structure consistency between the label space and the feature space of heterogeneous data in the common latent space. Finally, an efficient joint optimization algorithm is proposed to update the mapping matrices and the label matrix for unlabeled data simultaneously and iteratively. It makes samples from different classes to be far apart, while the samples from same class lie as close as possible. Meanwhile, the l(2,1)-norm constraint is used for feature selection and outlier reduction when the mapping matrices are learned. In addition, we propose learning different mapping matrices corresponding to different sub-tasks to emphasize the semantic and structural information of query data. Experiment results on three datasets demonstrate that our method performs better than the state-of-the-art methods.
引用
收藏
页码:1276 / 1288
页数:13
相关论文
共 45 条
  • [1] [Anonymous], 2003, P ACM INT C MULT ACM
  • [2] [Anonymous], P 3 INT C LEARNING R
  • [3] [Anonymous], 2017, P IEEE C COMP VIS PA
  • [4] [Anonymous], 2013, P 3 ACM INT C MULT R
  • [5] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [6] Generalized Multi-View Embedding for Visual Recognition and Cross-Modal Retrieval
    Cao, Guanqun
    Iosifidis, Alexandros
    Chen, Ke
    Gabbouj, Moncef
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (09) : 2542 - 2555
  • [7] Multi-View Nonparametric Discriminant Analysis for Image Retrieval and Recognition
    Cao, Guanqun
    Iosifidis, Alexandros
    Gabbouj, Moncef
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (10) : 1537 - 1541
  • [8] Correlation Autoencoder Hashing for Supervised Cross-Modal Search
    Cao, Yue
    Long, Mingsheng
    Wang, Jianmin
    Zhu, Han
    [J]. ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 197 - 204
  • [9] Semisupervised Feature Analysis by Mining Correlations Among Multiple Tasks
    Chang, Xiaojun
    Yang, Yi
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (10) : 2294 - 2305
  • [10] Semantic Pooling for Complex Event Analysis in Untrimmed Videos
    Chang, Xiaojun
    Yu, Yao-Liang
    Yang, Yi
    Xing, Eric P.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) : 1617 - 1632