Image classification with multi-view multi-instance metric learning

被引:17
作者
Tang, Jingjing [1 ,2 ]
Li, Dewei [3 ]
Tian, Yingjie [3 ,4 ,5 ]
机构
[1] Southwestern Univ Finance & Econ, Fac Business Adm, Sch Business Adm, Chengdu 611130, Peoples R China
[2] Southwestern Univ Finance & Econ, Inst Big Data, Chengdu 611130, Peoples R China
[3] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China
[4] Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100190, Peoples R China
[5] Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China
关键词
Metric learning; Multi-view learning; Multi-instance learning; Image classification; FACE-RECOGNITION; BAG; FRAMEWORK; FEATURES; MACHINE;
D O I
10.1016/j.eswa.2021.116117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image classification is a critical and meaningful task in image retrieval, recognition and object detection. In this paper, three-side efforts are taken to accomplish this task. First, visual features with multi-instance representation are extracted to characterize the image due to the merits of bag-of-words representations. And a new distance function is designed for bags, which measures the relationship between bags more precisely. Second, the idea of multi-view learning is implemented since multiple views encourage the classifier to be more consistent and accurate. Last but not least, the metric learning technique is explored by optimizing the joint conditional probability to pursue view-dependent metrics and the importance weights of the newly designed distance in multi-view scenario. Therefore, we propose a multi-view multi-instance metric learning method named MVMIML for image classification, which integrates the advantages of the multi-view multi instance representation and metric learning into a unified framework. To solve MVMIML, we adopt the alternate iteration optimization algorithm and analyze the corresponding computational complexity. Numerical experiments verify the advantages of the new distance function and the effectiveness of MVMIML.
引用
收藏
页数:13
相关论文
共 56 条
  • [1] Ahonen T, 2004, LECT NOTES COMPUT SC, V3021, P469
  • [2] Akaho S., 2006, A kernel method for canonical correlation analysis, P263
  • [3] [Anonymous], 2010, Omnipress
  • [4] SURF: Speeded up robust features
    Bay, Herbert
    Tuytelaars, Tinne
    Van Gool, Luc
    [J]. COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 : 404 - 417
  • [5] Bosch A, 2007, IEEE I CONF COMP VIS, P1863
  • [6] An ensemble approach to multi-view multi-instance learning
    Cano, Alberto
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 136 : 46 - 57
  • [7] Multiple instance learning with bag dissimilarities
    Cheplygina, Veronika
    Tax, David M. J.
    Loog, Marco
    [J]. PATTERN RECOGNITION, 2015, 48 (01) : 264 - 275
  • [8] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [9] Davis J.V., 2007, INT C MACHINE LEARNI, P209
  • [10] Solving the multiple instance problem with axis-parallel rectangles
    Dietterich, TG
    Lathrop, RH
    LozanoPerez, T
    [J]. ARTIFICIAL INTELLIGENCE, 1997, 89 (1-2) : 31 - 71