Multi-label video classification via coupling attentional multiple instance learning with label relation graph *

被引:12
|
作者
Li, Xuewei [1 ]
Wu, Hongjun [1 ]
Li, Mengzhu [1 ]
Liu, Hongzhe [1 ]
机构
[1] Beijing Union Univ, Beijing Key Lab Informat Serv Engn, Beijing 100101, Peoples R China
关键词
Multi-label video classification; Multiple instance learning; Attentional feature learning; Label relation graph;
D O I
10.1016/j.patrec.2022.01.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label video classification is a challenging problem in pattern recognition field, as it is difficult to grasp the occurring localizations of a huge number of labels in videos. To solve this problem, we propose a general framework named MALL-CNN, i.e., Multi-Attention Label Relation Learning Convolutional Neural Network. MALL-CNN not only builds the correspondences between labels and videos by an attention mechanism, but also captures label co-occurrence by a graph learning approach. Specifically, we introduce multiple instance learning to composite a set of frame-level features into a video-level feature. Then, video-level feature is mapped into the content-aware category representations in an improved attentional manner. Further, these representations are enhanced by a series of label relation graphs, which transform global label relationships to the label relationships of current video. With the three processes, frame feature aggregation, video feature mapping, and label relationship construction can be achieved in MALL-CNN for multi-label video classification. Extensive experiments on real-world scene benchmark Youtube-8M verify that MALL-CNN with only frame feature surpasses the state of the arts with multi modal features as well as ensemble models.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 59
页数:7
相关论文
共 50 条
  • [1] Instance-Aware Deep Graph Learning for Multi-Label Classification
    Wang, Yun
    Zhang, Tong
    Zhou, Chuanwei
    Cui, Zhen
    Yang, Jian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 90 - 99
  • [2] Partial Multi-Label Learning via Exploiting Instance and Label Correlations
    Liang, Weichao
    Gao, Guangliang
    Chen, Lei
    Wang, Youquan
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 19 (01)
  • [3] Learning Local Instance Constraint for Multi-label Classification
    Luo, Shang
    Wu, Xiaofeng
    Wang, Bin
    Zhang, Liming
    IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 284 - 294
  • [4] Handling Label Noise in Video Classification via Multiple Instance Learning
    Leung, Thomas
    Song, Yang
    Zhang, John
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 2056 - 2063
  • [5] Learning Video Features for Multi-label Classification
    Garg, Shivam
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 325 - 337
  • [6] Incorporating Instance Correlations in Multi-label Classification via Label-Space
    de Abreu, Iuri Bonna M.
    Mantovani, Rafael G.
    Cerri, Ricardo
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 581 - 588
  • [7] Multi-Label Classification with Label Graph Superimposing
    Wang, Ya
    He, Dongliang
    Li, Fu
    Long, Xiang
    Zhou, Zhichao
    Ma, Jinwen
    Wen, Shilei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12265 - 12272
  • [8] Multi-task multi-label multiple instance learning
    Shen, Yi
    Fan, Jian-ping
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2010, 11 (11): : 860 - 871
  • [9] Multi-task multi-label multiple instance learning
    Yi SHENJianping FANDepartment of Computer ScienceUniversity of North Carolina at Charlotte USA
    Journal of Zhejiang University-Science C(Computers & Electronics), 2010, 11 (11) : 860 - 871