Multi-label video classification via coupling attentional multiple instance learning with label relation graph *

被引:12
作者
Li, Xuewei [1 ]
Wu, Hongjun [1 ]
Li, Mengzhu [1 ]
Liu, Hongzhe [1 ]
机构
[1] Beijing Union Univ, Beijing Key Lab Informat Serv Engn, Beijing 100101, Peoples R China
关键词
Multi-label video classification; Multiple instance learning; Attentional feature learning; Label relation graph;
D O I
10.1016/j.patrec.2022.01.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label video classification is a challenging problem in pattern recognition field, as it is difficult to grasp the occurring localizations of a huge number of labels in videos. To solve this problem, we propose a general framework named MALL-CNN, i.e., Multi-Attention Label Relation Learning Convolutional Neural Network. MALL-CNN not only builds the correspondences between labels and videos by an attention mechanism, but also captures label co-occurrence by a graph learning approach. Specifically, we introduce multiple instance learning to composite a set of frame-level features into a video-level feature. Then, video-level feature is mapped into the content-aware category representations in an improved attentional manner. Further, these representations are enhanced by a series of label relation graphs, which transform global label relationships to the label relationships of current video. With the three processes, frame feature aggregation, video feature mapping, and label relationship construction can be achieved in MALL-CNN for multi-label video classification. Extensive experiments on real-world scene benchmark Youtube-8M verify that MALL-CNN with only frame feature surpasses the state of the arts with multi modal features as well as ensemble models.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 59
页数:7
相关论文
共 36 条
  • [21] Seeing Is Believing: Video Classification for Computed Tomographic Colonography Using Multiple-Instance Learning
    Wang, Shijun
    McKenna, Matthew T.
    Nguyen, Tan B.
    Burns, Joseph E.
    Petrick, Nicholas
    Sahiner, Berkman
    Summers, Ronald M.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2012, 31 (05) : 1141 - 1153
  • [22] Multiple instance learning combined with label invariant synthetic data for guiding systematic prostate biopsy: a feasibility study
    Golara Javadi
    Samareh Samadi
    Sharareh Bayat
    Mehran Pesteie
    Mohammad H. Jafari
    Samira Sojoudi
    Claudia Kesch
    Antonio Hurtado
    Silvia Chang
    Parvin Mousavi
    Peter Black
    Purang Abolmaesumi
    International Journal of Computer Assisted Radiology and Surgery, 2020, 15 : 1023 - 1031
  • [23] Multiple instance learning combined with label invariant synthetic data for guiding systematic prostate biopsy: a feasibility study
    Javadi, Golara
    Samadi, Samareh
    Bayat, Sharareh
    Pesteie, Mehran
    Jafari, Mohammad H.
    Sojoudi, Samira
    Kesch, Claudia
    Hurtado, Antonio
    Chang, Silvia
    Mousavi, Parvin
    Black, Peter
    Abolmaesumi, Purang
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2020, 15 (06) : 1023 - 1031
  • [24] A multi-resolution model for histopathology image classification and localization with multiple instance learning
    Li, Jiayun
    Li, Wenyuan
    Sisk, Anthony
    Ye, Huihui
    Wallace, W. Dean
    Speier, William
    Arnold, Corey W.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 131
  • [25] Multi-scale relational graph convolutional network for multiple instance learning in histopathology images
    Bazargani, Roozbeh
    Fazli, Ladan
    Gleave, Martin
    Goldenberg, Larry
    Bashashati, Ali
    Salcudean, Septimiu
    MEDICAL IMAGE ANALYSIS, 2024, 96
  • [26] Event recognition in personal photo collections via multiple instance learning-based classification of multiple images
    Ahmad, Kashif
    Conci, Nicola
    Boato, Giulia
    De Natale, Francesco G. B.
    JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (06)
  • [27] BREAST TUMOR IMAGE CLASSIFICATION IN BRIGHT CHALLENGE VIA MULTIPLE INSTANCE LEARNING AND DEEP TRANSFORMERS
    Zhan, Yangen
    Bian, Hao
    Chen, Yang
    Li, Xiu
    Zhang, Yongbing
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING CHALLENGES (IEEE ISBI 2022), 2022,
  • [28] MULTI-SCALE BLOCKS BASED IMAGE EMOTION CLASSIFICATION USING MULTIPLE INSTANCE LEARNING
    Rao, Tianrong
    Xu, Min
    Liu, Huiying
    Wang, Jinqiao
    Burnett, Ian
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 634 - 638
  • [29] Classification and quantification of glomerular spike-like projections via deep residual multiple instance learning with multi-scale annotation
    Chen, Yilin
    Liu, Xueyu
    Hao, Fang
    Zheng, Wen
    Zhou, Xiaoshuang
    Li, Ming
    Wu, Yongfei
    Wang, Chen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 76529 - 76549
  • [30] Multi-Scale Task Multiple Instance Learning for the Classification of Digital Pathology Images with Global Annotations
    Marini, Niccolo
    Otalora, Sebastian
    Ciompi, Francesco
    Silvello, Gianmaria
    Marchesin, Stefano
    Vatrano, Simona
    Buttafuoco, Genziana
    Atzori, Manfredo
    Mueller, Henning
    MICCAI WORKSHOP ON COMPUTATIONAL PATHOLOGY, VOL 156, 2021, 156 : 170 - +