Multi-label video classification via coupling attentional multiple instance learning with label relation graph *

被引:12
作者
Li, Xuewei [1 ]
Wu, Hongjun [1 ]
Li, Mengzhu [1 ]
Liu, Hongzhe [1 ]
机构
[1] Beijing Union Univ, Beijing Key Lab Informat Serv Engn, Beijing 100101, Peoples R China
关键词
Multi-label video classification; Multiple instance learning; Attentional feature learning; Label relation graph;
D O I
10.1016/j.patrec.2022.01.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label video classification is a challenging problem in pattern recognition field, as it is difficult to grasp the occurring localizations of a huge number of labels in videos. To solve this problem, we propose a general framework named MALL-CNN, i.e., Multi-Attention Label Relation Learning Convolutional Neural Network. MALL-CNN not only builds the correspondences between labels and videos by an attention mechanism, but also captures label co-occurrence by a graph learning approach. Specifically, we introduce multiple instance learning to composite a set of frame-level features into a video-level feature. Then, video-level feature is mapped into the content-aware category representations in an improved attentional manner. Further, these representations are enhanced by a series of label relation graphs, which transform global label relationships to the label relationships of current video. With the three processes, frame feature aggregation, video feature mapping, and label relationship construction can be achieved in MALL-CNN for multi-label video classification. Extensive experiments on real-world scene benchmark Youtube-8M verify that MALL-CNN with only frame feature surpasses the state of the arts with multi modal features as well as ensemble models.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 59
页数:7
相关论文
共 38 条
[21]   Seeing Is Believing: Video Classification for Computed Tomographic Colonography Using Multiple-Instance Learning [J].
Wang, Shijun ;
McKenna, Matthew T. ;
Nguyen, Tan B. ;
Burns, Joseph E. ;
Petrick, Nicholas ;
Sahiner, Berkman ;
Summers, Ronald M. .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2012, 31 (05) :1141-1153
[22]   Multiple instance learning combined with label invariant synthetic data for guiding systematic prostate biopsy: a feasibility study [J].
Golara Javadi ;
Samareh Samadi ;
Sharareh Bayat ;
Mehran Pesteie ;
Mohammad H. Jafari ;
Samira Sojoudi ;
Claudia Kesch ;
Antonio Hurtado ;
Silvia Chang ;
Parvin Mousavi ;
Peter Black ;
Purang Abolmaesumi .
International Journal of Computer Assisted Radiology and Surgery, 2020, 15 :1023-1031
[23]   Multiple instance learning combined with label invariant synthetic data for guiding systematic prostate biopsy: a feasibility study [J].
Javadi, Golara ;
Samadi, Samareh ;
Bayat, Sharareh ;
Pesteie, Mehran ;
Jafari, Mohammad H. ;
Sojoudi, Samira ;
Kesch, Claudia ;
Hurtado, Antonio ;
Chang, Silvia ;
Mousavi, Parvin ;
Black, Peter ;
Abolmaesumi, Purang .
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2020, 15 (06) :1023-1031
[24]   A multi-resolution model for histopathology image classification and localization with multiple instance learning [J].
Li, Jiayun ;
Li, Wenyuan ;
Sisk, Anthony ;
Ye, Huihui ;
Wallace, W. Dean ;
Speier, William ;
Arnold, Corey W. .
COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 131
[25]   Multi-scale relational graph convolutional network for multiple instance learning in histopathology images [J].
Bazargani, Roozbeh ;
Fazli, Ladan ;
Gleave, Martin ;
Goldenberg, Larry ;
Bashashati, Ali ;
Salcudean, Septimiu .
MEDICAL IMAGE ANALYSIS, 2024, 96
[26]   Event recognition in personal photo collections via multiple instance learning-based classification of multiple images [J].
Ahmad, Kashif ;
Conci, Nicola ;
Boato, Giulia ;
De Natale, Francesco G. B. .
JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (06)
[27]   BREAST TUMOR IMAGE CLASSIFICATION IN BRIGHT CHALLENGE VIA MULTIPLE INSTANCE LEARNING AND DEEP TRANSFORMERS [J].
Zhan, Yangen ;
Bian, Hao ;
Chen, Yang ;
Li, Xiu ;
Zhang, Yongbing .
2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING CHALLENGES (IEEE ISBI 2022), 2022,
[28]   MULTI-SCALE BLOCKS BASED IMAGE EMOTION CLASSIFICATION USING MULTIPLE INSTANCE LEARNING [J].
Rao, Tianrong ;
Xu, Min ;
Liu, Huiying ;
Wang, Jinqiao ;
Burnett, Ian .
2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, :634-638
[29]   Classification and quantification of glomerular spike-like projections via deep residual multiple instance learning with multi-scale annotation [J].
Chen, Yilin ;
Liu, Xueyu ;
Hao, Fang ;
Zheng, Wen ;
Zhou, Xiaoshuang ;
Li, Ming ;
Wu, Yongfei ;
Wang, Chen .
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) :76529-76549
[30]   Multi-Scale Task Multiple Instance Learning for the Classification of Digital Pathology Images with Global Annotations [J].
Marini, Niccolo ;
Otalora, Sebastian ;
Ciompi, Francesco ;
Silvello, Gianmaria ;
Marchesin, Stefano ;
Vatrano, Simona ;
Buttafuoco, Genziana ;
Atzori, Manfredo ;
Mueller, Henning .
MICCAI WORKSHOP ON COMPUTATIONAL PATHOLOGY, VOL 156, 2021, 156 :170-+