Collaboration based multi-modal multi-label learning

被引:0
作者
Yi Zhang
Yinlong Zhu
Zhecheng Zhang
Chongjung Wang
机构
[1] Nanjing University,Department of Computer Science and Technology, State Key Laboratory for Novel Software Technology
来源
Applied Intelligence | 2022年 / 52卷
关键词
Multi-modal; Multi-label; Collaboration; Label correlations;
D O I
暂无
中图分类号
学科分类号
摘要
Complex objects can be represented as multiple modal features and associated with multiple labels. The major challenge of complex object classification is how to jointly utilize heterogeneous modals in a mutually beneficial way. Besides, how to effectively utilize label correlations is also a challenging issue. Previous methods model the label correlations by requiring that any two label-specific classifiers behave similarly on the same modal if the associated labels are similar. To address the above challenges, we propose a novel modal-oriented deep learning framework named Collaboration based Multi-modal Multi-label Learning (CoM3L). With the help of memory structure in LSTM, CoM3L handles modalities sequentially, which predicts next modal to be extracted and learns label correlations simultaneously. On the one hand, CoM3L can extract the most useful modal sequence, which extracts different modal sequences for different instances. On the other hand, for each label, CoM3L combines the collaboration between its own prediction and the prediction of other labels. Extensive experiments on 5 multi-modal multi-label datasets validate the effectiveness of the proposed CoM3L approach.
引用
收藏
页码:14204 / 14217
页数:13
相关论文
共 54 条
[1]  
Boutell MR(2004)Learning multi-label scene classification Pattern Recognition 37 1757-1771
[2]  
Luo J(2008)Multilabel classification via calibrated label ranking Mach Learn 73 133-153
[3]  
Shen X(2015)A tutorial on multilabel learning ACM Computing Surveys (CSUR) 47 52-275
[4]  
Brown CM(2021)Multi-Label Classification Review and Opportunities J Netw Intell 6 255-1780
[5]  
Fürnkranz J(1997)Long short-term memory Neural Comput 9 1735-889
[6]  
Hüllermeier E(2017)Joint feature selection and classification for multilabel learning IEEE Trans Cybern 48 876-364
[7]  
Mencía EL(2018)Exploiting feature and class relationships in video categorization with regularized deep neural networks IEEE Trans Pattern Anal Mach Intell 40 352-367
[8]  
Brinker K(2010)Learning to detect a salient object IEEE Trans Pattern Anal Mach Intell 33 353-766
[9]  
Gibaja E(2011)Classifier chains for multi-label classification Mach Learn 85 333-2048
[10]  
Ventura S(2010)Harvesting image databases from the web IEEE Trans Pattern Anal Mach Intell 33 754-1837