Survey on Multi-Output Learning

被引:164
作者
Xu, Donna [1 ]
Shi, Yaxin [1 ]
Tsang, Ivor W. [1 ]
Ong, Yew-Soon [2 ]
Gong, Chen [3 ]
Shen, Xiaobo [3 ]
机构
[1] Univ Technol Sydney, Fac Engn & Informat Technol, Ctr Artificial Intelligence, Ultimo, NSW 2007, Australia
[2] Nanyang Technol Univ, Data Sci & Artificial Intelligence Res Ctr, Sch Comp Sci & Engn, Singapore 639798, Singapore
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
关键词
Task analysis; Data models; Crowdsourcing; Supervised learning; Machine learning; Social networking (online); Tools; extreme classification; label distribution; multi-output learning; output label representation; structured output prediction; NEURAL-NETWORK; CONCEPT DRIFT; IMAGE; SEGMENTATION; RECOGNITION; CLASSIFICATION; CONSISTENCY; PREDICTION; RANKING; MODEL;
D O I
10.1109/TNNLS.2019.2945133
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The aim of multi-output learning is to simultaneously predict multiple outputs given an input. It is an important learning problem for decision-making since making decisions in the real world often involves multiple complex factors and criteria. In recent times, an increasing number of research studies have focused on ways to predict multiple outputs at once. Such efforts have transpired in different forms according to the particular multi-output learning problem under study. Classic cases of multi-output learning include multi-label learning, multi-dimensional learning, multi-target regression, and others. From our survey of the topic, we were struck by a lack in studies that generalize the different forms of multi-output learning into a common framework. This article fills that gap with a comprehensive review and analysis of the multi-output learning paradigm. In particular, we characterize the four Vs of multi-output learning, i.e., volume, velocity, variety, and veracity, and the ways in which the four Vs both benefit and bring challenges to multi-output learning by taking inspiration from big data. We analyze the life cycle of output labeling, present the main mathematical definitions of multi-output learning, and examine the field's key challenges and corresponding solutions as found in the literature. Several model evaluation metrics and popular data repositories are also discussed. Last but not least, we highlight some emerging challenges with multi-output learning from the perspective of the four Vs as potential research directions worthy of further studies.
引用
收藏
页码:2409 / 2429
页数:21
相关论文
共 309 条
[1]  
Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911
[2]   An Efficient Over-sampling Approach Based on Mean Square Error Back-propagation for Dealing with the Multi-class Imbalance Problem [J].
Alejo, R. ;
Garcia, V. ;
Pacheco-Sanchez, J. H. .
NEURAL PROCESSING LETTERS, 2015, 42 (03) :603-617
[3]   Hierarchical Image Segmentation Using Correlation Clustering [J].
Alush, Amir ;
Goldberger, Jacob .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (06) :1358-1367
[4]  
Anava O, 2015, PR MACH LEARN RES, V37, P2191
[5]  
[Anonymous], 2008, P ECAI MIN SOC DAT W
[6]  
[Anonymous], IEEE T PATTERN ANAL
[7]  
[Anonymous], 2009, P INT C NEUR INF PRO
[8]  
[Anonymous], 2011, Advances in Neural Information Processing Systems
[9]  
[Anonymous], 2013, P 26 INT C NEURAL IN
[10]  
[Anonymous], 2005, P MACH TRANSL SUMM 1