Task weighting based on particle filter in deep multi-task learning with a view to uncertainty and performance

被引：3

作者：

Aghajanzadeh, Emad ^{[1
,2
]}

Bahraini, Tahereh ^{[2
,3
]}

Mehrizi, Amir Hossein ^{[2
,3
]}

Yazdi, Hadi Sadoghi ^{[1
,3
]}

机构：

[1] Ferdowsi Univ Mashhad, Dept Comp Engn, Mashhad, Iran

[2] Ferdowsi Univ Mashhad, Ctr Excellence Soft Comp & Intelligent Informat Pr, Mashhad, Iran

[3] Ferdowsi Univ Mashhad, Dept Elect Engn, Mashhad, Iran

来源：

PATTERN RECOGNITION | 2023年 / 140卷

关键词：

Multi task learning; Uncertainty; Hyper -parameter tuning; Deep learning; Particle filter; Bayesian estimation;

D O I：

10.1016/j.patcog.2023.109587

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently multi-task learning (MTL) has been widely used in different applications to build more robust models by sharing knowledge across several related tasks. However, one challenge that arises is the vari-ability in the learning pace of different tasks causing the inefficiency of naively training all tasks. There-fore, it is of great importance to consider some coefficients to balance tasks in the process of learning, but, due to the large search space and the significance of setting them properly, conventional search methods such as grid or random search are no longer effective. In this paper, we propose a learning mechanism for these coefficients based on the high efficiency of the particle filter (PF) algorithm to deal with nonlinear search problems. PF considers each state of the tasks' coefficients as a particle and recur-sively converges coefficients to an optimum point. While in most previous works coefficients were evalu-ated to only increase performance, to address the recent concerns related to applying AI in real-world ap-plications, we also incorporate uncertainty alongside our method to prevent learning coefficients leading to unstable outcomes. This mechanism is independent of the models main learning process and can be easily added to every learning system without changing its training algorithm. Extensive experiments on real-world data sets demonstrate the superiority of the proposed method over the state-of-the-art meth-ods on both performance and uncertainty. We also proved the acceptable performance of the method using Cramer Rao lower bound theory.(c) 2023 Elsevier Ltd. All rights reserved.

引用

页数：13

共 50 条

[21] A Comparison of Loss Weighting Strategies for Multi task Learning in Deep Neural Networks
Gong, Ting
Lee, Tyler
Stephenson, Cory
Renduchintala, Venkata
Padhy, Suchismita
Ndirango, Anthony
Keskin, Gokce
Elibol, Oguz H.
[J]. IEEE ACCESS, 2019, 7 : 141627 - 141632
[22] Multi-task Deep Reinforcement Learning for Scalable Parallel Task Scheduling
Zhang, Lingxin
Qi, Qi
Wang, Jingyu
Sun, Haifeng
Liao, Jianxin
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2992 - 3001
[23] UMT-Net: A Uniform Multi-Task Network With Adaptive Task Weighting
Chen, Sihan
Zheng, Lianqing
Huang, Libo
Bai, Jie
Zhu, Xichan
Ma, Zhixiong
[J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 2304 - 2317
[24] Deep Model Based Transfer and Multi-Task Learning for Biological Image Analysis
Zhang, Wenlu
Li, Rongjian
Zeng, Tao
Sun, Qian
Kumar, Sudhir
Ye, Jieping
Ji, Shuiwang
[J]. KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1475 - 1484
[25] Deep Model Based Transfer and Multi-Task Learning for Biological Image Analysis
Zhang, Wenlu
Li, Rongjian
Zeng, Tao
Sun, Qian
Kumar, Sudhir
Ye, Jieping
Ji, Shuiwang
[J]. IEEE TRANSACTIONS ON BIG DATA, 2020, 6 (02) : 322 - 333
[26] Vanishing Point Detection and Rail Segmentation Based on Deep Multi-Task Learning
Li, Xingxin
Zhu, Liqiang
Yu, Zujun
Guo, Baoqing
Wan, Yanqin
[J]. IEEE ACCESS, 2020, 8 : 163015 - 163025
[27] EEG-Based Motor Imagery Classification with Deep Multi-Task Learning
Song, Yaguang
Wang, Danli
Yue, Kang
Zheng, Nan
Shen, Zuo-Jun Max
[J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[28] Dermoscopic attributes classification using deep learning and multi-task learning
Saitov, Irek
Polevaya, Tatyana
Filchenkov, Andrey
[J]. 9TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE, YSC2020, 2020, 178 : 328 - 336
[29] Multi-task deep learning for multi-parameter elastic inversion
Li, Duo
Jiang, Peng
Yang, Senlin
Zhang, Fengkai
[J]. ACTA GEOPHYSICA, 2025, : 2443 - 2460
[30] Ask the GRU: Multi-task Learning for Deep Text Recommendations
Bansal, Trapit
Belanger, David
McCallum, Andrew
[J]. PROCEEDINGS OF THE 10TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'16), 2016, : 107 - 114

← 1 2 3 4 5 →