Task weighting based on particle filter in deep multi-task learning with a view to uncertainty and performance

被引:3
作者
Aghajanzadeh, Emad [1 ,2 ]
Bahraini, Tahereh [2 ,3 ]
Mehrizi, Amir Hossein [2 ,3 ]
Yazdi, Hadi Sadoghi [1 ,3 ]
机构
[1] Ferdowsi Univ Mashhad, Dept Comp Engn, Mashhad, Iran
[2] Ferdowsi Univ Mashhad, Ctr Excellence Soft Comp & Intelligent Informat Pr, Mashhad, Iran
[3] Ferdowsi Univ Mashhad, Dept Elect Engn, Mashhad, Iran
关键词
Multi task learning; Uncertainty; Hyper -parameter tuning; Deep learning; Particle filter; Bayesian estimation;
D O I
10.1016/j.patcog.2023.109587
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently multi-task learning (MTL) has been widely used in different applications to build more robust models by sharing knowledge across several related tasks. However, one challenge that arises is the vari-ability in the learning pace of different tasks causing the inefficiency of naively training all tasks. There-fore, it is of great importance to consider some coefficients to balance tasks in the process of learning, but, due to the large search space and the significance of setting them properly, conventional search methods such as grid or random search are no longer effective. In this paper, we propose a learning mechanism for these coefficients based on the high efficiency of the particle filter (PF) algorithm to deal with nonlinear search problems. PF considers each state of the tasks' coefficients as a particle and recur-sively converges coefficients to an optimum point. While in most previous works coefficients were evalu-ated to only increase performance, to address the recent concerns related to applying AI in real-world ap-plications, we also incorporate uncertainty alongside our method to prevent learning coefficients leading to unstable outcomes. This mechanism is independent of the models main learning process and can be easily added to every learning system without changing its training algorithm. Extensive experiments on real-world data sets demonstrate the superiority of the proposed method over the state-of-the-art meth-ods on both performance and uncertainty. We also proved the acceptable performance of the method using Cramer Rao lower bound theory.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A Comparison of Loss Weighting Strategies for Multi task Learning in Deep Neural Networks
    Gong, Ting
    Lee, Tyler
    Stephenson, Cory
    Renduchintala, Venkata
    Padhy, Suchismita
    Ndirango, Anthony
    Keskin, Gokce
    Elibol, Oguz H.
    [J]. IEEE ACCESS, 2019, 7 : 141627 - 141632
  • [22] Multi-task Deep Reinforcement Learning for Scalable Parallel Task Scheduling
    Zhang, Lingxin
    Qi, Qi
    Wang, Jingyu
    Sun, Haifeng
    Liao, Jianxin
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2992 - 3001
  • [23] UMT-Net: A Uniform Multi-Task Network With Adaptive Task Weighting
    Chen, Sihan
    Zheng, Lianqing
    Huang, Libo
    Bai, Jie
    Zhu, Xichan
    Ma, Zhixiong
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 2304 - 2317
  • [24] Deep Model Based Transfer and Multi-Task Learning for Biological Image Analysis
    Zhang, Wenlu
    Li, Rongjian
    Zeng, Tao
    Sun, Qian
    Kumar, Sudhir
    Ye, Jieping
    Ji, Shuiwang
    [J]. KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1475 - 1484
  • [25] Deep Model Based Transfer and Multi-Task Learning for Biological Image Analysis
    Zhang, Wenlu
    Li, Rongjian
    Zeng, Tao
    Sun, Qian
    Kumar, Sudhir
    Ye, Jieping
    Ji, Shuiwang
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2020, 6 (02) : 322 - 333
  • [26] Vanishing Point Detection and Rail Segmentation Based on Deep Multi-Task Learning
    Li, Xingxin
    Zhu, Liqiang
    Yu, Zujun
    Guo, Baoqing
    Wan, Yanqin
    [J]. IEEE ACCESS, 2020, 8 : 163015 - 163025
  • [27] EEG-Based Motor Imagery Classification with Deep Multi-Task Learning
    Song, Yaguang
    Wang, Danli
    Yue, Kang
    Zheng, Nan
    Shen, Zuo-Jun Max
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [28] Dermoscopic attributes classification using deep learning and multi-task learning
    Saitov, Irek
    Polevaya, Tatyana
    Filchenkov, Andrey
    [J]. 9TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE, YSC2020, 2020, 178 : 328 - 336
  • [29] Multi-task deep learning for multi-parameter elastic inversion
    Li, Duo
    Jiang, Peng
    Yang, Senlin
    Zhang, Fengkai
    [J]. ACTA GEOPHYSICA, 2025, : 2443 - 2460
  • [30] Ask the GRU: Multi-task Learning for Deep Text Recommendations
    Bansal, Trapit
    Belanger, David
    McCallum, Andrew
    [J]. PROCEEDINGS OF THE 10TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'16), 2016, : 107 - 114