Task weighting based on particle filter in deep multi-task learning with a view to uncertainty and performance

被引：4

作者：

Aghajanzadeh, Emad ^{[1
,2
]}

Bahraini, Tahereh ^{[2
,3
]}

Mehrizi, Amir Hossein ^{[2
,3
]}

Yazdi, Hadi Sadoghi ^{[1
,3
]}

机构：

[1] Ferdowsi Univ Mashhad, Dept Comp Engn, Mashhad, Iran

[2] Ferdowsi Univ Mashhad, Ctr Excellence Soft Comp & Intelligent Informat Pr, Mashhad, Iran

[3] Ferdowsi Univ Mashhad, Dept Elect Engn, Mashhad, Iran

来源：

PATTERN RECOGNITION | 2023年 / 140卷

关键词：

Multi task learning; Uncertainty; Hyper -parameter tuning; Deep learning; Particle filter; Bayesian estimation;

D O I：

10.1016/j.patcog.2023.109587

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently multi-task learning (MTL) has been widely used in different applications to build more robust models by sharing knowledge across several related tasks. However, one challenge that arises is the vari-ability in the learning pace of different tasks causing the inefficiency of naively training all tasks. There-fore, it is of great importance to consider some coefficients to balance tasks in the process of learning, but, due to the large search space and the significance of setting them properly, conventional search methods such as grid or random search are no longer effective. In this paper, we propose a learning mechanism for these coefficients based on the high efficiency of the particle filter (PF) algorithm to deal with nonlinear search problems. PF considers each state of the tasks' coefficients as a particle and recur-sively converges coefficients to an optimum point. While in most previous works coefficients were evalu-ated to only increase performance, to address the recent concerns related to applying AI in real-world ap-plications, we also incorporate uncertainty alongside our method to prevent learning coefficients leading to unstable outcomes. This mechanism is independent of the models main learning process and can be easily added to every learning system without changing its training algorithm. Extensive experiments on real-world data sets demonstrate the superiority of the proposed method over the state-of-the-art meth-ods on both performance and uncertainty. We also proved the acceptable performance of the method using Cramer Rao lower bound theory.(c) 2023 Elsevier Ltd. All rights reserved.

引用

页数：13

共 50 条

[41] Deep multi-task learning for image/video distortions identification [J].

Ameur, Zoubida ;

Fezza, Sid Ahmed ;

Hamidouche, Wassim .

NEURAL COMPUTING & APPLICATIONS, 2022, 34 (24) :21607-21623

[42] Neural Demographic Prediction in Social Media with Deep Multi-view Multi-task Learning [J].

Lai, Yantong ;

Su, Yijun ;

Xue, Cong ;

Zha, Daren .

DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT II, 2021, 12682 :271-279

[43] Argumentation Mining Based on Multi-task Joint Learning [J].

Liao X. ;

Ni J. ;

Wei J. ;

Wu Y. ;

Chen G. .

Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (12) :1072-1079

[44] A Comparison of Multi-task Learning and Single-Task Learning Approaches [J].

Marquet, Thomas ;

Oswald, Elisabeth .

APPLIED CRYPTOGRAPHY AND NETWORK SECURITY WORKSHOPS, ACNS 2023 SATELLITE WORKSHOPS, ADSC 2023, AIBLOCK 2023, AIHWS 2023, AIOTS 2023, CIMSS 2023, CLOUD S&P 2023, SCI 2023, SECMT 2023, SIMLA 2023, 2023, 13907 :121-138

[45] Algorithm for Stereo Matching Based on Multi-Task Learning [J].

Wang Yufeng ;

Wang Hongwei ;

Liu Yu ;

Yang Mingquan ;

Quan Jicheng .

LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (04)

[46] Hateful Memes Detection Based on Multi-Task Learning [J].

Ma, Zhiyu ;

Yao, Shaowen ;

Wu, Liwen ;

Gao, Song ;

Zhang, Yunqi .

MATHEMATICS, 2022, 10 (23)

[47] Moisture Content Measurement of Yarn based on Deep Multi-task Learning [J].

Wu, Yizhi ;

Li, Hongyan .

PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, :68-72

[48] A multi-task deep learning based vulnerability severity prediction method [J].

Shan, Chun ;

Zhang, Ziyi ;

Zhou, Siyi .

2023 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING, CLOUDNET, 2023, :307-315

[49] A Deep Neural Networks Based on Multi-task Learning and Its Application [J].

Zhao, Mengru ;

Zhang, Yuxian ;

Qiao, Likui ;

Sun, Deyuan .

2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, :6201-6206

[50] Task Variance Regularized Multi-Task Learning [J].

Mao, Yuren ;

Wang, Zekai ;

Liu, Weiwei ;

Lin, Xuemin ;

Hu, Wenbin .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) :8615-8629

← 1 2 3 4 5 →