ConvMTL: Multi-task Learning via Self-supervised Learning for Simultaneous Dense Predictions

被引：0

作者：

Iyer, Vijayasri ^{[1
]}

Thangavel, Senthil Kumar ^{[1
]}

Nalluri, Madhusudana Rao ^{[2
]}

Chang, Maiga ^{[3
]}

机构：

[1] Amrita Vishwa Vidyapeetham, Amrita Sch Comp, Dept Comp Sci & Engn, Coimbatore 641112, Tamil Nadu, India

[2] Amrita Vishwa Vidyapeetham, Amrita Sch Comp, Dept Comp Sci & Engn, Amaravati 522503, India

[3] Athabasca Univ, Sch Comp Informat & Syst, Athabasca, AB, Canada

来源：

COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I | 2024年 / 2009卷

关键词：

Multi-task Learning; Transfer Learning; Deep Learning; Computer Vision; Autonomous Driving;

D O I：

10.1007/978-3-031-58181-6_38

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Perception systems in autonomous vehicles are required to perform multiple scene-understanding tasks under tight constraints of latency and power. Single-task neural networks can become unscalable when the number of tasks increases in the perception stack. Multi-task learning has been shown to improve parameter efficiency and enable models to learn more generalizable task representations compared to single-task neural networks. This work explores a novel convolutional multi-task neural network architecture that simultaneously performs two dense prediction tasks, semantic segmentation and depth estimation. A self-supervised ResNet-50 backbone is used as the basis of the proposed network, along with a multi-scale feature fusion module and a dense decoder. The model uses a simple weighted loss function with an informed search algorithm identifying the optimal parameters. The performance of the proposed model on the segmentation task is assessed using the mean Intersection of Union (mIoU) and pixel accuracy. In contrast, absolute and relative errors assess the depth estimation task. The obtained results for segmentation and depth estimation are mIoU of 73.81%, pixel accuracy of 93.52%, an absolute error of 0.130, and a relative error of 29.05. The model's performance is comparable to existing multitask algorithms on the Cityscapes dataset, using only 2975 training samples.

引用

页码：455 / 466

页数：12

共 50 条

[1] Multi-task Self-Supervised Visual Learning
Doersch, Carl
Zisserman, Andrew
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2070 - 2079
[2] A Multi-Task Dense Network with Self-Supervised Learning for Retinal Vessel Segmentation
Tu, Zhonghao
Zhou, Qian
Zou, Hua
Zhang, Xuedong
ELECTRONICS, 2022, 11 (21)
[3] Anomaly Detection in Video via Self-Supervised and Multi-Task Learning
Georgescu, Mariana-Iuliana
Barbalau, Antonio
Ionescu, Radu Tudor
Khan, Fahad Shahbaz
Popescu, Marius
Shah, Mubarak
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12737 - 12747
[4] Multi-task Semantic Matching with Self-supervised Learning
Chen Y.
Qiu X.
Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (01): : 83 - 90
[5] Multi-task Self-Supervised Adaptation for Reinforcement Learning
Wu, Keyu
Chen, Zhenghua
Wu, Min
Xiang, Shili
Jin, Ruibing
Zhang, Le
Li, Xiaoli
2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 15 - 20
[6] Multi-Task Self-Supervised Learning for Disfluency Detection
Wang, Shaolei
Che, Wanxiang
Liu, Qi
Qin, Pengda
Liu, Ting
Wang, William Yang
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9193 - 9200
[7] Self-supervised multi-task learning for medical image analysis
Yu, Huihui
Dai, Qun
PATTERN RECOGNITION, 2024, 150
[8] MULTI-TASK SELF-SUPERVISED LEARNING FOR ROBUST SPEECH RECOGNITION
Ravanelli, Mirco
Zhong, Jianyuan
Pascual, Santiago
Swietojanski, Pawel
Monteiro, Joao
Trmal, Jan
Bengio, Yoshua
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6989 - 6993
[9] A MULTI-TASK SELF-SUPERVISED LEARNING FRAMEWORK FOR SCOPY IMAGES
Li, Yuexiang
Chen, Jiawei
Zheng, Yefeng
2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 2005 - 2009
[10] Multi-task self-supervised learning for human activity detection
Saeed, Aaqib
Ozcelebi, Tanir
Lukkien, Johan
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2019, 3 (02)

← 1 2 3 4 5 →