Simultaneously Learning Semantic Segmentation and Depth Estimation from Omnidirectional Image

被引:0
作者
Yokota A. [1 ]
Li S. [1 ]
Kamio T. [1 ]
Kosaku T. [1 ]
机构
[1] Graduate School of Information Sciences, Hiroshima City University, 3-4-1, Ozuka-higashi, Asaminami-ku, Hiroshima
关键词
depth estimation; multi-task learning; omnidirectional image; semantic segmentation;
D O I
10.1541/ieejeiss.144.560
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-task learning, the goal is to improve the generalization performance of the model by exploiting the information shared across tasks. In this paper, we propose a neural network that simultaneously learns depth estimation and semantic segmentation of the environment from omnidirectional images captured by an omnidirectional camera. Our proposed neural network is developed by modifying UniFuse network, which was originally developed for depth estimation from omnidirectional images, to simultaneously learn depth estimation and semantic segmentation of the environment by exploiting the features shared between depth estimation and semantic segmentation tasks. In the experiments, the proposed method was evaluated with the well-known Stanford 2D3D Dataset. High accuracy for the two tasks was not obtained with a single network. However, if either of the two tasks was prioritized in learning, the synergistic effect of the two tasks with shared feature maps would improve accuracy, resulting in better results than a single-task network. It showed the effectiveness of simultaneously learning semantic segmentation and depth estimation from omnidirectional images. © 2024 The Institute of Electrical Engineers of Japan.
引用
收藏
页码:560 / 567
页数:7
相关论文
共 50 条
[21]   Depth-Guided Texture Diffusion for Image Semantic Segmentation [J].
Sun, Wei ;
Li, Yuan ;
Ye, Qixiang ;
Jiao, Jianbin ;
Zhou, Yanzhao .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) :1287-1302
[22]   Efficient Depth Fusion Transformer for Aerial Image Semantic Segmentation [J].
Yan, Li ;
Huang, Jianming ;
Xie, Hong ;
Wei, Pengcheng ;
Gao, Zhao .
REMOTE SENSING, 2022, 14 (05)
[23]   Practical Depth Estimation with Image Segmentation and Serial U-Nets [J].
Cantrell, Kyle J. ;
Miller, Craig D. ;
Morato, Carlos W. .
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON VEHICLE TECHNOLOGY AND INTELLIGENT TRANSPORT SYSTEMS (VEHITS), 2020, :406-414
[24]   Transformer framework for depth-assisted UDA semantic segmentation [J].
Song, Yunna ;
Shi, Jinlong ;
Zou, Danping ;
Liu, Caisheng ;
Bai, Suqin ;
Shu, Xin ;
Qian, Qian ;
Xu, Dan ;
Yuan, Yu ;
Sun, Yunhan .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
[25]   Semantic Reconstruction based on RGB Image and Sparse Depth [J].
Cai, Yu ;
Ding, Yinzhang ;
Li, Dongxiao ;
Zhang, Ming .
TWELFTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2020), 2021, 11720
[26]   Semantic image segmentation via dynamic curriculum learning [J].
Zhang, Xiang ;
Zhao, Wanqing ;
Wang, Chenji ;
Luo, Hangzai ;
Zhong, Sheng ;
Tang, Lei ;
Peng, Jinye ;
Fan, Jianping .
APPLIED INTELLIGENCE, 2025, 55 (12)
[27]   Semantic image segmentation network based on deep learning [J].
Chen, Bo ;
Zhang, Jiahao ;
Zhou, Jianbang ;
Chen, Zhong ;
Yang, Tian ;
Zhang, Yanna .
MIPPR 2019: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2020, 11429
[28]   Medical image semantic segmentation based on deep learning [J].
Jiang, Feng ;
Grigorev, Aleksei ;
Rho, Seungmin ;
Tian, Zhihong ;
Fu, YunSheng ;
Jifara, Worku ;
Adil, Khan ;
Liu, Shaohui .
NEURAL COMPUTING & APPLICATIONS, 2018, 29 (05) :1257-1265
[29]   SOSD-Net: Joint semantic object segmentation and depth estimation from monocular images [J].
He, Lei ;
Lu, Jiwen ;
Wang, Guanghui ;
Song, Shiyu ;
Zhou, Jie .
NEUROCOMPUTING, 2021, 440 (440) :251-263
[30]   Depth Estimation Based on Semantic Guidance for Light Field Image [J].
Deng Huiping ;
Sheng Zhichao ;
Xiang Sen ;
Wu Jing .
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (08) :2940-2948