Multimodal Deep Learning in Semantic Image Segmentation: A Review

被引:4
|
作者
Raman, Vishal [1 ]
Kumari, Madhu [1 ]
机构
[1] NIT Hamirpur, Hamirpur, Himachal Prades, India
关键词
Multimodal learning; semantic image segmentation; deep learning; NETWORKS;
D O I
10.1145/3291064.3291067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, there has been a lot of research in the area of semantic image segmentation, which involves breaking down an image into its discrete components, such that humans can give meaning to its contents. From the humble beginnings of image search using human-provided captions, content-based image retrieval has come a long way. Yet, areas of research and improvement are far from diminishing. In this paper we will take a look at how multi-modal approaches to semantic image segmentation are setting the new standard in image search and retrieval.
引用
收藏
页码:7 / 11
页数:5
相关论文
共 50 条
  • [41] How deep learning is empowering semantic segmentation Traditional and deep learning techniques for semantic segmentation: A comparison
    Sehar, Uroosa
    Naseem, Muhammad Luqman
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) : 30519 - 30544
  • [42] Exploration of Deep Learning-based Multimodal Fusion for Semantic Road Scene Segmentation
    Zhang, Yifei
    Morel, Olivier
    Blanchon, Marc
    Seulin, Ralph
    Rastgoo, Mojdeh
    Sidibe, Desire
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 336 - 343
  • [43] A Systematic Literature Review on Machine Learning and Deep Learning Methods for Semantic Segmentation
    Sohail, Ali
    Nawaz, Naeem A.
    Shah, Asghar Ali
    Rasheed, Saim
    Ilyas, Sheeba
    Ehsan, Muhammad Khurram
    IEEE Access, 2022, 10 : 134557 - 134570
  • [44] A Systematic Literature Review on Machine Learning and Deep Learning Methods for Semantic Segmentation
    Sohail, Ali
    Nawaz, Naeem A. A.
    Shah, Asghar Ali
    Rasheed, Saim
    Ilyas, Sheeba
    Ehsan, Muhammad Khurram
    IEEE ACCESS, 2022, 10 : 134557 - 134570
  • [45] Multimodal image semantic segmentation based on attention mechanism
    Zhang Ji-you
    Zhang Rong-fen
    Liu Yu-hong
    Yuan Wen-hao
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (07) : 975 - 984
  • [46] A SEGMENTATION BASED ROBUST DEEP LEARNING FRAMEWORK FOR MULTIMODAL RETINAL IMAGE REGISTRATION
    Wang, Yiqian
    Zhang, Junkang
    An, Cheolhong
    Cavichini, Melina
    Jhingan, Mahima
    Amador-Patarroyo, Manuel J.
    Long, Christopher P.
    Bartsch, Dirk-Uwe G.
    Freeman, William R.
    Nguyen, Truong Q.
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1369 - 1373
  • [47] Deep Multimodal Fusion Network for Semantic Segmentation Using Remote Sensing Image and LiDAR Data
    Sun, Yangjie
    Fu, Zhongliang
    Sun, Chuanxia
    Hu, Yinglei
    Zhang, Shengyuan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [48] Review of Real-Time Semantic Segmentation Algorithms for Deep Learning
    He, Jiafeng
    Chen, Hongwei
    Luo, Dehan
    Computer Engineering and Applications, 2023, 59 (08) : 13 - 27
  • [49] A Review of Deep Learning-Based Semantic Segmentation for Point Cloud
    Zhang, Jiaying
    Zhao, Xiaoli
    Chen, Zheng
    Lu, Zhejun
    IEEE ACCESS, 2019, 7 : 179118 - 179133
  • [50] A review of deep learning methods for semantic segmentation of remote sensing imagery
    Yuan, Xiaohui
    Shi, Jianfang
    Gu, Lichuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 169