Multi-modal data clustering using deep learning: A systematic review

被引:0
|
作者
Raya, Sura [1 ]
Orabi, Mariam [1 ]
Afyouni, Imad [1 ]
Al Aghbari, Zaher [1 ]
机构
[1] Univ Sharjah, Coll Comp & Informat, Sharjah, U Arab Emirates
关键词
Multi-modal data; Clustering algorithms; Deep learning; Review article; FRAMEWORK; INFORMATION; TRENDS;
D O I
10.1016/j.neucom.2024.128348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-modal clustering represents a formidable challenge in the domain of unsupervised learning. The objective of multi-modal clustering is to categorize data collected from diverse modalities, such as audio, visual, and textual sources, into distinct clusters. These clustering techniques operate by extracting shared features across modalities in an unsupervised manner, where the identified common features exhibit high correlations within real-world objects. Recognizing the importance of perceiving the correlated nature of these features is vital for enhancing clustering accuracy in multi-modal settings. This survey explores Deep Learning (DL) techniques applied to multi-modal clustering, encompassing methodologies such as Convolutional Neural Networks (CNN), Autoencoders (AE), Recurrent Neural Networks (RNN), and Graph Convolutional Networks (GCN). Notably, this survey represents the first attempt to investigate DL techniques specifically for multi-modal clustering. The survey presents a novel taxonomy for DL-based multi-modal clustering, conducts a comparative analysis of various multi-modal clustering approaches, and deliberates on the datasets employed in the evaluation process. Additionally, the survey identifies research gaps within the realm of multi-modal clustering, offering insights into potential future avenues for research.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Deep Object Tracking with Multi-modal Data
    Zhang, Xuezhi
    Yuan, Yuan
    Lu, Xiaoqiang
    2016 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (CITS), 2016, : 161 - 165
  • [22] Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach
    Rudovic, Ognjen
    Zhang, Meiru
    Schuller, Bjorn
    Picard, Rosalind W.
    ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 6 - 15
  • [23] Deep reinforcement learning for financial trading using multi-modal features
    Avramelou, Loukia
    Nousi, Paraskevi
    Passalis, Nikolaos
    Tefas, Anastasios
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [24] Multi-modal body part segmentation of infants using deep learning
    Voss, Florian
    Brechmann, Noah
    Lyra, Simon
    Rixen, Joeran
    Leonhardt, Steffen
    Antink, Christoph Hoog
    BIOMEDICAL ENGINEERING ONLINE, 2023, 22 (01)
  • [25] Combining Multi-Modal Statistics for Welfare Prediction Using Deep Learning
    Sharma, Pulkit
    Manandhar, Achut
    Thomson, Patrick
    Katuva, Jacob
    Hope, Robert
    Clifton, David A.
    SUSTAINABILITY, 2019, 11 (22)
  • [26] Multi-modal body part segmentation of infants using deep learning
    Florian Voss
    Noah Brechmann
    Simon Lyra
    Jöran Rixen
    Steffen Leonhardt
    Christoph Hoog Antink
    BioMedical Engineering OnLine, 22
  • [27] Direct Multi-Modal Inversion of Geophysical Logs Using Deep Learning
    Alyaev, Sergey
    Elsheikh, Ahmed H.
    EARTH AND SPACE SCIENCE, 2022, 9 (09)
  • [28] Multi-modal Food Recommendation Using Clustering and Self-supervised Learning
    Zhang, Yixin
    Zhou, Xin
    Meng, Qianwen
    Zhu, Fanglin
    Xu, Yonghui
    Shen, Zhiqi
    Cui, Lizhen
    PRICAI 2024: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2025, 15281 : 269 - 281
  • [29] Multi-modal deep learning from imaging genomic data for schizophrenia classification
    Kanyal, Ayush
    Mazumder, Badhan
    Calhoun, Vince D.
    Preda, Adrian
    Turner, Jessica
    Ford, Judith
    Ye, Dong Hye
    FRONTIERS IN PSYCHIATRY, 2024, 15
  • [30] Cardiovascular disease detection based on deep learning and multi-modal data fusion
    Zhu, Jiayuan
    Liu, Hui
    Liu, Xiaowei
    Chen, Chao
    Shu, Minglei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 99