Multi-modal Representation Learning for Social Post Location Inference

被引:0
|
作者
Dai, RuiTing [1 ]
Luo, Jiayi [1 ]
Luo, Xucheng [1 ]
Mo, Lisi [1 ]
Ma, Wanlun [2 ]
Zhou, Fan [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 610054, Peoples R China
[2] Swinburne Univ Technol, Melbourne, Vic, Australia
来源
ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS | 2023年
关键词
Social geographic location; multi-modal social post dataset; multi-modal representation learning; multi-head attention mechanism; PREDICTION;
D O I
10.1109/ICC45041.2023.10279649
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Inferring geographic locations via social posts is essential for many practical location-based applications such as product marketing, point-of-interest recommendation, and infector tracking for COVID-19. Unlike image-based location retrieval or social-post text embedding-based location inference, the combined effect of multi-modal information (i.e., post images, text, and hashtags) for social post positioning receives less attention. In this work, we collect real datasets of social posts with images, texts, and hashtags from Instagram and propose a novel Multi-modal Representation Learning Framework (MRLF) capable of fusing different modalities of social posts for location inference. MRLF integrates a multi-head attention mechanism to enhance location-salient information extraction while significantly improving location inference compared with single domain-based methods. To overcome the noisy user-generated textual content, we introduce a novel attention-based character-aware module that considers the relative dependencies between characters of social post texts and hashtags for flexible multimodel information fusion. The experimental results show that MRLF can make accurate location predictions and open a new door to understanding the multi-modal data of social posts for online inference tasks.
引用
收藏
页码:6331 / 6336
页数:6
相关论文
共 50 条
  • [21] Multi-modal multi-step wind power forecasting based on stacking deep learning model
    Xing, Zhikai
    He, Yigang
    RENEWABLE ENERGY, 2023, 215
  • [22] Multi-modal classification of neurodegenerative disease by progressive graph-based transductive learning
    Wang, Zhengxia
    Zhu, Xiaofeng
    Adeli, Ehsan
    Zhu, Yingying
    Nie, Feiping
    Munsell, Brent
    Wu, Guorong
    MEDICAL IMAGE ANALYSIS, 2017, 39 : 218 - 230
  • [23] Multi-Modal Deep Learning Diagnosis of Parkinson's Disease-A Systematic Review
    Skaramagkas, Vasileios
    Pentari, Anastasia
    Kefalopoulou, Zinovia
    Tsiknakis, Manolis
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 2399 - 2423
  • [24] Advances and prospects of multi-modal ophthalmic artificial intelligence based on deep learning: a review
    Wang, Shaopan
    He, Xin
    Jian, Zhongquan
    Li, Jie
    Xu, Changsheng
    Chen, Yuguang
    Liu, Yuwen
    Chen, Han
    Huang, Caihong
    Hu, Jiaoyue
    Liu, Zuguo
    EYE AND VISION, 2024, 11 (01)
  • [25] Machine Learning of Multi-Modal Tumor Imaging Reveals Trajectories of Response to Precision Treatment
    Mansouri, Nesrin
    Balvay, Daniel
    Zenteno, Omar
    Facchin, Caterina
    Yoganathan, Thulaciga
    Viel, Thomas
    Herraiz, Joaquin Lopez
    Tavitian, Bertrand
    Perez-Liva, Mailyn
    CANCERS, 2023, 15 (06)
  • [26] Multi-modal deep learning enables efficient and accurate annotation of enzymatic active sites
    Wang, Xiaorui
    Yin, Xiaodan
    Jiang, Dejun
    Zhao, Huifeng
    Wu, Zhenxing
    Zhang, Odin
    Wang, Jike
    Li, Yuquan
    Deng, Yafeng
    Liu, Huanxiang
    Luo, Pei
    Han, Yuqiang
    Hou, Tingjun
    Yao, Xiaojun
    Hsieh, Chang-Yu
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [27] Multi-modal discriminative dictionary learning for Alzheimer's disease and mild cognitive impairment
    Li, Qing
    Wu, Xia
    Xu, Lele
    Chen, Kewei
    Yao, Li
    Li, Rui
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 150 : 1 - 8
  • [28] A joint multi-modal learning method for early-stage knee osteoarthritis disease classification
    Liu, Liangliang
    Chang, Jing
    Zhang, Pei
    Ma, Qingzhi
    Zhang, Hui
    Sun, Tong
    Qiao, Hongbo
    HELIYON, 2023, 9 (04)
  • [29] A multi-modal deep learning solution for precise pneumonia diagnosis: the PneumoFusion-Net model
    Wang, Yujie
    Liu, Can
    Fan, Yinghan
    Niu, Chenyue
    Huang, Wanyun
    Pan, Yixuan
    Li, Jingze
    Wang, Yilin
    Li, Jun
    FRONTIERS IN PHYSIOLOGY, 2025, 16
  • [30] Multi-Modal Predictors of Cannabis Use Initiation in Adolescents
    Garavan, Hugh
    Spechler, Phil
    BIOLOGICAL PSYCHIATRY, 2018, 83 (09) : S30 - S30