Learning to Switch off, Switch on, and Integrate Modalities in Large Pre-trained Transformers

被引:0
|
作者
Duseja, Tejas [1 ]
Annervaz, K. M. [1 ]
Duggani, Jeevithiesh [1 ]
Zacharia, Shyam [2 ]
Free, Michael [3 ]
Dukkipati, Ambedkar [1 ]
机构
[1] Indian Inst Sci, Bengaluru, India
[2] British Telcom, Bengaluru, India
[3] British Telcom, London, England
来源
2024 IEEE 7TH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL, MIPR 2024 | 2024年
关键词
Multi-modal emotion recognition; sentiment analysis; pre-trained models;
D O I
10.1109/MIPR62202.2024.00070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer models that revolutionized foundation models are ubiquitous nowadays. Hence, there has been a surge in pre-trained transformers that can be fine-tuned to perform different downstream tasks. Most pre-trained transformers are trained only on a single modality, and there is no direct way to fine-tune them in multiple modalities. To tackle this issue, in this paper, we propose a general-purpose gate, SSIM (Switch off, Switch on, and Integrate Modalities), by which one can integrate other modalities into large pre-trained language transformers. The proposed SSIM gate helps to obtain the unified representation by soft-switching between multi-modal interactions. To evaluate our approach, we have established benchmarks using pre-trained language transformers like BERT, XLNet, and T5 on multi-modal tasks such as Sentiment and Emotion analysis (CMU-MOSI, CMU-MOSEI), Emotion Recognition in Conversations (IEMOCAP, MELD), and Multimodal Intent Recognition (MIntRec), achieving close to State-of-the-art results.
引用
收藏
页码:403 / 409
页数:7
相关论文
共 33 条
  • [21] Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need
    Zhou, Da-Wei
    Cai, Zi-Wen
    Ye, Han-Jia
    Zhan, De-Chuan
    Liu, Ziwei
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (03) : 1012 - 1032
  • [22] Classification of Rice Leaf Diseases using CNN-based pre-trained models and transfer learning
    Mavaddat, Marjan
    Naderan, Marjan
    Alavi, Seyyed Enayatallah
    2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA, 2023,
  • [23] Fault diagnosis of air compressors using transfer learning: A comparative study of pre-trained networks and hyperparameter optimization
    Srivatsan, B.
    Venkatesh, S. Naveen
    Aravinth, S.
    Sugumaran, V
    Dhanraj, Joshuva Arockia
    Solomon, Jenoris Muthiya
    Vaidhyanathan, R. Muthu
    JOURNAL OF LOW FREQUENCY NOISE VIBRATION AND ACTIVE CONTROL, 2024, 43 (04) : 1877 - 1894
  • [24] Detection of driver distraction in the Australian naturalistic driving study videos using pre-trained models and transfer learning
    Elhenawy, Mohammed
    Masoud, Mahmoud
    Haworth, Narelle
    Young, Kristie
    Rakotonirainy, Andry
    Grzebieta, Raphael
    Williamson, Ann
    TRANSPORTATION RESEARCH PART F-TRAFFIC PSYCHOLOGY AND BEHAVIOUR, 2023, 97 : 31 - 43
  • [25] Domain knowledge-infused pre-trained deep learning models for efficient white blood cell classification
    P. Jeneessha
    Vinoth Kumar Balasubramanian
    Scientific Reports, 15 (1)
  • [26] Pre-trained Deep Learning Models for Chest X-Rays' Classification: Views and Age-Groups
    Farhat, Hanan
    Jabbour, Joey
    Sakr, Georges E.
    Kilany, Rima
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, INTELLISYS 2023, 2024, 823 : 71 - 82
  • [27] Hybridization of Deep Learning Pre-Trained Models with Machine Learning Classifiers and Fuzzy Min-Max Neural Network for Cervical Cancer Diagnosis
    Kalbhor, Madhura
    Shinde, Swati
    Popescu, Daniela Elena
    Hemanth, D. Jude
    DIAGNOSTICS, 2023, 13 (07)
  • [28] Weakly Supervised Deep Learning for Arabic Tweet Sentiment Analysis on Education Reforms: Leveraging Pre-Trained Models and LLMs With Snorkel
    Alotaibi, Alanoud
    Nadeem, Farrukh
    Hamdy, Mohamed
    IEEE ACCESS, 2025, 13 : 30523 - 30542
  • [29] Pre-Trained Deep Neural Network-Based Features Selection Supported Machine Learning for Rice Leaf Disease Classification
    Aggarwal, Meenakshi
    Khullar, Vikas
    Goyal, Nitin
    Singh, Aman
    Tolba, Amr
    Thompson, Ernesto Bautista
    Kumar, Sushil
    AGRICULTURE-BASEL, 2023, 13 (05):
  • [30] Unlocking language barriers: Assessing pre-trained large language models across multilingual tasks and unveiling the black box with Explainable Artificial Intelligence
    Kastrati, Muhamet
    Imran, Ali Shariq
    Hashmi, Ehtesham
    Kastrati, Zenun
    Daudpota, Sher Muhammad
    Biba, Marenglen
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 149