All Together Now! The Benefits of Adaptively Fusing Pre-trained Deep Representations

被引：1

作者：

Resheff, Yehezkel ^{[1
]}

Lieder, Itay ^{[2
]}

Hope, Tom ^{[2
]}

机构：

[1] Intuit Tech Futures, Petah Tiqwa, Israel

[2] Intel Adv Analyt, Haifa, Israel

来源：

ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS | 2019年

关键词：

Deep Learning; Fusion;

D O I：

10.5220/0007367301350144

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Pre-trained deep neural networks, powerful models trained on large datasets, have become a popular tool in computer vision for transfer learning. However, the standard approach of using a single network potentially misses out on valuable information contained in other readily available models. In this work, we study the Mixture of Experts (MoE) approach for adaptively fusing multiple pre-trained models for each individual input image. In particular, we explore how far we can get by combining diverse pre-trained representations in a customized way that maximizes their potential in a lightweight framework. Our approach is motivated by an empirical study of the predictions made by popular pre-trained nets across various datasets, finding that both performance and agreement between models vary across datasets. We further propose a miniature CNN gating mechanism operating on a thumbnail version of the input image, and show this is enough to guide a good fusion. Finally, we explore a multi-modal blend of visual and natural-language representations, using a label-space embedding to inject pre-trained word-vectors. Across multiple datasets, we demonstrate that an adaptive fusion of pre-trained models can obtain favorable results.

引用

页码：135 / 144

页数：10

共 50 条

[1] Deep Fusing Pre-trained Models into Neural Machine Translation
Weng, Rongxiang
Yu, Heng
Luo, Weihua
Zhang, Min
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11468 - 11476
[2] Diffused Redundancy in Pre-trained Representations
Nanda, Vedant
Speicher, Till
Dickerson, John P.
Gummadi, Krishna P.
Feizi, Soheil
Weller, Adrian
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[3] Pre-trained Affective Word Representations
Chawla, Kushal
Khosla, Sopan
Chhaya, Niyati
Jaidka, Kokil
2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2019,
[4] Learning to Select Pre-trained Deep Representations with Bayesian Evidence Framework
Kim, Yong-Deok
Jang, Taewoong
Han, Bohyung
Choi, Seungjin
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5318 - 5326
[5] On the Language Neutrality of Pre-trained Multilingual Representations
Libovicky, Jindrich
Rosa, Rudolf
Fraser, Alexander
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1663 - 1674
[6] Imparting Fairness to Pre-Trained Biased Representations
Sadeghi, Bashir
Boddeti, Vishnu Naresh
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 75 - 82
[7] Pre-trained molecular representations enable antimicrobial discovery
Roberto Olayo-Alarcon
Martin K. Amstalden
Annamaria Zannoni
Medina Bajramovic
Cynthia M. Sharma
Ana Rita Brochado
Mina Rezaei
Christian L. Müller
Nature Communications, 16 (1)
[8] Pre-trained Language Model Representations for Language Generation
Edunov, Sergey
Baevski, Alexei
Auli, Michael
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 4052 - 4059
[9] Assessing Multilingual Fairness in Pre-trained Multimodal Representations
Wang, Jialu
Liu, Yang
Wang, Xin Eric
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2681 - 2695
[10] Inverse Problems Leveraging Pre-trained Contrastive Representations
Ravula, Sriram
Smyrnis, Georgios
Jordan, Matt
Dimakis, Alexandros G.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34

← 1 2 3 4 5 →