Latent Semantic and Disentangled Attention

被引：0

作者：

Chien, Jen-Tzung ^{[1
]}

Huang, Yu-Han ^{[1
]}

机构：

[1] Natl Yang Ming Chiao Tung Univ, Inst Elect & Comp Engn, Hsinchu 30010, Taiwan

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2024年 / 46卷 / 12期

关键词：

Transformers; Semantics; Bayes methods; Magnetic heads; Head; Decoding; Feature extraction; Sequential learning; Bayesian learning; disentangled representation; mask attention; transformer;

D O I：

10.1109/TPAMI.2024.3432631

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Sequential learning using transformer has achieved state-of-the-art performance in natural language tasks and many others. The key to this success is the multi-head self attention which encodes and gathers the features from individual tokens of an input sequence. The mapping or decoding is performed to produce an output sequence via cross attention. There are threefold weaknesses by using such an attention framework. First, since the attention would mix up the features of different tokens in input and output sequences, it is likely that redundant information exists in sequence data representation. Second, the patterns of attention weights among different heads tend to be similar. The model capacity is bounded. Third, the robustness in an encoder-decoder network against the model uncertainty is disregarded. To handle these weaknesses, this paper presents a Bayesian semantic and disentangled mask attention to learn latent disentanglement in multi-head attention where the redundant features in transformer are compensated with the latent topic information. The attention weights are filtered by a mask which is optimized through semantic clustering. This attention mechanism is implemented according to Bayesian learning for clustered disentanglement. The experiments on machine translation and speech recognition show the merit of Bayesian clustered disentanglement for mask attention.

引用

页码：10047 / 10059

页数：13

共 50 条

[1] Semantic uncertainty intervals for disentangled latent spaces
Sankaranarayanan, Swami
Angelopoulos, Anastasios N.
Bates, Stephen
Romano, Yaniv
Isola, Phillip
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[2] A Compact and Semantic Latent Space for Disentangled and Controllable Image Editing
Lesne, Gwilherm
Gousseau, Yann
Ladjal, Said
Newson, Alasdair
20TH ACM SIGGRAPH EUROPEAN CONFERENCE ON VISUAL MEDIA PRODUCTION, CVMP 2023, 2023,
[3] Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations
Michieli, Umberto
Zanuttigh, Pietro
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1114 - 1124
[4] β-CLVAE: a semantic disentangled generative model
Cheng, Keyang
Meng, Chunyun
Ma, Guojian
Zhan, Yongzhao
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8517 - 8532
[5] Semantic Segmentation of Aerial Imagery via Split-Attention Networks with Disentangled Nonlocal and Edge Supervision
Zhang, Cheng
Jiang, Wanshou
Zhao, Qing
REMOTE SENSING, 2021, 13 (06)
[6] Treatment Effect Estimation with Disentangled Latent Factors
Zhang, Weijia
Liu, Lin
Li, Jiuyong
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10923 - 10930
[7] Learning Debiased and Disentangled Representations for Semantic Segmentation
Chu, Sanghyeok
Kim, Dongwan
Han, Bohyung
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[8] Features Disentangled Semantic Broadcast Communication Networks
Ma, Shuai
Zhang, Zhi
Wu, Youlong
Li, Hang
Shi, Guangming
Gao, Dahua
Shi, Yuanming
Li, Shiyin
Al-Dhahir, Naofal
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (06) : 6580 - 6594
[9] Learning Disentangled Semantic Representation for Domain Adaptation
Cai, Ruichu
Li, Zijian
Wei, Pengfei
Qiao, Jie
Zhang, Kun
Hao, Zhifeng
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2060 - 2066
[10] Applying Visual Attention Computational Model and Latent Semantic Indexing to Image Retrieval
Liu, Wei
Xu, Weidong
Li, Lihua
Wang, Weiwei
ICIEA: 2009 4TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-6, 2009, : 2658 - 2662

← 1 2 3 4 5 →