Latent Semantic and Disentangled Attention

被引:0
|
作者
Chien, Jen-Tzung [1 ]
Huang, Yu-Han [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Inst Elect & Comp Engn, Hsinchu 30010, Taiwan
关键词
Transformers; Semantics; Bayes methods; Magnetic heads; Head; Decoding; Feature extraction; Sequential learning; Bayesian learning; disentangled representation; mask attention; transformer;
D O I
10.1109/TPAMI.2024.3432631
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequential learning using transformer has achieved state-of-the-art performance in natural language tasks and many others. The key to this success is the multi-head self attention which encodes and gathers the features from individual tokens of an input sequence. The mapping or decoding is performed to produce an output sequence via cross attention. There are threefold weaknesses by using such an attention framework. First, since the attention would mix up the features of different tokens in input and output sequences, it is likely that redundant information exists in sequence data representation. Second, the patterns of attention weights among different heads tend to be similar. The model capacity is bounded. Third, the robustness in an encoder-decoder network against the model uncertainty is disregarded. To handle these weaknesses, this paper presents a Bayesian semantic and disentangled mask attention to learn latent disentanglement in multi-head attention where the redundant features in transformer are compensated with the latent topic information. The attention weights are filtered by a mask which is optimized through semantic clustering. This attention mechanism is implemented according to Bayesian learning for clustered disentanglement. The experiments on machine translation and speech recognition show the merit of Bayesian clustered disentanglement for mask attention.
引用
收藏
页码:10047 / 10059
页数:13
相关论文
共 50 条
  • [1] Semantic uncertainty intervals for disentangled latent spaces
    Sankaranarayanan, Swami
    Angelopoulos, Anastasios N.
    Bates, Stephen
    Romano, Yaniv
    Isola, Phillip
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] A Compact and Semantic Latent Space for Disentangled and Controllable Image Editing
    Lesne, Gwilherm
    Gousseau, Yann
    Ladjal, Said
    Newson, Alasdair
    20TH ACM SIGGRAPH EUROPEAN CONFERENCE ON VISUAL MEDIA PRODUCTION, CVMP 2023, 2023,
  • [3] Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations
    Michieli, Umberto
    Zanuttigh, Pietro
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1114 - 1124
  • [4] β-CLVAE: a semantic disentangled generative model
    Cheng, Keyang
    Meng, Chunyun
    Ma, Guojian
    Zhan, Yongzhao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8517 - 8532
  • [5] Semantic Segmentation of Aerial Imagery via Split-Attention Networks with Disentangled Nonlocal and Edge Supervision
    Zhang, Cheng
    Jiang, Wanshou
    Zhao, Qing
    REMOTE SENSING, 2021, 13 (06)
  • [6] Treatment Effect Estimation with Disentangled Latent Factors
    Zhang, Weijia
    Liu, Lin
    Li, Jiuyong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10923 - 10930
  • [7] Learning Debiased and Disentangled Representations for Semantic Segmentation
    Chu, Sanghyeok
    Kim, Dongwan
    Han, Bohyung
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [8] Features Disentangled Semantic Broadcast Communication Networks
    Ma, Shuai
    Zhang, Zhi
    Wu, Youlong
    Li, Hang
    Shi, Guangming
    Gao, Dahua
    Shi, Yuanming
    Li, Shiyin
    Al-Dhahir, Naofal
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (06) : 6580 - 6594
  • [9] Learning Disentangled Semantic Representation for Domain Adaptation
    Cai, Ruichu
    Li, Zijian
    Wei, Pengfei
    Qiao, Jie
    Zhang, Kun
    Hao, Zhifeng
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2060 - 2066
  • [10] Applying Visual Attention Computational Model and Latent Semantic Indexing to Image Retrieval
    Liu, Wei
    Xu, Weidong
    Li, Lihua
    Wang, Weiwei
    ICIEA: 2009 4TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-6, 2009, : 2658 - 2662