HBert: A Long Text Processing Method Based on BERT and Hierarchical Attention Mechanisms

被引:3
作者
Lv, Xueqiang [1 ]
Liu, Zhaonan [1 ]
Zhao, Ying [1 ]
Xu, Ge [2 ]
You, Xindong [1 ]
机构
[1] Beijing Informat Sci & Technol Univ, Beijing, Peoples R China
[2] Minjiang Univ, Fuzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
BERT; Hierarchical Attention; Long Text Processing;
D O I
10.4018/IJSWIS.322769
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the emergence of a large-scale pre-training model based on the transformer model, the effect of all-natural language processing tasks has been pushed to a new level. However, due to the high complexity of the transformer's self-attention mechanism, these models have poor processing ability for long text. Aiming at solving this problem, a long text processing method named HBert based on Bert and hierarchical attention neural network is proposed. Firstly, the long text is divided into multiple sentences whose vectors are obtained through the word encoder composed of Bert and the word attention layer. And the article vector is obtained through the sentence encoder that is composed of transformer and sentence attention. Then the article vector is used to complete the subsequent tasks. The experimental results show that the proposed HBert method achieves good results in text classification and QA tasks. The F1 value is 95.7% in longer text classification tasks and 75.2% in QA tasks, which are better than the state-of-the-art model longformer.
引用
收藏
页数:14
相关论文
共 24 条
  • [21] Hierarchical attention-based context-aware network for long-term forecasting of chlorophyll
    Xiaoyu He
    Suixiang Shi
    Xiulin Geng
    Lingyu Xu
    Applied Intelligence, 2023, 53 : 10202 - 10217
  • [22] Exploration of text matching methods in Chinese disease Q&A systems: A method using ensemble based on BERT and boosted tree models
    Wu, Ziming
    Liang, Jun
    Zhang, Zhongan
    Lei, Jianbo
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 115
  • [23] iAMP-Attenpred: a novel antimicrobial peptide predictor based on BERT feature extraction method and CNN-BiLSTM-Attention combination model
    Xing, Wenxuan
    Zhang, Jie
    Li, Chen
    Huo, Yujia
    Dong, Gaifang
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (01)
  • [24] HARDC : A novel ECG-based heartbeat classification method to detect arrhythmia using hierarchical attention based dual structured RNN with dilated CNN
    Islam, Md Shofiqul
    Hasan, Khondokar Fida
    Sultana, Sunjida
    Uddin, Shahadat
    Lio', Pietro
    Quinn, Julian M. W.
    Moni, Mohammad Ali
    NEURAL NETWORKS, 2023, 162 : 271 - 287