Transformer-based Hierarchical Encoder for Document Classification

被引：1

作者：

Sakhrani, Harsh ^{[1
]}

Parekh, Saloni ^{[1
]}

Ratadiya, Pratik ^{[2
]}

机构：

[1] Pune Inst Comp Technol, Pune, Maharashtra, India

[2] vCreaTek Consulting Serv Pvt Ltd, Pune, Maharashtra, India

来源：

21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021 | 2021年

关键词：

Transformer; Self-attention; Document Classification;

D O I：

10.1109/ICDMW53433.2021.00109

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Document Classification has a wide range of applications in various domains like Ontology Mapping, Sentiment Analysis, Topic Categorization and Document Clustering, to mention a few. Unlike Text Classification, Document Classification works with longer sequences that typically contain multiple paragraphs. Previous approaches for this task have achieved promising results, but have often relied on complex recurrence mechanisms that are expensive and time-consuming in nature. Recently, self-attention based models like Transformers and BERT have achieved state-of-the-art performance on several Natural Language Understanding (NLU) tasks, but owing to the quadratic computational complexity of the self-attention mechanism with respect to the input sequence length, these approaches are generally applied to shorter text sequences. In this paper, we address this issue, by proposing a new Transformer-based Hierarchical Encoder approach for the Document Classification task. The hierarchical framework we adopt helps us extend the self-attention mechanism to long-form text modelling thereby reducing the complexity considerably. We use the Bidirectional Transformer Encoder (BTE) at the sentence-level to generate a fixed-size sentence embedding for each sentence in the document. A document-level Transformer Encoder is then used to model the global document context and learn the inter-sentence dependencies. We also carry out experiments with the BTE in a feature-extraction and a fine-tuning setup, allowing us to evaluate the trade-off between computation power and accuracy. Furthermore, we also conduct ablation experiments, and evaluate the impact of different pre-training strategies on the overall performance. Experimental results demonstrate that our proposed model achieves state-of-the-art performance on two standard benchmark datasets.

引用

页码：852 / 858

页数：7

共 50 条

[1] A hierarchical transformer-based network for multivariate time series classification
Tang, Yingxia
Wei, Yanxuan
Li, Teng
Zheng, Xiangwei
Ji, Cun
INFORMATION SYSTEMS, 2025, 132
[2] Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document Matching
Yang, Liu
Zhang, Mingyang
Li, Cheng
Bendersky, Michael
Najork, Marc
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1725 - 1734
[3] Transformer-based factorized encoder for classification of pneumoconiosis on 3D CT images
Huang, Yingying
Si, Yang
Hu, Bingliang
Zhang, Yan
Wu, Shuang
Wu, Dongsheng
Wang, Quan
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
[4] Evaluation of Transformer-Based Encoder on Conditional Graph Generation
Abeywickrama, Thamila E. H.
Tsugawa, Sho
Manada, Akiko
Watabe, Kohei
2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 1526 - 1527
[5] TRANSFORMER-BASED APPROACH FOR DOCUMENT LAYOUT UNDERSTANDING
Yang, Huichen
Hsu, William
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 4043 - 4047
[6] Medical image super-resolution via transformer-based hierarchical encoder-decoder network
Sun, Jianhao
Zeng, Xiangqin
Lei, Xiang
Gao, Mingliang
Li, Qilei
Zhang, Housheng
Ba, Fengli
NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2024, 13 (01):
[7] Transformer-based Bug/Feature Classification
Ozturk, Ceyhun E.
Yilmaz, Eyup Halit
Koksal, Omer
2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
[8] EEG Classification with Transformer-Based Models
Sun, Jiayao
Xie, Jin
Zhou, Huihui
2021 IEEE 3RD GLOBAL CONFERENCE ON LIFE SCIENCES AND TECHNOLOGIES (IEEE LIFETECH 2021), 2021, : 92 - 93
[9] Transformer-Based Point Cloud Classification
Wu, Xianfeng
Liu, Xinyi
Wang, Junfei
Wu, Xianzu
Lai, Zhongyuan
Zhou, Jing
Liu, Xia
ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2022, PT I, 2022, 1700 : 218 - 225
[10] Hierarchical Transformer-based Query by Multiple Documents
Huang, Zhiqi
Naseri, Shahrzad
Bonab, Hamed
Sarwar, Sheikh Muhammad
Allan, James
PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 105 - 115

← 1 2 3 4 5 →