Self-supervised Pre-training and Semi-supervised Learning for Extractive Dialog Summarization

被引：0

作者：

Zhuang, Yingying ^{[1
]}

Song, Jiecheng ^{[1
]}

Sadagopan, Narayanan ^{[1
]}

Beniwal, Anurag ^{[1
]}

机构：

[1] Amazon, San Francisco, CA 94107 USA

来源：

COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023 | 2023年

关键词：

summarization; twitter; dialog; self-supervised pre-training; semi-supervised learning;

D O I：

10.1145/3543873.3587680

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Language model pre-training has led to state-of-the-art performance in text summarization. While a variety of pre-trained transformer models are available nowadays, they are mostly trained on documents. In this study we introduce self-supervised pre-training to enhance the BERT model's semantic and structural understanding of dialog texts from social media. We also propose a semi-supervised teacher-student learning framework to address the common issue of limited available labels in summarization datasets. We empirically evaluate our approach on extractive summarization task with the TWEETSUMM corpus, a recently introduced dialog summarization dataset from Twitter customer care conversations and demonstrate that our self-supervised pre-training and semi-supervised teacher-student learning are both beneficial in comparison to other pre-trained models. Additionally, we compare pre-training and teacher-student learning in various low data-resource settings, and find that pre-training outperforms teacher-student learning and the differences between the two are more significant when the available labels are scarce.

引用

页码：1069 / 1076

页数：8

共 50 条

[1] Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Zhang, Bowen
Cao, Songjun
Zhang, Xiaoming
Zhang, Yike
Ma, Long
Shinozaki, Takahiro
INTERSPEECH 2022, 2022, : 2653 - 2657
[2] A debiased self-training framework with graph self-supervised pre-training aided for semi-supervised rumor detection
Qiao, Yuhan
Cui, Chaoqun
Wang, Yiying
Jia, Caiyan
NEUROCOMPUTING, 2024, 604
[3] Comparing Self-Supervised Pre-Training and Semi-Supervised Training for Speech Recognition in Languages with Weak Language Models
Lam-Yee-Mui, Lea-Marie
Yang, Lucas Ondel
Klejch, Ondrej
INTERSPEECH 2023, 2023, : 87 - 91
[4] Self-Supervised Learning for Contextualized Extractive Summarization
Wang, Hong
Wang, Xin
Xiong, Wenhan
Yu, Mo
Guo, Xiaoxiao
Chang, Shiyu
Wang, William Yang
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2221 - 2227
[5] Self-supervised ECG pre-training
Liu, Han
Zhao, Zhenbo
She, Qiang
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 70
[6] Dense Contrastive Learning for Self-Supervised Visual Pre-Training
Wang, Xinlong
Zhang, Rufeng
Shen, Chunhua
Kong, Tao
Li, Lei
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3023 - 3032
[7] Class incremental learning with self-supervised pre-training and prototype learning
Liu, Wenzhuo
Wu, Xin-Jian
Zhu, Fei
Yu, Ming-Ming
Wang, Chuang
Liu, Cheng-Lin
PATTERN RECOGNITION, 2025, 157
[8] Self-supervised Pre-training of Text Recognizers
Kiss, Martin
Hradis, Michal
DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT IV, 2024, 14807 : 218 - 235
[9] Self-supervised Pre-training for Mirror Detection
Lin, Jiaying
Lau, Rynson W. H.
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12193 - 12202
[10] Self-supervised Pre-training for Nuclei Segmentation
Haq, Mohammad Minhazul
Huang, Junzhou
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 303 - 313

← 1 2 3 4 5 →