Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization

被引:7
|
作者
Zheng, Chujie [1 ]
Zhang, Kunpeng [2 ]
Wang, Harry Jiannan [1 ]
Fan, Ling [3 ,4 ]
Wang, Zhe [4 ]
机构
[1] Univ Delaware, Newark, DE 19716 USA
[2] Univ Maryland, College Pk, MD 20742 USA
[3] Tongji Univ, Shanghai, Peoples R China
[4] Tezign Com, Shanghai, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2021年
关键词
Abstractive Text Summarization; Contrastive Learning; Data Augmentation; Seq2seq;
D O I
10.1109/BigData52589.2021.9671819
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a denoising sequence-to-sequence (seq2seq) autoencoder via contrastive learning for abstractive text summarization. Our model adopts a standard Transformer-based architecture with a multi-layer bi-directional encoder and an auto-regressive decoder. To enhance its denoising ability, we incorporate self-supervised contrastive learning along with various sentence-level document augmentation. These two components, seq2seq autoencoder and contrastive learning, are jointly trained through fine-tuning, w hich i mproves t he performance of text summarization with regard to ROUGE scores and human evaluation. We conduct experiments on two datasets and demonstrate that our model outperforms many existing benchmarks and even achieves comparable performance to the state-of-the-art abstractive systems trained with more complex architecture and extensive computation resources.
引用
收藏
页码:1764 / 1771
页数:8
相关论文
共 50 条
  • [41] Advancing machine learning with OCR2SEQ: an innovative approach to multi-modal data augmentation
    Lowe, Michael
    Prusa, Joseph D.
    Leevy, Joffrey L.
    Khoshgoftaar, Taghi M.
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [42] ACT2G: Attention-based Contrastive Learning for Text-to-Gesture Generation
    Teshima, Hitoshi
    Wake, Naoki
    Thomas, Diego
    Nakashima, Yuta
    Kawasaki, Hiroshi
    Ikeuchi, Katsushi
    PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2023, 6 (03)
  • [43] Contrastive learning based on hierarchical graph of microstructures through directed energy deposition process to establish process-structure-property relationship via autoencoder
    Chen, Chengxi
    Wong, Stanley Jian Liang
    Tan, Eddie Zhi'En
    Li, Hua
    MATERIALS & DESIGN, 2024, 244
  • [44] Enhanced Point Cloud Interpretation via Style Fusion and Contrastive Learning in Advanced 3D Data Analysis
    Zhou, Ruimin
    Own, Chung-Ming
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT I, 2023, 14254 : 344 - 355
  • [45] Template-Free Prompting for Few-Shot Named Entity Recognition via Semantic-Enhanced Contrastive Learning
    He, Kai
    Mao, Rui
    Huang, Yucheng
    Gong, Tieliang
    Li, Chen
    Cambria, Erik
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) : 18357 - 18369
  • [46] I Know Your Intent: Graph-enhanced Intent-aware User Device Interaction Prediction via Contrastive Learning
    Xiao, Jingyu
    Zou, Qingsong
    Li, Qing
    Zhao, Dan
    Li, Kang
    Weng, Zixuan
    Li, Ruoyu
    Jiang, Yong
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2023, 7 (03):
  • [47] FreqSpace-NeRF: A fourier-enhanced Neural Radiance Fields method via dual-domain contrastive learning for novel view synthesis
    Yu, Xiaosheng
    Tian, Xiaolei
    Chen, Jubo
    Wang, Ying
    COMPUTERS & GRAPHICS-UK, 2025, 127
  • [48] ACF-R+: An asymmetry-sensitive method for image-text retrieval enhanced by cross-modal fusion and re-ranking based on contrastive learning
    Gong, Ziyu
    Huang, Yihua
    Yu, Chunhua
    Dai, Peng
    Ge, Xing
    Shen, Yiming
    Liu, Yafei
    NEUROCOMPUTING, 2025, 628
  • [49] U2-Former: Nested U-Shaped Transformer for Image Restoration via Multi-View Contrastive Learning
    Feng, Xin
    Ji, Haobo
    Pei, Wenjie
    Li, Jinxing
    Lu, Guangming
    Zhang, David
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 168 - 181
  • [50] CGUN-2A: Deep Graph Convolutional Network via Contrastive Learning for Large-Scale Zero-Shot Image Classification
    Li, Liangwei
    Liu, Lin
    Du, Xiaohui
    Wang, Xiangzhou
    Zhang, Ziruo
    Zhang, Jing
    Zhang, Ping
    Liu, Juanxiu
    SENSORS, 2022, 22 (24)