Representation transfer and data cleaning in multi-views for text simplification

被引:1
|
作者
He, Wei [1 ,2 ]
Farrahi, Katayoun [1 ]
Chen, Bin [3 ]
Peng, Bohua [2 ]
Villavicencio, Aline [2 ]
机构
[1] Univ Southampton, Dept Elect & Comp Sci, Southampton SO17 1BJ, England
[2] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, England
[3] Univ Sheffield, Dept Automatic Control & Syst Engn, Sheffield S1 3JD, England
基金
英国工程与自然科学研究理事会;
关键词
Text simplification; Sentence representation; Pre-trained language model; Data cleaning; Decoding;
D O I
10.1016/j.patrec.2023.11.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Representation transfer is a widely used technique in natural language processing. We propose methods of cleaning the dominant dataset of text simplification (TS) WikiLarge in multi-views to remove errors that impact model training and fine-tuning. The results show that our method can effectively refine the dataset. We propose to take the pre-trained text representations from a similar task (e.g., text summarization) to text simplification to conduct a continue-fine-tuning strategy to improve the performance of pre-trained models on TS. This approach will speed up the training and make the model convergence easier. Besides, we also propose a new decoding strategy for simple text generation. It is able to generate simpler and more comprehensible text with controllable lexical simplicity. The experimental results show that our method can achieve good performance on many evaluation metrics.
引用
收藏
页码:40 / 46
页数:7
相关论文
共 50 条
  • [21] Cloud-Based Data Offloading for Multi-focus and Multi-views Image Fusion in Mobile Applications
    Yiqi Shi
    Jianguo Sun
    Duo Liu
    Liang Kou
    Boquan Li
    Qing Yang
    Liguo Zhang
    Mobile Networks and Applications, 2021, 26 : 830 - 841
  • [22] Overview of an Approach Describing Multi-views/Multi-abstraction Levels Software Architecture
    Kheir, Ahmad
    Naja, Hala
    Oussalah, Mourad
    Tout, Kifah
    ENASE: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING, 2013, : 140 - 148
  • [23] Military Training Game Requirement Description Framework Based on Multi-Views
    Nan, Yang
    Mu Xiaodong
    Zhang Qinghui
    Guo Yi
    Song Wei
    9TH IEEE INTERNATIONAL SYMPOSIUM ON SERVICE-ORIENTED SYSTEM ENGINEERING (SOSE 2015), 2015, : 297 - 301
  • [24] Multi-views reinforced LSTM for video-based action recognition
    Mao, Zhenzhen
    Jiang, Min
    Kong, Jun
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (05)
  • [25] Occlusion-robust bimanual gesture recognition by fusing multi-views
    Poon, Geoffrey
    Kwan, Kin Chung
    Pang, Wai-Man
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (16) : 23469 - 23488
  • [26] Multi-views Action Recognition on Deep Learning and K-SVD
    Wang, Chuanxu
    Hu, Guofeng
    Liu, Yun
    2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
  • [27] A new registration method for scattered point clouds from multi-views
    Mei, Z., 1600, Asian Network for Scientific Information (12):
  • [28] Occlusion-robust bimanual gesture recognition by fusing multi-views
    Geoffrey Poon
    Kin Chung Kwan
    Wai-Man Pang
    Multimedia Tools and Applications, 2019, 78 : 23469 - 23488
  • [29] Coordinate CNNs and LSTMs to categorize scene images with multi-views and multi-levels of abstraction
    Bai, Shuang
    Tang, Huadong
    An, Shan
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 120 : 298 - 309
  • [30] Structure and motion of 3D moving objects from multi-views
    Mori, TY
    Suzuki, S
    Yasuno, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (12): : 1598 - 1606