Parameter-efficient feature-based transfer for paraphrase identification

被引:1
|
作者
Liu, Xiaodong [1 ]
Rzepka, Rafal [2 ]
Araki, Kenji [2 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo, Hokkaido, Japan
[2] Hokkaido Univ, Fac Informat Sci & Technol, Sapporo, Hokkaido, Japan
关键词
Parameter-efficient feature-based transfer; Paraphrase identification; Natural language inference; Semantic textual similarity; Continual learning;
D O I
10.1017/S135132492200050X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are many types of approaches for Paraphrase Identification (PI), an NLP task of determining whether a sentence pair has equivalent semantics. Traditional approaches mainly consist of unsupervised learning and feature engineering, which are computationally inexpensive. However, their task performance is moderate nowadays. To seek a method that can preserve the low computational costs of traditional approaches but yield better task performance, we take an investigation into neural network-based transfer learning approaches. We discover that by improving the usage of parameters efficiently for feature-based transfer, our research goal can be accomplished. Regarding the improvement, we propose a pre-trained task-specific architecture. The fixed parameters of the pre-trained architecture can be shared by multiple classifiers with small additional parameters. As a result, the computational cost left involving parameter update is only generated from classifier-tuning: the features output from the architecture combined with lexical overlap features are fed into a single classifier for tuning. Furthermore, the pre-trained task-specific architecture can be applied to natural language inference and semantic textual similarity tasks as well. Such technical novelty leads to slight consumption of computational and memory resources for each task and is also conducive to power-efficient continual learning. The experimental results show that our proposed method is competitive with adapter-BERT (a parameter-efficient fine-tuning approach) over some tasks while consuming only 16% trainable parameters and saving 69-96% time for parameter update.
引用
收藏
页码:1066 / 1096
页数:31
相关论文
共 50 条
  • [41] PARAMETER-EFFICIENT HYDROLOGIC INFILTRATION-MODEL
    SMITH, RE
    PARLANGE, JY
    TRANSACTIONS-AMERICAN GEOPHYSICAL UNION, 1978, 59 (04): : 281 - 281
  • [42] PERS: Parameter-Efficient Multimodal Transfer Learning for Remote Sensing Visual Question Answering
    He, Jinlong
    Liu, Gang
    Li, Pengfei
    Su, Xiaonan
    Jiang, Wenhua
    Zhang, Dongze
    Zhong, Shenjun
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 14823 - 14835
  • [43] VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation
    Qiao, Yanyuan
    Yu, Zheng
    Wu, Qi
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15397 - 15406
  • [44] On the Effectiveness of Parameter-Efficient Fine-Tuning
    Fu, Zihao
    Yang, Haoran
    So, Anthony Man-Cho
    Lam, Wai
    Bing, Lidong
    Collier, Nigel
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 12799 - 12807
  • [45] PET: Parameter-efficient Knowledge Distillation on Transformer
    Jeon, Hyojin
    Park, Seungcheol
    Kim, Jin-Gee
    Kang, U.
    PLOS ONE, 2023, 18 (07):
  • [46] Modular and Parameter-Efficient Multimodal Fusion with Prompting
    Liang, Sheng
    Zhao, Mengjie
    Schuetze, Hinrich
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2976 - 2985
  • [47] PARAMETER-EFFICIENT HYDROLOGIC INFILTRATION-MODEL
    SMITH, RE
    PARLANGE, JY
    WATER RESOURCES RESEARCH, 1978, 14 (03) : 533 - 538
  • [48] Parameter-Efficient Model Adaptation for Vision Transformers
    He, Xuehai
    Li, Chuanyuan
    Zhang, Pengchuan
    Yang, Jianwei
    Wang, Xin Eric
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 817 - 825
  • [49] Feature-Based Transfer Learning for Robotic Push Manipulation
    Stuber, Jochen
    Kopicki, Marek
    Zito, Claudio
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 5643 - 5650
  • [50] Feature-based pattern recognition and object identification for telerobotics
    Lee, JK
    Mauer, GF
    2005 IEEE International Conference on Mechatronics, 2005, : 214 - 219