Induction Networks for Few-Shot Text Classification

被引:0
|
作者
Geng, Ruiying [1 ,2 ]
Li, Binhua [2 ]
Li, Yongbin [2 ]
Zhu, Xiaodan [3 ]
Jian, Ping [1 ]
Sun, Jian [2 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[2] Alibaba Grp, Beijing, Peoples R China
[3] Queens Univ, ECE, Kingston, ON, Canada
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification tends to struggle when data is deficient or when it needs to adapt to unseen classes. In such challenging scenarios, recent studies have used meta-learning to simulate the few-shot task, in which new queries are compared to a small support set at the sample-wise level. However, this sample-wise comparison may be severely disturbed by the various expressions in the same class. Therefore, we should be able to learn a general representation of each class in the support set and then compare it to new queries. In this paper, we propose a novel Induction Network to learn such a generalized class-wise representation, by innovatively leveraging the dynamic routing algorithm in meta-learning. In this way, we find the model is able to induce and generalize better. We evaluate the proposed model on a well-studied sentiment classification dataset (English) and a real-world dialogue intent classification dataset (Chinese). Experiment results show that on both datasets, the proposed model significantly outperforms the existing state-of-the-art approaches, proving the effectiveness of class-wise generalization in few-shot text classification.
引用
收藏
页码:3904 / 3913
页数:10
相关论文
共 50 条
  • [21] Knowledge Guided Metric Learning for Few-Shot Text Classification
    Sui, Dianbo
    Chen, Yubo
    Mao, Binjie
    Qiu, Delai
    Liu, Kang
    Zhao, Jun
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3266 - 3271
  • [22] Boosting Few-Shot Text Classification via Distribution Estimation
    Liu, Han
    Zhang, Feng
    Zhang, Xiaotong
    Zhao, Siyang
    Ma, Fenglong
    Wu, Xiao-Ming
    Chen, Hongyang
    Yu, Hong
    Zhang, Xianchao
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13219 - 13227
  • [23] Few-shot Text Classification Method Based on Feature Optimization
    Peng, Jing
    Huo, Shuquan
    JOURNAL OF WEB ENGINEERING, 2023, 22 (03): : 497 - 514
  • [24] Few-Shot Classification with Feature Map Reconstruction Networks
    Wertheimer, Davis
    Tang, Luming
    Hariharan, Bharath
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8008 - 8017
  • [25] Powerful embedding networks for few-shot image classification
    Luo, Laigan
    Zhou, Anan
    Yi, Benshun
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (06)
  • [26] Compound Memory Networks for Few-Shot Video Classification
    Zhu, Linchao
    Yang, Yi
    COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 782 - 797
  • [27] A review of few-shot classification
    Lim, Jia Min
    Lim, Kian Ming
    Lee, Chin Poo
    Lim, Jit Yan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 275
  • [28] MetricPrompt: Prompting Model as a Relevance Metric for Few-shot Text Classification
    Dong, Hongyuan
    Zhang, Weinan
    Che, Wanxiang
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 426 - 436
  • [29] Enhance Prototypical Network with Text Descriptions for Few-shot Relation Classification
    Yang, Kaijia
    Zheng, Nantao
    Dai, Xinyu
    He, Liang
    Huang, Shujian
    Chen, Jiajun
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2273 - 2276
  • [30] Noisy Channel Language Model Prompting for Few-Shot Text Classification
    Min, Sewon
    Lewis, Mike
    Hajishirzi, Hannaneh
    Zettlemoyer, Luke
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5316 - 5330