Few-Shot and Zero-Shot Multi-Label Learning for Structured Label Spaces

被引:0
作者
Rios, Anthony [1 ]
Kavuluru, Ramakanth [2 ]
机构
[1] Univ Kentucky, Dept Comp Sci, Lexington, KY 40506 USA
[2] Univ Kentucky, Div Biomed Informat, Lexington, KY USA
来源
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018) | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large multi-label datasets contain labels that occur thousands of times (frequent group), those that occur only a few times (few-shot group), and labels that never appear in the training dataset (zero-shot group). Multi-label few- and zero-shot label prediction is mostly unexplored on datasets with large label spaces, especially for text classification. In this paper, we perform a fine-grained evaluation to understand how state-of-the-art methods perform on infrequent labels. Furthermore, we develop few- and zero-shot methods for multilabel text classification when there is a known structure over the label space, and evaluate them on two publicly available medical text datasets: MIMIC II and MIMIC III. For few-shot labels we achieve improvements of 6.2% and 4.8% in R@10 for MIMIC II and MIMIC III, respectively, over prior efforts; the corresponding R@10 improvements for zero-shot labels are 17.3% and 19%.
引用
收藏
页码:3132 / 3142
页数:11
相关论文
共 47 条
  • [1] Label-Embedding for Image Classification
    Akata, Zeynep
    Perronnin, Florent
    Harchaoui, Zaid
    Schmid, Cordelia
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (07) : 1425 - 1438
  • [2] Allamanis M, 2016, PR MACH LEARN RES, V48
  • [3] [Anonymous], 2016, SCI DATA
  • [4] [Anonymous], 2017, ARXIV170508557
  • [5] [Anonymous], 2015, ADV NEURAL INFORM PR
  • [6] Bastings J., 2017, P C EMPIRICAL METHOD, P1957
  • [7] Baumel Tal, 2018, AAAI JOINT WORKSH HL
  • [8] Bhatia K, 2015, 29 ANN C NEURAL INFO, V28
  • [9] The Unified Medical Language System (UMLS): integrating biomedical terminology
    Bodenreider, O
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D267 - D270
  • [10] Chen Meihao, 2017, ARXIV171004908