Few-Shot and Zero-Shot Multi-Label Learning for Structured Label Spaces

被引:0
作者
Rios, Anthony [1 ]
Kavuluru, Ramakanth [2 ]
机构
[1] Univ Kentucky, Dept Comp Sci, Lexington, KY 40506 USA
[2] Univ Kentucky, Div Biomed Informat, Lexington, KY USA
来源
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018) | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large multi-label datasets contain labels that occur thousands of times (frequent group), those that occur only a few times (few-shot group), and labels that never appear in the training dataset (zero-shot group). Multi-label few- and zero-shot label prediction is mostly unexplored on datasets with large label spaces, especially for text classification. In this paper, we perform a fine-grained evaluation to understand how state-of-the-art methods perform on infrequent labels. Furthermore, we develop few- and zero-shot methods for multilabel text classification when there is a known structure over the label space, and evaluate them on two publicly available medical text datasets: MIMIC II and MIMIC III. For few-shot labels we achieve improvements of 6.2% and 4.8% in R@10 for MIMIC II and MIMIC III, respectively, over prior efforts; the corresponding R@10 improvements for zero-shot labels are 17.3% and 19%.
引用
收藏
页码:3132 / 3142
页数:11
相关论文
共 47 条
  • [11] Defferrard M, 2016, ADV NEUR IN, V29
  • [12] Automated Classification of Free-text Pathology Reports for Registration of Incident Cases of Cancer
    Jouhet, V.
    Defossez, G.
    Burgun, A.
    le Beux, P.
    Levillain, P.
    Ingrand, P.
    Claveau, V.
    [J]. METHODS OF INFORMATION IN MEDICINE, 2012, 51 (03) : 242 - 251
  • [13] A Convolutional Neural Network for Modelling Sentences
    Kalchbrenner, Nal
    Grefenstette, Edward
    Blunsom, Phil
    [J]. PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 655 - 665
  • [14] Kim Y, 2014, IEEE ASME INT C ADV, P1747, DOI 10.1109/AIM.2014.6878336
  • [15] Kingma D. P., P 3 INT C LEARN REPR
  • [16] Koch GR, 2015, SIAMESE NEURAL NETWO
  • [17] Liu P, 2016, 2016 17TH INTERNATIONAL CONFERENCE ON ELECTRONIC PACKAGING TECHNOLOGY (ICEPT), P1480, DOI 10.1109/ICEPT.2016.7583403
  • [18] Marcheggiani D., 2017, EMNLP, P1506, DOI 10.18653/v1/d17-1159
  • [19] COSTA: Co-Occurrence Statistics for Zero-Shot Classification
    Mensink, Thomas
    Gavves, Efstratios
    Snoek, Cees G. M.
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2441 - 2448
  • [20] Mikolov T., 2013, ICLR, P3111