A Unified Approach on Active Learning Dual Supervision

被引:0
|
作者
Chriswanto, Adrian [1 ]
Pao, Hsing-Kuo [1 ]
Leet, Yuh-Jye [2 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[2] Natl Chiao Tung Univ, Dept Appl Math, Hsinchu, Taiwan
关键词
active learning; categorical features; feature labeling; hierarchical clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active Learning (AL) is a machine learning framework that aims to efficiently select a limited labeled data to construct an effective model given huge amount of unlabeled data on the side. Most studies in AL focus on how to select the unlabeled data to be labeled by a human oracle in order to maximize the performance gain of the model with as little labeling effort as possible. In this work, however, we focus not only on how to select appropriate data instances but also how to select informative features, more specifically, categorical features to be labeled by the oracle in a unified manner. The unification means that we select the best choice of item to label where the item can be either a feature or an instance on each iteration given a unified scoring function to make the decision. The method that we propose is by synthesizing new instances that represent a set of features. By utilizing synthesized instances, we can treat this set of features as if they are regular instances. Therefore they could be compared on an equal ground when the model tries to select which instances to be labeled by the oracle. The features used to build the synthesized instances need to be carefully selected so the resulting synthesized instances could improve the model and not introducing any contradicting information. We utilize hierarchical clustering in order to group features that own similar content. This is done first by picking clusters whose label purity are estimated to be high. Then we score a feature based on how common the feature is in the cluster and how related the feature is to the estimated majority label. The top scoring features will then be used to synthesize instances. We demonstrate the effectiveness of the proposed method through a few data sets that consist of only categorical features where the feature labeling makes more sense to labeling oracles. The experiment results show that adopting the unified approach creates clear benefit to model construction, especially in the early stage where we can efficiently obtain an effective model through only a few iterations, compared to the one using only instance labeling for model construction.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] A Unified Active Learning Framework for Biomedical Relation Extraction
    张宏涛
    黄民烈
    朱小燕
    JournalofComputerScience&Technology, 2012, 27 (06) : 1302 - 1313
  • [32] A unified theoretical approach for biological cognition and learning
    Komer, Brent
    Eliasmith, Chris
    CURRENT OPINION IN BEHAVIORAL SCIENCES, 2016, 11 : 14 - 20
  • [33] AN ECOLOGICAL APPROACH TOWARD A UNIFIED THEORY OF LEARNING
    CHARLESWORTH, WR
    BEHAVIORAL AND BRAIN SCIENCES, 1981, 4 (01) : 142 - 143
  • [34] Compact dual ensembles for active learning
    Mandvikar, A
    Liu, H
    Motoda, H
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2004, 3056 : 293 - 297
  • [35] Dual generative adversarial active learning
    Guo, Jifeng
    Pang, Zhiqi
    Bai, Miaoyuan
    Xie, Peijiao
    Chen, Yu
    APPLIED INTELLIGENCE, 2021, 51 (08) : 5953 - 5964
  • [36] Dual generative adversarial active learning
    Jifeng Guo
    Zhiqi Pang
    Miaoyuan Bai
    Peijiao Xie
    Yu Chen
    Applied Intelligence, 2021, 51 : 5953 - 5964
  • [37] PSF: A Unified Patient Similarity Evaluation Framework Through Metric Learning With Weak Supervision
    Wang, Fei
    Sun, Jimeng
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2015, 19 (03) : 1053 - 1060
  • [38] Unified Semantic Parsing with Weak Supervision
    Agrawal, Priyanka
    Jain, Parag
    Dalmia, Ayushi
    Bansal, Abhishek
    Mittal, Ashish
    Sankaranarayanan, Karthik
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4801 - 4810
  • [39] Imitation Learning and Attentional Supervision of Dual-Arm Structured Tasks
    Caccavale, R.
    Saveriano, M.
    Fontanelli, G. A.
    Ficuciello, F.
    Lee, D.
    Finzi, A.
    2017 THE SEVENTH JOINT IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (ICDL-EPIROB), 2017, : 66 - 71
  • [40] A Simple Unified Model for Generic Operation of Dual Active Bridge Converter
    Shah, Suyash Sushilkumar
    Bhattacharya, Subhashish
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2019, 66 (05) : 3486 - 3495