sCOs: Semi-Supervised Co-Selection by a Similarity Preserving Approach

被引:7
作者
Benabdeslem, Khalid [1 ]
Mansouri, Dou El Kefel [2 ]
Makkhongkaew, Raywat [3 ]
机构
[1] Univ Lyon1, LIRIS, CNRS, UMR5205, F-69622 Lyon, France
[2] Ibn Khaldoun Univ, BP P 78 Zaaroura, Tiaret 14000, Algeria
[3] State Railway Thailand SRT, Bangkok 10520, Thailand
关键词
Feature extraction; Task analysis; Semisupervised learning; Data mining; Robustness; Optimization; Supervised learning; Instance selection; feature selection; semi-supervised learning; similarity preserving; optimization; co-selection; INSTANCE SELECTION; CLASSIFIERS;
D O I
10.1109/TKDE.2020.3014262
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on co-selection of instances and features in the semi-supervised learning scenario. In this context, co-selection becomes a more challenging problem as data contain labeled and unlabeled examples sampled from the same population. To carry out such semi-supervised co-selection, we propose a unified framework, called sCOs, which efficiently integrates labeled and unlabeled parts into the co-selection process. The framework is based on introducing both a sparse regularization term and a similarity preserving approach. It evaluates the usefulness of features and instances in order to select the most relevant ones, simultaneously. We propose two efficient algorithms that work for both convex and nonconvex functions. To the best of our knowledge, this paper offers, for the first time ever, a study utilizing nonconvex penalties for the co-selection of semi-supervised learning tasks. Experimental results on some known benchmark datasets are provided for validating sCOs and comparing it with some representative methods in the state-of-the art.
引用
收藏
页码:2899 / 2911
页数:13
相关论文
共 52 条
  • [1] Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling
    Alizadeh, AA
    Eisen, MB
    Davis, RE
    Ma, C
    Lossos, IS
    Rosenwald, A
    Boldrick, JG
    Sabet, H
    Tran, T
    Yu, X
    Powell, JI
    Yang, LM
    Marti, GE
    Moore, T
    Hudson, J
    Lu, LS
    Lewis, DB
    Tibshirani, R
    Sherlock, G
    Chan, WC
    Greiner, TC
    Weisenburger, DD
    Armitage, JO
    Warnke, R
    Levy, R
    Wilson, W
    Grever, MR
    Byrd, JC
    Botstein, D
    Brown, PO
    Staudt, LM
    [J]. NATURE, 2000, 403 (6769) : 503 - 511
  • [2] Allab K, 2011, LECT NOTES ARTIF INT, V6911, P28, DOI 10.1007/978-3-642-23780-5_12
  • [3] [Anonymous], 2000, Pattern Classification, DOI DOI 10.1007/978-3-319-57027-3_4
  • [4] [Anonymous], 2013, P SIAM INT C DAT MIN
  • [5] A review of instance selection methods
    Arturo Olvera-Lopez, J.
    Ariel Carrasco-Ochoa, J.
    Francisco Martinez-Trinidad, J.
    Kittler, Josef
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2010, 34 (02) : 133 - 143
  • [6] Ensemble constrained Laplacian score for efficient and robust semi-supervised feature selection
    Benabdeslem, Khalid
    Elghazel, Haytham
    Hindawi, Mohammed
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 49 (03) : 1161 - 1185
  • [7] Benabdeslem K, 2011, LECT NOTES ARTIF INT, V6911, P204, DOI 10.1007/978-3-642-23780-5_23
  • [8] Distributed optimization and statistical learning via the alternating direction method of multipliers
    Boyd S.
    Parikh N.
    Chu E.
    Peleato B.
    Eckstein J.
    [J]. Foundations and Trends in Machine Learning, 2010, 3 (01): : 1 - 122
  • [9] Chapelle O., 2006, Semi-Supervised Learning, P3
  • [10] Evolutionary feature and instance selection for traffic sign recognition
    Chen, Zong-Yao
    Lin, Wei-Chao
    Ke, Shih-Wen
    Tsai, Chih-Fong
    [J]. COMPUTERS IN INDUSTRY, 2015, 74 : 201 - 211