Enhancing Graph-Based Semisupervised Learning via Knowledge-Aware Data Embedding

被引:8
作者
Ienco, Dino [1 ,2 ]
Pensa, Ruggero G. [3 ]
机构
[1] Univ Montpellier, INRAE, TETIS, F-34090 Montpellier, France
[2] LIRMM, F-34090 Montpellier, France
[3] Univ Turin, Dept Comp Sci, I-10149 Turin, Italy
关键词
Task analysis; Pipelines; Standards; Data models; Semisupervised learning; Training; Decoding; Autoencoders; data embedding; ensembles; semisupervised learning (SSL);
D O I
10.1109/TNNLS.2019.2955565
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semisupervised learning (SSL) is a family of classification methods conceived to reduce the amount of required labeled information in the training phase. Graph-based methods are among the most popular semisupervised strategies: the nearest neighbor graph is built in such a way that the manifold of the data is captured and the labeled information is propagated to target samples along the structure of the manifold. Research in graph-based SSL has mainly focused on two aspects: 1) the construction of the k-nearest neighbors graph and/or 2) the propagation algorithm providing the classification. Differently from the previous literature, in this article, we focus on the data representation with the aim of incorporating semisupervision earlier in the process. To this end, we propose an algorithm that learns a new knowledge-aware data embedding via an ensemble of semisupervised autoencoders to enhance a graph-based semisupervised classification. The experiments carried out on different classification tasks demonstrate the benefit of our approach.
引用
收藏
页码:5014 / 5020
页数:7
相关论文
共 22 条
[1]  
[Anonymous], 2012, Advances in Neural Information Processing Systems
[2]  
[Anonymous], 2019, Introduction to Data Mining
[3]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[4]  
Dua D., 2019, Uci machine learning repository
[5]   Semi Supervised Autoencoder [J].
Gogna, Anupriya ;
Majumdar, Angshul .
NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II, 2016, 9948 :82-89
[6]  
Gong FL, 2017, 2017 INTERNATIONAL CONFERENCE ON SOCIAL SCIENCES, ARTS AND HUMANITIES (SSAH 2017), P90
[7]   Investigation of the random forest framework for classification of hyperspectral data [J].
Ham, J ;
Chen, YC ;
Crawford, MM ;
Ghosh, J .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2005, 43 (03) :492-501
[8]  
Ienco D., 2018, IJCNN, P1
[9]   DuPLO: A DUal view Point deep Learning architecture for time series classificatiOn [J].
Interdonato, Roberto ;
Ienco, Dino ;
Gaetano, Raffaele ;
Ose, Kenji .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 149 :91-104
[10]  
LeCun Y, 1990, Advances in neural information processing systems, P396