Exploiting covariate embeddings for classification using Gaussian processes

被引:1
|
作者
Andrade, Daniel [1 ]
Tamura, Akihiro [2 ]
Tsuchida, Masaaki [3 ]
机构
[1] NEC Corp Ltd, Secur Res Labs, Tokyo, Japan
[2] Ehime Univ, Grad Sch Sci & Engn, Matsuyama, Ehime, Japan
[3] DeNA Co Ltd, Tokyo, Japan
关键词
Logistic regression; Auxiliary information of covariates; Gaussian process; Text classification; TEXT CLASSIFICATION;
D O I
10.1016/j.patrec.2018.01.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many logistic regression tasks, auxiliary information about the covariates is available. For example, a user might be able to specify a similarity measure between the covariates, or an embedding (feature vector) for each covariate, which is created from unlabeled data. In particular for text classification, the covariates (words) can be described by word embeddings or similarity measures from lexical resources like WordNet. We propose a new method to use such embeddings of covariates for logistic regression. Our method consists of two main components. The first component is a Gaussian process (GP) with a covariance function that models the correlations between covariates, and returns a noise-free estimate of the covariates. The second component is a logistic regression model that uses these noise-free estimates. One advantage of our model is that the covariance function can be adjusted to the training data using maximum likelihood. Another advantage is that new covariates that never occurred in the training data can be incorporated at test time, while run-time increases only linearly in the number of new covariates. Our experiments demonstrate the usefulness of our method in situations when only small training data is available. (c) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:8 / 14
页数:7
相关论文
共 50 条
  • [1] Active Learning of Linear Embeddings for Gaussian Processes
    Garnett, Roman
    Osborne, Michael A.
    Hennig, Philipp
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2014, : 230 - 239
  • [2] Bayesian regression and classification using mixtures of Gaussian processes
    Shi, JQ
    Murray-Smith, R
    Titterington, DM
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2003, 17 (02) : 149 - 161
  • [3] Automatic Classification of Arrhythmic Beats Using Gaussian Processes
    Skolidis, G.
    Clayton, R. H.
    Sanguinetti, G.
    COMPUTERS IN CARDIOLOGY 2008, VOLS 1 AND 2, 2008, : 921 - 924
  • [4] Using Gaussian processes for human tracking and action classification
    Raskin, Leonid
    Rivlin, Ehud
    Rudzsky, Michael
    ADVANCES IN VISUAL COMPUTING, PT I, 2007, 4841 : 36 - +
  • [5] Human Motion Recognition using Gaussian Processes Classification
    Zhou, Hang
    Wang, Liang
    Suter, David
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3023 - 3026
  • [6] Skew Gaussian processes for classification
    Benavoli, Alessio
    Azzimonti, Dario
    Piga, Dario
    MACHINE LEARNING, 2020, 109 (9-10) : 1877 - 1902
  • [7] Skew Gaussian processes for classification
    Alessio Benavoli
    Dario Azzimonti
    Dario Piga
    Machine Learning, 2020, 109 : 1877 - 1902
  • [8] Bayesian classification with Gaussian processes
    Williams, CKI
    Barber, D
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (12) : 1342 - 1351
  • [9] Classification of Microorganisms via Raman Spectroscopy Using Gaussian Processes
    Kemmler, Michael
    Denzler, Joachim
    Roesch, Petra
    Popp, Juegen
    PATTERN RECOGNITION, 2010, 6376 : 81 - +
  • [10] Variational Mixtures of Gaussian Processes for Classification
    Luo, Chen
    Sun, Shiliang
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4603 - 4609