Exploiting covariate embeddings for classification using Gaussian processes

被引:1
|
作者
Andrade, Daniel [1 ]
Tamura, Akihiro [2 ]
Tsuchida, Masaaki [3 ]
机构
[1] NEC Corp Ltd, Secur Res Labs, Tokyo, Japan
[2] Ehime Univ, Grad Sch Sci & Engn, Matsuyama, Ehime, Japan
[3] DeNA Co Ltd, Tokyo, Japan
关键词
Logistic regression; Auxiliary information of covariates; Gaussian process; Text classification; TEXT CLASSIFICATION;
D O I
10.1016/j.patrec.2018.01.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many logistic regression tasks, auxiliary information about the covariates is available. For example, a user might be able to specify a similarity measure between the covariates, or an embedding (feature vector) for each covariate, which is created from unlabeled data. In particular for text classification, the covariates (words) can be described by word embeddings or similarity measures from lexical resources like WordNet. We propose a new method to use such embeddings of covariates for logistic regression. Our method consists of two main components. The first component is a Gaussian process (GP) with a covariance function that models the correlations between covariates, and returns a noise-free estimate of the covariates. The second component is a logistic regression model that uses these noise-free estimates. One advantage of our model is that the covariance function can be adjusted to the training data using maximum likelihood. Another advantage is that new covariates that never occurred in the training data can be incorporated at test time, while run-time increases only linearly in the number of new covariates. Our experiments demonstrate the usefulness of our method in situations when only small training data is available. (c) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:8 / 14
页数:7
相关论文
共 50 条
  • [11] OPTIMUM CLASSIFICATION OF NON-GAUSSIAN PROCESSES USING NEURAL NETWORKS
    BLACKNELL, D
    WHITE, RG
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1994, 141 (01): : 56 - 66
  • [12] Text classification using embeddings: a survey
    Liliane Soares da Costa
    Italo L. Oliveira
    Renato Fileto
    Knowledge and Information Systems, 2023, 65 : 2761 - 2803
  • [13] Text classification using embeddings: a survey
    da Costa, Liliane Soares
    Oliveira, Italo L.
    Fileto, Renato
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (07) : 2761 - 2803
  • [14] Bayesian learning of orthogonal embeddings for multi-fidelity Gaussian Processes
    Tsilifis, Panagiotis
    Pandita, Piyush
    Ghosh, Sayan
    Andreoli, Valeria
    Vandeputte, Thomas
    Wang, Liping
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2021, 386
  • [15] Nationality Classification Using Name Embeddings
    Ye, Junting
    Han, Shuchu
    Hu, Yifan
    Coskun, Baris
    Liu, Meizhu
    Qin, Hong
    Skiena, Steven
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1897 - 1906
  • [16] Exploiting Differential Flatness for Robust Learning-Based Tracking Control Using Gaussian Processes
    Greeff, Melissa
    Schoellig, Angela P.
    IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (04): : 1121 - 1126
  • [17] Text Classification Using Word Embeddings
    Helaskar, Mukund N.
    Sonawane, Sheetal S.
    2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
  • [18] Exploiting Causality for Improved Prediction of Patient Volumes by Gaussian Processes
    Feng, Guanchao
    Yu, Kezi
    Wang, Yunlong
    Yuan, Yilian
    Djuric, Petar M.
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (07) : 2487 - 2496
  • [19] CONSTRAINED BAYESIAN OPTIMIZATION METHODS USING REGRESSION AND CLASSIFICATION GAUSSIAN PROCESSES AS CONSTRAINTS
    Jetton, Cole
    Li, Chengda
    Hoyle, Christopher
    PROCEEDINGS OF ASME 2023 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2023, VOL 3B, 2023,
  • [20] Fault Detection and Classification for Nonlinear Chemical Processes using Lasso and Gaussian Process
    Du, Yuncheng
    Budman, Hector
    Duever, Thomas A.
    Du, Dongping
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2018, 57 (27) : 8962 - 8977