A Supervised Learning Model for High-Dimensional and Large-Scale Data

被引:25
|
作者
Peng, Chong [1 ]
Cheng, Jie [2 ]
Cheng, Qiang [1 ]
机构
[1] Southern Illinois Univ, Dept Comp Sci, Carbondale, IL 62901 USA
[2] Univ Hawaii, Dept Comp Sci & Engn, Hilo, HI 96720 USA
基金
美国国家科学基金会;
关键词
Discriminative regression; supervised learning; classification; high dimension; large-scale data; NEWTON METHOD; CLASSIFICATION;
D O I
10.1145/2972957
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a new supervised learning model using a discriminative regression approach. This new model estimates a regression vector to represent the similarity between a test example and training examples while seamlessly integrating the class information in the similarity estimation. This distinguishes our model from usual regression models and locally linear embedding approaches, rendering our method suitable for supervised learning problems in high-dimensional settings. Our model is easily extensible to account for nonlinear relationship and applicable to general data, including both high-and low-dimensional data. The objective function of the model is convex, for which two optimization algorithms are provided. These two optimization approaches induce two scalable solvers that are of mathematically provable, linear time complexity. Experimental results verify the effectiveness of the proposed method on various kinds of data. For example, our method shows comparable performance on low-dimensional data and superior performance on high-dimensional data to several widely used classifiers; also, the linear solvers obtain promising performance on large-scale classification.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Supervised Papers Classification on Large-Scale High-Dimensional Data with Apache Spark
    Akritidis, Leonidas
    Bozanis, Panayiotis
    Fevgas, Athanasios
    2018 16TH IEEE INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP, 16TH IEEE INT CONF ON PERVAS INTELLIGENCE AND COMP, 4TH IEEE INT CONF ON BIG DATA INTELLIGENCE AND COMP, 3RD IEEE CYBER SCI AND TECHNOL CONGRESS (DASC/PICOM/DATACOM/CYBERSCITECH), 2018, : 987 - 994
  • [2] Visualizing Large-scale and High-dimensional Data
    Tang, Jian
    Liu, Jingzhou
    Zhang, Ming
    Mei, Qiaozhu
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 287 - 297
  • [3] Asynchronous Distributed ADMM for Learning with Large-Scale and High-Dimensional Sparse Data Set
    Wang, Dongxia
    Lei, Yongmei
    ADVANCED HYBRID INFORMATION PROCESSING, ADHIP 2019, PT II, 2019, 302 : 259 - 274
  • [4] High-dimensional MRI data analysis using a large-scale manifold learning approach
    Loc Tran
    Debrup Banerjee
    Jihong Wang
    Ashok J. Kumar
    Frederic McKenzie
    Yaohang Li
    Jiang Li
    Machine Vision and Applications, 2013, 24 : 995 - 1014
  • [5] High-dimensional MRI data analysis using a large-scale manifold learning approach
    Tran, Loc
    Banerjee, Debrup
    Wang, Jihong
    Kumar, Ashok J.
    McKenzie, Frederic
    Li, Yaohang
    Li, Jiang
    MACHINE VISION AND APPLICATIONS, 2013, 24 (05) : 995 - 1014
  • [6] Fast Low-rank Metric Learning for Large-scale and High-dimensional Data
    Liu, Han
    Han, Zhizhong
    Liu, Yu-Shen
    Gu, Ming
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [7] RECURSIVE REDUCTION NET FOR LARGE-SCALE HIGH-DIMENSIONAL DATA
    Ke, Tsung-Wei
    Liu, Tyng-Luh
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1903 - 1907
  • [8] Feature screening with large-scale and high-dimensional survival data
    Yi, Grace Y.
    He, Wenqing
    Carroll, Raymond. J.
    BIOMETRICS, 2022, 78 (03) : 894 - 907
  • [9] Machine learning of large-scale spatial distributions of wild turkeys with high-dimensional environmental data
    Farrell, Annie
    Wang, Guiming
    Rush, Scott A.
    Martin, James A.
    Belant, Jerrold L.
    Butler, Adam B.
    Godwin, Dave
    ECOLOGY AND EVOLUTION, 2019, 9 (10): : 5938 - 5949
  • [10] Visualizing the Finer Cluster Structure of Large-Scale and High-Dimensional Data
    Liang, Yu
    Chaudhuri, Arin
    Wang, Haoyu
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, 2021, 12817 : 361 - 372