A Supervised Learning Model for High-Dimensional and Large-Scale Data

被引:25
|
作者
Peng, Chong [1 ]
Cheng, Jie [2 ]
Cheng, Qiang [1 ]
机构
[1] Southern Illinois Univ, Dept Comp Sci, Carbondale, IL 62901 USA
[2] Univ Hawaii, Dept Comp Sci & Engn, Hilo, HI 96720 USA
基金
美国国家科学基金会;
关键词
Discriminative regression; supervised learning; classification; high dimension; large-scale data; NEWTON METHOD; CLASSIFICATION;
D O I
10.1145/2972957
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a new supervised learning model using a discriminative regression approach. This new model estimates a regression vector to represent the similarity between a test example and training examples while seamlessly integrating the class information in the similarity estimation. This distinguishes our model from usual regression models and locally linear embedding approaches, rendering our method suitable for supervised learning problems in high-dimensional settings. Our model is easily extensible to account for nonlinear relationship and applicable to general data, including both high-and low-dimensional data. The objective function of the model is convex, for which two optimization algorithms are provided. These two optimization approaches induce two scalable solvers that are of mathematically provable, linear time complexity. Experimental results verify the effectiveness of the proposed method on various kinds of data. For example, our method shows comparable performance on low-dimensional data and superior performance on high-dimensional data to several widely used classifiers; also, the linear solvers obtain promising performance on large-scale classification.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Semi-supervised classifier ensemble model for high-dimensional data
    Niu, Xufeng
    Ma, Wenping
    INFORMATION SCIENCES, 2023, 643
  • [42] A fast classification strategy for SVM on the large-scale high-dimensional datasets
    I-Jing Li
    Jiunn-Lin Wu
    Chih-Hung Yeh
    Pattern Analysis and Applications, 2018, 21 : 1023 - 1038
  • [43] Large-scale Parallel Simulation of High-dimensional American Option Pricing
    Chang Hong-xu
    Lu Zhong-hua
    Chi Xue-bin
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2012, 6 (01) : 1 - 16
  • [44] Large-scale parallel simulation of high-dimensional american option pricing
    Chang, Hong-Xu
    Lu, Zhong-Hua
    Chi, Xue-Bin
    Journal of Algorithms and Computational Technology, 2012, 6 (01): : 1 - 16
  • [45] A study on high dimensional large-scale data visualization
    Lee, Eun-Kyung
    Hwang, Nayoung
    Lee, Yoondong
    KOREAN JOURNAL OF APPLIED STATISTICS, 2016, 29 (06) : 1061 - 1075
  • [46] Learning high-dimensional data
    Verleysen, M
    LIMITATIONS AND FUTURE TRENDS IN NEURAL COMPUTATION, 2003, 186 : 141 - 162
  • [47] CerebelluMorphic: Large-Scale Neuromorphic Model and Architecture for Supervised Motor Learning
    Yang, Shuangming
    Wang, Jiang
    Zhang, Nan
    Deng, Bin
    Pang, Yanwei
    Azghadi, Mostafa Rahimi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (09) : 4398 - 4412
  • [48] Large-scale supervised similarity learning in networks
    Shiyu Chang
    Guo-Jun Qi
    Yingzhen Yang
    Charu C. Aggarwal
    Jiayu Zhou
    Meng Wang
    Thomas S. Huang
    Knowledge and Information Systems, 2016, 48 : 707 - 740
  • [49] Large-scale supervised similarity learning in networks
    Chang, Shiyu
    Qi, Guo-Jun
    Yang, Yingzhen
    Aggarwal, Charu C.
    Zhou, Jiayu
    Wang, Meng
    Huang, Thomas S.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 48 (03) : 707 - 740
  • [50] Measuring large-scale market responses and forecasting aggregated sales: Regression for sparse high-dimensional data
    Terui, Nobuhiko
    Li, Yinxing
    JOURNAL OF FORECASTING, 2019, 38 (05) : 440 - 458