Probabilistic support vector machines for classification of noise affected data

被引:53
|
作者
Li, Han-Xiong [1 ,2 ]
Yang, Jing-Lin [1 ,3 ]
Zhang, Geng [2 ]
Fan, Bi [1 ]
机构
[1] City Univ Hong Kong, Dept Syst Eng & Eng Management, Hong Kong, Hong Kong, Peoples R China
[2] Cent S Univ, Sch Mech & Elect Engn, Changsha, Hunan, Peoples R China
[3] So Power Grid Co Ltd, Guangzhou Branch Extra High Voltage Power Transmi, Guangzhou, Guangdong, Peoples R China
关键词
SVM; Classification; PCA based sampling; Probabilistic distribution; REGRESSION; SYSTEM; BOOTSTRAP; FRAMEWORK; MODEL;
D O I
10.1016/j.ins.2012.09.041
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The support vector machines (SVMs) have gained visibility and been thoroughly studied in the machine learning community. However, the performance of these machines is sensitive to noisy data and the machine may not be effective when the level of noise is high. Since the noise makes the separating margin of SVM to be a stochastic variable, a probabilistic support vector machine (PSVM) is proposed to capture the probabilistic information of the separating margin and formulate the decision function within such a noisy environment. First, all data are clustered, upon which different subsets are formed by PCA-based sampling; then, a distributed SVM system is constructed to estimate the separating margin for each subset. Next, a quadratic optimization problem is being solved with the use of probabilistic information extracted from separating margins to determine the decision function. Using the weighted average of probability of cluster centers, the confidence of the decision can be estimated. An artificial dataset and four real-life datasets from a UCI machine learning database are used to demonstrate the effectiveness of the proposed probabilistic SVM. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:60 / 71
页数:12
相关论文
共 50 条
  • [1] Multiclass Probabilistic Classification for Support Vector Machines
    Bae, Ji-Sang
    Kim, Jong-Ok
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (06): : 1251 - 1255
  • [2] Support vector machines for classification of hyperspectral data
    Gualtieri, JA
    Chettri, S
    IGARSS 2000: IEEE 2000 INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOL I - VI, PROCEEDINGS, 2000, : 813 - 815
  • [3] Probabilistic Classification Vector Machines
    Chen, Huanhuan
    Tino, Peter
    Yao, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (06): : 901 - 914
  • [4] Fusion of support vector machines for classification of multisensor data
    Waske, Bjoern
    Benediktsson, Jo Atli
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (12): : 3858 - 3866
  • [5] Data mining with parallel support vector machines for classification
    Eitrich, Tatjana
    Lang, Bruno
    ADVANCES IN INFORMATION SYSTEMS, PROCEEDINGS, 2006, 4243 : 197 - 206
  • [6] Classification of fuzzy data based on the support vector machines
    Forghani, Yahya
    Yazdi, Hadi Sadoghi
    Effati, Sohrab
    EXPERT SYSTEMS, 2013, 30 (05) : 403 - 417
  • [7] Classification of electronic nose data with support vector machines
    Pardo, M
    Sberveglieri, G
    SENSORS AND ACTUATORS B-CHEMICAL, 2005, 107 (02): : 730 - 737
  • [8] Probabilistic Classification Vector Machines for Multiclass Classification
    Qian, Xusheng
    Huang, He
    Hu, Jisu
    Zhou, Zhiyong
    Geng, Chen
    Dai, Yakang
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 1028 - 1032
  • [9] Probabilistic methods for Support Vector Machines
    Sollich, P
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 349 - 355
  • [10] The Use of Multiclass Support Vector Machines and Probabilistic Neural Networks for Signal Classification and Noise Detection in PLC/OFDM Channels
    Baroud, Dalal H.
    Hasan, Ali N.
    Shongwe, T.
    PROCEEDINGS OF THE 2020 30TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2020, : 41 - 46