Graph Classification with Imbalanced Data Sets

被引:0
|
作者
Xiao, Gang-Song [1 ]
Chen, Xiao-Yun [1 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350002, Peoples R China
来源
2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR) | 2011年
关键词
Graph mining; Graph classification; Class imbalance; Cost-sensitive learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many graph classification methods have been proposed in recent years. These graph classification methods can perform well with balanced graph data sets, but perform poorly with imbalanced graph data sets. In this paper, we propose a new graph classification method based on cost sensitivity to deal with imbalance. First, we introduce a misclassification cost-matrix, and select the weighted subgraph based on the least misclassification cost as the attribute of graph. Then we build up a decision stump classifier and ensemble learning, finally obtain classify critical function to classify a new graph. Especially we prove that the supergraph of a weighted subgraph has an upper bound. And we can use the upper bound of supergraph to reduce the number of candidate subgraphs, so our method can be very efficient. Moreover we compare our method with other graph classification methods through experiment on imbalanced graph date sets.
引用
收藏
页码:57 / 61
页数:5
相关论文
共 50 条
  • [1] The Text Classification for Imbalanced Data Sets
    Li, Yanling
    Zhu, Yehang
    Yang, Ping
    ISISE 2008: INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING, VOL 2, 2008, : 778 - +
  • [2] Classification with local clustering in imbalanced data sets
    Ji, Hua
    Zhang, Huaxiang
    ADVANCED RESEARCH ON INFORMATION SCIENCE, AUTOMATION AND MATERIAL SYSTEM, PTS 1-6, 2011, 219-220 : 151 - 155
  • [3] Evaluation Measures of the Classification Performance of Imbalanced Data Sets
    Gu, Qiong
    Zhu, Li
    Cai, Zhihua
    COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, 2009, 51 : 461 - +
  • [4] An Improved Algorithm for SVMs Classification of Imbalanced Data Sets
    Castro, Cristiano Leite
    Carvalho, Mateus Araujo
    Braga, Antonio Padua
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, PROCEEDINGS, 2009, 43 : 108 - 118
  • [5] Classification of imbalanced marketing data with balanced random sets
    Nikulin, Vladimir
    McLachlan, Geoffrey J.
    Journal of Machine Learning Research, 2009, 7 : 89 - 100
  • [6] Improving SVM Classification on Imbalanced Data Sets in Distance Spaces
    Koeknar-Tezel, Suzan
    Latecki, Longin Jan
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 259 - +
  • [7] An experimental comparison of classification algorithms for imbalanced credit scoring data sets
    Brown, Iain
    Mues, Christophe
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (03) : 3446 - 3453
  • [8] An Effective Over-sampling Method for Imbalanced Data Sets Classification
    Zhai Yun
    Ma Nan
    Ruan Da
    An Bing
    CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (03): : 489 - 494
  • [9] Classification on Imbalanced Data Sets, Taking Advantage of Errors to Improve Performance
    Lopez-Chau, Asdrubal
    Garcia-Lamont, Farid
    Cervantes, Jair
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2015, PT III, 2015, 9227 : 72 - 78
  • [10] SVM classification for imbalanced data sets using a multiobjective optimization framework
    Askan, Aysegul
    Sayin, Serpil
    ANNALS OF OPERATIONS RESEARCH, 2014, 216 (01) : 191 - 203