Large-Scale Learning with Structural Kernels for Class-Imbalanced Datasets

被引:0
|
作者
Severyn, Aliaksei [1 ]
Moschitti, Alessandro [1 ]
机构
[1] Univ Trento, Dept Comp Sci & Engn, I-38123 Povo, TN, Italy
来源
ETERNAL SYSTEMS | 2012年 / 255卷
关键词
Machine Learning; Kernel Methods; Structural Kernels; Support Vector Machine; Natural Language Processing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Much of the success in machine learning can be attributed to the ability of learning methods to adequately represent, extract, and exploit inherent structure present in the data under interest. Kernel methods represent a rich family of techniques that harvest on this principle. Domain-specific kernels are able to exploit rich structural information present in the input data to deliver state of the art results in many application areas, e.g. natural language processing (NLP), bio-informatics, computer vision and many others. The use of kernels to capture relationships in the input data has made Support Vector Machine (SVM) algorithm the state of the art tool in many application areas. Nevertheless, kernel learning remains a computationally expensive process. The contribution of this paper is to make learning with structural kernels, e.g. tree kernels, more applicable to real-world large-scale tasks. More specifically, we propose two important enhancements of the approximate cutting plane algorithm to train Support Vector Machines with structural kernels: (i) a new sampling strategy to handle class-imbalanced problem; and (ii) a parallel implementation, which makes the training scale almost linearly with the number of CPUs. We also show that theoretical convergence bounds are preserved for the improved algorithm. The experimental evaluations demonstrate the soundness of our approach and the possibility to carry out large-scale learning with structural kernels.
引用
收藏
页码:34 / 41
页数:8
相关论文
共 50 条
  • [21] Class-Imbalanced Deep Learning via a Class-Balanced Ensemble
    Chen, Zhi
    Duan, Jiang
    Kang, Li
    Qiu, Guoping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5626 - 5640
  • [22] Classification method for failure modes of RC columns based on class-imbalanced datasets
    Yu, Bo
    Xie, Longlong
    Yu, Zecheng
    Cheng, Hao
    STRUCTURES, 2023, 48 : 694 - 705
  • [23] ALLIE: Active Learning on Large-scale Imbalanced Graphs
    Cui, Limeng
    Tang, Xianfeng
    Katariya, Sumeet
    Rao, Nikhil
    Agrawal, Pallav
    Subbian, Karthik
    Lee, Dongwon
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 690 - 698
  • [24] A Distributed Instance-weighted SVM Algorithm on Large-scale Imbalanced Datasets
    Wang, Xiaoguang
    Liu, Xuan
    Matwin, Stan
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014,
  • [25] A semi-supervised resampling method for class-imbalanced learning
    Jiang, Zhen
    Zhao, Lingyun
    Lu, Yu
    Zhan, Yongzhao
    Mao, Qirong
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 221
  • [26] A boosted co-training method for class-imbalanced learning
    Jiang, Zhen
    Zhao, Lingyun
    Zhan, Yongzhao
    EXPERT SYSTEMS, 2023, 40 (09)
  • [27] Class-Imbalanced Semi-Supervised Learning with Adaptive Thresholding
    Guo, Lan-Zhe
    Li, Yu-Feng
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [28] Learning from class-imbalanced data in wireless sensor networks
    Radivojac, P
    Korad, U
    Sivalingam, KM
    Obradovic, Z
    2003 IEEE 58TH VEHICULAR TECHNOLOGY CONFERENCE, VOLS1-5, PROCEEDINGS, 2003, : 3030 - 3034
  • [29] Learning from class-imbalanced data: Review of methods and applications
    Guo Haixiang
    Li Yijing
    Shang, Jennifer
    Gu Mingyun
    Huang Yuanyue
    Bing, Gong
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 73 : 220 - 239
  • [30] Weight Decision Algorithm for Oversampling Technique on Class-Imbalanced Learning
    Kang, Young-Il
    Won, Sangchul
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 182 - 186