Online MEM Based Binary Classification Algorithm for China Mobile Imbalanced Dataset

被引:0
|
作者
Lin, Shuqing [1 ,2 ]
Yin, Feng [2 ,3 ]
Lin, Zhidi [1 ,2 ]
Lin, Yanbin [1 ,2 ]
Cui, Shuguang [2 ,3 ]
Li, Teng [4 ]
Yu, Fengli [4 ]
Yu, Wei [4 ]
Hong, Xuemin [1 ]
Shi, Jianghong [1 ]
Luo, Zhi-Quan [2 ,3 ]
机构
[1] Xiamen Univ, Sch Informat Sci & Engn, Xiamen, Peoples R China
[2] SRIBD, Shenzhen, Peoples R China
[3] Chinese Univ Hong Kong Shenzhen, Shenzhen, Peoples R China
[4] China Mobile Informat Technol Co Ltd Shenzhen, Shenzhen, Peoples R China
来源
2018 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC) | 2018年
关键词
Binary Classification; China Mobile; Imbalanced Dataset; MEM Algorithm; Online Algorithm; DATA SETS; SMOTE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Driven by a plethora of real machine learning applications, there have been many attempts at improving the performance of a classifier applied to imbalanced dataset. In this paper we propose a maximum entropy machine (MEM) based hybrid algorithm to handle binary classification problems with high imbalance ratios and large numbers of features in the datasets. At the training stage, we combine an efficient MEM algorithm with the SMOTE algorithm to build a classifier in a batch manner. At the application stage, the different-cost strategy is incorporated into the MEM algorithm to handle the imbalance learning problem in an online manner. Experiments are conducted based on various real datasets (including one China Mobile dataset and several other standard test datasets) with different imbalance ratios and different numbers of features. The results show that the proposed algorithm outperforms the state-of-the-art algorithms significantly in terms of robustness and overall classification performance.
引用
收藏
页码:68 / 73
页数:6
相关论文
共 50 条
  • [1] Bronze Inscriptions Classification Algorithm On Imbalanced Dataset
    He, Jiayuan
    Zhu, Qingting
    Chen, Youguang
    Nie, Fan
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1715 - 1718
  • [2] A Classification Algorithm Based on Ensemble Feature Selections for Imbalanced-Class Dataset
    Yin, Hua
    Gai, Keke
    Wang, Zhijian
    2016 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY), IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING (HPSC), AND IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2016, : 245 - 249
  • [3] Downsampling for Binary Classification with a Highly Imbalanced Dataset Using Active Learning
    Lee, Wonjae
    Seo, Kangwon
    BIG DATA RESEARCH, 2022, 28
  • [4] An Improved Random Forest Algorithm for classification in an imbalanced dataset.
    Jose, Christy
    Gopakumar, G.
    2019 URSI ASIA-PACIFIC RADIO SCIENCE CONFERENCE (AP-RASC), 2019,
  • [5] Optimizing Kernel Transformations to Handle Binary Class Imbalanced Dataset Classification
    Patel, Vaibhavi
    Bhavsar, Hetal
    APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [6] Classification of imbalanced dataset based on random walk model
    Wang Y.
    Qiao P.
    Sun G.
    Fan K.
    Zeng X.
    Revue d'Intelligence Artificielle, 2019, 33 (02): : 89 - 95
  • [7] Improving Emotion Classification in Imbalanced YouTube Dataset Using SMOTE Algorithm
    Sarakit, Phakhawat
    Theeramunkong, Thanaruk
    Haruechaiyasak, Choochart
    2015 2ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS ICAICTA, 2015,
  • [8] An overlapping minimization-based over-sampling algorithm for binary imbalanced classification
    Lu, Xuan
    Ye, Xuan
    Cheng, Yingchao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [9] Classification For Imbalanced Dataset Based on Biased Empirical Feature Mapping
    Yang Zhiming
    Yang Yu
    Gang Wang
    2012 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2012, : 1645 - 1649
  • [10] An Imbalanced Data Classification Algorithm Based on Boosting
    Li Qiu-Jie
    Mao Yao-Bin
    Wang Zhi-Quan
    2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 3053 - 3057