Machine Learning Techniques for SIM Box Fraud Detection

被引:0
作者
Kashir, Mhair [1 ]
Bashir, Sajid [2 ]
机构
[1] Iqra Univ, Dept Comp, Islamabad, Pakistan
[2] Natl Univ Technol NUTECH, Dept Comp Engn Technol, Islamabad, Pakistan
来源
2019 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGIES (COMTECH) | 2019年
关键词
Gray Traffic; Interconnect Bypass; Machine Learning; Neural Network; SIM Box Fraud; Support Vector Machine; Telecommunication;
D O I
10.1109/comtech.2019.8737828
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In today's competitive environment, telecommunication operators and service providers need to generate revenue by designing and delivering innovative services to the subscribers. At the same time, a prime consideration is to minimize the cost and prevent the revenue leakages. In this context, the industry faces numerous challenges and different types of frauds. There is a continuous effort to tackle this problem by improving the implementation methodology and the network protocols. However, in general, fraud detection is difficult and currently addressed in a proactive manner. Fraudulent callers exploit the weaknesses of specific protocol level solutions and avoid detection of the gray traffic. Our research work is inspired by classification algorithms used in machine learning and employed in different fields of science and engineering e.g. images processing, speech recognition, spam email detection etc.. We have applied these machine learning techniques (MLTs) for the classification of normal and fraudulent subscriber (SIM Box). We have used the call detail records (CDRs) of normal and fraudulent subscriber as an input to identify the important attributes; 25 for each customers. These attribute are used for classification of the normal and fraudulent subscribers using Neural Network (NN) and Support Vector Machine (SVM). A comparative performance analysis of both techniques is also presented using various evaluation parameters. SVM using the kernel (Polynomial, Radial, and Sigmoid) show best performance with an accuracy of 99.24 %. SVM Linear kernel show the worst performance with accuracy of 95.18 % and 0.19 regression. In case of NN, Bayesian Regularization and Resilient Back-Propagation algorithms show best and worst performance with an accuracy of 99.87 % and 99.53% respectively.
引用
收藏
页码:4 / 8
页数:5
相关论文
共 11 条
[1]  
[Anonymous], 2018, DAWN NEWS
[2]  
[Anonymous], 2013, The Nation
[3]  
Burge P, 1997, C PUBLICATION, V437
[4]  
Communication Fraud Control Association, 2017 GLOB FRAUD LOSS
[5]   Learning Sequential Behavior Representations for Fraud Detection [J].
Guo, Jia ;
Liu, Guannan ;
Zuo, Yuan ;
Wu, Junjie .
2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, :127-136
[6]  
Liu X., 2018, LECT NOTES COMPUTER
[7]  
Olszewski D., 2011, J KNOWLEDGE BASED SY
[8]  
Wu S., 2007, COMMUNICATIONS IIMA
[9]   Fraud detection in telecommunication: A rough fuzzy set based approach [J].
Xu, Wei ;
Pang, Ye ;
Ma, Jian ;
Wang, Shou-Yang ;
Hao, Gang ;
Zeng, Shuo ;
Qian, Yu-Hua .
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, :1249-+
[10]  
Zhou Chunlai, 2018 IEEE 8 ANN COMP