Stock Price Manipulation Detection Using Empirical Mode Decomposition Based Kernel Density Estimation Clustering Method

被引:11
作者
Abbas, Baqar [1 ]
Belatreche, Ammar [1 ]
Bouridane, Ahmed [1 ]
机构
[1] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne, Tyne & Wear, England
来源
INTELLIGENT SYSTEMS AND APPLICATIONS, INTELLISYS, VOL 2 | 2019年 / 869卷
关键词
Anomaly detection; Stock price manipulation; Empirical mode decomposition; Kernel density estimation; Clustering; K-means; Principal component analysis; Dirichlet process Gaussian mixture model;
D O I
10.1007/978-3-030-01057-7_63
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stock market manipulation means illegitimate or illegal activities trying to influence the prices of stocks, hence diluting the legal definition of trading stocks. In this research, a model for detecting Stock price manipulation is presented for anomalies like Pump & Dump, Quote stuffing, Gouging or Spoof Trading. The model is presented on level 1-tick data which contains highly volatile time series and a high trading frequency making the detection more challenging. In literature, very less number of studies based on unsupervised learning for Stock market manipulation has been carried out. In addition, the existing studies focused only on specific anomalies and were not generalized enough to capture other anomalies. The research model used in this work uses unsupervised learning where the input data is decomposed using Empirical Mode Decomposition followed by Kernel Density Estimation based clustering technique for anomaly detection. One of the key advantages of this technique is receiver operating characteristic (ROC) curve, which is better than the currently available techniques and provides a maximum area under curve (AUC) equal to 0.96. The results in this work are also compared with existing benchmark approaches like K-means, Principal Component Analysis (PCA) based anomaly detection and Dirichlet process Gaussian Mixture Model (DPGMM) based anomaly detection, and a maximum improvement of 84% is obtained.
引用
收藏
页码:851 / 866
页数:16
相关论文
共 32 条
  • [21] Li S.Z., 2009, Encyclopedia of Biometrics, P953
  • [22] Lin X., 2012, TECHNICAL REPORT
  • [23] LOBSTER project Atlanta GA USA, 2012, LIM ORD BOOK SYST
  • [24] Empirical Mode Decomposition-Based Time-Frequency Analysis of Multivariate Signals
    Mandic, Danilo P.
    Rehman, Naveed Ur
    Wu, Zhaohua
    Huang, Norden E.
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2013, 30 (06) : 74 - 86
  • [25] Markov chain sampling methods for Dirichlet process mixture
    Neal, RM
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2000, 9 (02) : 249 - 265
  • [26] Trade-based manipulation: Beyond the prosecuted cases
    Neupane, Suman
    Rhee, S. Ghon
    Vithanage, Kulunu
    Veeraraghavan, Madhu
    [J]. JOURNAL OF CORPORATE FINANCE, 2017, 42 : 115 - 130
  • [27] Collusion set detection using graph clustering
    Palshikar, Girish Keshav
    Apte, Manoj M.
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2008, 16 (02) : 135 - 164
  • [28] Santos S.R., 2016, J APPL STAT, V44, P1
  • [29] Shyu ML, 2003, 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, P353
  • [30] Silverman BW, 2018, DENSITY ESTIMATION S, DOI [10.1201/9781315140919, DOI 10.1201/9781315140919]