Detecting stock market manipulation via machine learning: Evidence from China Securities Regulatory Commission punishment cases

被引:27
作者
Liu, Qingbai [1 ]
Wang, Chuanjie [2 ]
Zhang, Ping [3 ]
Zheng, Kaixin [2 ]
机构
[1] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China
[2] Fudan Univ, Sch Econ, Shanghai, Peoples R China
[3] Fudan Univ, Sch Journalism, 400 Guoding Rd, Shanghai, Peoples R China
关键词
Market manipulation; Machine learning; Support vector machine; Sentiment indicator; Borderline SMOTE; IMBALANCED DATA SETS; PRICE MANIPULATION; INVESTOR SENTIMENT; ASYMMETRIC INFORMATION; BINARY CLASSIFIERS; ANALYST COVERAGE; CLASSIFICATION; SVM; VOLATILITY; PREDICTION;
D O I
10.1016/j.irfa.2021.101887
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
In this paper, we apply machine-learning techniques to construct detecting models of stock market manipulation. By combining manually collected China Securities Regulatory Commission punishment cases from 2014 to 2016 with financial information of listed companies, we construct a training set and a test set to compare the detecting ability of support vector machine (SVM) and logistic model. Considering imbalanced data, we further incorporate Borderline Synthetic Minority Oversampling Technique (Borderline SMOTE) to oversample minority class and then find that Borderline SMOTE-SVM performs better than SVM and benchmark model in detecting manipulation. To enhance detecting performance of the models, we innovatively introduce market sentiment indicators which are extracted from analyst rating reports, financial news, and Guba comments into our indicators set. The results indicate that the new indicators generate significant marginal increment to the model accuracy.
引用
收藏
页数:11
相关论文
共 51 条
[1]   Stock market manipulations [J].
Aggarwal, Rajesh K. ;
Wu, Guojun .
JOURNAL OF BUSINESS, 2006, 79 (04) :1915-1953
[2]   A Worldwide Examination of Exchange Market Quality: Greater Integrity Increases Market Efficiency [J].
Aitken, Michael J. ;
Harris, Frederick H. de B. ;
Ji, Shan .
JOURNAL OF BUSINESS ETHICS, 2015, 132 (01) :147-170
[3]  
Ali A., 2013, INT J ADV SOFT COMPU, V5
[4]   STOCK-PRICE MANIPULATION [J].
ALLEN, F ;
GALE, D .
REVIEW OF FINANCIAL STUDIES, 1992, 5 (03) :503-529
[5]   Investor sentiment and the cross-section of stock returns [J].
Baker, Malcolm ;
Wurgler, Jeffrey .
JOURNAL OF FINANCE, 2006, 61 (04) :1645-1680
[6]   Support vector machines for credit scoring and discovery of significant features [J].
Bellotti, Tony ;
Crook, Jonathan .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :3302-3308
[7]  
Bianchi D, 2021, REV FINANC STUD, V34, P1046, DOI 10.1093/rfs/hhaa062
[8]   A comprehensive survey on support vector machine classification: Applications, challenges and trends [J].
Cervantes, Jair ;
Garcia-Lamont, Farid ;
Rodriguez-Mazahua, Lisbeth ;
Lopez, Asdrubal .
NEUROCOMPUTING, 2020, 408 :189-215
[9]   Data selection based on decision tree for SVM classification on large data sets [J].
Cervantes, Jair ;
Garcia Lamont, Farid ;
Lopez-Chau, Asdrubal ;
Rodriguez Mazahua, Lisbeth ;
Sergio Ruiz, J. .
APPLIED SOFT COMPUTING, 2015, 37 :787-798
[10]   Analyst coverage and financing decisions [J].
Chang, Xin ;
Dasgupta, Sudipto ;
Hilary, Gilles .
JOURNAL OF FINANCE, 2006, 61 (06) :3009-3048