An Integrated Cluster Detection, Optimization, and Interpretation Approach for Financial Data

被引:160
作者
Li, Tie [1 ]
Kou, Gang [2 ]
Peng, Yi [1 ]
Yu, Philip S. [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Management & Econ, Chengdu 611731, Peoples R China
[2] Southwestern Univ Finance & Econ, Sch Business Adm, Chengdu 610074, Peoples R China
[3] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Clustering algorithms; Data models; Correlation; Shape; Optimization; Laplace equations; Feature extraction; Clustering methods; data mining; financial management; spectral analysis; VALIDATION; ALGORITHM;
D O I
10.1109/TCYB.2021.3109066
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In many financial applications, such as fraud detection, reject inference, and credit evaluation, detecting clusters automatically is critical because it helps to understand the subpatterns of the data that can be used to infer user's behaviors and identify potential risks. Due to the complexity of human behaviors and changing social environments, the distributions of financial data are usually complex and it is challenging to find clusters and give reasonable interpretations. The goal of this study is to develop an integrated approach to detect clusters in financial data, and optimize the scope of the clusters such that the clusters can be easily interpreted. Specifically, we first proposed a new cluster quality evaluation criterion, which is free from large-scale computation and can guide base clustering algorithms such as k-Means to detect hyperellipsoidal clusters adaptively. Then, we designed a new solver for a revised support vector data description model, which efficiently refines the centroids and scopes of the detected clusters to make the clusters tighter such that the data in the clusters share greater similarities, and thus, the clusters can be easily interpreted with eigenvectors. Using ten financial datasets, the experiments showed that the proposed algorithm can efficiently find reasonable number of clusters. The proposed approach is suitable for large-scale financial datasets whose features are meaningful, and also applicable to financial mining tasks, such as data distribution interpretation and anomaly detection.
引用
收藏
页码:13848 / 13861
页数:14
相关论文
共 50 条
  • [21] Mixture-based Cluster Detection in Driving-Related Data
    Nagy, Ivan
    Suzdaleva, Evgenia
    Pecherkova, Pavla
    Urbaniec, Krzysztof
    2015 SMART CITIES SYMPOSIUM PRAGUE (SCSP), 2015,
  • [22] An optimization approach to epistasis detection
    Wang, Lizhi
    Mehr, Maryam Nikouei
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 274 (03) : 1069 - 1076
  • [23] An integrated approach for aircraft turbofan engine fault detection based on data mining techniques
    Gharoun, Hassan
    Keramati, Abbas
    Nasiri, Mohammad Mahdi
    Azadeh, Ali
    EXPERT SYSTEMS, 2019, 36 (02)
  • [24] A Nonparametric Outlier Detection Method for Financial Data
    Qu Ji-lin
    Qin Wen
    Sai Ying
    Feng Yu-mei
    2009 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING (16TH), VOLS I AND II, CONFERENCE PROCEEDINGS, 2009, : 1442 - +
  • [25] A Link-Based Cluster Ensemble Approach for Categorical Data Clustering
    Iam-On, Natthakan
    Boongoen, Tossapon
    Garrett, Simon
    Price, Chris
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (03) : 413 - 425
  • [26] An optimization approach to financial and operational viabilities of spare aircraft
    Bazargan, Massoud
    Orhan, Ilkay
    AIRCRAFT ENGINEERING AND AEROSPACE TECHNOLOGY, 2023, 95 (08) : 1237 - 1246
  • [27] Research on Optimization of Financial Management in Big Data Era
    Wang Hongni
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON SOCIAL SCIENCE AND TECHNOLOGY EDUCATION (ICSSTE 2015), 2015, 18 : 311 - 314
  • [28] An Integrated Approach to Geovisualize Epidemiological Data
    Meddah, Fatiha Guerroudji
    Ayouani, Yousra
    Meddah, Ishak H. A.
    INTERNATIONAL JOURNAL OF APPLIED GEOSPATIAL RESEARCH, 2022, 13 (01)
  • [29] Teaching public financial management: An integrated approach to a critical subject
    Thom, Michael
    TEACHING PUBLIC ADMINISTRATION, 2019, 37 (01) : 92 - 106
  • [30] An integrated approach for stock evaluation and portfolio optimization
    Kiris, Safak
    Ustun, Ozden
    OPTIMIZATION, 2012, 61 (04) : 423 - 441