A Gradient-Based Clustering for Multi-Database Mining

被引:3
|
作者
Miloudi, Salim [1 ]
Wang, Yulin [1 ]
Ding, Wenjia [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
关键词
Databases; Itemsets; Clustering algorithms; Data models; Prototypes; Computer science; Computational modeling; Multi-database mining; graph clustering; dual gradient descent; quasi-convex optimization; similarity measure; HIGH-FREQUENCY RULES; INTERESTING PATTERNS; ITEM RECOMMENDATION; ALGORITHMS; CLASSIFICATION;
D O I
10.1109/ACCESS.2021.3050404
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multinational corporations have multiple databases distributed throughout their branches, which store millions of transactions per day. For business applications, identifying disjoint clusters of similar and relevant databases contributes to learning the common buying patterns among customers and also increases the profits by targeting potential clients in the future. This process is called clustering, which is an important unsupervised technique for big data mining. In this article, we present an effective approach to search for the optimal clustering of multiple transaction databases in a weighted undirected similarity graph. To assess the clustering quality, we use dual gradient descent to minimize a constrained quasi-convex loss function whose parameters will determine the edges needed to form the optimal database clusters in the graph. Therefore, finding the global minimum is guaranteed in a finite and short time compared with the existing non-convex objectives where all possible candidate clusterings are generated to find the ideal clustering. Moreover, our algorithm does not require specifying the number of clusters a priori and uses a disjoint-set forest data structure to maintain and keep track of the clusters as they are updated. Through a series of experiments on public data samples and precomputed similarity matrices, we show that our algorithm is more accurate and faster in practice than the existing clustering algorithms for multi-database mining.
引用
收藏
页码:11144 / 11172
页数:29
相关论文
共 50 条
  • [21] Gradient-based iteration for a class of matrix equations
    Zhang, Huamin
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 1201 - 1205
  • [22] CLSPN is a potential biomarker associated with poor prognosis in low-grade gliomas based on a multi-database analysis
    Jia, Yulong
    Cheng, Xingbo
    Liang, Wenjia
    Lin, Shaochong
    Li, Pengxu
    Yan, Zhaoyue
    Zhang, Meng
    Ma, Wen
    Hu, Chenchen
    Wang, Baoya
    Liu, Zhendong
    CURRENT RESEARCH IN TRANSLATIONAL MEDICINE, 2022, 70 (04)
  • [23] Gradient-based contour encoding for character recognition
    Srikantan, G
    Lam, SW
    Srihari, SN
    PATTERN RECOGNITION, 1996, 29 (07) : 1147 - 1160
  • [24] GRADIENT-BASED SPARSE APPROXIMATION FOR COMPUTED TOMOGRAPHY
    Sakhaee, Elham
    Arreola, Manuel
    Entezari, Alireza
    2015 IEEE 12TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2015, : 1608 - 1611
  • [25] Conjugate gradient-based active noise cancellation
    de Souza, Jose Victor Goncalez
    Andrade, Fabio Augusto de Alcantara
    Pinto, Milena Faria
    Kar, Asutosh
    de Barros, Ana Lucia Ferreira
    Haddad, Diego Barreto
    ELECTRONICS LETTERS, 2024, 60 (12)
  • [26] Explainability of Speech Recognition Transformers via Gradient-Based Attention Visualization
    Sun, Tianli
    Chen, Haonan
    Hu, Guosheng
    He, Lianghua
    Zhao, Cairong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1395 - 1406
  • [27] A gradient-based approach for discrete optimum design
    Li, Yanyan
    Tan, Tao
    Li, Xingsi
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2010, 41 (06) : 881 - 892
  • [28] Gradient-Based Multi-Objective Feature Selection for Gait Mode Recognition of Transfemoral Amputees
    Khademi, Gholamreza
    Mohammadi, Hanieh
    Simon, Dan
    SENSORS, 2019, 19 (02):
  • [29] An efficient binary Gradient-based optimizer for feature selection
    Jiang, Yugui
    Luo, Qifang
    Wei, Yuanfei
    Abualigah, Laith
    Zhou, Yongquan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (04) : 3813 - 3854
  • [30] XGrad: Boosting Gradient-Based Optimizers With Weight Prediction
    Guan, Lei
    Li, Dongsheng
    Shi, Yanqi
    Meng, Jian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (10) : 6731 - 6747