A Gradient-Based Clustering for Multi-Database Mining

被引:3
|
作者
Miloudi, Salim [1 ]
Wang, Yulin [1 ]
Ding, Wenjia [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
关键词
Databases; Itemsets; Clustering algorithms; Data models; Prototypes; Computer science; Computational modeling; Multi-database mining; graph clustering; dual gradient descent; quasi-convex optimization; similarity measure; HIGH-FREQUENCY RULES; INTERESTING PATTERNS; ITEM RECOMMENDATION; ALGORITHMS; CLASSIFICATION;
D O I
10.1109/ACCESS.2021.3050404
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multinational corporations have multiple databases distributed throughout their branches, which store millions of transactions per day. For business applications, identifying disjoint clusters of similar and relevant databases contributes to learning the common buying patterns among customers and also increases the profits by targeting potential clients in the future. This process is called clustering, which is an important unsupervised technique for big data mining. In this article, we present an effective approach to search for the optimal clustering of multiple transaction databases in a weighted undirected similarity graph. To assess the clustering quality, we use dual gradient descent to minimize a constrained quasi-convex loss function whose parameters will determine the edges needed to form the optimal database clusters in the graph. Therefore, finding the global minimum is guaranteed in a finite and short time compared with the existing non-convex objectives where all possible candidate clusterings are generated to find the ideal clustering. Moreover, our algorithm does not require specifying the number of clusters a priori and uses a disjoint-set forest data structure to maintain and keep track of the clusters as they are updated. Through a series of experiments on public data samples and precomputed similarity matrices, we show that our algorithm is more accurate and faster in practice than the existing clustering algorithms for multi-database mining.
引用
收藏
页码:11144 / 11172
页数:29
相关论文
共 50 条
  • [41] Anisotropic gradient-based filtering for object segmentation in medical images
    Joao, Ana
    Gambaruto, Alberto
    Sequeira, Adelia
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2020, 8 (06) : 621 - 630
  • [42] A Gradient-Based Approach for Breast DCE-MRI Analysis
    Losurdo, L.
    Basile, T. M. A.
    Fanizzi, A.
    Bellotti, R.
    Bottigli, U.
    Carbonara, R.
    Dentamaro, R.
    Diacono, D.
    Didonna, V
    Lombardi, A.
    Giotta, F.
    Guaragnella, C.
    Mangia, A.
    Massafra, R.
    Tamborra, P.
    Tangaro, S.
    La Forgia, D.
    BIOMED RESEARCH INTERNATIONAL, 2018, 2018
  • [43] Orthogonal Gradient-Based Binary Image Representation for Vehicle Detection
    Czapla, Zbigniew
    COMPUTER VISION AND GRAPHICS, ICCVG 2016, 2016, 9972 : 453 - 461
  • [44] Gradient-Based Assessment of Habitat Quality for Spectral Ecosystem Monitoring
    Neumann, Carsten
    Weiss, Gabriele
    Schmidtlein, Sebastian
    Itzerott, Sibylle
    Lausch, Angela
    Doktor, Daniel
    Brell, Maximilian
    REMOTE SENSING, 2015, 7 (03) : 2871 - 2898
  • [45] Nonlinear gradient-based feature selection for precise prediction of diseases
    Kabir, Sadaf
    Farrokhvar, Leily
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2022, 14 (03) : 248 - 268
  • [46] Adapting lacunarity techniques for gradient-based analyses of landscape surfaces
    Hoechstetter, Sebastian
    Walz, Ulrich
    Nguyen Xuan Thinh
    ECOLOGICAL COMPLEXITY, 2011, 8 (03) : 229 - 238
  • [47] Gradient-Based Spectral Embeddings of Random Dot Product Graphs
    Fiori, Marcelo
    Marenco, Bernardo
    Larroca, Federico
    Bermolen, Paola
    Mateos, Gonzalo
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2024, 10 : 1 - 16
  • [48] Gradient-Based Markov Chain Monte Carlo for MIMO Detection
    Zhou, Xingyu
    Liang, Le
    Zhang, Jing
    Wen, Chao-Kai
    Jin, Shi
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (07) : 7566 - 7581
  • [49] Fingerprint liveness detection using gradient-based texture features
    Xia, Zhihua
    Lv, Rui
    Zhu, Yafeng
    Ji, Peng
    Sun, Huiyu
    Shi, Yun-Qing
    SIGNAL IMAGE AND VIDEO PROCESSING, 2017, 11 (02) : 381 - 388
  • [50] ON THE GRADIENT-BASED ALGORITHM FOR MATRIX FACTORIZATION APPLIED TO DIMENSIONALITY REDUCTION
    Nikulin, Vladimir
    McLachlan, Geoffrey J.
    BIONFORMATICS 2010: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON BIOINFORMATICS, 2010, : 147 - 152