Incremental clustering based on Wasserstein distance between histogram models

被引:0
|
作者
Qian, Xiaotong [1 ]
Cabanes, Guenael [2 ]
Rastin, Parisa [2 ]
Guidani, Mohamed Alae [3 ]
Marrakchi, Ghassen [4 ]
Clausel, Marianne [2 ]
Grozavu, Nistor [1 ]
机构
[1] CY Cergy Paris Univ, ETIS, UMR 8051, F-95000 Cergy, France
[2] Univ Lorraine, LORIA, UMR 7503, F-54500 Vandoeuvr Les Nancy, France
[3] Ecole Natl Super Mines, Campus Artem, F-54042 Nancy, France
[4] Univ Sorbonne Paris Nord, LIPN, UMR 7030, F-93430 Villetaneuse, France
关键词
Unsupervised learning; Static and dynamic clustering; Large datasets; Data streams; Sliding windows; Histogram models; Wasserstein distance; STREAMING-DATA; CLASSIFIER; ALGORITHMS;
D O I
10.1016/j.patcog.2025.111414
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we present an innovative clustering framework designed for large datasets and real-time data streams which uses a sliding window and histogram model to address the challenge of memory congestion while reducing computational complexity and improving cluster quality for both static and dynamic clustering. The framework provides a simple way to characterize the probability distribution of cluster distributions through histogram models, regardless of their distribution type. This advantage allows for efficient use with various conventional clustering algorithms. To facilitate effective clustering across windows, we use a statistical measure that allows the comparison and merging of different clusters based on the calculation of the Wasserstein distance between histograms.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Persistent homology based Wasserstein distance for graph networks
    Babu, Archana
    John, Sunil Jacob
    HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2025, 54 (01): : 90 - 114
  • [22] Online machine learning algorithms based on Wasserstein distance
    Li Z.
    Zhang Z.-H.
    Zhongguo Kexue Jishu Kexue/Scientia Sinica Technologica, 2023, 53 (07): : 1031 - 1042
  • [23] Least Wasserstein distance between disjoint shapes with perimeter regularization
    Novack, Michael
    Topaloglu, Ihsan
    Venkatraman, Raghavendra
    JOURNAL OF FUNCTIONAL ANALYSIS, 2023, 284 (01)
  • [24] Wasserstein-Distance-Based Gaussian Mixture Reduction
    Assa, Akbar
    Plataniotis, Konstantinos N.
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (10) : 1465 - 1469
  • [25] Wasserstein and total variation distance between marginals of Levy processes
    Mariucci, Ester
    Reiss, Markus
    ELECTRONIC JOURNAL OF STATISTICS, 2018, 12 (02): : 2482 - 2514
  • [26] Evolution of the Wasserstein distance between the marginals of two Markov processes
    Alfonsi, Aurelien
    Corbetta, Jacopo
    Jourdain, Benjamin
    BERNOULLI, 2018, 24 (4A) : 2461 - 2498
  • [27] Wasserstein distance-based fuzzy C-means clustering in Riemannian manifold feature space for image segmentation
    Wu, Chengmao
    Zheng, Jia
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [28] A Novel Graph Kernel Based on the Wasserstein Distance and Spectral Signatures
    Liu, Yantao
    Rossi, Luca
    Torsello, Andrea
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2022, 2022, 13813 : 122 - 131
  • [29] Computing Wasserstein-p Distance Between Images with Linear Cost
    Chen, Yidong
    Li, Chen
    Lu, Zhonghua
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 509 - 518
  • [30] ON A WASSERSTEIN-TYPE DISTANCE BETWEEN SOLUTIONS TO STOCHASTIC DIFFERENTIAL EQUATIONS
    Bion-Nadal, Jocelyne
    Talay, Denis
    ANNALS OF APPLIED PROBABILITY, 2019, 29 (03): : 1609 - 1639