Statistical Technique in Clustering Problems

被引:0
|
作者
Nikolaeva O.V. [1 ]
机构
[1] Keldysh Institute of Applied Mathematics, Russian Academy of Sciences, Moscow
关键词
clustering; multispectral imaging; statistical techniques;
D O I
10.1134/S2070048223030134
中图分类号
学科分类号
摘要
Abstract: The problem of evaluating and improving the quality of clustering multispectral data is considered. A method for calculating the distance between clusters is developed. Vectors of each cluster are considered as implementations of some random vector. Sampling distribution functions (SDF) are found and the errors of the approximation of unknown exact distribution functions by SDFs are obtained. The distance between two clusters is defined as the distance between two SDFs. The criteria for indiscernible, overlapping, and disjoint clusters are defined. A technique to improve clustering is proposed in which indiscernible (or indiscernible and overlapping) clusters are merged. The results of numerical experiments on simulated data are given. It is shown that the technique can decompose the data into the initial groups of vectors. The results of numerical experiments with real data are given. The real data are multispectral images of the HYPERION sensor, obtained above the ocean under a clear sky and broken clouds. It is shown that the presented technique can distinguish clouds and their shadows in the images. © 2023, Pleiades Publishing, Ltd.
引用
收藏
页码:445 / 453
页数:8
相关论文
共 50 条
  • [21] Statistical inference for clustering microarrays
    Rahnenführer, J
    NONLINEAR ESTIMATION AND CLASSIFICATION, 2003, 171 : 323 - 332
  • [22] Statistical properties of convex clustering
    Tan, Kean Ming
    Witten, Daniela
    ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (02): : 2324 - 2347
  • [23] STATISTICAL-THEORY IN CLUSTERING
    HARTIGAN, JA
    JOURNAL OF CLASSIFICATION, 1985, 2 (01) : 63 - 76
  • [24] SSC: Statistical subspace clustering
    Candillier, L
    Tellier, I
    Torre, F
    Bousquet, O
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2005, 3587 : 100 - 109
  • [25] Statistical Clustering Analysis: An Introduction
    Zhang, Hang
    CLUSTER CHALLENGES IN BIOLOGICAL NETWORKS, 2009, : 101 - 126
  • [26] Statistical properties of earthquakes clustering
    Vecchio, A.
    Carbone, V.
    Sorriso-Valvo, L.
    De Rose, C.
    Guerra, I.
    Harabaglia, P.
    NONLINEAR PROCESSES IN GEOPHYSICS, 2008, 15 (02) : 333 - 338
  • [27] STATISTICAL MEASURES OF GALAXY CLUSTERING
    PORTER, DH
    MINNESOTA LECTURES ON CLUSTERS OF GALAXIES AND LARGE-SCALE STRUCTURE, 1988, 5 : 1 - 8
  • [28] Statistical Significance for Hierarchical Clustering
    Kimes, Patrick K.
    Liu, Yufeng
    Hayes, David Neil
    Marron, James Stephen
    BIOMETRICS, 2017, 73 (03) : 811 - 821
  • [29] Clustering Technique for DSMs
    Behncke, Florian G. H.
    Maurer, Doris
    Schrenk, Lukas
    Schmidt, Danilo Marcello
    Lindemann, Udo
    RISK AND CHANGE MANAGEMENT IN COMPLEX SYSTEMS, 2014, : 177 - 186
  • [30] PSEUDOINVERSE IN CLUSTERING PROBLEMS
    Kirichenko, N. F.
    Donchenko, V. S.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2007, 43 (04) : 527 - 541