Hierarchical nearest neighbor descent, in-tree, and clustering

被引:1
作者
Qiu, Teng [1 ]
Li, Yongjie [1 ]
机构
[1] Univ Elect Sci & Technol China, Key Lab Neuroinformat, Radiat Oncol Key Lab Sichuan Prov, Minist Educ, Chengdu 610054, Peoples R China
关键词
Clustering; In-tree; Hierarchical nearest neighbor descent; Mass cytometry; MASS CYTOMETRY; SEARCH; CELLS;
D O I
10.1016/j.patcog.2023.109300
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, we have proposed a physically-inspired graph-theoretical method, called the Nearest Descent (ND), which is capable of organizing a dataset into an in-tree graph structure. Due to some beautiful and effective features, the constructed in-tree proves well-suited for data clustering. Although there exist some undesired edges (i.e., the inter-cluster edges) in this in-tree, those edges are usually very distin-guishable, in sharp contrast to the cases in the famous Minimal Spanning Tree (MST). Here, we propose another graph-theoretical method, called the Hierarchical Nearest Neighbor Descent (HNND). Like ND, HNND also organizes a dataset into an in-tree, but in a more efficient way. Consequently, HNND-based clustering (HNND-C) is more efficient than ND-based clustering (ND-C) as well. This is well proved by the experimental results on five high-dimensional and large-size mass cytometry datasets. The experimental results also show that HNND-C achieves overall better performance than some state-of-the-art clustering methods.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 37 条
[1]   To cluster, or not to cluster: An analysis of clusterability methods [J].
Adolfsson, Andreas ;
Ackerman, Margareta ;
Brownstein, Naomi C. .
PATTERN RECOGNITION, 2019, 88 :13-26
[2]   Mass Cytometry: Technique for Real Time Single Cell Multitarget Immunoassay Based on Inductively Coupled Plasma Time-of-Flight Mass Spectrometry [J].
Bandura, Dmitry R. ;
Baranov, Vladimir I. ;
Ornatsky, Olga I. ;
Antonov, Alexei ;
Kinach, Robert ;
Lou, Xudong ;
Pavlov, Serguei ;
Vorobiev, Sergey ;
Dick, John E. ;
Tanner, Scott D. .
ANALYTICAL CHEMISTRY, 2009, 81 (16) :6813-6822
[3]  
Bateni M, 2017, ADV NEUR IN, V30
[4]   Single-Cell Mass Cytometry of Differential Immune and Drug Responses Across a Human Hematopoietic Continuum [J].
Bendall, Sean C. ;
Simonds, Erin F. ;
Qiu, Peng ;
Amir, El-ad D. ;
Krutzik, Peter O. ;
Finck, Rachel ;
Bruggner, Robert V. ;
Melamed, Rachel ;
Trejo, Angelica ;
Ornatsky, Olga I. ;
Balderas, Robert S. ;
Plevritis, Sylvia K. ;
Sachs, Karen ;
Pe'er, Dana ;
Tanner, Scott D. ;
Nolan, Garry P. .
SCIENCE, 2011, 332 (6030) :687-696
[5]   Hierarchical Density Estimates for Data Clustering, Visualization, and Outlier Detection [J].
Campello, Ricardo J. G. B. ;
Moulavi, Davoud ;
Zimek, Arthur ;
Sander, Joerg .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2015, 10 (01)
[6]   Minimum curvilinearity to enhance topological prediction of protein interactions by network embedding [J].
Cannistraci, Carlo Vittorio ;
Alanis-Lobato, Gregorio ;
Ravasi, Timothy .
BIOINFORMATICS, 2013, 29 (13) :199-209
[7]   Fast and Accurate Hierarchical Clustering Based on Growing Multilayer Topology Training [J].
Cheung, Yiu-ming ;
Zhang, Yiqun .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (03) :876-890
[8]   Unraveling cell populations in tumors by single-cell mass cytometry [J].
Di Palma, Serena ;
Bodenmiller, Bernd .
CURRENT OPINION IN BIOTECHNOLOGY, 2015, 31 :122-129
[9]   densityCut: an efficient and versatile topological approach for automatic clustering of biological data [J].
Ding, Jiarui ;
Shah, Sohrab ;
Condon, Anne .
BIOINFORMATICS, 2016, 32 (17) :2567-2576
[10]   Fast Approximate Nearest Neighbor Search With The Navigating Spreading-out Graph [J].
Fu, Cong ;
Xiang, Chao ;
Wang, Changxu ;
Cai, Deng .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (05) :461-474