Supervised convex clustering

被引:1
|
作者
Wang, Minjie [1 ,7 ]
Yao, Tianyi [2 ]
Allen, Genevera I. [3 ,4 ,5 ,6 ]
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN USA
[2] Rice Univ, Dept Stat, Houston, TX USA
[3] Rice Univ, Dept Elect & Comp Engn, Houston, TX USA
[4] Rice Univ, Dept Stat, Houston, TX USA
[5] Rice Univ, Dept Comp Sci, Houston, TX USA
[6] Baylor Coll Med, Jan & Dan Duncan Neurol Res Inst, Houston, TX USA
[7] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
convex clustering; exponential family; generalized linear model deviance; interpretable clustering; supervised clustering; SELECTION; NUMBER; ADMM;
D O I
10.1111/biom.13860
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clustering has long been a popular unsupervised learning approach to identify groups of similar objects and discover patterns from unlabeled data in many applications. Yet, coming up with meaningful interpretations of the estimated clusters has often been challenging precisely due to their unsupervised nature. Meanwhile, in many real-world scenarios, there are some noisy supervising auxiliary variables, for instance, subjective diagnostic opinions, that are related to the observed heterogeneity of the unlabeled data. By leveraging information from both supervising auxiliary variables and unlabeled data, we seek to uncover more scientifically interpretable group structures that may be hidden by completely unsupervised analyses. In this work, we propose and develop a new statistical pattern discovery method named supervised convex clustering (SCC) that borrows strength from both information sources and guides towards finding more interpretable patterns via a joint convex fusion penalty. We develop several extensions of SCC to integrate different types of supervising auxiliary variables, to adjust for additional covariates, and to find biclusters. We demonstrate the practical advantages of SCC through simulations and a case study on Alzheimer's disease genomics. Specifically, we discover new candidate genes as well as new subtypes of Alzheimer's disease that can potentially lead to better understanding of the underlying genetic mechanisms responsible for the observed heterogeneity of cognitive decline in older adults.
引用
收藏
页码:3846 / 3858
页数:13
相关论文
共 50 条
  • [21] Information-incorporated sparse convex clustering for disease subtyping
    Zhang, Xiaoyu
    Liu, Ching-Ti
    BIOINFORMATICS, 2023, 39 (07)
  • [22] A dual reformulation and solution framework for regularized convex clustering problems
    Pi, J.
    Wang, Honggang
    Pardalos, Panos M.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 290 (03) : 844 - 856
  • [23] The Research and Application of Supervised Clustering MOMA
    Wang Diangang
    Peng Xiaoqiangn
    Li Fan
    Li Zhuo
    Luo Na
    2012 7TH INTERNATIONAL CONFERENCE ON COMPUTING AND CONVERGENCE TECHNOLOGY (ICCCT2012), 2012, : 902 - 906
  • [24] Supervised clustering for TSPO PET imaging
    Julia Schubert
    Matteo Tonietto
    Federico Turkheimer
    Paolo Zanotti-Fregonara
    Mattia Veronese
    European Journal of Nuclear Medicine and Molecular Imaging, 2021, 49 : 257 - 268
  • [25] Supervised clustering for TSPO PET imaging
    Schubert, Julia
    Tonietto, Matteo
    Turkheimer, Federico
    Zanotti-Fregonara, Paolo
    Veronese, Mattia
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2021, 49 (01) : 257 - 268
  • [26] Regularized boxplot via convex clustering
    Choi, Hosik
    Poythress, J. C.
    Park, Cheolwoo
    Jeon, Jong-June
    Park, Changyi
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2019, 89 (07) : 1227 - 1247
  • [27] Supervised hierarchical clustering using CART
    Hancock, TP
    Coomans, DH
    Everingham, YL
    MODSIM 2003: INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION, VOLS 1-4: VOL 1: NATURAL SYSTEMS, PT 1; VOL 2: NATURAL SYSTEMS, PT 2; VOL 3: SOCIO-ECONOMIC SYSTEMS; VOL 4: GENERAL SYSTEMS, 2003, : 1880 - 1885
  • [28] DYNAMIC-PROGRAMMING AND CONVEX CLUSTERING
    BATAGELJ, V
    KORENJAKCERNE, S
    KLAVZAR, S
    ALGORITHMICA, 1994, 11 (02) : 93 - 103
  • [29] An Improved Robust Sparse Convex Clustering
    Ma, Jinyao
    Zhang, Haibin
    Yang, Shanshan
    Jiang, Jiaojiao
    Li, Gaidi
    TSINGHUA SCIENCE AND TECHNOLOGY, 2023, 28 (06): : 989 - 998
  • [30] Convex clustering method for compositional data via sparse group lasso
    Wang, Xiaokang
    Wang, Huiwen
    Wang, Shanshan
    Yuan, Jidong
    NEUROCOMPUTING, 2021, 425 : 23 - 36