Identifying homogeneous subgroups of patients and important features: a topological machine learning approach

被引:1
作者
Carr, Ewan [1 ]
Carriere, Mathieu [2 ]
Michel, Bertrand [3 ]
Chazal, Frederic [4 ]
Iniesta, Raquel [1 ]
机构
[1] Kings Coll London, Inst Psychiat Psychol & Neurosci, Dept Biostat & Hlth Informat, London, England
[2] Inria Sophia Antipolis, DataShape Team, Biot, France
[3] Ecole Cent Nantes, LMJL, UMR CNRS 6629, Nantes, France
[4] Inria Saclay, Alan Turing Bldg, Palaiseau, France
关键词
Topological data analysis; Clustering; Machine learning;
D O I
10.1186/s12859-021-04360-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background This paper exploits recent developments in topological data analysis to present a pipeline for clustering based on Mapper, an algorithm that reduces complex data into a one-dimensional graph. Results We present a pipeline to identify and summarise clusters based on statistically significant topological features from a point cloud using Mapper. Conclusions Key strengths of this pipeline include the integration of prior knowledge to inform the clustering process and the selection of optimal clusters; the use of the bootstrap to restrict the search to robust topological features; the use of machine learning to inspect clusters; and the ability to incorporate mixed data types. Our pipeline can be downloaded under the GNU GPLv3 license at .
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Identifying Fallers Based on Functional Parameters: A Machine Learning Approach
    Fahimi, F.
    Taylor, W. R.
    Dietzel, R.
    Armbrecht, G.
    Singh, N. B.
    [J]. 2021 IEEE ASIA-PACIFIC CONFERENCE ON COMPUTER SCIENCE AND DATA ENGINEERING (CSDE), 2021,
  • [22] Identifying NAT Devices to Detect Shadow IT: A Machine Learning Approach
    Nassar, Reem
    Elhajj, Imad
    Kayssi, Ayman
    Salam, Samer
    [J]. 2021 IEEE/ACS 18TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2021,
  • [23] Identifying new earnings management components: a machine learning approach
    Almasarwah, Adel
    Aram, Khalid Y.
    Alhaj-Yaseen, Yaseen S.
    [J]. ACCOUNTING RESEARCH JOURNAL, 2024, 37 (04) : 418 - 435
  • [24] Identifying equity constrained subgroups using machine learning, decision modelling and optimal policy algorithms
    Glynn, David
    Hatamyar, Julia
    Giardina, John
    Pandya, Ankur
    Kreif, Noemi
    [J]. MEDICAL DECISION MAKING, 2024, 44 (02) : NP193 - NP196
  • [25] Identifying Facial Features and Predicting Patients of Acromegaly Using Three-Dimensional Imaging Techniques and Machine Learning
    Meng, Tian
    Guo, Xiaopeng
    Lian, Wei
    Deng, Kan
    Gao, Lu
    Wang, Zihao
    Huang, Jiuzuo
    Wang, Xiaojun
    Long, Xiao
    Xing, Bing
    [J]. FRONTIERS IN ENDOCRINOLOGY, 2020, 11
  • [26] Identifying Predictive Features in Drug Response Using Machine Learning: Opportunities and Challenges
    Vidyasagar, Mathukumalli
    [J]. ANNUAL REVIEW OF PHARMACOLOGY AND TOXICOLOGY, VOL 55, 2015, 55 : 15 - 34
  • [27] Machine Learning Consensus Clustering Approach for Hospitalized Patients with Dysmagnesemia
    Thongprayoon, Charat
    Sy-Go, Janina Paula T.
    Nissaisorakarn, Voravech
    Dumancas, Carissa Y.
    Keddis, Mira T.
    Kattah, Andrea G.
    Pattharanitima, Pattharawin
    Vallabhajosyula, Saraschandra
    Mao, Michael A.
    Qureshi, Fawad
    Garovic, Vesna D.
    Dillon, John J.
    Erickson, Stephen B.
    Cheungpasitporn, Wisit
    [J]. DIAGNOSTICS, 2021, 11 (11)
  • [28] Predicting diabetes in adults: identifying important features in unbalanced data over a 5-year cohort study using machine learning algorithm
    Moghaddam, Maryam Talebi
    Jahani, Yones
    Arefzadeh, Zahra
    Dehghan, Azizallah
    Khaleghi, Mohsen
    Sharafi, Mehdi
    Nikfar, Ghasem
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2024, 24 (01)
  • [29] Important Correlates of Purpose in Life Identified Through a Machine Learning Approach
    Mei, Zhen
    Lori, Adriana
    Vattathil, Selina M.
    Boyle, Patricia A.
    Bradley, Bekh
    Li, Peng
    Bennett, David A.
    Wingo, Thomas S.
    Wingo, Aliza P.
    [J]. AMERICAN JOURNAL OF GERIATRIC PSYCHIATRY, 2021, 29 (05) : 488 - 498
  • [30] Identifying Patients with Coronary Microvascular Dysfunction using Machine Learning
    Fodeh, Samah
    Li, Taihua
    Jarad, Haya
    Safdar, Basmah
    [J]. 2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 715 - 721