Clustering and machine learning framework for medical time series classification

被引:1
作者
Ruiperez-Campillo, Samuel [1 ]
Reiss, Michael [2 ,3 ]
Ramirez, Elisa [4 ]
Cebrian, Antonio [4 ]
Millet, Jose [4 ]
Castells, Francisco [4 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
[2] Univ Calif San Diego, Dept Bioengn, San Diego, CA USA
[3] Yale Sch Med, Dept Internal Med, Sect Cardiovasc Med, New Haven, CT USA
[4] Univ Politecn Valencia, ITACA Inst, Valencia, Spain
关键词
AI in medicine; Machine learning unsupervised machine; learning; Hilbert vector spaces; Reproducibility; Hierarchical clustering; Medical time-series processing;
D O I
10.1016/j.bbe.2024.07.005
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Background and motivation: The application of artificial intelligence in medical research, particularly unsupervised learning techniques, has shown promising potential. Medical time series data poses a unique challenge for analysis due to its complexity. Existing unsupervised learning methods often fail to effectively classify these variations, highlighting a gap in current approaches. We introduce a methodological clustering classification framework designed to accurately handle such data, aiming for improved classification tasks in biomedical signals. Methods: To address these challenges, we introduce a novel approach for the analysis and classification of medical time series data. Our method integrates agglomerative hierarchical clustering with Hilbert vector space representations of medical signals and biological sequences. We rigorously define the mathematical principles and conduct evaluations using simulations of cardiac signals, real-world neural signal datasets, open-source protein sequences, and the MNIST dataset for illustrative purposes. Results: The proposed method exhibited a 96% success rate in classifying protein sequences by function and effectively identifying families within a large protein set. In cardiac signal analysis, it retained 0.996 variance in a condensed 6-dimensional space, accurately classifying 87.4% of simulated atrial flutter groups and 99.91% of main groups when excluding conduction direction. For neural signals, it demonstrated near-perfect tracking accuracy of neural activity in mouse brain recordings, as confirmed by expert evaluations. Conclusion: Our proposed method offers a novel, translational approach for the treatment and classification of medical and biological time series, addressing some of the prevalent challenges in the field and paving the way for more reliable and effective biomedical signal analysis.
引用
收藏
页码:521 / 533
页数:13
相关论文
共 56 条
[1]  
Alberts B., 2007, Molecular biology of the cell, V5th
[2]  
Bateman A, 2002, NUCLEIC ACIDS RES, V30, P276, DOI [10.1093/nar/gkr1065, 10.1093/nar/gkp985, 10.1093/nar/gkh121]
[3]   Challenges to the Reproducibility of Machine Learning Models in Health Care [J].
Beam, Andrew L. ;
Manrai, Arjun K. ;
Ghassemi, Marzyeh .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2020, 323 (04) :305-306
[4]   ATRIAL-FLUTTER, ATRIAL-FIBRILLATION, AND OTHER PRIMARY ATRIAL TACHYCARDIAS [J].
BENDITT, DG ;
BENSON, DW ;
DUNNIGAN, A ;
GORNICK, CC ;
ANDERSON, RW .
MEDICAL CLINICS OF NORTH AMERICA, 1984, 68 (04) :895-918
[5]  
Bhattacharya B.K., 1985, Computational geometry, V2, P43
[6]   Using deep learning to annotate the protein universe [J].
Bileschi, Maxwell L. ;
Belanger, David ;
Bryant, Drew ;
Sanderson, Theo ;
Carter, Brandon ;
Sculley, D. ;
Bateman, Alex ;
DePristo, Mark A. ;
Colwell, Lucy J. .
NATURE BIOTECHNOLOGY, 2022, 40 (06) :932-+
[7]   The InterPro protein families and domains database: 20 years on [J].
Blum, Matthias ;
Chang, Hsin-Yu ;
Chuguransky, Sara ;
Grego, Tiago ;
Kandasaamy, Swaathi ;
Mitchell, Alex ;
Nuka, Gift ;
Paysan-Lafosse, Typhaine ;
Qureshi, Matloob ;
Raj, Shriya ;
Richardson, Lorna ;
Salazar, Gustavo A. ;
Williams, Lowri ;
Bork, Peer ;
Bridge, Alan ;
Gough, Julian ;
Haft, Daniel H. ;
Letunic, Ivica ;
Marchler-Bauer, Aron ;
Mi, Huaiyu ;
Natale, Darren A. ;
Necci, Marco ;
Orengo, Christine A. ;
Pandurangan, Arun P. ;
Rivoire, Catherine ;
Sigrist, Christian J. A. ;
Sillitoe, Ian ;
Thanki, Narmada ;
Thomas, Paul D. ;
Tosatto, Silvio C. E. ;
Wu, Cathy H. ;
Bateman, Alex ;
Finn, Robert D. .
NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) :D344-D354
[8]   Electrocardiology of atrial fibrillation - Current knowledge and future challenges [J].
Bollmann, Andreas ;
Lombardi, Federico .
IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2006, 25 (06) :15-23
[9]  
Calinski T., 1974, COMMUN STAT, V3, P1, DOI [10.1080/03610927408827101, DOI 10.1080/03610927408827101]
[10]   Machine learning-based phenogrouping in heart failure to identify responders to cardiac resynchronization therapy [J].
Cikes, Maja ;
Sanchez-Martinez, Sergio ;
Claggett, Brian ;
Duchateau, Nicolas ;
Piella, Gemma ;
Butakoff, Constantine ;
Pouleur, Anne Catherine ;
Knappe, Dorit ;
Biering-Sorensen, Tor ;
Kutyifa, Valentina ;
Moss, Arthur ;
Stein, Kenneth ;
Solomon, Scott D. ;
Bijnens, Bart .
EUROPEAN JOURNAL OF HEART FAILURE, 2019, 21 (01) :74-85