Predicting Heart Cell Types by Using Transcriptome Profiles and a Machine Learning Method

被引:45
作者
Ding, Shijian [1 ]
Wang, Deling [2 ]
Zhou, Xianchao [3 ]
Chen, Lei [4 ]
Feng, Kaiyan [5 ]
Xu, Xianling [6 ]
Huang, Tao [7 ,8 ]
Li, Zhandong [9 ]
Cai, Yudong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai 200444, Peoples R China
[2] Sun Yat Sen Univ, Collaborat Innovat Ctr Canc Med, Dept Med Imaging, State Key Lab Oncol South China,Canc Ctr, Guangzhou 510060, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Publ Hlth, Sch Med, Ctr Single Cell Omics, Shanghai 200025, Peoples R China
[4] Shanghai Maritime Univ, Coll Informat Engn, Shanghai 201306, Peoples R China
[5] Guangdong AIB Polytech Coll, Dept Comp Sci, Guangzhou 510507, Peoples R China
[6] Guangdong AIB Polytech Coll, Guangzhou 510507, Peoples R China
[7] Univ Chinese Acad Sci, Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr,CAS Key Lab Computat Biol, Shanghai 200031, Peoples R China
[8] Univ Chinese Acad Sci, Chinese Acad Sci, Shanghai Inst Nutr & Hlth, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai 200031, Peoples R China
[9] Jilin Engn Normal Univ, Coll Food Engn, Changchun 130052, Peoples R China
来源
LIFE-BASEL | 2022年 / 12卷 / 02期
基金
国家重点研发计划;
关键词
heart cell; single-cell profiles; machine learning method; biomarker; decision rule; NATRIURETIC PEPTIDES; MAMMALIAN HEART; ATRIAL; MUTATIONS; PROTEINS; GENE; TTN;
D O I
10.3390/life12020228
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The heart is an essential organ in the human body. It contains various types of cells, such as cardiomyocytes, mesothelial cells, endothelial cells, and fibroblasts. The interactions between these cells determine the vital functions of the heart. Therefore, identifying the different cell types and revealing the expression rules in these cell types are crucial. In this study, multiple machine learning methods were used to analyze the heart single-cell profiles with 11 different heart cell types. The single-cell profiles were first analyzed via light gradient boosting machine method to evaluate the importance of gene features on the profiling dataset, and a ranking feature list was produced. This feature list was then brought into the incremental feature selection method to identify the best features and build the optimal classifiers. The results suggested that the best decision tree (DT) and random forest classification models achieved the highest weighted F1 scores of 0.957 and 0.981, respectively. The selected features, such as NPPA, LAMA2, DLC1, and the classification rules extracted from the optimal DT classifier played a crucial role in cardiac structure and function in recent research and enrichment analysis. In particular, some lncRNAs (LINC02019, NEAT1) were found to be quite important for the recognition of different cardiac cell types. In summary, these findings provide a solid academic foundation for the development of molecular diagnostics and biomarker discovery for cardiac diseases.
引用
收藏
页数:17
相关论文
共 77 条
[1]  
[Anonymous], 2021, IEEE Trans. Broadcast.
[2]   Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage [J].
Aran, Dvir ;
Looney, Agnieszka P. ;
Liu, Leqian ;
Wu, Esther ;
Fong, Valerie ;
Hsu, Austin ;
Chak, Suzanna ;
Naikawadi, Ram P. ;
Wolters, Paul J. ;
Abate, Adam R. ;
Butte, Atul J. ;
Bhattacharya, Mallar .
NATURE IMMUNOLOGY, 2019, 20 (02) :163-+
[3]   'Ryanopathy': causes and manifestations of RyR2 dysfunction in heart failure [J].
Belevych, Andriy E. ;
Radwanski, Przemyslaw B. ;
Carnes, Cynthia A. ;
Gyoerke, Sandor .
CARDIOVASCULAR RESEARCH, 2013, 98 (02) :240-247
[4]   Cardiac natriuretic peptides act via p38 MAPK to induce the brown fat thermogenic program in mouse and human adipocytes [J].
Bordicchia, Marica ;
Liu, Dianxin ;
Amri, Ez-Zoubir ;
Ailhaud, Gerard ;
Dessi-Fulgheri, Paolo ;
Zhang, Chaoying ;
Takahashi, Nobuyuki ;
Sarzani, Riccardo ;
Collins, Sheila .
JOURNAL OF CLINICAL INVESTIGATION, 2012, 122 (03) :1022-1036
[5]   Cardiac endothelial-myocardial signaling: Its role in cardiac growth, contractile performance, and rhythmicity [J].
Brutsaert, DL .
PHYSIOLOGICAL REVIEWS, 2003, 83 (01) :59-115
[6]   Transcriptomic Analyses of Adipocyte Differentiation From Human Mesenchymal Stromal-Cells (MSC) [J].
Casado-Diaz, Antonio ;
Anter, Jaouad ;
Mueller, Soeren ;
Winter, Peter ;
Manuel Quesada-Gomez, Jose ;
Dorado, Gabriel .
JOURNAL OF CELLULAR PHYSIOLOGY, 2017, 232 (04) :771-784
[7]   A Rising Titan: TTN Review and Mutation Update [J].
Chauveau, Claire ;
Rowell, John ;
Ferreiro, Ana .
HUMAN MUTATION, 2014, 35 (09) :1046-1059
[8]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[9]   ELAVL1 is transcriptionally activated by FOXC1 and promotes ferroptosis in myocardial ischemia/reperfusion injury by regulating autophagy [J].
Chen, Hui-Yong ;
Xiao, Ze-Zhou ;
Ling, Xiao ;
Xu, Rong-Ning ;
Zhu, Peng ;
Zheng, Shao-Yi .
MOLECULAR MEDICINE, 2021, 27 (01)
[10]   Endothelial cells are progenitors of cardiac pericytes and vascular smooth muscle cells [J].
Chen, Qi ;
Zhang, Hui ;
Liu, Yang ;
Adams, Susanne ;
Eilken, Hanna ;
Stehling, Martin ;
Corada, Monica ;
Dejana, Elisabetta ;
Zhou, Bin ;
Adams, Ralf H. .
NATURE COMMUNICATIONS, 2016, 7