A set-theoretic definition of cell types with an algebraic structure on gene regulatory networks and application in annotation of RNA-seq data

被引:1
作者
Okano, Yuji [1 ]
Kase, Yoshitaka [1 ]
Okano, Hideyuki [1 ]
机构
[1] Keio Univ, Dept Physiol, Sch Med, 35 Shinanomachi,Shinjuku Ku, Tokyo 1608582, Japan
关键词
annotation; cell type; cellular state; mathematical model; scRNA-seq; set theory; transcriptome;
D O I
10.1016/j.stemcr.2022.10.015
中图分类号
Q813 [细胞工程];
学科分类号
摘要
The emergence of single-cell RNA sequencing (RNA-seq) has radically changed the observation of cellular diversity. Although annotations of RNA-seq data require preserved properties among cells of an identity, annotations using conventional methods have not been able to capture universal characters of a cell type. Analysis of expression levels cannot be accurately annotated for cells because differences in transcription do not necessarily explain biological characteristics in terms of cellular functions and because the data themselves do not inform about the correct mapping between cell types and genes. Hence, in this study, we developed a new representation of cellular identities that can be compared over different datasets while preserving nontrivial biological semantics. To generalize the notion of cell types, we developed a new framework to manage cellular identities in terms of set theory. We provided further insights into cells by installing mathematical descriptions of cell biology. We also performed experiments that could correspond to practical applications in annotations of RNA-seq data.
引用
收藏
页码:113 / 130
页数:18
相关论文
共 34 条
[1]   Gene regulatory network inference from sparsely sampled noisy data [J].
Aalto, Atte ;
Viitasaari, Lauri ;
Ilmonen, Pauliina ;
Mombaerts, Laurent ;
Goncalves, Jorge .
NATURE COMMUNICATIONS, 2020, 11 (01)
[3]  
Ankan Ankur., 2015, pgmpy: Probabilistic graphical models using python, DOI 10.25080/MAJORA-7B98E3ED-001
[4]   Generalized Hamming distance [J].
Bookstein, A ;
Kulyukin, VA ;
Raita, T .
INFORMATION RETRIEVAL, 2002, 5 (04) :353-375
[5]  
Cheng J., 1997, Proceedings of the Sixth International Workshop on Artificial Intelligence and Statistics, P83, DOI DOI 10.1016/J.PATCOG.2004.05.012
[6]  
Clevers H, 2017, CELL SYST, V4, P255, DOI [10.1016/j.cels.2017.03.006, 10.1016/j.cels.2017.03.006]
[7]   The Scree Test and the Number of Factors: a Dynamic Graphics Approach [J].
Daniel Ledesma, Ruben ;
Valero-Mora, Pedro ;
Macbeth, Guillermo .
SPANISH JOURNAL OF PSYCHOLOGY, 2015, 18 :E11
[8]   MAPPING OF GLUTAMIC-ACID DECARBOXYLASE (GAD) GENES [J].
EDELHOFF, S ;
GRUBIN, CE ;
KARLSEN, AE ;
ADLER, DA ;
FOSTER, D ;
DISTECHE, CM ;
LERNMARK, A .
GENOMICS, 1993, 17 (01) :93-97
[9]  
Grigg N, 2007, Arxiv, DOI arXiv:0707.2591
[10]   On the metric reflection of a pseudometric space in ZF [J].
Herrlich, Horst ;
Keremedis, Kyriakos .
COMMENTATIONES MATHEMATICAE UNIVERSITATIS CAROLINAE, 2015, 56 (01) :77-88