BIOTIC: a Bayesian framework to integrate single-cell multi-omics for transcription factor activity inference and improve identity characterization of cells

被引:0
作者
Cao, Lan [1 ]
Zhang, Wenhao [1 ]
Yang, Fan [1 ,2 ,3 ]
Chen, Shengquan [4 ]
Huang, Xiaobing [5 ]
Zeng, Feng [1 ,2 ,3 ]
Wang, Ying [1 ,2 ,3 ,6 ]
机构
[1] Xiamen Univ, Dept Automat, Xiangan South Rd, Xiamen 361102, Fujian, Peoples R China
[2] Xiamen Univ, Natl Inst Data Sci Hlth & Med, Xiangan South Rd, Xiamen 361102, Fujian, Peoples R China
[3] Xiamen Univ, Xiamen Key Lab Big Data Intelligent Anal & Decis, Xiangan South Rd, Xiamen 361102, Fujian, Peoples R China
[4] Nankai Univ, Sch Math Sci& LPMC, Weijing Rd, Tianjin 300071, Peoples R China
[5] Fujian Med Univ, Dept Med Oncol, Fuzhou Hosp 1, Chating Rd, Fuzhou 350000, Fujian, Peoples R China
[6] Xiamen Univ, State Key Lab Mariculture Breeding, Xiangan South Rd, Xiamen 361102, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
single-cell; multi-omics; transcriptional factor; variational inference; TF activity; gene regulatory; CHROMATIN; MECHANISMS; EVOLUTION; RNA;
D O I
10.1093/bib/bbaf013
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Understanding cell destiny requires unraveling the intricate mechanism of gene regulation, where transcription factors (TFs) play a pivotal role. However, the actual contribution of TFs, that is TF activity, is not only determined by TF expression, but also accessibility of corresponding chromatin regions. Therefore, we introduce BIOTIC, an advanced Bayesian model with a well-established gene regulation structure that harnesses the power of single-cell multi-omics data to model the gene expression process under the control of regulatory elements, thereby defining the regulatory activity of TFs with variational inference. We demonstrated that the TF activity inferred by BIOTIC can serve as a characterization of cell identity, and outperforms baseline methods for the tasks of cell typing, cell development tracking, and batch effect correction. Additionally, BIOTIC trained on multi-omics data can flexibly be applied to the scenario where merely single-cell transcriptome sequencing is available, to infer TF activity and annotate the cell type by mapping the query cell into the reference TF activity space, as an emerging application of cell atlases. The structure of BIOTIC has been determined to be adaptable for the inclusion of additional biological factors, allowing for flexible and more comprehensive gene regulation analysis. BIOTIC introduces a pioneering biological-mechanism-driven framework to infer TF activity and elucidate cell identity states at gene regulatory level, paving the way for a deeper understanding of the complex interplay between TFs and gene expression in living systems.
引用
收藏
页数:10
相关论文
共 46 条
[1]  
Aibar S, 2017, NAT METHODS, V14, P1083, DOI [10.1038/NMETH.4463, 10.1038/nmeth.4463]
[2]   Functional characterization of somatic mutations in cancer using network-based inference of protein activity [J].
Alvarez, Mariano J. ;
Shen, Yao ;
Giorgi, Federico M. ;
Lachmann, Alexander ;
Ding, B. Belinda ;
Ye, B. Hilda ;
Califano, Andrea .
NATURE GENETICS, 2016, 48 (08) :838-+
[3]   The origin and evolution of cell types [J].
Arendt, Detlev ;
Musser, Jacob M. ;
Baker, Clare V. H. ;
Bergman, Aviv ;
Cepko, Connie ;
Erwin, Douglas H. ;
Pavlicev, Mihaela ;
Schlosser, Gerhard ;
Widder, Stefanie ;
Laubichler, Manfred D. ;
Wagner, Gunter P. .
NATURE REVIEWS GENETICS, 2016, 17 (12) :744-757
[4]   MultiVI: deep generative model for the integration of multimodal data [J].
Ashuach, Tal ;
Gabitto, Mariano I. ;
Koodli, Rohan V. ;
Saldi, Giuseppe-Antonio ;
Jordan, Michael I. ;
Yosef, Nir .
NATURE METHODS, 2023, 20 (08) :1222-+
[5]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[6]   Joint profiling of chromatin accessibility and gene expression in thousands of single cells [J].
Cao, Junyue ;
Cusanovich, Darren A. ;
Ramani, Vijay ;
Aghamirzaie, Delasa ;
Pliner, Hannah A. ;
Hill, Andrew J. ;
Daza, Riza M. ;
McFaline-Figueroa, Jose L. ;
Packer, Jonathan S. ;
Christiansen, Lena ;
Steemers, Frank J. ;
Adey, Andrew C. ;
Trapnell, Cole ;
Shendure, Jay .
SCIENCE, 2018, 361 (6409) :1380-1385
[7]   Multi-omics single-cell data integration and regulatory inference with graph-linked embedding [J].
Cao, Zhi-Jie ;
Gao, Ge .
NATURE BIOTECHNOLOGY, 2022, 40 (10) :1458-+
[8]   The evolution of gene regulation by transcription factors and microRNAs [J].
Chen, Kevin ;
Rajewsky, Nikolaus .
NATURE REVIEWS GENETICS, 2007, 8 (02) :93-103
[9]   High-throughput sequencing of the transcriptome and chromatin accessibility in the same cell [J].
Chen, Song ;
Lake, Blue B. ;
Zhang, Kun .
NATURE BIOTECHNOLOGY, 2019, 37 (12) :1452-+
[10]   TIMEOR: a web-based tool to uncover temporal regulatory mechanisms from multi-omics data [J].
Conard, Ashley Mae ;
Goodman, Nathaniel ;
Hu, Yanhui ;
Perrimon, Norbert ;
Singh, Ritambhara ;
Lawrence, Charles ;
Larschan, Erica .
NUCLEIC ACIDS RESEARCH, 2021, 49 (W1) :W641-W653