Persistent spectral simplicial complex-based machine learning for chromosomal structural analysis in cellular differentiation

被引:8
作者
Gong, Weikang [1 ,2 ]
Wee, JunJie [2 ]
Wu, Min-Chun [2 ]
Sun, Xiaohan [1 ]
Li, Chunhua [1 ]
Xia, Kelin [2 ]
机构
[1] Beijing Univ Technol, Fac Environm & Life Sci, Beijing 100124, Peoples R China
[2] Nanyang Technol Univ, Sch Phys & Math Sci, Div Math Sci, Singapore 637371, Singapore
基金
中国国家自然科学基金;
关键词
Hi-C data; Hodge Laplacian; persistent spectral simplicial complex; chromosomal featurization; machine learning; ELASTIC NETWORK MODEL; 3D GENOME; DYNAMICS; DOMAINS;
D O I
10.1093/bib/bbac168
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The three-dimensional (3D) chromosomal structure plays an essential role in all DNA-templated processes, including gene transcription, DNA replication and other cellular processes. Although developing chromosome conformation capture (3C) methods, such as Hi-C, which can generate chromosomal contact data characterized genome-wide chromosomal structural properties, understanding 3D genomic nature-based on Hi-C data remains lacking. Here, we propose a persistent spectral simplicial complex (PerSpectSC) model to describe Hi-C data for the first time. Specifically, a filtration process is introduced to generate a series of nested simplicial complexes at different scales. For each of these simplicial complexes, its spectral information can be calculated from the corresponding Hodge Laplacian matrix. PerSpectSC model describes the persistence and variation of the spectral information of the nested simplicial complexes during the filtration process. Different from all previous models, our PerSpectSC-based features provide a quantitative global-scale characterization of chromosome structures and topology. Our descriptors can successfully classify cell types and also cellular differentiation stages for all the 24 types of chromosomes simultaneously. In particular, persistent minimum best characterizes cell types and Dim (1) persistent multiplicity best characterizes cellular differentiation. These results demonstrate the great potential of our PerSpectSC-based models in polymeric data analysis.
引用
收藏
页数:11
相关论文
共 50 条
[21]   Machine learning-based Radiomics analysis for differentiation degree and lymphatic node metastasis of extrahepatic cholangiocarcinoma [J].
Yong Tang ;
Chun Mei Yang ;
Song Su ;
Wei Jia Wang ;
Li Ping Fan ;
Jian Shu .
BMC Cancer, 21
[22]   Machine learning-based Radiomics analysis for differentiation degree and lymphatic node metastasis of extrahepatic cholangiocarcinoma [J].
Tang, Yong ;
Yang, Chun Mei ;
Su, Song ;
Wang, Wei Jia ;
Fan, Li Ping ;
Shu, Jian .
BMC CANCER, 2021, 21 (01)
[23]   Machine Learning Decision Tree Models for Differentiation of Posterior Fossa Tumors Using Diffusion Histogram Analysis and Structural MRI Findings [J].
Payabvash, Seyedmehdi ;
Aboian, Mariam ;
Tihan, Tarik ;
Cha, Soonmee .
FRONTIERS IN ONCOLOGY, 2020, 10
[24]   Prediction and mechanism analysis of octanol-air partition coefficient for persistent organic pollutants based on machine learning models [J].
Xu, Zhenpeng ;
Zhao, Hongxia ;
Wang, Jinyang ;
Li, Xintong ;
Li, Zhansheng ;
Zhang, Xiaonuo ;
Ou, Yiwen .
JOURNAL OF ENVIRONMENTAL CHEMICAL ENGINEERING, 2025, 13 (02)
[25]   Raman Spectra-based Structural Classification Analysis of Flavones, Flavonols, and Isoflavones Using Machine Learning [J].
Peng, Yangyao ;
Li, Li ;
Yang, Yuhang ;
Zhang, Dongjie ;
Bao, Deyu ;
Li, Xiujun ;
Hu, Xiaojia ;
Zeng, Qi ;
Li, Xiao ;
Zhang, Zhen ;
Chen, Xueli .
CURRENT ANALYTICAL CHEMISTRY, 2024,
[26]   Gender prediction based on University students' complex thinking competency: An analysis from machine learning approaches [J].
Ibarra-Vazquez, Gerardo ;
Rami-rez-Montoya, Maria Soledad ;
Terashima, Hugo .
EDUCATION AND INFORMATION TECHNOLOGIES, 2024, 29 (03) :2721-2739
[27]   Gender prediction based on University students’ complex thinking competency: An analysis from machine learning approaches [J].
Gerardo Ibarra-Vazquez ;
María Soledad Ramí­rez-Montoya ;
Hugo Terashima .
Education and Information Technologies, 2024, 29 :2721-2739
[28]   Comparative Analysis of Machine Learning Models for Predicting the Mechanical Behavior of Bio-Based Cellular Composite Sandwich Structures [J].
Dashtgoli, Danial Sheini ;
Taghizadeh, Seyedahmad ;
Macconi, Lorenzo ;
Concli, Franco .
MATERIALS, 2024, 17 (14)
[29]   Problem-independent machine learning-enhanced structural topology optimization of complex design domains based on isoparametric elements [J].
Zhang, Linfeng ;
Huang, Mengcheng ;
Liu, Chang ;
Du, Zongliang ;
Cui, Tianchen ;
Guo, Xu .
EXTREME MECHANICS LETTERS, 2024, 72
[30]   Quantitative Analysis of Superior Structural Features in Hickory Trees Based on Terrestrial LiDAR Point Cloud and Machine Learning [J].
Chen, Yi ;
Yang, Yinhui ;
Xu, Zhuangzhi ;
Ding, Lizhong ;
Wang, Weiyu ;
Huang, Jianqin .
FORESTS, 2025, 16 (06)