Persistent spectral simplicial complex-based machine learning for chromosomal structural analysis in cellular differentiation

被引:7
作者
Gong, Weikang [1 ,2 ]
Wee, JunJie [2 ]
Wu, Min-Chun [2 ]
Sun, Xiaohan [1 ]
Li, Chunhua [1 ]
Xia, Kelin [2 ]
机构
[1] Beijing Univ Technol, Fac Environm & Life Sci, Beijing 100124, Peoples R China
[2] Nanyang Technol Univ, Sch Phys & Math Sci, Div Math Sci, Singapore 637371, Singapore
基金
中国国家自然科学基金;
关键词
Hi-C data; Hodge Laplacian; persistent spectral simplicial complex; chromosomal featurization; machine learning; ELASTIC NETWORK MODEL; 3D GENOME; DYNAMICS; DOMAINS;
D O I
10.1093/bib/bbac168
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The three-dimensional (3D) chromosomal structure plays an essential role in all DNA-templated processes, including gene transcription, DNA replication and other cellular processes. Although developing chromosome conformation capture (3C) methods, such as Hi-C, which can generate chromosomal contact data characterized genome-wide chromosomal structural properties, understanding 3D genomic nature-based on Hi-C data remains lacking. Here, we propose a persistent spectral simplicial complex (PerSpectSC) model to describe Hi-C data for the first time. Specifically, a filtration process is introduced to generate a series of nested simplicial complexes at different scales. For each of these simplicial complexes, its spectral information can be calculated from the corresponding Hodge Laplacian matrix. PerSpectSC model describes the persistence and variation of the spectral information of the nested simplicial complexes during the filtration process. Different from all previous models, our PerSpectSC-based features provide a quantitative global-scale characterization of chromosome structures and topology. Our descriptors can successfully classify cell types and also cellular differentiation stages for all the 24 types of chromosomes simultaneously. In particular, persistent minimum best characterizes cell types and Dim (1) persistent multiplicity best characterizes cellular differentiation. These results demonstrate the great potential of our PerSpectSC-based models in polymeric data analysis.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Machine learning-based Radiomics analysis for differentiation degree and lymphatic node metastasis of extrahepatic cholangiocarcinoma
    Yong Tang
    Chun Mei Yang
    Song Su
    Wei Jia Wang
    Li Ping Fan
    Jian Shu
    BMC Cancer, 21
  • [22] Machine Learning Decision Tree Models for Differentiation of Posterior Fossa Tumors Using Diffusion Histogram Analysis and Structural MRI Findings
    Payabvash, Seyedmehdi
    Aboian, Mariam
    Tihan, Tarik
    Cha, Soonmee
    FRONTIERS IN ONCOLOGY, 2020, 10
  • [23] Prediction and mechanism analysis of octanol-air partition coefficient for persistent organic pollutants based on machine learning models
    Xu, Zhenpeng
    Zhao, Hongxia
    Wang, Jinyang
    Li, Xintong
    Li, Zhansheng
    Zhang, Xiaonuo
    Ou, Yiwen
    JOURNAL OF ENVIRONMENTAL CHEMICAL ENGINEERING, 2025, 13 (02):
  • [24] Raman Spectra-based Structural Classification Analysis of Flavones, Flavonols, and Isoflavones Using Machine Learning
    Peng, Yangyao
    Li, Li
    Yang, Yuhang
    Zhang, Dongjie
    Bao, Deyu
    Li, Xiujun
    Hu, Xiaojia
    Zeng, Qi
    Li, Xiao
    Zhang, Zhen
    Chen, Xueli
    CURRENT ANALYTICAL CHEMISTRY, 2024,
  • [25] Gender prediction based on University students’ complex thinking competency: An analysis from machine learning approaches
    Gerardo Ibarra-Vazquez
    María Soledad Ramí­rez-Montoya
    Hugo Terashima
    Education and Information Technologies, 2024, 29 : 2721 - 2739
  • [26] Gender prediction based on University students' complex thinking competency: An analysis from machine learning approaches
    Ibarra-Vazquez, Gerardo
    Rami-rez-Montoya, Maria Soledad
    Terashima, Hugo
    EDUCATION AND INFORMATION TECHNOLOGIES, 2024, 29 (03) : 2721 - 2739
  • [27] Problem-independent machine learning-enhanced structural topology optimization of complex design domains based on isoparametric elements
    Zhang, Linfeng
    Huang, Mengcheng
    Liu, Chang
    Du, Zongliang
    Cui, Tianchen
    Guo, Xu
    EXTREME MECHANICS LETTERS, 2024, 72
  • [28] Machine Learning and Geo-Based Multi-Criteria Decision Support Systems in Analysis of Complex Problems
    Pirouz, Behrouz
    Ferrante, Aldo Pedro
    Pirouz, Behzad
    Piro, Patrizia
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (06)
  • [29] Comparative Analysis of Machine Learning Models for Predicting the Mechanical Behavior of Bio-Based Cellular Composite Sandwich Structures
    Dashtgoli, Danial Sheini
    Taghizadeh, Seyedahmad
    Macconi, Lorenzo
    Concli, Franco
    MATERIALS, 2024, 17 (14)
  • [30] Differentiation of closely-related species within Acinetobacter baumannii-calcoaceticus complex via Raman spectroscopy: a comparative machine learning analysis
    Xue-Song Xiong
    Lin-Fei Yao
    Yan-Fei Luo
    Quan Yuan
    Yu-Ting Si
    Jie Chen
    Xin-Ru Wen
    Jia-Wei Tang
    Su-Ling Liu
    Liang Wang
    World Journal of Microbiology and Biotechnology, 2024, 40