Persistent spectral simplicial complex-based machine learning for chromosomal structural analysis in cellular differentiation

被引:8
作者
Gong, Weikang [1 ,2 ]
Wee, JunJie [2 ]
Wu, Min-Chun [2 ]
Sun, Xiaohan [1 ]
Li, Chunhua [1 ]
Xia, Kelin [2 ]
机构
[1] Beijing Univ Technol, Fac Environm & Life Sci, Beijing 100124, Peoples R China
[2] Nanyang Technol Univ, Sch Phys & Math Sci, Div Math Sci, Singapore 637371, Singapore
基金
中国国家自然科学基金;
关键词
Hi-C data; Hodge Laplacian; persistent spectral simplicial complex; chromosomal featurization; machine learning; ELASTIC NETWORK MODEL; 3D GENOME; DYNAMICS; DOMAINS;
D O I
10.1093/bib/bbac168
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The three-dimensional (3D) chromosomal structure plays an essential role in all DNA-templated processes, including gene transcription, DNA replication and other cellular processes. Although developing chromosome conformation capture (3C) methods, such as Hi-C, which can generate chromosomal contact data characterized genome-wide chromosomal structural properties, understanding 3D genomic nature-based on Hi-C data remains lacking. Here, we propose a persistent spectral simplicial complex (PerSpectSC) model to describe Hi-C data for the first time. Specifically, a filtration process is introduced to generate a series of nested simplicial complexes at different scales. For each of these simplicial complexes, its spectral information can be calculated from the corresponding Hodge Laplacian matrix. PerSpectSC model describes the persistence and variation of the spectral information of the nested simplicial complexes during the filtration process. Different from all previous models, our PerSpectSC-based features provide a quantitative global-scale characterization of chromosome structures and topology. Our descriptors can successfully classify cell types and also cellular differentiation stages for all the 24 types of chromosomes simultaneously. In particular, persistent minimum best characterizes cell types and Dim (1) persistent multiplicity best characterizes cellular differentiation. These results demonstrate the great potential of our PerSpectSC-based models in polymeric data analysis.
引用
收藏
页数:11
相关论文
共 50 条
[41]   Training machine learning models based on the structural formula for the enthalpy of vaporization and sublimation and a thorough analysis of Trouton's rules [J].
Wahler, Sabrina ;
Chung, Peter ;
Klapoetke, Thomas M. .
JOURNAL OF ENERGETIC MATERIALS, 2025, 43 (02) :199-211
[42]   Utilizing Spectral, Structural and Textural Features for Estimating Oat Above-Ground Biomass Using UAV-Based Multispectral Data and Machine Learning [J].
Dhakal, Rakshya ;
Maimaitijiang, Maitiniyazi ;
Chang, Jiyul ;
Caffe, Melanie .
SENSORS, 2023, 23 (24)
[43]   A screening method for predicting left ventricular dysfunction based on spectral analysis of a single-channel electrocardiogram using machine learning algorithms [J].
Kuznetsova, Natalia ;
Sagirova, Zhanna ;
Suvorov, Aleksandr ;
Dhif, Ines ;
Gognieva, Daria ;
Afina, Bestavashvili ;
Poltavskaya, Maria ;
Sedov, Vsevolod ;
Chomakhidze, Petr ;
Kopylov, Philippe .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
[44]   Machine Learning-Based Ultrasound Texture Analysis in Differentiation of Benign Phyllodes Tumors from Borderline-Malignant Phyllodes Tumors [J].
Basara Akin, Isil ;
Ozgul, Hakan Abdullah ;
Altay, Canan ;
Guray Durak, Merih ;
Aksoy, Suleyman Ozkan ;
Sevinc, Ali Ibrahim ;
Secil, Mustafa ;
Gulmez, Hakan ;
Balci, Pinar .
ULTRASCHALL IN DER MEDIZIN, 2023, 44 (03) :318-326
[45]   Machine learning based damage state identification: A novel perspective on fragility analysis for nuclear power plants considering structural uncertainties [J].
Zheng, Zhi ;
Wang, Yong ;
Pan, Xiaolan ;
Ji, Duofa .
EARTHQUAKE ENGINEERING AND ENGINEERING VIBRATION, 2025, 24 (01) :201-222
[46]   A semiempirical and machine learning approach for fragment-based structural analysis of non-hydroxamate HDAC3 inhibitors [J].
Amin, Sk. Abdul ;
Sessa, Lucia ;
Tarafdar, Rajdip ;
Gayen, Shovanlal ;
Piotto, Stefano .
BIOPHYSICAL CHEMISTRY, 2025, 320
[47]   Integrated Analysis of Ferroptosis-and Cellular Senescence-Related Biomarkers in Atherosclerosis Based on Machine Learning and Single-Cell Sequencing Data [J].
Qi, Xiang ;
Cao, Shan ;
Chen, Jian ;
Yin, XiaoLei .
JOURNAL OF INFLAMMATION RESEARCH, 2025, 18 :9283-9305
[48]   Spatial Differentiation Characteristics of Rural Areas Based on Machine Learning and GIS Statistical Analysis-A Case Study of Yongtai County, Fuzhou City [J].
Wang, Ziyuan .
SUSTAINABILITY, 2023, 15 (05)
[49]   A machine learning-based structural load estimation model for shear-critical RC beams and slabs using multifractal analysis [J].
Osei, Jack Banahene ;
Adom-Asamoah, Mark ;
Twumasi, Jones Owusu ;
Andras, Peter ;
Zhang, Hexin .
CONSTRUCTION AND BUILDING MATERIALS, 2023, 394
[50]   Designing of symmetric and asymmetric small molecule acceptors for organic solar cells: A farmwork based on Machine learning, virtual screening and structural analysis [J].
Mubashir, Tayyaba ;
Tahir, Mudassir Hussain ;
Mahmoud, M. H. H. ;
Shafiq, Zunaira ;
Ashraf, Mohsin ;
El Azab, Islam H. ;
El-Bahy, Zeinhom M. ;
Janjua, Muhammad Ramzan Saeed Ashraf .
JOURNAL OF PHOTOCHEMISTRY AND PHOTOBIOLOGY A-CHEMISTRY, 2023, 444