Unveiling human origins of replication using deep learning: accurate prediction and comprehensive analysis

被引:3
|
作者
Yin, Zhen-Ning [1 ]
Lai, Fei-Liao [1 ]
Gao, Feng [1 ,2 ,3 ]
机构
[1] Tianjin Univ, Dept Phys, Sch Sci, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Frontiers Sci Ctr Synthet Biol, Tianjin 300072, Peoples R China
[3] Tianjin Univ, Minist Educ, Key Lab Syst Bioengn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
human genome; origin of replication; deep learning; Z-curve method; DNA-REPLICATION; INITIATION; DATABASE; IDENTIFICATION; SEQUENCES; CANCER;
D O I
10.1093/bib/bbad432
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Accurate identification of replication origins (ORIs) is crucial for a comprehensive investigation into the progression of human cell growth and cancer therapy. Here, we proposed a computational approach Ori-FinderH, which can efficiently and precisely predict the human ORIs of various lengths by combining the Z-curve method with deep learning approach. Compared with existing methods, Ori-FinderH exhibits superior performance, achieving an area under the receiver operating characteristic curve (AUC) of 0.9616 for K562 cell line in 10-fold cross-validation. In addition, we also established a cross-cell-line predictive model, which yielded a further improved AUC of 0.9706. The model was subsequently employed as a fitness function to support genetic algorithm for generating artificial ORIs. Sequence analysis through iORI-Euk revealed that a vast majority of the created sequences, specifically 98% or more, incorporate at least one ORI for three cell lines (Hela, MCF7 and K562). This innovative approach could provide more efficient, accurate and comprehensive information for experimental investigation, thereby further advancing the development of this field.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Accurate Prediction of Human Essential Proteins Using Ensemble Deep Learning
    Li, Yiming
    Zeng, Min
    Wu, Yifan
    Li, Yaohang
    Li, Min
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3263 - 3271
  • [2] Comprehensive Analysis of Computational Models for Prediction of Anticancer Peptides Using Machine Learning and Deep Learning
    Ali, Farman
    Ibrahim, Nouf
    Alsini, Raed
    Masmoudi, Atef
    Alghamdi, Wajdi
    Alkhalifah, Tamim
    Alturise, Fahad
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2025,
  • [3] Highly accurate prediction of specific activity using deep learning
    Sheinfeld, Mati
    Levinson, Samuel
    Orion, Itzhak
    APPLIED RADIATION AND ISOTOPES, 2017, 130 : 115 - 120
  • [4] Accurate prediction of chromatin conformation status using deep learning
    Uryu, Hidetaka
    Hata, Kenichiro
    CANCER SCIENCE, 2018, 109 : 845 - 845
  • [5] Ori-Finder 2022: A Comprehensive Web Server for Prediction and Analysis of Bacterial Replication Origins
    Mei-Jing Dong
    Hao Luo
    Feng Gao
    Genomics,Proteomics & Bioinformatics, 2022, Proteomics & Bioinformatics2022 (06) : 1207 - 1213
  • [6] Ori-Finder 2022: A Comprehensive Web Server for Prediction and Analysis of Bacterial Replication Origins
    Dong, Mei-Jing
    Luo, Hao
    Gao, Feng
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2022, 20 (06) : 1207 - 1213
  • [7] Accurate prediction of somatic variants using deep learning model.
    Zhang, Peng
    Wang, Kai
    Yao, Ming
    Wang, Aodi
    Chen, Lijuan
    Liu, Angen
    Shi, Xiaoliang
    Zhang, Shiyue
    JOURNAL OF CLINICAL ONCOLOGY, 2020, 38 (15)
  • [8] Comprehensive Analysis of Replication Origins in Saccharomyces cerevisiae Genomes
    Wang, Dan
    Gao, Feng
    FRONTIERS IN MICROBIOLOGY, 2019, 10
  • [9] A Comprehensive Survey on Event Analysis Using Deep Learning
    Varshney, Abhilasha
    Lamba, Sonia
    Garg, Puneet
    Proceedings - 2022 5th International Conference on Computational Intelligence and Communication Technologies, CCICT 2022, 2022, : 146 - 150
  • [10] A Comprehensive Review of Healthcare Prediction using Data Science with Deep Learning
    Thandu, Asha Latha
    Gera, Pradeepini
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (12) : 657 - 669