PCGIMA: developing the web server for human position-defined CpG islands methylation analysis

被引:0
作者
Xiao, Ming [1 ,2 ]
Xiao, Yi [1 ]
Yu, Jun [3 ,4 ]
Zhang, Le [1 ,5 ,6 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu, Peoples R China
[2] Tianfu Engn Oriented Numer Simulat & Software Inno, Chengdu, Peoples R China
[3] Chinese Acad Sci, CAS Key Lab Genome Sci & Informat, Beijing Inst Genom, Beijing, Peoples R China
[4] Univ Chinese Acad Sci, Beijing, Peoples R China
[5] Univ Chinese Acad Sci, Chinese Acad Sci, Key Lab Syst Biol, Hangzhou Inst Adv Study, Hangzhou, Peoples R China
[6] Univ Chinese Acad Sci, Hangzhou Inst Adv Study, Key Lab Syst Hlth Sci Zhejiang Prov, Hangzhou, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
position-defined CGIs; DNA methylation; genome annotation; high performance computing; genome analysis; DNA METHYLATION; GENOME; DATABASE; SITES;
D O I
10.3389/fgene.2024.1367731
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Introduction: CpG island (CGI) methylation is one of the key epigenomic mechanisms for gene expression regulation and chromosomal integrity. However, classical CGI prediction methods are neither easy to locate those short and position-sensitive CGIs (CpG islets), nor investigate genetic and expression pattern for CGIs under different CpG position- and interval- sensitive parameters in a genome-wide perspective. Therefore, it is urgent for us to develop such a bioinformatic algorithm that not only can locate CpG islets, but also provide CGI methylation site annotation and functional analysis to investigate the regulatory mechanisms for CGI methylation.Methods: This study develops Human position-defined CGI prediction method to locate CpG islets using high performance computing, and then builds up a novel human genome annotation and analysis method to investigate the connections among CGI, gene expression and methylation. Finally, we integrate these functions into PCGIMA to provide relevant online computing and visualization service.Results: The main results include: (1) Human position-defined CGI prediction method is more efficient to predict position-defined CGIs with multiple consecutive (d) values and locate more potential short CGIs than previous CGI prediction methods. (2) Our annotation and analysis method not only can investigate the connections between position-defined CGI methylation and gene expression specificity from a genome-wide perspective, but also can analysis the potential association of position-defined CGIs with gene functions. (3) PCGIMA (http://www.combio-lezhang.online/pcgima/home.html) provides an easy-to-use analysis and visualization platform for human CGI prediction and methylation.Discussion: This study not only develops Human position-defined CGI prediction method to locate short and position-sensitive CGIs (CpG islets) using high performance computing to construct MR-CpGCluster algorithm, but also a novel human genome annotation and analysis method to investigate the connections among CGI, gene expression and methylation. Finally, we integrate them into PCGIMA for online computing and visualization.
引用
收藏
页数:9
相关论文
共 51 条
  • [21] EWASdb: epigenome-wide association study database
    Liu, Di
    Zhao, Linna
    Wang, Zhaoyang
    Zhou, Xu
    Fan, Xiuzhao
    Li, Yong
    Xu, Jing
    Hu, Simeng
    Niu, Miaomiao
    Song, Xiuling
    Li, Ying
    Zuo, Lijiao
    Lei, Changgui
    Zhang, Meng
    Tang, Guoping
    Huang, Min
    Zhang, Nan
    Duan, Lian
    Lv, Hongchao
    Zhang, Mingming
    Li, Jin
    Xu, Liangde
    Kong, Fanwu
    Feng, Rennan
    Jiang, Yongshuai
    [J]. NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) : D989 - D993
  • [22] A Brief Review of Artificial Intelligence Applications and Algorithms for Psychiatric Disorders
    Liu, Guang-Di
    Li, Yu-Chen
    Zhang, Wei
    Zhang, Le
    [J]. ENGINEERING, 2020, 6 (04) : 462 - 467
  • [23] Developing an Embedding, Koopman and Autoencoder Technologies-Based Multi-Omics Time Series Predictive Model (EKATP) for Systems Biology research
    Liu, Suran
    You, Yujie
    Tong, Zhaoqi
    Zhang, Le
    [J]. FRONTIERS IN GENETICS, 2021, 12
  • [24] Mitotic inheritance of DNA methylation: more than just copy and paste
    Ming, Xuan
    Zhu, Bing
    Li, Yingfeng
    [J]. JOURNAL OF GENETICS AND GENOMICS, 2021, 48 (01) : 1 - 13
  • [25] CpGProD: identifying CpG islands associated with transcription start sites in large genomic mammalian sequences
    Ponger, L
    Mouchiroud, D
    [J]. BIOINFORMATICS, 2002, 18 (04) : 631 - 633
  • [26] NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins
    Pruitt, KD
    Tatusova, T
    Maglott, DR
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : D501 - D504
  • [27] ENCODE whole-genome data in the UCSC genome browser (2011 update)
    Raney, Brian J.
    Cline, Melissa S.
    Rosenbloom, Kate R.
    Dreszer, Timothy R.
    Learned, Katrina
    Barber, Galt P.
    Meyer, Laurence R.
    Sloan, Cricket A.
    Malladi, Venkat S.
    Roskin, Krishna M.
    Suh, Bernard B.
    Hinrichs, Angie S.
    Clawson, Hiram
    Zweig, Ann S.
    Kirkup, Vanessa
    Fujita, Pauline A.
    Rhead, Brooke
    Smith, Kayla E.
    Pohl, Andy
    Kuhn, Robert M.
    Karolchik, Donna
    Haussler, David
    Kent, W. James
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D871 - D875
  • [28] Stability and flexibility of epigenetic gene regulation in mammalian development
    Reik, Wolf
    [J]. NATURE, 2007, 447 (7143) : 425 - 432
  • [29] CloudBurst: highly sensitive read mapping with MapReduce
    Schatz, Michael C.
    [J]. BIOINFORMATICS, 2009, 25 (11) : 1363 - 1369
  • [30] Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly
    Schneider, Valerie A.
    Graves-Lindsay, Tina
    Howe, Kerstin
    Bouk, Nathan
    Chen, Hsiu-Chuan
    Kitts, Paul A.
    Murphy, Terence D.
    Pruitt, Kim D.
    Thibaud-Nissen, Francoise
    Albracht, Derek
    Fulton, Robert S.
    Kremitzki, Milinn
    Magrini, Vincent
    Markovic, Chris
    McGrath, Sean
    Steinberg, Karyn Meltz
    Auger, Kate
    Chow, William
    Collins, Joanna
    Harden, Glenn
    Hubbard, Timothy
    Pelan, Sarah
    Simpson, Jared T.
    Threadgold, Glen
    Torrance, James
    Wood, Jonathan M.
    Clarke, Laura
    Koren, Sergey
    Boitano, Matthew
    Peluso, Paul
    Li, Heng
    Chin, Chen-Shan
    Phillippy, Adam M.
    Durbin, Richard
    Wilson, Richard K.
    Flicek, Paul
    Eichler, Evan E.
    Church, Deanna M.
    [J]. GENOME RESEARCH, 2017, 27 (05) : 849 - 864