Network analysis of pseudogene-gene relationships: from pseudogene evolution to their functional potentials

被引:0
作者
Johnson, Travis S. [1 ]
Li, Sihong [2 ]
Kho, Jonathan R. [3 ]
Huang, Kun [4 ]
Zhang, Yan [5 ]
机构
[1] Ohio State Univ, Dept Biomed Informat, 5000 HITS,410 W 10th St, Indianapolis, IN 46202 USA
[2] Ohio State Univ, Dept Biomed Informat, 250 Lincoln Tower,1800 Cannon Dr, Columbus, OH 43210 USA
[3] Georgia Inst Technol, Dept Computat Sci & Engn, Klaus Adv Comp Bldg,266 Ferst Dr, Atlanta, GA 30332 USA
[4] Indiana Univ, Dept Hematol Oncol, 335 Regenstrief Inst,1101 W 10th St, Indianapolis, IN 46202 USA
[5] Ohio State Univ, Dept Biomed Informat, 310-B Lincoln Tower,1800 Cannon Dr, Columbus, OH 43210 USA
来源
PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018 (PSB) | 2018年
关键词
Pseudogene-gene (PGG) relationship; Network analysis; Pseudogene function; PGG network database; EXPRESSION ANALYSIS; CYTOSCAPE; ANNOTATION; DATABASE; BIOLOGY; MODELS; PTEN;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Pseudogenes are fossil relatives of genes. Pseudogenes have long been thought of as "junk DNAs", since they do not code proteins in nounal tissues. Although most of the human pseudogenes do not have noticeable functions, similar to 20% of them exhibit transcriptional activity. There has been evidence showing that some pseudogenes adopted functions as lncRNAs and work as regulators of gene expression. Furthermore, pseudogenes can even be "reactivated" in some conditions, such as cancer initiation. Some pseudogenes are transcribed in specific cancer types, and some are even translated into proteins as observed in several cancer cell lines. All the above have shown that pseudogenes could have functional roles or potentials in the genome. Evaluating the relationships between pseudogenes and their gene counterparts could help us reveal the evolutionary path of pseudogenes and associate pseudogenes with functional potentials. It also provides an insight into the regulatory networks involving pseudogenes with transcriptional and even translational activities. In this study, we develop a novel approach integrating graph analysis, sequence alignment and functional analysis to evaluate pseudogene-gene relationships, and apply it to human gene homologs and pseudogenes. We generated a comprehensive set of 445 pseudogene-gene (PGG) families from the original 3,281 gene families (13.56%). Of these 438 (98.4% PGG, 13.3% total) were non-trivial (containing more than one pseudogene). Each PGG family contains multiple genes and pseudogenes with high sequence similarity. For each family, we generate a sequence alignment network and phylogenetic trees recapitulating the evolutionary paths. We find evidence supporting the evolution history of olfactory family (both genes and pseudogenes) in human, which also supports the validity of our analysis method. Next, we evaluate these networks in respect to the gene ontology from which we identify functions enriched in these pseudogene-gene families and infer functional impact of pseudogenes involved in the networks. This demonstrates the application of our PGG network database in the study of pseudogene function in disease context.
引用
收藏
页码:536 / 547
页数:12
相关论文
共 48 条
[1]   Anaplastic Thyroid Carcinoma: A ceRNA Analysis Pointed to a Crosstalk between SOX2, TP53, and microRNA Biogenesis [J].
Arancio, Walter ;
Carina, Valeria ;
Pizzolanti, Giuseppe ;
Tomasello, Laura ;
Pitrone, Maria ;
Baiamonte, Concetta ;
Amato, Marco Calogero ;
Giordano, Carla .
INTERNATIONAL JOURNAL OF ENDOCRINOLOGY, 2015, 2015
[2]   Distributed processing of very large datasets with DataCutter [J].
Beynon, MD ;
Kurc, T ;
Catalyurek, U ;
Chang, CL ;
Sussman, A ;
Saltz, J .
PARALLEL COMPUTING, 2001, 27 (11) :1457-1478
[3]   gff2sequence, a new user friendly tool for the generation of genomic sequences [J].
Camiolo, Salvatore ;
Porceddu, Andrea .
BIODATA MINING, 2013, 6
[4]   Competing endogenous RNA interplay in cancer: mechanism, methodology, and perspectives [J].
Cheng, Dong-Liang ;
Xiang, Yuan-Yuan ;
Ji, Li-juan ;
Lu, Xiao-Jie .
TUMOR BIOLOGY, 2015, 36 (02) :479-488
[5]  
Chirag Jain SK, 2014, 21 INT C HIGH PERF C
[6]   Processed pseudogenes acquired somatically during cancer development [J].
Cooke, Susanna L. ;
Shlien, Adam ;
Marshall, John ;
Pipinikas, Christodoulos P. ;
Martincorena, Inigo ;
Tubio, Jose M. C. ;
Li, Yilong ;
Menzies, Andrew ;
Mudie, Laura ;
Ramakrishna, Manasa ;
Yates, Lucy ;
Davies, Helen ;
Bolli, Niccolo ;
Bignell, Graham R. ;
Tarpey, Patrick S. ;
Behjati, Sam ;
Nik-Zainal, Serena ;
Papaemmanuil, Elli ;
Teixeira, Vitor H. ;
Raine, Keiran ;
O'Meara, Sarah ;
Dodoran, Maryam S. ;
Teague, Jon W. ;
Butler, Adam P. ;
Iacobuzio-Donahue, Christine ;
Santarius, Thomas ;
Grundy, Richard G. ;
Malkin, David ;
Greaves, Mel ;
Munshi, Nikhil ;
Flanagan, Adrienne M. ;
Bowtell, David ;
Martin, Sancha ;
Larsimont, Denis ;
Reis-Filho, Jorge S. ;
Boussioutas, Alex ;
Taylor, Jack A. ;
Hayes, D. Neil ;
Janes, Sam M. ;
Futreal, P. Andrew ;
Stratton, Michael R. ;
McDermott, Ultan ;
Campbell, Peter J. .
NATURE COMMUNICATIONS, 2014, 5
[7]   Genetic restriction of HIV-1 infection and progression to AIDS by a deletion allele of the CKR5 structural gene [J].
Dean, M ;
Carrington, M ;
Winkler, C ;
Huttley, GA ;
Smith, MW ;
Allikmets, R ;
Goedert, JJ ;
Buchbinder, SP ;
Vittinghoff, E ;
Gomperts, E ;
Donfield, S ;
Vlahov, D ;
Kaslow, R ;
Saah, A ;
Rinaldo, C ;
Detels, R ;
OBrien, SJ .
SCIENCE, 1996, 273 (5283) :1856-1862
[8]   A gene ontology inferred from molecular networks [J].
Dutkowski, Janusz ;
Kramer, Michael ;
Surma, Michal A. ;
Balakrishnan, Rama ;
Cherry, J. Michael ;
Krogan, Nevan J. ;
Ideker, Trey .
NATURE BIOTECHNOLOGY, 2013, 31 (01) :38-+
[9]   MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797
[10]   Loss of olfactory receptor genes coincides with the acquisition of full trichromatic vision in primates [J].
Gilad, Y ;
Wiebel, V ;
Przeworski, M ;
Lancet, D ;
Pääbo, S .
PLOS BIOLOGY, 2004, 2 (01) :120-125