Enhancing Gene Co-Expression Network Inference for the Malaria Parasite Plasmodium falciparum

被引:0
作者
Li, Qi [1 ,2 ,3 ]
Button-Simons, Katrina A. [3 ,4 ]
Sievert, Mackenzie A. C. [3 ,4 ]
Chahoud, Elias [1 ,5 ]
Foster, Gabriel F. [4 ]
Meis, Kaitlynn [4 ]
Ferdig, Michael T. [3 ,4 ]
Milenkovic, Tijana [1 ,2 ,3 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[2] Univ Notre Dame, Eck Inst Global Hlth, Notre Dame, IN 46556 USA
[3] Univ Notre Dame, Lucy Family Inst Data & Soc, Notre Dame, IN 46556 USA
[4] Univ Notre Dame, Dept Biol Sci, Notre Dame, IN 46556 USA
[5] Univ Notre Dame, Dept Preprofess Studies, Notre Dame, IN 46556 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
malaria; P; falciparum; gene co-expression networks; network inference; gene function prediction; PROTEIN-INTERACTION NETWORK; MISSING VALUE ESTIMATION; ARTEMISININ RESISTANCE; FUNCTIONAL ANNOTATION; ENDOCYTOSIS; REGULARIZATION;
D O I
10.3390/genes15060685
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Malaria results in more than 550,000 deaths each year due to drug resistance in the most lethal Plasmodium (P.) species P. falciparum. A full P. falciparum genome was published in 2002, yet 44.6% of its genes have unknown functions. Improving the functional annotation of genes is important for identifying drug targets and understanding the evolution of drug resistance. Results: Genes function by interacting with one another. So, analyzing gene co-expression networks can enhance functional annotations and prioritize genes for wet lab validation. Earlier efforts to build gene co-expression networks in P. falciparum have been limited to a single network inference method or gaining biological understanding for only a single gene and its interacting partners. Here, we explore multiple inference methods and aim to systematically predict functional annotations for all P. falciparum genes. We evaluate each inferred network based on how well it predicts existing gene-Gene Ontology (GO) term annotations using network clustering and leave-one-out crossvalidation. We assess overlaps of the different networks' edges (gene co-expression relationships), as well as predicted functional knowledge. The networks' edges are overall complementary: 47-85% of all edges are unique to each network. In terms of the accuracy of predicting gene functional annotations, all networks yielded relatively high precision (as high as 87% for the network inferred using mutual information), but the highest recall reached was below 15%. All networks having low recall means that none of them capture a large amount of all existing gene-GO term annotations. In fact, their annotation predictions are highly complementary, with the largest pairwise overlap of only 27%. We provide ranked lists of inferred gene-gene interactions and predicted gene-GO term annotations for future use and wet lab validation by the malaria community. Conclusions: The different networks seem to capture different aspects of the P. falciparum biology in terms of both inferred interactions and predicted gene functional annotations. Thus, relying on a single network inference method should be avoided when possible. Supplementary data: Attached.
引用
收藏
页数:30
相关论文
共 82 条
[61]   Predicting functional and regulatory divergence of a drug resistance transporter gene in the human malaria parasite [J].
Siwo, Geoffrey H. ;
Tan, Asako ;
Button-Simons, Katrina A. ;
Samarakoon, Upeka ;
Checkley, Lisa A. ;
Pinapati, Richard S. ;
Ferdig, Michael T. .
BMC GENOMICS, 2015, 16
[62]   Endocytosis in Plasmodium and Toxoplasma Parasites [J].
Spielmann, Tobias ;
Gras, Simon ;
Sabitzki, Ricarda ;
Meissner, Markus .
TRENDS IN PARASITOLOGY, 2020, 36 (06) :520-532
[63]   pcaMethods - a bioconductor package providing PCA methods for incomplete data [J].
Stacklies, Wolfram ;
Redestig, Henning ;
Scholz, Matthias ;
Walther, Dirk ;
Selbig, Joachim .
BIOINFORMATICS, 2007, 23 (09) :1164-1167
[64]   REVIGO Summarizes and Visualizes Long Lists of Gene Ontology Terms [J].
Supek, Fran ;
Bosnjak, Matko ;
Skunca, Nives ;
Smuc, Tomislav .
PLOS ONE, 2011, 6 (07)
[65]   The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets [J].
Szklarczyk, Damian ;
Gable, Annika L. ;
Nastou, Katerina C. ;
Lyon, David ;
Kirsch, Rebecca ;
Pyysalo, Sampo ;
Doncheva, Nadezhda T. ;
Legeay, Marc ;
Fang, Tao ;
Bork, Peer ;
Jensen, Lars J. ;
von Mering, Christian .
NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) :D605-D612
[66]   Malaria: The Past and the Present [J].
Talapko, Jasminka ;
Skrlec, Ivana ;
Alebic, Tamara ;
Jukic, Melita ;
Vcev, Aleksandar .
MICROORGANISMS, 2019, 7 (06)
[67]   Malaria.tools-comparative genomic and transcriptomic database for Plasmodium species [J].
Tan, Qiao Wen ;
Mutwil, Marek .
NUCLEIC ACIDS RESEARCH, 2020, 48 (D1) :D768-D775
[68]   Malaria Diagnosis: A Brief Review [J].
Tangpukdee, Noppadon ;
Duangdee, Chatnapa ;
Wilairatana, Polrat ;
Krudsood, Srivicha .
KOREAN JOURNAL OF PARASITOLOGY, 2009, 47 (02) :93-102
[69]   Eps15 homology domain containing protein of Plasmodium falciparum (PfEHD) associates with endocytosis and vesicular trafficking towards neutral lipid storage site [J].
Thakur, Vandana ;
Asad, Mohd ;
Jain, Shaifali ;
Hossain, Mohammad E. ;
Gupta, Akanksha ;
Kaur, Inderjeet ;
Rathore, Sumit ;
Ali, Shakir ;
Khan, Nida J. ;
Mohmmed, Asif .
BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR CELL RESEARCH, 2015, 1853 (11) :2856-2869
[70]   Missing value estimation methods for DNA microarrays [J].
Troyanskaya, O ;
Cantor, M ;
Sherlock, G ;
Brown, P ;
Hastie, T ;
Tibshirani, R ;
Botstein, D ;
Altman, RB .
BIOINFORMATICS, 2001, 17 (06) :520-525