ISOGO: Functional annotation of protein-coding splice variants

被引:0
|
作者
Juan A Ferrer-Bonsoms
Ignacio Cassol
Pablo Fernández-Acín
Carlos Castilla
Fernando Carazo
Angel Rubio
机构
[1] Department of Biomedical Engineering and Sciences,
[2] Tecnun-Universidad de Navarra,undefined
[3] Manuel de Lardizábal 15,undefined
[4] Department of Bioengineering,undefined
[5] Facultad de Ingeniería,undefined
[6] Universidad Austral,undefined
来源
Scientific Reports | / 10卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The advent of RNA-seq technologies has switched the paradigm of genetic analysis from a genome to a transcriptome-based perspective. Alternative splicing generates functional diversity in genes, but the precise functions of many individual isoforms are yet to be elucidated. Gene Ontology was developed to annotate gene products according to their biological processes, molecular functions and cellular components. Despite a single gene may have several gene products, most annotations are not isoform-specific and do not distinguish the functions of the different proteins originated from a single gene. Several approaches have tried to automatically annotate ontologies at the isoform level, but this has shown to be a daunting task. We have developed ISOGO (ISOform + GO function imputation), a novel algorithm to predict the function of coding isoforms based on their protein domains and their correlation of expression along 11,373 cancer patients. Combining these two sources of information outperforms previous approaches: it provides an area under precision-recall curve (AUPRC) five times larger than previous attempts and the median AUROC of assigned functions to genes is 0.82. We tested ISOGO predictions on some genes with isoform-specific functions (BRCA1, MADD,VAMP7 and ITSN1) and they were coherent with the literature. Besides, we examined whether the main isoform of each gene -as predicted by APPRIS- was the most likely to have the annotated gene functions and it occurs in 99.4% of the genes. We also evaluated the predictions for isoform-specific functions provided by the CAFA3 challenge and results were also convincing. To make these results available to the scientific community, we have deployed a web application to consult ISOGO predictions (https://biotecnun.unav.es/app/isogo). Initial data, website link, isoform-specific GO function predictions and R code is available at https://gitlab.com/icassol/isogo.
引用
收藏
相关论文
共 50 条
  • [21] Accurate annotation of human protein-coding small open reading frames
    Martinez, Thomas F.
    Chu, Qian
    Donaldson, Cynthia
    Tan, Dan
    Shokhirev, Maxim N.
    Saghatelian, Alan
    NATURE CHEMICAL BIOLOGY, 2020, 16 (04) : 458 - +
  • [22] Non-coding transcript variants of protein-coding genes - what are they good for?
    Dhamija, Sonam
    Menon, Manoj B.
    RNA BIOLOGY, 2018, 15 (08) : 1025 - 1031
  • [23] Deleterious protein-coding variants in diverse cattle breeds of the world
    Subramanian, Sankar
    GENETICS SELECTION EVOLUTION, 2021, 53 (01)
  • [24] Genome-wide analysis of protein-coding variants in leprosy
    Zhang, F.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2017, 137 (05) : S98 - S98
  • [25] Deleterious protein-coding variants in diverse cattle breeds of the world
    Sankar Subramanian
    Genetics Selection Evolution, 53
  • [26] Identification of Protein-coding Variants Associated with Type 2 Diabetes
    Mahajan, Anubha
    DIABETES, 2015, 64 : A5 - A5
  • [27] The Role of Protein-Coding Variants in South Africans with Exfoliation Glaucoma
    Liu, Yutao
    Qin, Xuejun
    Gibson, Jason
    Williams, Susan
    Rautenbach, Robyn
    Carmichael, Trevor
    Ashley-Koch, Allison
    Allingham, R. Rand
    Hauser, Michael
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2013, 54 (15)
  • [28] A massive effort links protein-coding gene variants to health
    Yukinori Okada
    Qingbo S. Wang
    Nature, 2021, 599 (7886) : 561 - 563
  • [29] Genome-Wide Analysis of Protein-Coding Variants in Leprosy
    Liu, Hong
    Wang, Zhenzhen
    Li, Yi
    Yu, Gongqi
    Fu, Xi'an
    Wang, Chuan
    Liu, Wenting
    Yu, Yongxiang
    Bao, Fangfang
    Irwanto, Astrid
    Liu, Jian
    Chu, Tongsheng
    Andiappan, Anand Kumar
    Maurer-Stroh, Sebastian
    Limviphuvadh, Vachiranee
    Wang, Honglei
    Mi, Zihao
    Sun, Yonghu
    Sun, Lele
    Wang, Ling
    Wang, Chaolong
    You, Jiabao
    Li, Jinghui
    Foo, Jia Nee
    Liany, Herty
    Meah, Wee Yang
    Niu, Guiye
    Yue, Zhenhua
    Zhao, Qing
    Wang, Na
    Yu, Meiwen
    Yu, Wenjun
    Cheng, Xiujun
    Khor, Chiea Chuen
    Sim, Kar Seng
    Aung, Tin
    Wang, Ningli
    Wang, Deyun
    Shi, Li
    Ning, Yong
    Zheng, Zhongyi
    Yang, Rongde
    Li, Jinlan
    Yang, Jun
    Yan, Liangbin
    Shen, Jianping
    Zhang, Guocheng
    Chen, Shumin
    Liu, Jianjun
    Zhang, Furen
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2017, 137 (12) : 2544 - 2551
  • [30] The Functional Meaning of 5′UTR in Protein-Coding Genes
    Ryczek, Natalia
    Lys, Aneta
    Makalowska, Izabela
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (03)