Systematic Functional Annotation Workflow for Insects

被引:11
作者
Bono, Hidemasa [1 ,2 ]
Sakamoto, Takuma [3 ,4 ]
Kasukawa, Takeya [5 ]
Tabunoki, Hiroko [3 ,4 ]
机构
[1] Hiroshima Univ, Genome Editing Innovat Ctr, Lab BioDX, 3-10-23 Kagamiyama, Higashihiroshima 7390046, Japan
[2] Hiroshima Univ, Lab Genome Informat, Grad Sch Integrated Sci Life, 3-10-23 Kagamiyama, Higashihiroshima 7390046, Japan
[3] Tokyo Univ Agr & Technol, Inst Global Innovat Res, 3-5-8 Saiwai Cho, Fuchu, Tokyo 1838509, Japan
[4] Tokyo Univ Agr & Technol, Grad Sch Agr, Dept Sci Biol Prod, 3-5-8 Saiwai Cho, Fuchu, Tokyo 1838509, Japan
[5] RIKEN Ctr Integrat Med Sci, Tsurumi Ku, 1-7-22 Suehiro Cho, Yokohama, Kanagawa 2300045, Japan
基金
日本科学技术振兴机构;
关键词
functional annotation; RNA sequencing; transcriptome assembly; stick insect; silkworm; RECONSTRUCTION; GENERATION; COVERAGE; REMINDER; PIPELINE; TOOL;
D O I
10.3390/insects13070586
中图分类号
Q96 [昆虫学];
学科分类号
摘要
Simple Summary The functions of all genes encoded in the genome should be studied for genome editing. Genome editing technology can speed up insect research for the functional analysis of genes. Our knowledge about the functional information of genes is still incomplete currently, while the genome sequencing of an organism can be completed. The functional information has been annotated based solely on the information that has been obtained from the results of previous biological research. However, this information will be important in determining the target genes for genome editing. In particular, it is very important that this information is in machine-readable form because computer programs mainly parse this information for the understanding of biological systems. In this paper, we describe a workflow-based method for annotating gene functions in insects that makes use of transcribed sequence information as well as reference genome and protein sequence databases. Using the developed workflow, we annotated the functional information of the Japanese stick insect and silkworm, including gene expression as well as sequence analysis. The functional annotation information obtained by the workflow will greatly expand the possibilities of entomological research using genome editing. Next-generation sequencing has revolutionized entomological study, rendering it possible to analyze the genomes and transcriptomes of non-model insects. However, use of this technology is often limited to obtaining the nucleotide sequences of target or related genes, with many of the acquired sequences remaining unused because other available sequences are not sufficiently annotated. To address this issue, we have developed a functional annotation workflow for transcriptome-sequenced insects to determine transcript descriptions, which represents a significant improvement over the previous method (functional annotation pipeline for insects). The developed workflow attempts to annotate not only the protein sequences obtained from transcriptome analysis but also the ncRNA sequences obtained simultaneously. In addition, the workflow integrates the expression-level information obtained from transcriptome sequencing for application as functional annotation information. Using the workflow, functional annotation was performed on the sequences obtained from transcriptome sequencing of the stick insect (Entoria okinawaensis) and silkworm (Bombyx mori), yielding richer functional annotation information than that obtained in our previous study. The improved workflow allows the more comprehensive exploitation of transcriptome data and is applicable to other insects because the workflow has been openly developed on GitHub.
引用
收藏
页数:14
相关论文
共 42 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], BIOS AN US PROF HIDD, DOI DOI 10.5281/ZENODO.13285213
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   UniProt: the universal protein knowledgebase in 2021 [J].
Bateman, Alex ;
Martin, Maria-Jesus ;
Orchard, Sandra ;
Magrane, Michele ;
Agivetova, Rahat ;
Ahmad, Shadab ;
Alpi, Emanuele ;
Bowler-Barnett, Emily H. ;
Britto, Ramona ;
Bursteinas, Borisas ;
Bye-A-Jee, Hema ;
Coetzee, Ray ;
Cukura, Austra ;
Da Silva, Alan ;
Denny, Paul ;
Dogan, Tunca ;
Ebenezer, ThankGod ;
Fan, Jun ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Georghiou, George ;
Gonzales, Leonardo ;
Hatton-Ellis, Emma ;
Hussein, Abdulrahman ;
Ignatchenko, Alexandr ;
Insana, Giuseppe ;
Ishtiaq, Rizwan ;
Jokinen, Petteri ;
Joshi, Vishal ;
Jyothi, Dushyanth ;
Lock, Antonia ;
Lopez, Rodrigo ;
Luciani, Aurelien ;
Luo, Jie ;
Lussi, Yvonne ;
Mac-Dougall, Alistair ;
Madeira, Fabio ;
Mahmoudy, Mahdi ;
Menchi, Manuela ;
Mishra, Alok ;
Moulang, Katie ;
Nightingale, Andrew ;
Oliveira, Carla Susana ;
Pundir, Sangya ;
Qi, Guoying ;
Raj, Shriya ;
Rice, Daniel ;
Lopez, Milagros Rodriguez ;
Saidi, Rabie ;
Sampson, Joseph .
NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) :D480-D489
[5]  
bioconductor, About us
[6]   Reminder to deposit DNA sequences [J].
Blaxter, Mark ;
Danchin, Antoine ;
Savakis, Babis ;
Fukami-Kobayashi, Kaoru ;
Kurokawa, Ken ;
Sugano, Sumio ;
Roberts, Richard J. ;
Salzberg, Steven L. ;
Wu, Chung-I .
SCIENCE, 2016, 352 (6287) :780-780
[7]   Reconstruction of amino acid biosynthesis pathways from the complete genome sequence [J].
Bono, H ;
Ogata, H ;
Goto, S ;
Kanehisa, M .
GENOME RESEARCH, 1998, 8 (03) :203-210
[8]   Meta-Analysis of Oxidative Transcriptomes in Insects [J].
Bono, Hidemasa .
ANTIOXIDANTS, 2021, 10 (03) :1-12
[9]   The transcriptional landscape of the mammalian genome [J].
Carninci, P ;
Kasukawa, T ;
Katayama, S ;
Gough, J ;
Frith, MC ;
Maeda, N ;
Oyama, R ;
Ravasi, T ;
Lenhard, B ;
Wells, C ;
Kodzius, R ;
Shimokawa, K ;
Bajic, VB ;
Brenner, SE ;
Batalov, S ;
Forrest, ARR ;
Zavolan, M ;
Davis, MJ ;
Wilming, LG ;
Aidinis, V ;
Allen, JE ;
Ambesi-Impiombato, X ;
Apweiler, R ;
Aturaliya, RN ;
Bailey, TL ;
Bansal, M ;
Baxter, L ;
Beisel, KW ;
Bersano, T ;
Bono, H ;
Chalk, AM ;
Chiu, KP ;
Choudhary, V ;
Christoffels, A ;
Clutterbuck, DR ;
Crowe, ML ;
Dalla, E ;
Dalrymple, BP ;
de Bono, B ;
Della Gatta, G ;
di Bernardo, D ;
Down, T ;
Engstrom, P ;
Fagiolini, M ;
Faulkner, G ;
Fletcher, CF ;
Fukushima, T ;
Furuno, M ;
Futaki, S ;
Gariboldi, M .
SCIENCE, 2005, 309 (5740) :1559-1563
[10]   Blast2GO:: a universal tool for annotation, visualization and analysis in functional genomics research [J].
Conesa, A ;
Götz, S ;
García-Gómez, JM ;
Terol, J ;
Talón, M ;
Robles, M .
BIOINFORMATICS, 2005, 21 (18) :3674-3676