EGassembler: online bioinformatics service for large-scale processing, clustering and assembling ESTs and genomic DNA fragments

被引:129
作者
Masoudi-Nejad, Ali [1 ]
Tonomura, Koichiro
Kawashima, Shuichi
Moriya, Yuki
Suzuki, Masanori
Itoh, Masumi
Kanehisa, Minoru
Endo, Takashi
Goto, Susumu
机构
[1] Kyoto Univ, Inst Chem Res, Bioinformat Ctr, Lab Bioknowledge Syst, Uji, Kyoto 6110011, Japan
[2] Kyoto Univ, Lab Plant Genet, Div Appl Biosci, Kyoto 6068502, Japan
[3] Univ Tokyo, Ctr Human Genome, Lab Genome Database, Tokyo 1088639, Japan
[4] Hitachi Govt & Publ Corp Syst Engn Ltd, Koto Ku, Tokyo 1358633, Japan
基金
日本学术振兴会;
关键词
D O I
10.1093/nar/gkl066
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Expressed sequence tag ( EST) sequencing has proven to be an economically feasible alternative for gene discovery in species lacking a draft genome sequence. Ongoing large- scale EST sequencing projects feel the need for bioinformatics tools to facilitate uniform EST handling. This brings about a renewed importance for a universal tool for processing and functional annotation of large sets of ESTs. EGassembler ( http://egassembler.hgc.jp/) is a web server, which provides an automated as well as a user- customized analysis tool for cleaning, repeat masking, vector trimming, organelle masking, clustering and assembling of ESTs and genomic fragments. The web server is publicly available and provides the community a unique all-in-one online application web service for large- scale ESTs and genomic DNA clustering and assembling. Running on a Sun Fire 15K supercomputer, a significantly large volume of data can be processed in a short period of time. The results can be used to functionally annotate genes, to facilitate splice alignment analysis, to link the transcripts to genetic and physical maps, design microarray chips, to perform transcriptome analysis and to map to KEGG metabolic pathways. The service provides an excellent bioinformatics tool to research groups in wet- lab as well as an all- in- one- tool for sequence handling to bioinformatics researchers.
引用
收藏
页码:W459 / W462
页数:4
相关论文
共 16 条
[1]   COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT [J].
ADAMS, MD ;
KELLEY, JM ;
GOCAYNE, JD ;
DUBNICK, M ;
POLYMEROPOULOS, MH ;
XIAO, H ;
MERRIL, CR ;
WU, A ;
OLDE, B ;
MORENO, RF ;
KERLAVAGE, AR ;
MCCOMBIE, WR ;
VENTER, JC .
SCIENCE, 1991, 252 (5013) :1651-1656
[2]   STACK: Sequence Tag Alignment and Consensus Knowledgebase [J].
Christoffels, A ;
van Gelder, A ;
Greyling, G ;
Miller, R ;
Hide, T ;
Hide, W .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :234-238
[3]  
EICKER T, 2004, TREP TRITICEAE REPEA
[4]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185
[5]   Generation and analysis of 280,000 human expressed sequence tags [J].
Hillier, L ;
Lennon, G ;
Becker, M ;
Bonaldo, MF ;
Chiapelli, B ;
Chissoe, S ;
Dietrich, N ;
DuBuque, T ;
Favello, A ;
Gish, W ;
Hawkins, M ;
Hultman, M ;
Kucaba, T ;
Lacy, M ;
Le, M ;
Le, N ;
Mardis, E ;
Moore, B ;
Morris, M ;
Parsons, J ;
Prange, C ;
Rifkin, L ;
Rohlfing, T ;
Schellenberg, K ;
Soares, MB ;
Tan, F ;
ThierryMeg, J ;
Trevaskis, E ;
Underwood, K ;
Wohldman, P ;
Waterston, R ;
Wilson, R ;
Marra, M .
GENOME RESEARCH, 1996, 6 (09) :807-828
[6]   CAP3: A DNA sequence assembly program [J].
Huang, XQ ;
Madan, A .
GENOME RESEARCH, 1999, 9 (09) :868-877
[7]   Repbase Update - a database and an electronic journal of repetitive elements [J].
Jurka, J .
TRENDS IN GENETICS, 2000, 16 (09) :418-420
[8]   From genomics to chemical genomics: new developments in KEGG [J].
Kanehisa, Minoru ;
Goto, Susumu ;
Hattori, Masahiro ;
Aoki-Kinoshita, Kiyoko F. ;
Itoh, Masumi ;
Kawashima, Shuichi ;
Katayama, Toshiaki ;
Araki, Michihiro ;
Hirakawa, Mika .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D354-D357
[9]   Computer-based methods for the mouse full-length cDNA encyclopedia: Real-time sequence clustering for construction of a nonredundant cDNA library [J].
Konno, H ;
Fukunishi, Y ;
Shibata, K ;
Itoh, M ;
Carninci, P ;
Sugahara, Y ;
Hayashizaki, Y .
GENOME RESEARCH, 2001, 11 (02) :281-289
[10]   The TIGR Gene Indices: clustering and assembling EST and known genes and integration with eukaryotic genomes [J].
Lee, Y ;
Tsai, J ;
Sunkara, S ;
Karamycheva, S ;
Pertea, G ;
Sultana, R ;
Antonescu, V ;
Chan, A ;
Cheung, F ;
Quackenbush, J .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D71-D74