Characterization of long cDNA clones from human adult spleen. II. The complete sequences of 81 cDNA clones

被引:7
作者
Jikuya, H
Takano, J
Kikuno, R
Hirosawa, M
Nagase, T
Nomura, N
Ohara, O
机构
[1] Kazusa DNA Res Inst, Chiba 2920818, Japan
[2] Shimadzu Co Ltd, Nakagyo Ku, Kyoto 6048511, Japan
[3] RIKEN, Res Ctr Allergy & Immunol, Yokohama, Kanagawa, Japan
关键词
long cDNA; single-pass sequence; cDNA sequencing; spleen;
D O I
10.1093/dnares/10.1.49
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
To accumulate information on the coding sequences (CDSs) of unidentified genes, we have conducted a sequencing project of human long cDNA clones. Both the end sequences of approximately 10,000 cDNA clones from two size-fractionated human spleen cDNA libraries (average sizes of 4.5 kb and 5.6 kb) were determined by single-pass sequencing to select cDNAs with unidentified sequences. We herein present the entire sequences of 81 cDNA clones, most of which were selected by two approaches based on their protein-coding potentialities in silico: Fifty-eight cDNA clones were selected as those having protein-coding potentialities at the 5'-end of single-pass sequences by applying the GeneMark analysis; and 20 cDNA clones were selected as those expected to encode proteins larger than 100 amino acid residues by analysis of the human genome sequences flanked by both the end sequences of cDNAs using the GENSCAN gene prediction program. In addition to these newly identified cDNAs, three cDNA clones were isolated by colony hybridization experiments using probes corresponding to known gene sequences since these cDNAs are likely to contain considerable amounts of new information regarding the genes already annotated. The sequence data indicated that the average sizes of the inserts and corresponding CDSs of cDNA clones analyzed here were 5.0 kb and 2.0 kb (670 amino acid residues), respectively, From the results of homology and motif searches against the public databases, functional categories of the 29 predicted gene products could be assigned; 86% of these predicted gene products (25 gene products) were classified into proteins relating to cell signaling/communication, nucleic acid management, and cell structure/motility.
引用
收藏
页码:49 / 57
页数:9
相关论文
共 23 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
  • [3] Patterns of variant polyadenylation signal usage in human genes
    Beaudoing, E
    Freier, S
    Wyatt, JR
    Claverie, JM
    Gautheret, D
    [J]. GENOME RESEARCH, 2000, 10 (07) : 1001 - 1010
  • [4] Benson DA, 2003, NUCLEIC ACIDS RES, V31, P23, DOI 10.1093/nar/gkg057
  • [5] DETECTION OF NEW GENES IN A BACTERIAL GENOME USING MARKOV-MODELS FOR 3 GENE CLASSES
    BORODOVSKY, M
    MCININCH, JD
    KOONIN, EV
    RUDD, KE
    MEDIGUE, C
    DANCHIN, A
    [J]. NUCLEIC ACIDS RESEARCH, 1995, 23 (17) : 3554 - 3562
  • [6] Prediction of complete gene structures in human genomic DNA
    Burge, C
    Karlin, S
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) : 78 - 94
  • [7] Non-coding RNA genes and the modern RNA world
    Eddy, SR
    [J]. NATURE REVIEWS GENETICS, 2001, 2 (12) : 919 - 929
  • [8] A computer program for aligning a cDNA sequence with a genomic DNA sequence
    Florea, L
    Hartzell, G
    Zhang, Z
    Rubin, GM
    Miller, W
    [J]. GENOME RESEARCH, 1998, 8 (09) : 967 - 974
  • [9] Characterization of long cDNA clones from human adult spleen
    Hattori, A
    Okumura, K
    Nagase, T
    Kikuno, R
    Hirosawa, M
    Ohara, O
    [J]. DNA RESEARCH, 2000, 7 (06) : 357 - 366
  • [10] Hirosawa M, 1999, DNA Res, V6, P329, DOI 10.1093/dnares/6.5.329