A semiautomated approach to gene discovery through expressed sequence tag data mining: Discovery of new human transporter genes

被引:13
作者
Brown, S
Chang, JL
Sadee, W
Babbitt, PC
机构
[1] Univ Calif San Francisco, Sch Pharm, Dept Pharmaceut Chem, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Sch Pharm, Dept Biopharmaceut Sci, San Francisco, CA 94143 USA
[3] MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02141 USA
[4] Ohio State Univ, Med Ctr, Columbus, OH 43210 USA
来源
AAPS PHARMSCI | 2003年 / 5卷 / 01期
关键词
major facilitator superfamily; transporters; superfamily analysis; expressed sequence tags; data mining;
D O I
10.1208/ps050101
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Identification and functional characterization of the genes in the human genome remain a major challenge. A principal source of publicly available information used for this purpose is the National Center for Biotechnology Information database of expressed sequence tags (dbEST), which contains over 4 million human ESTs. To extract the information buried in this data more effectively, we have developed a semiautomated method to mine dbEST for uncharacterized human genes. Starting with a single protein input sequence, a family of related proteins from all species is compiled. This entire family is then used to mine the human EST database for new gene candidates. Evaluation of putative new gene candidates in the context of a family of characterized proteins provides a framework for inference of the structure and function of the new genes. When applied to a test data set of 28 families within the major facilitator superfamily (MFS) of membrane transporters, our protocol found 73 previously characterized human MFS genes and 43 new MFS gene candidates. Development of this approach provided insights into the problems and pitfalls of automated data mining using public databases.
引用
收藏
页数:18
相关论文
共 9 条
  • [1] A semiautomated approach to gene discovery through expressed sequence tag data mining: Discovery of new human transporter genes
    Shoshana Brown
    Jean l. Chang
    Wolfgang Sadee
    Patricia C. Babbitt
    AAPS PharmSci, 5
  • [2] Update of the gene discovery program in Schistosoma mansoni with the expressed sequence tag approach
    Rabelo, EML
    Franco, GR
    Azevedo, VAC
    Pena, HB
    Santos, TM
    Meira, WSF
    Rodrigues, NA
    Ortega, JM
    Pena, SDJ
    MEMORIAS DO INSTITUTO OSWALDO CRUZ, 1997, 92 (05): : 625 - 629
  • [3] Immune gene discovery by expressed sequence tag (EST) analysis of hemocytes in the ridgetail white prawn Exopalaemon carinicauda
    Duan, Yafei
    Liu, Ping
    Li, Jitao
    Li, Jian
    Chen, Ping
    FISH & SHELLFISH IMMUNOLOGY, 2013, 34 (01) : 173 - 182
  • [4] A new data mining approach for the discovery of critical group mobile routes
    Tsiligaridis, J
    Acharya, R
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VIII, PROCEEDINGS: CONTROL, COMMUNICATION AND NETWORK SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2004, : 17 - 22
  • [5] Discovery of host defence genes in the Japanese scallop Mizuhopecten yessoensis Jay by expressed sequence tag analysis of kidney tissue
    Liu, Weidong
    He, Chongbo
    Li, Wenji
    Zhou, Zunchun
    Gao, Xianggang
    Fu, Liyuan
    AQUACULTURE RESEARCH, 2010, 41 (11) : 1602 - 1613
  • [6] Discovery of genes related to steroidal alkaloid biosynthesis in Fritillaria cirrhosa by generating and mining a dataset of expressed sequence tags (ESTs)
    Sun, Chao
    Sun, Yongqiao
    Song, Jingyuan
    Li, Chenji
    Li, Xiwen
    Zhang, Xiaowei
    Li, Ying
    Hu, Songnian
    Luo, Hongmei
    Zhu, Yingjie
    Chen, Shilin
    JOURNAL OF MEDICINAL PLANTS RESEARCH, 2011, 5 (21): : 5307 - 5314
  • [7] Gene expression profiling of coelomic cells and discovery of immune-related genes in the earthworm, Eisenia andrei, using expressed sequence tags
    Tak, Eun Sik
    Cho, Sung-Jin
    Park, Soon Cheol
    BIOSCIENCE BIOTECHNOLOGY AND BIOCHEMISTRY, 2015, 79 (03) : 367 - 373
  • [8] Immune gene discovery by expressed sequence tag analysis of hemocytes and hepatopancreas in the Pacific White Shrimp, Litopenaeus vannamei, and the Atlantic White Shrimp, L-setiferus
    Gross, PS
    Bartlett, TC
    Browdy, CL
    Chapman, RW
    Warr, GW
    DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY, 2001, 25 (07) : 565 - 577
  • [9] Mining conditions specific hub genes from RNA-Seq gene-expression data via biclustering and their application to drug discovery
    Maind, Ankush
    Raut, Shital
    IET SYSTEMS BIOLOGY, 2019, 13 (04) : 194 - 203