Finding Motifs in DNA Sequences Using Low-Dispersion Sequences

被引:49
作者
Wang, Xun [1 ]
Miao, Ying [1 ]
Cheng, Minquan [2 ]
机构
[1] Univ Tsukuba, Grad Sch Syst & Informat Engn, Tsukuba, Ibaraki 3058577, Japan
[2] Guangxi Normal Univ, Dept Math, Guilin, Peoples R China
基金
美国国家科学基金会;
关键词
random projection; developed almost difference family; uniform projection; motif finding; low-dispersion sequence; EXPECTATION MAXIMIZATION; PROJECTION; DISCOVERY; BINDING;
D O I
10.1089/cmb.2013.0054
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motif finding problems, abstracted as the planted (l, d)-motif finding problem, are a major task in molecular biology-finding functioning units and genes. In 2002, the random projection algorithm was introduced to solve the challenging (15, 4)-motif finding problem by using randomly chosen templates. Two years later, a so-called uniform projection algorithm was developed to improve the random projection algorithm by means of low-dispersion sequences generated by coverings. In this article, we introduce an improved projection algorithm called the low-dispersion projection algorithm, which uses low-dispersion sequences generated by developed almost difference families. Compared with the random projection algorithm, the low-dispersion projection algorithm can solve the (l, d)-motif finding problem with fewer templates without decreasing the success rate.
引用
收藏
页码:320 / 329
页数:10
相关论文
共 18 条
[11]   AN EXPECTATION MAXIMIZATION (EM) ALGORITHM FOR THE IDENTIFICATION AND CHARACTERIZATION OF COMMON SITES IN UNALIGNED BIOPOLYMER SEQUENCES [J].
LAWRENCE, CE ;
REILLY, AA .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1990, 7 (01) :41-51
[12]   DETECTING SUBTLE SEQUENCE SIGNALS - A GIBBS SAMPLING STRATEGY FOR MULTIPLE ALIGNMENT [J].
LAWRENCE, CE ;
ALTSCHUL, SF ;
BOGUSKI, MS ;
LIU, JS ;
NEUWALD, AF ;
WOOTTON, JC .
SCIENCE, 1993, 262 (5131) :208-214
[13]   Analysis of computational approaches for motif discovery [J].
Li, Nan ;
Tompa, Martin .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2006, 1 (1)
[14]  
Pevzner P A, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P269
[15]  
Raghavarao D., 1988, CONSTRUCTIONS COMBIN
[16]   A uniform projection method for motif discovery in DNA sequences [J].
Raphael, B ;
Liu, LT ;
Varghese, G .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2004, 1 (02) :91-94
[17]  
Wang HS, 2008, STAT SINICA, V18, P299
[18]  
[No title captured]