The Performance Assessment Strategy in DC-BTA Multiple Sequence Alignment

被引:0
作者
Cao, Zhanmao [1 ,2 ]
Xiao, Wenjun [3 ]
Peng, Limin [1 ,2 ]
机构
[1] South China Normal Univ, Sch Comp, Guangzhou 510631, Guangdong, Peoples R China
[2] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
[3] South China Univ Technol, Sch Software Engn, Guangzhou 510006, Guangdong, Peoples R China
来源
ADVANCED MEASUREMENT AND TEST, PARTS 1 AND 2 | 2010年 / 439-440卷
关键词
multiple sequence alignment; beam weight; beam area rate; beam position vector; WEIGHT MATRIX; SEARCH;
D O I
10.4028/www.scientific.net/KEM.439-440.35
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A brand new performance assessment model is proposed for multiple sequence alignment. The new strategy is based on beam constructing of DC-BTA algorithm, which is a Divide-and-Conquer alignment method with beams. Beams form blocks of almost the identical columns and contribute biggest similarity weight to sequences. A formula to compute all beam areas covering a sequence assigns a value or weight to the sequence. And the total beam area is a partial to the whole alignment. A rate value between 0 and 1 is computed to assess the performance. This scheme is a simple and effective assessment policy in DC-BTA for the convenience of collecting the beam areas.
引用
收藏
页码:35 / +
页数:2
相关论文
共 15 条
[1]  
Altschul SE, 1997, THEORETICAL AND COMPUTATIONAL METHODS IN GENOME RESEARCH, P1
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   Beam search for the longest common subsequence problem [J].
Blum, Christian ;
Blesa, Maria J. ;
Lopez-Ibanez, Manuel .
COMPUTERS & OPERATIONS RESEARCH, 2009, 36 (12) :3178-3186
[4]   Modeling residue usage in aligned protein sequences via maximum likelihood [J].
Bruno, WJ .
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (10) :1368-1374
[5]  
CAO ZM, 2005, P INT C MACH LEARN C, V9, P5704
[6]   POSITION-BASED SEQUENCE WEIGHTS [J].
HENIKOFF, S ;
HENIKOFF, JG .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 243 (04) :574-578
[7]  
JIANG T, 1994, J COMP BIOL, V1, P337
[8]   METHODS FOR ASSESSING THE STATISTICAL SIGNIFICANCE OF MOLECULAR SEQUENCE FEATURES BY USING GENERAL SCORING SCHEMES [J].
KARLIN, S ;
ALTSCHUL, SF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (06) :2264-2268
[9]   Factors influencing the identification of transcription factor binding sites by cross-species comparison [J].
McCue, LA ;
Thompson, W ;
Carmack, CS ;
Lawrence, CE .
GENOME RESEARCH, 2002, 12 (10) :1523-1532
[10]  
Newberg LA, 2005, STAT APPL GENET MO B, V4