Exact asymptotic results for the Bernoulli matching model of sequence alignment

被引:43
作者
Majumdar, SN [1 ]
Nechaev, S
机构
[1] Univ Toulouse 3, Phys Theor Lab, CNRS, UMR C5152, F-31062 Toulouse, France
[2] Univ Paris 11, Lab Phys Theor & Modeles Stat, F-91405 Orsay, France
[3] LD Landau Theoret Phys Inst, Moscow 117334, Russia
来源
PHYSICAL REVIEW E | 2005年 / 72卷 / 02期
关键词
D O I
10.1103/PhysRevE.72.020901
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
Finding analytically the statistics of the longest common subsequence (LCS) of a pair of random sequences drawn from c alphabets is a challenging problem in computational evolutionary biology. We present exact asymptotic results for the distribution of the LCS in a simpler, yet nontrivial, variant of the original model called the Bernoulli matching (BM) model. We show that in the BM model, for all c, the distribution of the asymptotic length of the LCS, suitably scaled, is identical to the Tracy-Widom distribution of the largest eigenvalue of a random matrix whose entries are drawn from a Gaussian unitary ensemble.
引用
收藏
页数:4
相关论文
共 35 条
[31]   LONG COMMON SUBSEQUENCES AND THE PROXIMITY OF 2 RANDOM STRINGS [J].
STEELE, JM .
SIAM JOURNAL ON APPLIED MATHEMATICS, 1982, 42 (04) :731-737
[32]   LEVEL-SPACING DISTRIBUTIONS AND THE AIRY KERNEL [J].
TRACY, CA ;
WIDOM, H .
COMMUNICATIONS IN MATHEMATICAL PHYSICS, 1994, 159 (01) :151-174
[33]   STRING-TO-STRING CORRECTION PROBLEM [J].
WAGNER, RA ;
FISCHER, MJ .
JOURNAL OF THE ACM, 1974, 21 (01) :168-173
[34]   PHASE-TRANSITIONS IN SEQUENCE MATCHES AND NUCLEIC-ACID STRUCTURE [J].
WATERMAN, MS ;
GORDON, L ;
ARRATIA, R .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (05) :1239-1243
[35]   ALIGNMENT OF MOLECULAR SEQUENCES SEEN RANDOM PATH ANALYSIS [J].
ZHANG, MQ ;
MARR, TG .
JOURNAL OF THEORETICAL BIOLOGY, 1995, 174 (02) :119-129