IRREGULARITY OF DISTRIBUTION OF AMINO-ACID SUBSTITUTIONS ALONG AMINO-ACID-SEQUENCES OF HOMOLOGOUS PROTEINS

被引:0
作者
KOSTETSKII, PV
机构
关键词
PHOSPHOLIPASE-A2; RHODOPSIN; CYTOCHROME-B; NA; K-ATPASE;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A group of aligned sequences of homologous proteins is divided into two groups comprising m and n closest sequences. Each position is then characterized by a point variability which is equal to a number of noncoincidences through all the possible intergroup mutual comparisons divided by m . n. The values of point variability averaged for ten consecutive positions are plotted versus the segment number to obtain an intergroup profile of local variability. The area S of a figure enclosed between the profile curve and straight line at the level of the mean value of the local variability is compared with an averaged noise area S(c) for 1000 families of artificial homologous proteins which are obtained by permutations of columns of amino acid residues of the initial family. If S exceeds S(c) by more than two standard deviations sigma(c) then the variability profile manifests peaks and valleys which correspond to meaningfully conserved and variable sequences. To identify these segments, it is necessary to cut off an area ''surplus'', deltaS = S -(S(c) + 2 sigma(c)), by two horizontal lines, each of them detaching an area of deltaS/2. The difference (S - S(c)), expressed in units of standard deviation, is suggested as a measure of overall irregularity of substitutions along the homologous protein sequences OI = (S - S(c))/sigma(c). The proposed method was applied to identify authentic variable and conserved segments in six families of homologous proteins: phospholipases A2, cytochromes b, alpha-subunits of Na,K-ATPases, L- and M-subunits of photoreaction center of photobacteria, and rhodopsins. For model families of homologous proteins, obtained by k-fold repetition of natural proteins, the value of overall irregularity was shown to be proportional to square-root k. Comparison of the extent of irregularity of substitutions in amino acid sequences of homologous proteins of various length L can be made by referring the values of overall irregularity of substitution in each family to the length of an average protein domain comprising 250 residues.
引用
收藏
页码:1294 / 1302
页数:9
相关论文
共 43 条
[1]  
BELANGER G, 1988, J BIOL CHEM, V263, P7632
[2]  
BLUMENBERG M, 1989, MOL BIOL EVOL, V6, P53
[3]   CONCERTED GENE DUPLICATIONS IN THE 2 KERATIN GENE FAMILIES [J].
BLUMENBERG, M .
JOURNAL OF MOLECULAR EVOLUTION, 1988, 27 (03) :203-211
[4]   CLONING AND ANALYSIS OF THE TOMATO NITRATE REDUCTASE-ENCODING GENE - PROTEIN DOMAIN-STRUCTURE AND AMINO-ACID HOMOLOGIES IN HIGHER-PLANTS [J].
DANIELVEDELE, F ;
DORBE, MF ;
CABOCHE, M ;
ROUZE, P .
GENE, 1989, 85 (02) :371-380
[5]  
DJER WE, 1990, J BIOL CHEM, V265, P1608
[6]  
ECK RV, 1966, ATLAS PROTEIN SEQUEN, P3
[7]   CONSTRUCTION OF PHYLOGENETIC TREES [J].
FITCH, WM ;
MARGOLIASH, E .
SCIENCE, 1967, 155 (3760) :279-+
[8]  
GANTT JS, 1990, J BIOL CHEM, V265, P2763
[9]   AN NTP-BINDING MOTIF IS THE MOST CONSERVED SEQUENCE IN A HIGHLY DIVERGED MONOPHYLETIC GROUP OF PROTEINS INVOLVED IN POSITIVE STRAND RNA VIRAL REPLICATION [J].
GORBALENYA, AE ;
BLINOV, VM ;
DONCHENKO, AP ;
KOONIN, EV .
JOURNAL OF MOLECULAR EVOLUTION, 1989, 28 (03) :256-268
[10]  
HAEFLIGER DN, 1989, J MOL EVOL, V12, P344