DETECTION OF COMPOSITIONAL CONSTRAINTS IN NUCLEIC-ACID SEQUENCES USING NEURAL NETWORKS

被引:0
作者
GRANJEON, E
TARROUX, P
机构
来源
COMPUTER APPLICATIONS IN THE BIOSCIENCES | 1995年 / 11卷 / 01期
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We describe in this paper a neural network method for the detection of compositional constraints in introns and exons. The first part of the algorithm (learning phase) consisted in presenting examples of intron and exon sequences to the network and in modifying its connections using the backpropagation algorithm. Previous connectionist methods achieved the learning of exons and introns using the latter as negative examples to the former. However, we chose to learn introns and exons jointly, using junk DNA as a common counter-example. In a second part (generalization phase), we rested the neural networks in the search for exons and introns in the human globin cluster. Their performances were also checked on the classification of unknown examples. As with the previous approaches, this technique discriminates introns and exons: values of the correlation coefficients are respectively 0.50 and 0.64 for the best achieved network. Moreover, using junk DNA sequences in the learning phase allows one to detect constrained regions inside the intron and the exon sequences (i.e. sequences that differ, by their nucleic acid compositions, from junk DNA). The application of our approach could be useful in the study of the internal organization of these sequences.
引用
收藏
页码:29 / 37
页数:9
相关论文
共 25 条
[1]   PERIODICITIES IN INTRONS [J].
ARQUES, DG ;
MICHEL, CJ .
NUCLEIC ACIDS RESEARCH, 1987, 15 (18) :7581-7592
[2]  
BILOFSKY HS, 1988, NUCLEIC ACIDS RES, V18, P1861
[3]   PREDICTION OF HUMAN MESSENGER-RNA DONOR AND ACCEPTOR SITES FROM THE DNA-SEQUENCE [J].
BRUNAK, S ;
ENGELBRECHT, J ;
KNUDSEN, S .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 220 (01) :49-65
[4]   NEURAL NETWORK DETECTS ERRORS IN THE ASSIGNMENT OF MESSENGER-RNA SPLICE SITES [J].
BRUNAK, S ;
ENGELBRECHT, J ;
KNUDSEN, S .
NUCLEIC ACIDS RESEARCH, 1990, 18 (16) :4797-4801
[5]   COMPUTERS IN MOLECULAR-BIOLOGY - CURRENT APPLICATIONS AND EMERGING TRENDS [J].
DELISI, C .
SCIENCE, 1988, 240 (4848) :47-52
[6]   A COMPREHENSIVE SET OF SEQUENCE-ANALYSIS PROGRAMS FOR THE VAX [J].
DEVEREUX, J ;
HAEBERLI, P ;
SMITHIES, O .
NUCLEIC ACIDS RESEARCH, 1984, 12 (01) :387-395
[7]   DETERMINATION OF EUKARYOTIC PROTEIN CODING REGIONS USING NEURAL NETWORKS AND INFORMATION-THEORY [J].
FARBER, R ;
LAPEDES, A ;
SIROTKIN, K .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 226 (02) :471-479
[8]   RECOGNITION OF PROTEIN CODING REGIONS IN DNA-SEQUENCES [J].
FICKETT, JW .
NUCLEIC ACIDS RESEARCH, 1982, 10 (17) :5303-5318
[9]   ACNUC - A NUCLEIC-ACID SEQUENCE DATA-BASE AND ANALYSIS SYSTEM [J].
GOUY, M ;
MILLERET, F ;
MUGNIER, C ;
JACOBZONE, M ;
GAUTIER, C .
NUCLEIC ACIDS RESEARCH, 1984, 12 (01) :121-127
[10]  
Hecht-Nielsen R., 1989, IJCNN: International Joint Conference on Neural Networks (Cat. No.89CH2765-6), P593, DOI 10.1109/IJCNN.1989.118638