Comparative study of artificial neural network for classification of hot and cold recombination regions in Saccharomyces cerevisiae

被引:5
作者
Dwivedi, Ashok Kumar [1 ,2 ]
Chouhan, Usha [2 ]
机构
[1] Maulana Azad Natl Inst Technol, Dept Bioinformat Math & Comp Applicat, Bhopal 462003, MP, India
[2] Maulana Azad Natl Inst Technol, Dept Math Bioinformat & Comp Applicat, Bhopal 462003, MP, India
关键词
Recombination; Support vector machine; Artificial neural network; Naive Bayes; Amino acid composition; Classification; Machine learning; Evolution; SUPPORT VECTOR MACHINES; DOUBLE-STRAND BREAKS; CODON USAGE; AFLATOXIN B-1; DROSOPHILA-MELANOGASTER; HUMAN GENOME; HOTSPOTS; YEAST; SELECTION; SPOTS;
D O I
10.1007/s00521-016-2466-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At the chromosomal level of evolution, recombination is a major factor for genetic variations. However, recombination does not occur with equal frequency at various regions of genome. The recombination has the tendency to occur at specific region with higher frequency and with low frequency at other regions, and former regions are named as hot recombination regions whereas later are called cold regions for recombination. In this paper, we have developed supervised machine learning-based models using artificial neural network, support vector machine and Na < ve Bayes for efficient and effective classification of such hot and cold recombination regions based on the nucleotide composition of sequences. All models were validated and tested using tenfold cross-validation. Furthermore, neural network model was validated using leave one out and random sampling techniques in addition to tenfold cross-validation. Moreover, models were evaluated using receiver-operating curve. Our results indicate that artificial neural network achieves the best result.
引用
收藏
页码:529 / 535
页数:7
相关论文
共 47 条
[1]  
[Anonymous], 2000, NATURE STAT LEARNING, DOI DOI 10.1007/978-1-4757-3264-1
[2]  
[Anonymous], 1996, INTRO BAYESIAN NETWO
[3]   Clustering of meiotic double-strand breaks on yeast chromosome III [J].
Baudat, F ;
Nicolas, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (10) :5213-5218
[4]   A neural computational aid to the diagnosis of acute myocardial infarction [J].
Baxt, WG ;
Shofer, FS ;
Sites, FD ;
Hollander, JE .
ANNALS OF EMERGENCY MEDICINE, 2002, 39 (04) :366-373
[5]   THE RELATIONSHIP BETWEEN BASE COMPOSITION AND CODON USAGE IN BACTERIAL GENES AND ITS USE FOR THE SIMPLE AND RELIABLE IDENTIFICATION OF PROTEIN-CODING SEQUENCES [J].
BIBB, MJ ;
FINDLAY, PR ;
JOHNSON, MW .
GENE, 1984, 30 (1-3) :157-166
[6]   Correlation between nucleotide composition and folding energy of coding sequences with special attention to wobble bases [J].
Biro, Jan C. .
THEORETICAL BIOLOGY AND MEDICAL MODELLING, 2008, 5
[7]   Guanine alkylation by the potent carcinogen aflatoxin B1:: Quantum chemical calculations [J].
Bren, Urban ;
Guengerich, F. Peter ;
Mavri, Janez .
CHEMICAL RESEARCH IN TOXICOLOGY, 2007, 20 (08) :1134-1140
[8]   Cooperative Binding of Aflatoxin B1 by Cytochrome P450 3A4: A Computational Study [J].
Bren, Urban ;
Fuchs, Julian E. ;
Oostenbrink, Chris .
CHEMICAL RESEARCH IN TOXICOLOGY, 2014, 27 (12) :2136-2147
[9]   Empirical studies of quality models in object-oriented systems [J].
Briand, LC ;
Wüst, J .
ADVANCES IN COMPUTERS, VOL 56, 2002, 56 :97-166
[10]   Inherent Stereospecificity in the Reaction of Aflatoxin B1 8,9-Epoxide with Deoxyguanosine and Efficiency of DNA Catalysis [J].
Brown, Kyle L. ;
Bren, Urban ;
Stone, Michael P. ;
Guengerich, F. Peter .
CHEMICAL RESEARCH IN TOXICOLOGY, 2009, 22 (05) :913-917