iSS-PseDNC: Identifying Splicing Sites Using Pseudo Dinucleotide Composition

被引:235
作者
Chen, Wei [1 ,2 ]
Feng, Peng-Mian [3 ]
Lin, Hao [2 ,4 ]
Chou, Kuo-Chen [1 ,2 ,5 ]
机构
[1] Hebei United Univ, Ctr Genom & Computat Biol, Sch Sci, Dept Phys, Tangshan 063000, Peoples R China
[2] Gordon Life Sci Inst, Boston, MA 02478 USA
[3] Hebei United Univ, Sch Publ Hlth, Tangshan 063000, Peoples R China
[4] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Ctr Bioinformat, Key Lab Neuroinformat,Minist Educ, Chengdu 610054, Peoples R China
[5] King Abdulaziz Univ, Ctr Excellence Genom Med Res, Jeddah 21589, Saudi Arabia
关键词
AMINO-ACID-COMPOSITION; MEMBRANE-PROTEIN TYPES; CHOUS PSEAAC FORMULATION; AVERAGE CHEMICAL-SHIFT; MULTI-LABEL CLASSIFIER; GENERAL-FORM; SUBCELLULAR-LOCALIZATION; WEB-SERVER; PHYSICOCHEMICAL PROPERTIES; EVOLUTIONARY INFORMATION;
D O I
10.1155/2014/623149
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
In eukaryotic genes, exons are generally interrupted by introns. Accurately removing introns and joining exons together are essential processes in eukaryotic gene expression. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapid and effective detection of splice sites that play important roles in gene structure annotation and even in RNA splicing. Although a series of computational methods were proposed for splice site identification, most of them neglected the intrinsic local structural properties. In the present study, a predictor called "iSS-PseDNC" was developed for identifying splice sites. In the new predictor, the sequences were formulated by a novel feature-vector called "pseudo dinucleotide composition" (PseDNC) into which six DNA local structural properties were incorporated. It was observed by the rigorous cross-validation tests on two benchmark datasets that the overall success rates achieved by iSS-PseDNC in identifying splice donor site and splice acceptor site were 85.45% and 87.73%, respectively. It is anticipated that iSS-PseDNC may become a useful tool for identifying splice sites and that the six DNA local structural properties described in this paper may provide novel insights for in-depth investigations into the mechanism of RNA splicing.
引用
收藏
页数:12
相关论文
共 110 条
[1]   Generic eukaryotic core promoter prediction using structural features of DNA [J].
Abeel, Thomas ;
Saeys, Yvan ;
Bonnet, Eric ;
Rouze, Pierre ;
Van de Peer, Yves .
GENOME RESEARCH, 2008, 18 (02) :310-323
[2]   The benzylthio-pyrimidine U-31,355, a potent inhibitor of HIV-1 reverse transcriptase [J].
Althaus, IW ;
Chou, KC ;
Lemay, RJ ;
Franks, KM ;
Deibel, MR ;
Kezdy, FJ ;
Resnick, L ;
Busso, ME ;
So, AG ;
Downey, KM ;
Romero, DL ;
Thomas, RC ;
Aristoff, PA ;
Tarpley, WG ;
Reusser, F .
BIOCHEMICAL PHARMACOLOGY, 1996, 51 (06) :743-750
[3]  
ALTHAUS IW, 1993, J BIOL CHEM, V268, P6119
[4]   KINETIC-STUDIES WITH THE NONNUCLEOSIDE HUMAN-IMMUNODEFICIENCY-VIRUS TYPE-1 REVERSE-TRANSCRIPTASE INHIBITOR U-90152E [J].
ALTHAUS, IW ;
CHOU, JJ ;
GONZALES, AJ ;
DEIBEL, MR ;
CHOU, KC ;
KEZDY, FJ ;
ROMERO, DL ;
THOMAS, RC ;
ARISTOFF, PA ;
TARPLEY, WG ;
REUSSER, F .
BIOCHEMICAL PHARMACOLOGY, 1994, 47 (11) :2017-2028
[5]   KINETIC-STUDIES WITH THE NONNUCLEOSIDE HIV-1 REVERSE-TRANSCRIPTASE INHIBITOR-U-88204E [J].
ALTHAUS, IW ;
CHOU, JJ ;
GONZALES, AJ ;
DEIBEL, MR ;
CHOU, KC ;
KEZDY, FJ ;
ROMERO, DL ;
PALMER, JR ;
THOMAS, RC ;
ARISTOFF, PA ;
TARPLEY, WG ;
REUSSER, F .
BIOCHEMISTRY, 1993, 32 (26) :6548-6554
[6]   Kinetic plasticity and the determination of product ratios for kinetic schemes leading to multiple products without rate laws - New methods based on directed graphs [J].
Andraos, John .
CANADIAN JOURNAL OF CHEMISTRY, 2008, 86 (04) :342-357
[7]   Predicting subcellular localization of proteins in a hybridization space [J].
Cai, YD ;
Chou, KC .
BIOINFORMATICS, 2004, 20 (07) :1151-1156
[8]   Support vector machines for predicting membrane protein types by using functional domain composition [J].
Cai, YD ;
Zhou, GP ;
Chou, KC .
BIOPHYSICAL JOURNAL, 2003, 84 (05) :3257-3263
[9]   propy: a tool to generate various modes of Chou's PseAAC [J].
Cao, Dong-Sheng ;
Xu, Qing-Song ;
Liang, Yi-Zeng .
BIOINFORMATICS, 2013, 29 (07) :960-962
[10]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)