In silico prediction of novel miRNAs from genomic sequences remains a challenging problem. This study presents a genome-wide miRNA discovery software package called GenoScan and evaluates two hairpin classification methods. These methods, one ensemble-based and one using logistic regression were benchmarked along with 15 published methods. In addition, the sequence-folding step is addressed by investigating the impact of secondary structure prediction methods and the choice of input sequence length on prediction performance. Both the accuracy of secondary structure predictions and the miRNA prediction are evaluated. In the benchmark of hairpin classification methods, the regression model achieved highest classification accuracy. Of the structure prediction methods evaluated, ContextFold achieved the highest agreement between predicted and experimentally determined structures. However, both the choice of secondary structure prediction method and input sequence length had limited impact on hairpin classification performance.
机构:
Garvan Inst Med Res, Sydney, NSW 2010, Australia
UNSW Australia, Fac Med, St Vincents Clin Sch, Sydney, NSW 2052, AustraliaGarvan Inst Med Res, Sydney, NSW 2010, Australia
Mercer, Tim R.
Clark, Michael B.
论文数: 0引用数: 0
h-index: 0
机构:
Garvan Inst Med Res, Sydney, NSW 2010, Australia
Univ Oxford, Dept Physiol Anat & Genet, MRC Funct Genom Unit, Oxford OX1 3PT, EnglandGarvan Inst Med Res, Sydney, NSW 2010, Australia
Clark, Michael B.
Andersen, Stacey B.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Queensland, Australian Inst Bioengn & Nanotechnol, Brisbane, Qld 4072, AustraliaGarvan Inst Med Res, Sydney, NSW 2010, Australia
Andersen, Stacey B.
Brunck, Marion E.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Queensland, Australian Inst Bioengn & Nanotechnol, Brisbane, Qld 4072, AustraliaGarvan Inst Med Res, Sydney, NSW 2010, Australia
Brunck, Marion E.
论文数: 引用数:
h-index:
机构:
Haerty, Wilfried
Crawford, Joanna
论文数: 0引用数: 0
h-index: 0
机构:
Univ Queensland, Inst Mol Biosci, Brisbane, Qld 4072, AustraliaGarvan Inst Med Res, Sydney, NSW 2010, Australia
Crawford, Joanna
Taft, Ryan J.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Queensland, Inst Mol Biosci, Brisbane, Qld 4072, Australia
Illumina Inc, San Diego, CA 92122 USA
George Washington Univ, Sch Med & Hlth Serv, Dept Integrated Syst Biol, Washington, DC 20037 USA
George Washington Univ, Dept Pediat, Washington, DC 20037 USAGarvan Inst Med Res, Sydney, NSW 2010, Australia
Taft, Ryan J.
Nielsen, Lars K.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Queensland, Australian Inst Bioengn & Nanotechnol, Brisbane, Qld 4072, AustraliaGarvan Inst Med Res, Sydney, NSW 2010, Australia
Nielsen, Lars K.
Dinger, Marcel E.
论文数: 0引用数: 0
h-index: 0
机构:
Garvan Inst Med Res, Sydney, NSW 2010, Australia
UNSW Australia, Fac Med, St Vincents Clin Sch, Sydney, NSW 2052, AustraliaGarvan Inst Med Res, Sydney, NSW 2010, Australia
Dinger, Marcel E.
Mattick, John S.
论文数: 0引用数: 0
h-index: 0
机构:
Garvan Inst Med Res, Sydney, NSW 2010, Australia
UNSW Australia, Fac Med, St Vincents Clin Sch, Sydney, NSW 2052, AustraliaGarvan Inst Med Res, Sydney, NSW 2010, Australia
机构:
Univ Queensland, Translat Res Inst, Diamantina Inst, Brisbane, Qld, Australia
Univ Western Australia, Sch Womens & Infants Hlth, Perth, WA, AustraliaUniv Queensland, Translat Res Inst, Diamantina Inst, Brisbane, Qld, Australia
Warrington, Nicole M.
Evans, David M.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Queensland, Translat Res Inst, Diamantina Inst, Brisbane, Qld, Australia
Univ Bristol, MRC Integrat Epidemiol Unit, Bristol, Avon, England
Univ Bristol, Sch Social & Community Med, Bristol, Avon, EnglandUniv Queensland, Translat Res Inst, Diamantina Inst, Brisbane, Qld, Australia