In silico prediction of novel miRNAs from genomic sequences remains a challenging problem. This study presents a genome-wide miRNA discovery software package called GenoScan and evaluates two hairpin classification methods. These methods, one ensemble-based and one using logistic regression were benchmarked along with 15 published methods. In addition, the sequence-folding step is addressed by investigating the impact of secondary structure prediction methods and the choice of input sequence length on prediction performance. Both the accuracy of secondary structure predictions and the miRNA prediction are evaluated. In the benchmark of hairpin classification methods, the regression model achieved highest classification accuracy. Of the structure prediction methods evaluated, ContextFold achieved the highest agreement between predicted and experimentally determined structures. However, both the choice of secondary structure prediction method and input sequence length had limited impact on hairpin classification performance.
机构:
NYU, Dept Comp Sci, Courant Inst Math Sci, New York, NY 10003 USANYU, Dept Biol, Ctr Genom & Syst Biol, New York, NY 10003 USA
Chen, Huang-Wen
Bandyopadhyay, Sunayan
论文数: 0引用数: 0
h-index: 0
机构:
NYU, Dept Comp Sci, Courant Inst Math Sci, New York, NY 10003 USA
Univ Minnesota Twin Cities, Dept Comp Sci & Engn, Minneapolis, MN 55455 USANYU, Dept Biol, Ctr Genom & Syst Biol, New York, NY 10003 USA
Bandyopadhyay, Sunayan
Shasha, Dennis E.
论文数: 0引用数: 0
h-index: 0
机构:
NYU, Dept Comp Sci, Courant Inst Math Sci, New York, NY 10003 USANYU, Dept Biol, Ctr Genom & Syst Biol, New York, NY 10003 USA
Shasha, Dennis E.
Birnbaum, Kenneth D.
论文数: 0引用数: 0
h-index: 0
机构:
NYU, Dept Biol, Ctr Genom & Syst Biol, New York, NY 10003 USANYU, Dept Biol, Ctr Genom & Syst Biol, New York, NY 10003 USA
机构:
Columbia Univ, Dept Biostat, New York, NY 10027 USA
Columbia Univ, Vagelos Coll Phys & Surg, Dept Med, Div Nephrol, New York, NY USAColumbia Univ, Dept Biostat, New York, NY 10027 USA
Wang, Chen
Wang, Tianying
论文数: 0引用数: 0
h-index: 0
机构:
Colorado State Univ, Ft Collins, CO USAColumbia Univ, Dept Biostat, New York, NY 10027 USA
Wang, Tianying
论文数: 引用数:
h-index:
机构:
Kiryluk, Krzysztof
Wei, Ying
论文数: 0引用数: 0
h-index: 0
机构:
Columbia Univ, Dept Biostat, New York, NY 10027 USAColumbia Univ, Dept Biostat, New York, NY 10027 USA
Wei, Ying
Aschard, Hugues
论文数: 0引用数: 0
h-index: 0
机构:
Univ Paris Cite, Inst Pasteur, Dept Computat Biol, Paris, FranceColumbia Univ, Dept Biostat, New York, NY 10027 USA
Aschard, Hugues
Ionita-Laza, Iuliana
论文数: 0引用数: 0
h-index: 0
机构:
Columbia Univ, Dept Biostat, New York, NY 10027 USA
Lund Univ, Dept Stat, Lund, SwedenColumbia Univ, Dept Biostat, New York, NY 10027 USA
机构:
Univ Maryland, Dept Epidemiol & Biostat, College Pk, MD 20742 USAUniv Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
Wu, Tong Tong
Chen, Yi Fang
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Stat, Stanford, CA 94305 USAUniv Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
Chen, Yi Fang
论文数: 引用数:
h-index:
机构:
Hastie, Trevor
Sobel, Eric
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USAUniv Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
Sobel, Eric
Lange, Kenneth
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
Univ Calif Los Angeles, Dept Biomath, Los Angeles, CA 90095 USAUniv Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
机构:
Univ Informat Technol, VNU HCM, Dept Informat Syst, Ho Chi Minh City, VietnamUniv Informat Technol, VNU HCM, Dept Informat Syst, Ho Chi Minh City, Vietnam
Thanh, Binh Pham
2022 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI,
2022,
: 79
-
85