Two-stage clustering based effective sample selection for classification of premiRNAs

被引:0
|
作者
Xuan, Ping [1 ]
Guo, Mao-zu [1 ]
Shi, Lei-lei [2 ]
Wang, Jun [1 ]
Liu, Xiao-yan [1 ]
Li, Wen-bin [3 ]
Han, Ying-peng [3 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Heilongjiang, Peoples R China
[2] Univ Kent, Comp Lab, Canterbury CT2 7NF, Kent, England
[3] Northeast Agr Univ, Soybean Res Inst, Harbin 150030, Heilongjiang, Peoples R China
来源
2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE | 2010年
关键词
PREDICTION; MICRORNAS; FEATURES; REAL;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
To solve the class imbalance problem in classification of pre-miRNAs with ab initio method, a novel sample selection method is proposed according to the characteristics of pre-miRNAs. Real/pseudo premiRNAs are clustered based on their stem similarity and their distribution in high dimensional sample space respectively. The training samples are selected according to the sample density of each cluster. Experimental results are validated by the cross validation and other testing datasets composed of human real/pseudo pre-miRNAs. When compared with the previous study, microPred, our classifier miRNAPred is nearly 12% greater in total accuracy. Our sample selection algorithm is useful to construct more efficient classifier for classification of real premiRNAs and pseudo hairpin sequences.
引用
收藏
页码:549 / 552
页数:4
相关论文
共 50 条
  • [1] Sequence Classification: A Regression Based Generalization of Two-stage Clustering
    Farin, Nusrat Jahan
    Mansoor, Nafees
    Momen, Sifat
    Mobin, Iftekharul
    Mohammed, Nabeel
    2016 INTERNATIONAL WORKSHOP ON COMPUTATIONAL INTELLIGENCE (IWCI), 2016, : 126 - 130
  • [2] Two-Stage Feature Selection for Text Classification
    Ozgur, Levent
    Gungor, Tunga
    INFORMATION SCIENCES AND SYSTEMS 2015, 2016, 363 : 329 - 337
  • [3] Clustering based Two-Stage Text Classification Requiring Minimal Training Data
    Zhang, Xue
    Xiao, Wangxin
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2012, 9 (04) : 1627 - 1643
  • [4] Adaptive Two-Stage Feature Selection for Sentiment Classification
    Chi, Xu
    Cambria, Erik
    Siew, Tan Puay
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1238 - 1243
  • [5] On Two-Stage Feature Selection Methods for Text Classification
    Uysal, Alper Kursat
    IEEE ACCESS, 2018, 6 : 43233 - 43251
  • [6] Two-stage Feature Selection Method for Text Classification
    Li Xi
    Dai Hang
    Wang Mingwen
    MINES 2009: FIRST INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY, VOL 1, PROCEEDINGS, 2009, : 234 - +
  • [7] Two-Stage Traffic Clustering Based on HNSW
    Zhang, Xu
    Niu, Xinzheng
    Fournier-Viger, Philippe
    Wang, Bing
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 609 - 620
  • [8] Revisiting two-stage feature selection based on coverage policies for text classification
    Mendez-Molina, Arquimides
    Li Ona-Garcia, Ana
    Ariel Carrasco-Ochoa, Jesus
    Martinez-Trinidad, Jose Fco.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 2949 - 2957
  • [9] A two-stage Markov blanket based feature selection algorithm for text classification
    Javed, Kashif
    Maruf, Sameen
    Babri, Haroon A.
    NEUROCOMPUTING, 2015, 157 : 91 - 104
  • [10] Two-stage distributed generation optimal sizing with clustering-based node selection
    Rotaru, Florina
    Chicco, Gianfranco
    Grigoras, Gheorghe
    Cortina, Gheorghe
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2012, 40 (01) : 120 - 129