Two-stage clustering based effective sample selection for classification of premiRNAs

被引:0
|
作者
Xuan, Ping [1 ]
Guo, Mao-zu [1 ]
Shi, Lei-lei [2 ]
Wang, Jun [1 ]
Liu, Xiao-yan [1 ]
Li, Wen-bin [3 ]
Han, Ying-peng [3 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Heilongjiang, Peoples R China
[2] Univ Kent, Comp Lab, Canterbury CT2 7NF, Kent, England
[3] Northeast Agr Univ, Soybean Res Inst, Harbin 150030, Heilongjiang, Peoples R China
来源
2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE | 2010年
关键词
PREDICTION; MICRORNAS; FEATURES; REAL;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
To solve the class imbalance problem in classification of pre-miRNAs with ab initio method, a novel sample selection method is proposed according to the characteristics of pre-miRNAs. Real/pseudo premiRNAs are clustered based on their stem similarity and their distribution in high dimensional sample space respectively. The training samples are selected according to the sample density of each cluster. Experimental results are validated by the cross validation and other testing datasets composed of human real/pseudo pre-miRNAs. When compared with the previous study, microPred, our classifier miRNAPred is nearly 12% greater in total accuracy. Our sample selection algorithm is useful to construct more efficient classifier for classification of real premiRNAs and pseudo hairpin sequences.
引用
收藏
页码:549 / 552
页数:4
相关论文
共 50 条
  • [31] Recent two-stage sample selection procedures with an application to the gender wage gap
    Christofides, LN
    Li, Q
    Liu, ZJ
    Min, IS
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2003, 21 (03) : 396 - 405
  • [32] A novel two-stage wrapper feature selection approach based on greedy search for text sentiment classification
    Sagbas, Ensar Arif
    NEUROCOMPUTING, 2024, 590
  • [33] Two-stage feature selection for classification of gene expression data based on an improved Salp Swarm Algorithm
    Qin, Xiwen
    Zhang, Shuang
    Yin, Dongmei
    Chen, Dongxue
    Dong, Xiaogang
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (12) : 13747 - 13781
  • [34] Two-Stage Decomposition Method Based on Cooperation Coevolution for Feature Selection on High-Dimensional Classification
    Wang, Yanli
    Qu, Boyang
    Liang, Jing
    Wei, Yunpeng
    Yue, Caitong
    Hu, Yi
    Song, Hui
    IEEE ACCESS, 2019, 7 : 163191 - 163201
  • [35] Two-stage gene selection for support vector machine classification of microarray data
    Xia, Xiao-Lei
    Li, Kang
    Irwin, George W.
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2009, 8 (02) : 164 - 171
  • [36] A Two-Stage Feature Selection Algorithm Based on Redundancy and Relevance
    Antioquia, Arren Matthew C.
    Azcarraga, Arnulfo P.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [37] Variable Selection based on a Two-stage Projection Pursuit Algorithm
    Jiang, Shu
    Xie, Yijun
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2020, : 188 - 193
  • [38] A wavelength selection method based on two-stage correlation coefficient
    Wan, Yan
    Chen, Zhengguang
    Jiao, Feng
    Chinese Journal of Analysis Laboratory, 2023, 42 (10) : 1332 - 1340
  • [39] A two-stage method for MUAP classification based on EMG decomposition
    Katsis, Christos D.
    Exarchos, Themis P.
    Papaloukas, Costas
    Goletsis, Yorgos
    Fotiadis, Dimitrios I.
    Sarmas, Ioannis
    COMPUTERS IN BIOLOGY AND MEDICINE, 2007, 37 (09) : 1232 - 1240
  • [40] A two-stage transformer based network for motor imagery classification
    Chaudhary, Priyanshu
    Dhankhar, Nischay
    Singhal, Amit
    Rana, K. P. S.
    MEDICAL ENGINEERING & PHYSICS, 2024, 128