Variable Ranking Feature Selection for the Identification of Nucleosome Related Sequences

被引:1
|
作者
Lo Bosco, Giosue [1 ,2 ]
Rizzo, Riccardo [3 ]
Fiannaca, Antonino [3 ]
La Rosa, Massimo [3 ]
Urso, Alfonso [3 ]
机构
[1] Univ Palermo, Dipartimento Matemat & Informat, UNIPA, Palermo, Italy
[2] Ist Euromediterraneo Sci & Tecnol, IEMEST, Dipartimento Sci Innovaz Tecnol, Palermo, Italy
[3] CNR, Natl Res Council Italy, ICAR, Palermo, Italy
来源
NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2018 | 2018年 / 909卷
关键词
Deep learning models; Feature selection; DNA sequences; Epigenomic; Nucleosomes;
D O I
10.1007/978-3-030-00063-9_30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Several recent works have shown that K-mer sequence representation of a DNA sequence can be used for classification or identification of nucleosome positioning related sequences. This representation can be computationally expensive when k grows, making the complexity in spaces of exponential dimension. This issue affects significantly the classification task computed by a general machine learning algorithm used for the purpose of sequence classification. In this paper, we investigate the advantage offered by the so-called Variable Ranking Feature Selection method to select the most informative k - mers associated to a set of DNA sequences, for the final purpose of nucleosome/linker classification by a deep learning network. Results computed on three public datasets show the effectiveness of the adopted feature selection method.
引用
收藏
页码:314 / 324
页数:11
相关论文
共 50 条
  • [21] Variable selection from a feature representing protein sequences: a case of classification on bacterial type IV secreted effectors
    Jian Zhang
    Lixin Lv
    Donglei Lu
    Denan Kong
    Mohammed Abdoh Ali Al-Alashaari
    Xudong Zhao
    BMC Bioinformatics, 21
  • [22] ENSEMBLE FEATURE SELECTION APPROACH BASED ON FEATURE RANKING FOR RICE SEED IMAGES CLASSIFICATION
    Dzi Lam Tran Tuan
    Surinwarangkoon, Thongchai
    Meethongjan, Kittikhun
    Vinh Truong Hoang
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2020, 18 (03) : 198 - 206
  • [23] Feature Ranking and Differential Evolution for Feature Selection in Brushless DC Motor Fault Diagnosis
    Lee, Chun-Yao
    Hung, Chen-Hsu
    SYMMETRY-BASEL, 2021, 13 (07):
  • [24] Variable selection from a feature representing protein sequences: a case of classification on bacterial type IV secreted effectors
    Zhang, Jian
    Lv, Lixin
    Lu, Donglei
    Kong, Denan
    Al-Alashaari, Mohammed Abdoh Ali
    Zhao, Xudong
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [25] Evaluating Feature Selection for Stress Identification
    Deng, Yong
    Wu, Zhonghai
    Chu, Chao-Hsien
    Yang, Tao
    2012 IEEE 13TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2012, : 584 - 591
  • [26] Feature Selection for Forensic Handwriting Identification
    Amaral, Aline Maria M. M.
    de Almendra Freitas, Cinthia Obladen
    Bortolozzi, Flavio
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 922 - 926
  • [27] Improved Feature Selection Algorithm for Biological Sequences Classification
    Guannoni, Naoual
    Mhamdi, Faouzi
    Elloumi, Mourad
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 689 - 700
  • [28] k-best feature selection and ranking via stochastic approximation
    Akman, David V.
    Malekipirbazari, Milad
    Yenice, Zeren D.
    Yeo, Anders
    Adhikari, Niranjan
    Wong, Yong Kai
    Abbasi, Babak
    Gumus, Alev Taskin
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [29] Phenotype feature selection for crop breeding evaluation based on ranking relevance
    Liu, Zhongqiang
    Zhao, Xiangyu
    Wang, Kaiyi
    Li, Minzan
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2015, 46 : 283 - 289
  • [30] A new ranking-based stability measure for feature selection algorithms
    Rakesh, Deepak Kumar
    Anwit, Raj
    Jana, Prasanta K.
    SOFT COMPUTING, 2023, 27 (09) : 5377 - 5396