Variable Ranking Feature Selection for the Identification of Nucleosome Related Sequences

被引:1
|
作者
Lo Bosco, Giosue [1 ,2 ]
Rizzo, Riccardo [3 ]
Fiannaca, Antonino [3 ]
La Rosa, Massimo [3 ]
Urso, Alfonso [3 ]
机构
[1] Univ Palermo, Dipartimento Matemat & Informat, UNIPA, Palermo, Italy
[2] Ist Euromediterraneo Sci & Tecnol, IEMEST, Dipartimento Sci Innovaz Tecnol, Palermo, Italy
[3] CNR, Natl Res Council Italy, ICAR, Palermo, Italy
来源
NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2018 | 2018年 / 909卷
关键词
Deep learning models; Feature selection; DNA sequences; Epigenomic; Nucleosomes;
D O I
10.1007/978-3-030-00063-9_30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Several recent works have shown that K-mer sequence representation of a DNA sequence can be used for classification or identification of nucleosome positioning related sequences. This representation can be computationally expensive when k grows, making the complexity in spaces of exponential dimension. This issue affects significantly the classification task computed by a general machine learning algorithm used for the purpose of sequence classification. In this paper, we investigate the advantage offered by the so-called Variable Ranking Feature Selection method to select the most informative k - mers associated to a set of DNA sequences, for the final purpose of nucleosome/linker classification by a deep learning network. Results computed on three public datasets show the effectiveness of the adopted feature selection method.
引用
收藏
页码:314 / 324
页数:11
相关论文
共 50 条
  • [41] Feature extraction of weighted data for implicit variable selection
    Sanchez, Luis
    Martinez, Fernando
    Castellanos, German
    Salazar, Augusto
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2007, 4673 : 840 - 847
  • [42] Informative variable identifier: Expanding interpretability in feature selection
    Munoz-Romero, Sergio
    Gorostiaga, Arantza
    Soguero-Ruiz, Cristina
    Mora-Jimenez, Inmaculada
    Rojo-Alvarez, Jose Luis
    PATTERN RECOGNITION, 2020, 98
  • [43] A Comparative Study of Feature-Ranking and Feature-Subset Selection Techniques for Improved Fault Prediction
    Rathore, Santosh Singh
    Gupta, Atul
    PROCEEDINGS OF THE 7TH INDIA SOFTWARE ENGINEERING CONFERENCE 2014, ISEC '14, 2014,
  • [44] Feature Selection for Power Quality Event Identification
    Hafiz, Faizal
    Swain, Akshya
    Naik, Chirag
    Patel, Nitish
    TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 2984 - 2989
  • [45] Specific Emitter Identification Based on Feature Selection
    Xu, Yingsen
    Wang, Shilian
    Lu, Luxi
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL II: SIGNAL PROCESSING, 2020, 516 : 999 - 1008
  • [46] The Study of Feature Selection Strategy in Electrocardiogram Identification
    Chen, Chen
    Zheng, Gang
    Dai, Min
    BIOMETRIC RECOGNITION (CCBR 2014), 2014, 8833 : 489 - 497
  • [47] Systematicity in linguistic feature selection: Repair sequences and subsequent accommodation
    O'Neal, George
    JOURNAL OF ENGLISH AS A LINGUA FRANCA, 2019, 8 (02) : 211 - 233
  • [48] Feature selection for identification of spot welding processes
    Haapalainen, Eija
    Laurinen, Perttu
    Junno, Heli
    Tuovinen, Lauri
    Roning, Juha
    ICINCO 2006: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS: INTELLIGENT CONTROL SYSTEMS AND OPTIMIZATION, 2006, : 40 - +
  • [49] Feature selection for splice site prediction: A new method using EDA-based feature ranking
    Yvan Saeys
    Sven Degroeve
    Dirk Aeyels
    Pierre Rouzé
    Yves Van de Peer
    BMC Bioinformatics, 5
  • [50] A new representation in genetic programming with hybrid feature ranking criterion for high-dimensional feature selection
    Li, Jiayi
    Zhang, Fan
    Ma, Jianbin
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (04)