Models of the Gene Must Inform Data-Mining Strategies in Genomics

被引:4
|
作者
Huminiecki, Lukasz [1 ]
机构
[1] Polish Acad Sci, Inst Genet & Anim Biotechnol, Dept Mol Biol, PL-00901 Warsaw, Poland
基金
欧盟地平线“2020”;
关键词
gene concept; scientific method; experimentalism; reductionism; anti-reductionism; data-mining; NETWORK MEDICINE; EXPRESSION;
D O I
10.3390/e22090942
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The gene is a fundamental concept of genetics, which emerged with the Mendelian paradigm of heredity at the beginning of the 20th century. However, the concept has since diversified. Somewhat different narratives and models of the gene developed in several sub-disciplines of genetics, that is in classical genetics, population genetics, molecular genetics, genomics, and, recently, also, in systems genetics. Here, I ask how the diversity of the concept impacts data-integration and data-mining strategies for bioinformatics, genomics, statistical genetics, and data science. I also consider theoretical background of the concept of the gene in the ideas of empiricism and experimentalism, as well as reductionist and anti-reductionist narratives on the concept. Finally, a few strategies of analysis from published examples of data-mining projects are discussed. Moreover, the examples are re-interpreted in the light of the theoretical material. I argue that the choice of an optimal level of abstraction for the gene is vital for a successful genome analysis.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Cancer gene search with data-mining and genetic algorithms
    Shah, Shital
    Kusiak, Andrew
    COMPUTERS IN BIOLOGY AND MEDICINE, 2007, 37 (02) : 251 - 261
  • [2] Reduced Models in Chemical Kinetics via Nonlinear Data-Mining
    Chiavazzo, Eliodoro
    Gear, Charles W.
    Dsilva, Carmeline J.
    Rabin, Neta
    Kevrekidis, Ioannis G.
    PROCESSES, 2014, 2 (01): : 112 - 140
  • [3] Inferring Market Strategies: Applying Data-Mining to Analysis of Financial Markets
    Luis Gordillo-Ruiz, Jose
    Martinez-Miranda, Enrique
    Stephens, Christopher R.
    COMPUTACION Y SISTEMAS, 2012, 16 (02): : 221 - 231
  • [4] Collation and data-mining of literature bioactivity data for drug discovery
    Bellis, Louisa J.
    Akhtar, Ruth
    Al-Lazikani, Bissan
    Atkinson, Francis
    Bento, A. Patricia
    Chambers, Jon
    Davies, Mark
    Gaulton, Anna
    Hersey, Anne
    Ikeda, Kazuyoshi
    Krueger, Felix A.
    Light, Yvonne
    McGlinchey, Shaun
    Santos, Rita
    Stauch, Benjamin
    Overington, John P.
    BIOCHEMICAL SOCIETY TRANSACTIONS, 2011, 39 : 1365 - 1370
  • [5] Data-mining application architecture
    Petersohn, H
    WIRTSCHAFTSINFORMATIK, 2004, 46 (01): : 15 - 21
  • [6] Rethinking Network Management : Models, Data-Mining and Self-Learning
    Wallin, Stefan
    Ahlund, Christer
    Nordlander, Johan
    2012 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (NOMS), 2012, : 880 - 886
  • [7] Data-Mining models for the Diagnosis of EMG-based Neuromuscular Diseases
    Pandey, Babita
    Mishra, R. B.
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2011, 6 (02) : 109 - 128
  • [8] Data-mining behavioural data from the web
    Balogh, Zoltan
    PROCEEDINGS OF 2016 10TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT & APPLICATIONS (SKIMA), 2016, : 122 - 127
  • [9] Molecular data-mining: a challenge for chemometrics
    Buydens, LMC
    Reijmers, TH
    Beckers, MLM
    Wehrens, R
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 49 (02) : 121 - 133
  • [10] Data-mining Approach for Battery Materials
    Ghadbeigi, Leila
    Sparks, Taylor D.
    Harada, Jaye K.
    Lettiere, Bethany R.
    2015 IEEE CONFERENCE ON TECHNOLOGIES FOR SUSTAINABILITY (SUSTECH), 2015, : 239 - 244