Models of the Gene Must Inform Data-Mining Strategies in Genomics

被引:4
|
作者
Huminiecki, Lukasz [1 ]
机构
[1] Polish Acad Sci, Inst Genet & Anim Biotechnol, Dept Mol Biol, PL-00901 Warsaw, Poland
基金
欧盟地平线“2020”;
关键词
gene concept; scientific method; experimentalism; reductionism; anti-reductionism; data-mining; NETWORK MEDICINE; EXPRESSION;
D O I
10.3390/e22090942
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The gene is a fundamental concept of genetics, which emerged with the Mendelian paradigm of heredity at the beginning of the 20th century. However, the concept has since diversified. Somewhat different narratives and models of the gene developed in several sub-disciplines of genetics, that is in classical genetics, population genetics, molecular genetics, genomics, and, recently, also, in systems genetics. Here, I ask how the diversity of the concept impacts data-integration and data-mining strategies for bioinformatics, genomics, statistical genetics, and data science. I also consider theoretical background of the concept of the gene in the ideas of empiricism and experimentalism, as well as reductionist and anti-reductionist narratives on the concept. Finally, a few strategies of analysis from published examples of data-mining projects are discussed. Moreover, the examples are re-interpreted in the light of the theoretical material. I argue that the choice of an optimal level of abstraction for the gene is vital for a successful genome analysis.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Applying data-mining techniques for discovering association rules
    Huang, Mu-Jung
    Sung, Hsiu-Shu
    Hsieh, Tsu-Jen
    Wu, Ming-Cheng
    Chung, Shao-Hsi
    SOFT COMPUTING, 2020, 24 (11) : 8069 - 8075
  • [42] Generalized Gini Correlation and its Application in Data-Mining
    Gao, Yi
    Jiang, Wenxin
    Tanner, Martin A.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (06) : 1455 - 1479
  • [43] Wind Farm Power Prediction: A Data-Mining Approach
    Kusiak, Andrew
    Zheng, Haiyang
    Song, Zhe
    WIND ENERGY, 2009, 12 (03) : 275 - 293
  • [44] A DATA-MINING BASED METHOD FOR THE GAIT PATTERN ANALYSIS
    Rudek, Marcelo
    Silva, Nicoli Maria
    Steinmetz, Jean-Paul
    Jahnen, Andreas
    FACTA UNIVERSITATIS-SERIES MECHANICAL ENGINEERING, 2015, 13 (03) : 205 - 215
  • [45] Key factors for achieving organizational data-mining success
    Nemati, HR
    Barko, CD
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2003, 103 (3-4) : 282 - 292
  • [46] DATA-MINING AND THE QUALITY OF DISTANCE-EDUCATION IMPROVEMENT
    Mensik, Marek
    Gerlich, Jakub
    GEOCONFERENCE ON INFORMATICS, GEOINFORMATICS AND REMOTE SENSING - CONFERENCE PROCEEDINGS, VOL I, 2013, : 77 - 84
  • [47] A Data-Mining Based Video Shot Classification Method
    Zhao, Shiwei
    Zhuo, Li
    Xiao, Zhu
    Shen, Lansun
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 2191 - 2194
  • [48] Applying data-mining techniques for discovering association rules
    Mu-Jung Huang
    Hsiu-Shu Sung
    Tsu-Jen Hsieh
    Ming-Cheng Wu
    Shao-Hsi Chung
    Soft Computing, 2020, 24 : 8069 - 8075
  • [49] Case data-mining analysis for patients with oesophageal cancer
    Cao, Yanning
    Zhang, Xiaoshu
    Wang, Jin
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2020, 22 (2-3) : 262 - 269
  • [50] Professional Poker Players' Modeling using Data-Mining
    Silva, Nuno
    Reis, Luis Paulo
    2016 11TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2016,