Data-mining discovery of pattern and process in ecological systems

被引:110
作者
Hochachka, Wesley M. [1 ]
Caruana, Rich
Fink, Danniel
Munson, Art
Riedewald, Mirek
Sorokina, Darla
Kelling, Steve
机构
[1] Cornell Univ, Ornithol Lab, Ithaca, NY 14850 USA
[2] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
关键词
bagging; data mining; decision trees; exploratory data analysis; hypothesis generation; machine learning; prediction;
D O I
10.2193/2006-503
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Most ecologists use statistical methods as their main analytical tools when analyzing data to identify relationships between a response and a set of predictors; thus, they treat all analyses as hypothesis tests or exercises in parameter estimation. However, little or no prior knowledge about a system can lead to creation of a statistical model or models that do not accurately describe major sources of variation in the response variable. We suggest that under such circumstances data mining is more appropriate for analysis. lit this paper we 1) present the distinctions between data-mining (usually exploratory) analyses and parametric statistical (confirmatory) analyses, 2) illustrate 3 strengths of data-mining tools for generating hypotheses from data, and 3) suggest useful ways in which data mining and statistical analyses can be integrated into a thorough analysis of data to facilitate rapid creation of accurate models and to guide further research.
引用
收藏
页码:2427 / 2437
页数:11
相关论文
共 50 条
[21]   Poker Learner: Players Modeling Through Data-Mining [J].
Silva, Nuno ;
Reis, Luis Paulo .
2015 10TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2015,
[22]   Data-mining behavioural data from the web [J].
Balogh, Zoltan .
PROCEEDINGS OF 2016 10TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT & APPLICATIONS (SKIMA), 2016, :122-127
[23]   Data-mining synthesised schedulers for hard real-time systems [J].
Kloukinas, C .
19TH INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, PROCEEDINGS, 2004, :14-23
[24]   Data-mining Approach for Battery Materials [J].
Ghadbeigi, Leila ;
Sparks, Taylor D. ;
Harada, Jaye K. ;
Lettiere, Bethany R. .
2015 IEEE CONFERENCE ON TECHNOLOGIES FOR SUSTAINABILITY (SUSTECH), 2015, :239-244
[25]   Attention technology for subjective data-mining [J].
Lien, Chen-Hsi ;
Su, Juei-Yang .
Eleventh ISSAT International Conference Reliability and Quality in Design, Proceedings, 2005, :131-134
[26]   DATA-MINING BASED FAULT DETECTION [J].
Ma Hongguang Han Chongzhao (Xi’an Jiaotong University .
Journal of Electronics(China), 2005, (06) :39-45
[27]   Molecular data-mining: a challenge for chemometrics [J].
Buydens, LMC ;
Reijmers, TH ;
Beckers, MLM ;
Wehrens, R .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1999, 49 (02) :121-133
[28]   Data-Mining Possibilities in Blended Learning [J].
Baksa-Hasko, Gabriella ;
Baranyai, Brigitta .
TEACHING AND LEARNING IN A DIGITAL WORLD, 2018, 716 :174-183
[29]   Fehlende Daten beim Data-Mining [J].
Dieter William Joenssen ;
Thomas Müllerleile .
HMD Praxis der Wirtschaftsinformatik, 2014, 51 (4) :458-468
[30]   Theory and support for process frameworks of knowledge discovery and data mining from ERP systems [J].
Bendoly, E .
INFORMATION & MANAGEMENT, 2003, 40 (07) :639-647