The interpretation of geochemical survey data

被引:271
作者
Grunsky, Eric C. [1 ]
机构
[1] Geol Survey Canada, Nat Resources Canada, Ottawa, ON K1A 0E9, Canada
关键词
geochemistry; data analysis; visualization; statistical methods; data interpretation; review; MULTIPLE DETECTION LIMITS; HOWARTH ERROR ANALYSIS; S-LANGUAGE SOFTWARE; WATER-QUALITY DATA; COMPOSITIONAL DATA; STATISTICAL-ANALYSIS; LITHOGEOCHEMICAL DATA; NORMATIVE MINERALOGY; PISOLITIC LATERITE; SEDIMENTARY-ROCKS;
D O I
10.1144/1467-7873/09-210
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Geochemical data are generally derived from government and industry geochemical surveys that cover areas at various spatial resolutions. These survey data are difficult to assemble and integrate due to their heterogeneous mixture of media, size fractions, methods of digestion and analytical instrumentation. These assembled sets of data often contain thousands of observations with as many as 50 or more elements. Although the assembly of these data is a challenge, the resulting integrated datasets provide an opportunity to discover a wide range of geochemical processes that are associated with underlying geology, alteration, landscape modification, weathering and mineralization. The use of data analysis and statistical visualization methods, combined with geographical information systems, provides an effective environment for process identification and pattern discovery in these large sets of data. Modern methods of evaluating data for associations, structures and patterns are grouped under the term 'data mining'. Mining data includes the application of multivariate data analysis and statistical techniques, combined with geographical information systems, and can significantly assist the task of data interpretation and subsequent model building. Geochemical data require special handling when measures of association are required. Because of its compositional nature logratios are required to eliminate the effects of closure on geochemical data. Exploratory multivariate methods include: scatterplot matrices (SPLOM), adjusting for censored and missing data, detecting atypical observations, computing robust means, correlations and covariances, principal component analysis, cluster analysis and knowledge based indices of association. Modelled multivariate methods include discriminant analysis, analysis of variance, classification and regression trees neural networks and related techniques. Many of these topics are covered with examples to demonstrate their application.
引用
收藏
页码:27 / 74
页数:48
相关论文
共 210 条
  • [1] RELATIVE VARIATION DIAGRAMS FOR DESCRIBING PATTERNS OF COMPOSITIONAL VARIABILITY
    AITCHISON, J
    [J]. MATHEMATICAL GEOLOGY, 1990, 22 (04): : 487 - 511
  • [2] Aitchison J., 1997, Proceedings of IAMG '97: The third annual conference of the International Association for Mathematical Geology, International Center for Numerical Methods in Engineering, V97, P3
  • [3] Aitchison John., 1986, STAT ANAL COMPOSITIO, DOI [10.1007/978-94-009-4109-0, DOI 10.1007/978-94-009-4109-0, 10.1111/j.2517-6161.1982.tb01195.x]
  • [4] [Anonymous], EXPLORE
  • [5] [Anonymous], 2002, Principal components analysis
  • [6] [Anonymous], 2003, User's Guide to Principal Components
  • [7] [Anonymous], 1980, CLUSTER ANAL
  • [8] [Anonymous], 1988, Principles of Multivariate Analysis
  • [9] [Anonymous], 1984, GEOCHEMICAL EXPLORAT
  • [10] [Anonymous], 1979, Economic Geology Report