Linear correlation discovery in databases: a data mining approach

被引:11
作者
Chiang, RHL
Cecil, CEH
Lim, EP
机构
[1] Univ Cincinnati, Coll Business, Dept Informat Syst, Cincinnati, OH 45221 USA
[2] Nanyang Technol Univ, Nanyang Business Sch, Singapore 639798, Singapore
[3] Nanyang Technol Univ, Sch Comp Engn, Ctr Adv Informat Syst, Singapore 639798, Singapore
关键词
knowledge discovery in database; linear correlation; association measurement; data mining;
D O I
10.1016/j.datak.2004.09.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Very little research in knowledge discovery has studied how to incorporate statistical methods to automate linear correlation discovery (LCD). We present an automatic LCD methodology that adopts statistical measurement functions to discover correlations from databases' attributes. Our methodology automatically pairs attribute groups having potential linear correlations, measures the linear correlation of each pair of attribute groups, and confirms the discovered correlation. The methodology is evaluated in two sets of experiments. The results demonstrate the methodology's ability to facilitate linear correlation discovery for databases with a large amount of data. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:311 / 337
页数:27
相关论文
共 50 条
  • [1] Data mining and knowledge discovery in databases - An overview
    MacKinnon, MJ
    Glick, N
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 1999, 41 (03) : 255 - 275
  • [2] Data mining: Efficiency of using sequence databases for polymorphism discovery
    Cox, DG
    Boillot, C
    Canzian, F
    HUMAN MUTATION, 2001, 17 (02) : 141 - 150
  • [3] A new data clustering approach for data mining in large databases
    Tsai, CF
    Wu, HC
    Tsai, CW
    I-SPAN'02: INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND NETWORKS, PROCEEDINGS, 2002, : 315 - 320
  • [4] Data mining and clustering in chemical process databases for monitoring and knowledge discovery
    Thomas, Michael C.
    Zhu, Wenbo
    Romagnoli, Jose A.
    JOURNAL OF PROCESS CONTROL, 2018, 67 : 160 - 175
  • [5] ACODF: a novel data clustering approach for data mining in large databases
    Tsai, CF
    Tsai, CW
    Wu, HC
    Yang, T
    JOURNAL OF SYSTEMS AND SOFTWARE, 2004, 73 (01) : 133 - 145
  • [6] Proposal of Honeypot-Based Data Mining Methods for the Discovery of Intrusions in Big Data Databases
    Kanga, Koffi
    Kamagate, Beman Hamidja
    Kabore, Raogo
    Oumtanaga, Souleymane
    TOWARDS NEW E-INFRASTRUCTURE AND E-SERVICES FOR DEVELOPING COUNTRIES, AFRICOMM 2023, PT I, 2025, 587 : 312 - 332
  • [7] Data mining via linguistic summaries of databases: An interactive approach
    Kacprzyk, J
    Zadrozny, S
    NEW PARADIGM OF KNOWLEDGE ENGINEERING BY SOFT COMPUTING, 2001, 5 : 325 - 345
  • [8] A Data Mining Approach for Biomarker Discovery Using Transcriptomics in Endometriosis
    Akter, Sadia
    Xu, Dong
    Nagel, Susan C.
    Joshi, Trupti
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 969 - 972
  • [9] Data mining and knowledge discovery in databases for urban solid waste management: A scientific literature review
    Dias, Janaina Lopes
    Sott, Michele Kremer
    Ferrao, Caroline Cipolatto
    Furtado, Joao Carlos
    Ribas Moraes, Jorge Andre
    WASTE MANAGEMENT & RESEARCH, 2021, 39 (11) : 1331 - 1340
  • [10] Causal Rule Mining for Knowledge Discovery from Databases
    Bhoopathi, Harchana
    Rama, B.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 978 - 984