TOP-10 DATA MINING CASE STUDIES

被引:4
|
作者
Melli, Gabor [1 ]
Wu, Xindong [2 ]
Beinat, Paul [3 ]
Bonchi, Francesco [4 ]
Cao, Longbing [5 ]
Duan, Rong [6 ]
Faloutsos, Christos [7 ]
Ghani, Rayid [8 ]
Kitts, Brendan [9 ]
Goethals, Bart [10 ]
Mclachlan, Geoff [11 ]
Pei, Jian [12 ]
Srivastava, Ashok [13 ]
Zaiane, Osmar [14 ]
机构
[1] PredictionWorks Inc, Seattle, WA 98126 USA
[2] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
[3] NeuronWorks Int, Hurstville, NSW 2220, Australia
[4] Yahoo Res, Barcelona, Spain
[5] Univ Technol Sydney, Sydney, NSW 2007, Australia
[6] AT&T Labs, Florham Pk, NJ USA
[7] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[8] Accenture Technol Labs, Chicago, IL 60601 USA
[9] Lucid Commerce, Seattle, WA 98104 USA
[10] Univ Antwerp, Dept Math & Comp Sci, Antwerp, Belgium
[11] Univ Queensland, Dept Math, Brisbane, Qld 4072, Australia
[12] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
[13] NASA, Washington, DC USA
[14] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
基金
美国国家科学基金会;
关键词
Data mining; cost-benefit analysis; case study;
D O I
10.1142/S021962201240007X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We report on the panel discussion held at the ICDM'10 conference on the top 10 data mining case studies in order to provide a snapshot of where and how data mining techniques have made significant real-world impact. The tasks covered by 10 case studies range from the detection of anomalies such as cancer, fraud, and system failures to the optimization of organizational operations, and include the automated extraction of information from unstructured sources. From the 10 cases we find that supervised methods prevail while unsupervised techniques play a supporting role. Further, significant domain knowledge is generally required to achieve a completed solution. Finally, we find that successful applications are more commonly associated with continual improvement rather than by single "aha moments" of knowledge ("nugget") discovery.
引用
收藏
页码:389 / 400
页数:12
相关论文
共 50 条
  • [31] Interactive mining of top-K frequent closed itemsets from data streams
    Li, Hua-Fu
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (07) : 10779 - 10788
  • [32] Using Bloom Filters for Mining Top-k Frequent Itemsets in Data Streams
    Kim, Younghee
    Cho, Kyungsoo
    Yoon, Jaeyeol
    Kim, Ieejoon
    Kim, Ungmo
    SECURE AND TRUST COMPUTING, DATA MANAGEMENT, AND APPLICATIONS, 2011, 186 : 209 - 216
  • [33] Uncovering the Top Nonadvertising Weight Loss Websites on Google: A Data-Mining Approach
    Almenara, Carlos A.
    Gulec, Hayriye
    JMIR INFODEMIOLOGY, 2024, 4
  • [34] An analysis of customer retention and insurance claim patterns using data mining: a case study
    Smith, KA
    Willis, RJ
    Brooks, M
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2000, 51 (05) : 532 - 541
  • [35] A Case Study of Medical Big Data Processing: Data Mining for the Hyperuricemia
    Tan, Junyan
    Xiong, Tianyu
    Miao, Hongxia
    Sun, Rurong
    Wu, Min
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 196 - 201
  • [36] Process Mining on Blockchain Data: A Case Study of Augur
    Hobeck, Richard
    Klinkmueller, Christopher
    Bandara, H. M. N. Dilum
    Weber, Ingo
    van der Aalst, Wil M. P.
    BUSINESS PROCESS MANAGEMENT (BPM 2021), 2021, 12875 : 306 - 323
  • [37] DATA MINING FOR OCCUPATIONAL INJURY RISK: A CASE STUDY
    Bevilacqua, Maurizio
    Ciarapica, Filippo Emanuele
    Giacchetta, Giancarlo
    INTERNATIONAL JOURNAL OF RELIABILITY QUALITY & SAFETY ENGINEERING, 2010, 17 (04) : 351 - 380
  • [38] Case base maintenance based on outlier data mining
    Ni, ZW
    Liu, Y
    Li, FG
    Yang, SL
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 2861 - 2864
  • [39] DATA MINING IN TELECOMMUNICATIONS: CASE STUDY OF CLUSTER ANALYSIS
    Bach, Mirjana Pejic
    Simicevic, Vanja
    Leskovic, Darko
    ANNALS OF DAAAM FOR 2009 & PROCEEDINGS OF THE 20TH INTERNATIONAL DAAAM SYMPOSIUM, 2009, 20 : 491 - 492
  • [40] Data Mining and Modeling Use Case in Banking Industry
    Kostic, Stefan M.
    Duricic, Milos
    Simic, Mirjana, I
    Kostic, Miroljub, V
    2018 26TH TELECOMMUNICATIONS FORUM (TELFOR), 2018, : 695 - 698