Domain-Driven Data Mining: Challenges and Prospects

被引:73
作者
Cao, Longbing [1 ,2 ]
机构
[1] Univ Technol Sydney, Ctr Quantum Computat & Intelligent Syst, Sydney, NSW 2007, Australia
[2] Univ Technol Sydney, Fac Engn & Informat Technol, Sydney, NSW 2007, Australia
基金
澳大利亚研究理事会;
关键词
Data mining; domain-driven data mining (D-3 M); actionable knowledge discovery and delivery; ACTIONABLE KNOWLEDGE; ASSOCIATION RULES; AGENTS;
D O I
10.1109/TKDE.2010.32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional data mining research mainly focus]es on developing, demonstrating, and pushing the use of specific algorithms and models. The process of data mining stops at pattern identification. Consequently, a widely seen fact is that 1) many algorithms have been designed of which very few are repeatable and executable in the real world, 2) often many patterns are mined but a major proportion of them are either commonsense or of no particular interest to business, and 3) end users generally cannot easily understand and take them over for business use. In summary, we see that the findings are not actionable, and lack soft power in solving real-world complex problems. Thorough efforts are essential for promoting the actionability of knowledge discovery in real-world smart decision making. To this end, domain-driven data mining (D-3 M) has been proposed to tackle the above issues, and promote the paradigm shift from "data-centered knowledge discovery" to "domain-driven, actionable knowledge delivery." In D-3 M, ubiquitous intelligence is incorporated into the mining process and models, and a corresponding problem-solving system is formed as the space for knowledge discovery and delivery. Based on our related work, this paper presents an overview of driving forces, theoretical frameworks, architectures, techniques, case studies, and open issues of D-3 M. We understand D-3 M discloses many critical issues with no thorough and mature solutions available for now, which indicates the challenges and prospects for this new topic.
引用
收藏
页码:755 / 769
页数:15
相关论文
共 42 条
[1]  
Ankerst M :., 2002, ACM SIGKDD Explorations Newsletter, V4, P110, DOI [DOI 10.1145/772862.772883, 10.1145/772862.772883]
[2]  
Brazdil P, 2008, METALEARNING APPL DA
[3]  
CAO J, 2005, INTERDISPLINARE MATH, V4, P65
[4]  
Cao L., 2009, Domain Driven Data Mining
[5]  
Cao L., 2008, Data Mining for Business Applications
[6]  
CAO L, 2007, INT J BUSINESS INTEL, V2, P496, DOI DOI 10.1504/IJBIDM.2007.016385
[7]  
CAO L, 2008, KNOWLEDGE PROCESSING, P193
[8]   The evolution of KDD: Towards domain-driven data mining* [J].
Cao, Longbing ;
Zhang, Chengqi .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2007, 21 (04) :677-692
[9]  
Cao LB, 2007, IEEE INTELL SYST, V22, P78, DOI 10.1109/MIS.2007.67
[10]   Flexible Frameworks for Actionable Knowledge Discovery [J].
Cao, Longbing ;
Zhao, Yanchang ;
Zhang, Huaifeng ;
Luo, Dan ;
Zhang, Chengqi ;
Park, E. K. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (09) :1299-1312