Domain-Driven Data Mining: Challenges and Prospects

被引:73
作者
Cao, Longbing [1 ,2 ]
机构
[1] Univ Technol Sydney, Ctr Quantum Computat & Intelligent Syst, Sydney, NSW 2007, Australia
[2] Univ Technol Sydney, Fac Engn & Informat Technol, Sydney, NSW 2007, Australia
基金
澳大利亚研究理事会;
关键词
Data mining; domain-driven data mining (D-3 M); actionable knowledge discovery and delivery; ACTIONABLE KNOWLEDGE; ASSOCIATION RULES; AGENTS;
D O I
10.1109/TKDE.2010.32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional data mining research mainly focus]es on developing, demonstrating, and pushing the use of specific algorithms and models. The process of data mining stops at pattern identification. Consequently, a widely seen fact is that 1) many algorithms have been designed of which very few are repeatable and executable in the real world, 2) often many patterns are mined but a major proportion of them are either commonsense or of no particular interest to business, and 3) end users generally cannot easily understand and take them over for business use. In summary, we see that the findings are not actionable, and lack soft power in solving real-world complex problems. Thorough efforts are essential for promoting the actionability of knowledge discovery in real-world smart decision making. To this end, domain-driven data mining (D-3 M) has been proposed to tackle the above issues, and promote the paradigm shift from "data-centered knowledge discovery" to "domain-driven, actionable knowledge delivery." In D-3 M, ubiquitous intelligence is incorporated into the mining process and models, and a corresponding problem-solving system is formed as the space for knowledge discovery and delivery. Based on our related work, this paper presents an overview of driving forces, theoretical frameworks, architectures, techniques, case studies, and open issues of D-3 M. We understand D-3 M discloses many critical issues with no thorough and mature solutions available for now, which indicates the challenges and prospects for this new topic.
引用
收藏
页码:755 / 769
页数:15
相关论文
共 42 条
[11]  
Cao LB, 2009, DATA MINING AND MULTI-AGENT INTEGRATION, P3, DOI 10.1007/978-1-4419-0522-2_1
[12]  
Cao LB, 2008, J UNIVERS COMPUT SCI, V14, P2288
[13]   Metasynthesis: M-Space, M-Interaction, and M-Computing for Open Complex Giant Systems [J].
Cao, Longbing ;
Dai, Ruwei ;
Zhou, Mengchu .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2009, 39 (05) :1007-1021
[14]   Agent Mining: The Synergy of Agents and Data Mining [J].
Cao, Longbing ;
Gorodetsky, Vladimir ;
Mitkas, Pericles A. .
IEEE INTELLIGENT SYSTEMS, 2009, 24 (03) :64-72
[15]   Developing actionable trading agents [J].
Cao, Longbing ;
He, Tony .
KNOWLEDGE AND INFORMATION SYSTEMS, 2009, 18 (02) :183-198
[16]  
Cao Longbing, 2008, OPEN COMPLEX INTELLI
[17]   Recent progress of Src family kinase inhibitors as anticancer agents [J].
Cao, Xin ;
You, Qi-Dong ;
Li, Zhi-Yu ;
Wang, Xiao-Jian ;
Lu, Xiao-Yun ;
Liu, Xiao-Rong ;
Xu, Dan ;
Liu, Bao .
MINI-REVIEWS IN MEDICINAL CHEMISTRY, 2008, 8 (10) :1053-1063
[18]  
Dong G., 1999, Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P43, DOI [DOI 10.1145/312129.312191, 10.1145/312129., DOI 10.1145/312129]
[19]  
Fayyad UM, 1996, ADV KNOWLEDGE DISCOV, P1
[20]  
Fayyad UM., 2003, ACM SIGKDD EXPLORATI, V5, P191, DOI [10.1145/980972.981004, DOI 10.1145/980972.981004]