Learning Context-Sensitive Domain Ontologies from Folksonomies: A Cognitively Motivated Method

被引:8
作者
Lau, Raymond Y. K. [1 ]
Zhao, J. Leon [1 ]
Zhang, Wenping [1 ]
Cai, Yi [2 ]
Ngai, Eric W. T. [3 ]
机构
[1] City Univ Hong Kong, Coll Business, Dept Informat Syst, Kowloon, Hong Kong, Peoples R China
[2] S China Univ Technol, Sch Software Engn, Guangzhou 510641, Guangdong, Peoples R China
[3] Hong Kong Polytech Univ, Dept Management & Mkt, Hong Kong, Hong Kong, Peoples R China
关键词
folksonomies; ontology learning; machine learning; artificial intelligence; knowledge management; MODEL; TEXT;
D O I
10.1287/ijoc.2015.0644
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Ontology is the backbone of the Semantic Web, helping users search for relevant resources from the Web of linked data. The existing context-free mapping approach between tags and concepts fails to address the problems of social synonymy and social polysemy when ontologies are induced from folksonomies. The novel contributions of this paper are threefold. First, grounded in the cognitively motivated category utility measure, a novel basic-level concept mining algorithm is developed to construct semantically rich concept vectors to alleviate the problem of social synonymy. Second, contextual aspects of ontology learning are exploited via probabilistic topic modeling to address the problem of social polysemy. Third, a novel context-sensitive domain ontology learning algorithm that combines link- and content-based semantic analysis is developed to identify both taxonomic and associative relations among concepts. To the best of our knowledge, this is the first successful research that exploits a cognitively motivated method to learn context-sensitive domain ontologies from folksonomies. By using the Open Directory Project ontology as a benchmark, we examined the effectiveness of the proposed algorithms based on social annotations crawled from three different folksonomy sites. Our experimental results show that the proposed ontology learning system significantly outperforms the best baseline system by 13.83% in terms of taxonomic F-measure. The practical implication of our research is that high-quality ontologies are constructed with minimal human intervention to facilitate concept-driven retrieval of linked data and the knowledge-based interoperability among enterprises.
引用
收藏
页码:561 / 578
页数:18
相关论文
共 42 条
[1]  
[Anonymous], 1992, COLING 1992, DOI DOI 10.3115/992133.992154
[2]  
[Anonymous], P 9 ANN INT ACM SIGI
[3]  
[Anonymous], P 2006 COLL WEB TAGG
[4]  
[Anonymous], 2010, P 19 ACM INT C INFOR
[5]  
[Anonymous], 2006, 200610 STANF INFOLAB
[6]  
[Anonymous], 2013, Proceedings of the 21st ACM international conference on Multimedia
[7]  
Bizer C, 2011, SEMANTIC SERVICES, INTEROPERABILITY AND WEB APPLICATIONS: EMERGING CONCEPTS, P205, DOI 10.4018/978-1-60960-593-3.ch008
[8]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[9]   A Cluster-Based Context-Tree Model for Multivariate Data Streams with Applications to Anomaly Detection [J].
Brice, Pierre ;
Jiang, Wei ;
Wan, Guohua .
INFORMS JOURNAL ON COMPUTING, 2011, 23 (03) :364-376
[10]   Contextual cueing: Implicit learning and memory of visual context guides spatial attention [J].
Chun, MM ;
Jian, YH .
COGNITIVE PSYCHOLOGY, 1998, 36 (01) :28-71