Improving Multilingual Semantic Interoperation in Cross-Organizational Enterprise Systems Through Concept Disambiguation

被引:83
作者
Guo, Jingzhi [1 ]
Xu, Li Da [2 ,3 ]
Xiao, Guangyi [1 ]
Gong, Zhiguo [1 ]
机构
[1] Univ Macau, Taipa, Peoples R China
[2] Old Dominion Univ, Norfolk, VA 23529 USA
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Data engineering; enterprise systems; industrial informatics; interenterprise multilingual interoperation; knowledge management; text analysis; understandability; SIMILARITY; WIKIPEDIA; ONTOLOGY; WORDNET;
D O I
10.1109/TII.2012.2188899
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For the multilingual semantic interoperations in cross-organizational enterprise systems and e-commerce systems, semantic consistency is a research issue that has not been well resolved. This paper contributes to improving multilingual semantic interoperation by proposing a concept-connected near synonym (NSG) framework for concept disambiguation. NSG framework provides a vocabulary preprocessing process of collaborative vocabulary editing, which further ensures semantically consistent vocabulary for building semantically consistent business processes and documents between context-different information systems. The vocabulary preprocessing offered by NSG automates the process of finding potential near synonym sets and identifying collaboratively editable near synonym sets. The realization of NSG framework includes a probability model that computes concept values between concepts based on a newly introduced semantic relatedness method-SRCT. In this paper, SRCT-based methods are implemented and compared with some existing semantic relatedness methods. Experiments have shown that SRCT-based methods outperform the existing methods. This paper has made an improvement on the existing methods of semantic relatedness and reduces the collaboration cost of collaborative vocabulary editing.
引用
收藏
页码:647 / 658
页数:12
相关论文
共 45 条
[21]   Managing Process Model Complexity Via Abstract Syntax Modifications [J].
La Rosa, Marcello ;
Wohed, Petia ;
Mendling, Jan ;
ter Hofstede, Arthur H. M. ;
Reijers, Hajo A. ;
van der Aalst, Wil M. P. .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2011, 7 (04) :614-629
[22]   Managing Process Model Complexity via Concrete Syntax Modifications [J].
La Rosa, Marcello ;
ter Hofstede, Arthur H. M. ;
Wohed, Petia ;
Reijers, Hajo A. ;
Mendling, Jan ;
van der Aalst, Wil M. P. .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2011, 7 (02) :255-265
[23]   A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge [J].
Landauer, TK ;
Dumais, ST .
PSYCHOLOGICAL REVIEW, 1997, 104 (02) :211-240
[24]  
Leacock C, 1998, LANG SPEECH & COMMUN, P265
[25]  
Lesk M., 1986, P 5 ANN INT C SYSTEM, P24
[26]   Integration of hybrid wireless networks in cloud services oriented enterprise information systems [J].
Li, Shancang ;
Xu, Lida ;
Wang, Xinheng ;
Wang, Jue .
ENTERPRISE INFORMATION SYSTEMS, 2012, 6 (02) :165-187
[27]   Sentence similarity based on semantic nets and corpus statistics [J].
Li, Yuhua ;
McLean, David ;
Bandar, Zuhair A. ;
O'Shea, James D. ;
Crockett, Keeley .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (08) :1138-1150
[28]  
Lin D., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P296
[29]   A Survey on Applications of Agent Technology in Industrial Process Control [J].
Metzger, Mieczyslaw ;
Polakow, Grzegorz .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2011, 7 (04) :570-581
[30]   WORDNET - A LEXICAL DATABASE FOR ENGLISH [J].
MILLER, GA .
COMMUNICATIONS OF THE ACM, 1995, 38 (11) :39-41