A Novel Method for Measuring Structure and Semantic Similarity of XML Documents Based on Extended Adjacency Matrix

被引:2
作者
Zhang, Xue-Liang [1 ]
Yang, Ting [1 ]
Fan, Bao-Quan [1 ]
Wang, Xu [1 ]
Wei, Jin-Mao [1 ]
机构
[1] Nankai Univ, Coll Informat Tech Sci, Tianjin 300071, Peoples R China
来源
INTERNATIONAL CONFERENCE ON APPLIED PHYSICS AND INDUSTRIAL ENGINEERING 2012, PT B | 2012年 / 24卷
关键词
similarity; XML; semantic; structure; adjacency matrix;
D O I
10.1016/j.phpro.2012.02.215
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Similarity measurement of XML documents is crucial to meet various needs of approximate searches and document classifications in XML-oriented applications. Some methods have been proposed for this purpose. Nevertheless, few methods can be elegantly exploited to depict structure and semantic information and hence to effectively measure the similarity of XML documents. In this paper, we present a new method of computing the structure and semantic similarity of XML documents based on extended adjacency matrix(EAM). Different from a general adjacency matrix, in an EAM, the structure information of not only the adjacent layers but also the ancestor-descendant layers can be stored. For measuring the similarity of two XML documents, the proposed method firstly stores the structure and semantic information in two extended adjacency matrices (M-1,M-2). Then it computes similarity of the two documents through cos(M-1,M-2). Experimental results on bench-mark data show that the method holds high efficiency and accuracy. (C) 2011 Published by Elsevier B.V. Selection and/or peer-review under responsibility of ICAPIE Organization Committee.
引用
收藏
页码:1452 / 1461
页数:10
相关论文
共 50 条
[41]   A Semantic Similarity Calculation Method for Battlefield Environment Elements Based on Operational Task Ontology [J].
Zhu J. ;
You X. ;
Xia Q. .
Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2019, 44 (09) :1407-1415
[42]   A novel method based on similarity for hourly solar irradiance forecasting [J].
Akarslan, Emre ;
Hocaoglu, Fatih Onur .
RENEWABLE ENERGY, 2017, 112 :337-346
[43]   Hedge-based Filtering Method for Semi-structure Documents [J].
Wang, Tong ;
Zhao, Chunhui ;
Yang, Lei .
2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, :249-253
[44]   RNA Secondary Structure Alignment Based on an Extended Binary Coding Method [J].
Cao, Zhi ;
Liao, Bo ;
Li, Renfa ;
Luo, Jiawei ;
Zhu, Wen .
INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2011, 111 (05) :978-982
[45]   A new method of precise orientation adjustment based on matrix similarity for large-scale component [J].
Wu, Dian ;
Du, Fuzhou .
ASSEMBLY AUTOMATION, 2018, 38 (02) :207-215
[46]   An Improved Method for Measuring the Complexity in Complex Networks Based on Structure Entropy [J].
Lei, Mingli ;
Liu, Lirong ;
Wei, Daijun .
IEEE ACCESS, 2019, 7 :159190-159198
[47]   Characterizing the semantic and form-based similarity spaces of the mental lexicon by means of the multi-arrangement method [J].
Ansteeg, Lukas ;
Leone, Frank ;
Dijkstra, Ton .
FRONTIERS IN PSYCHOLOGY, 2022, 13
[48]   A novel method to remove GPR background noise based on the similarity of non-neighboring regions [J].
Montiel-Zafra, V. ;
Canadas-Quesada, F. J. ;
Vera-Candeas, P. ;
Ruiz-Reyes, N. ;
Rey, J. ;
Martinez, J. .
JOURNAL OF APPLIED GEOPHYSICS, 2017, 144 :188-203
[49]   A Novel Method for Remaining Useful Life Prediction of Bearing Based on Spectrum Image Similarity Measures [J].
Wu, Bo ;
Zhang, Bo ;
Li, Wei ;
Jiang, Fan .
MATHEMATICS, 2022, 10 (13)
[50]   Novel Image Registration Method Based on Local Structure Constraints [J].
Li, Aixia ;
Cheng, Xiaojun ;
Guan, Haiyan ;
Feng, Tiantian ;
Guan, Zequn .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (09) :1584-1588