Semantics-guided clustering of heterogeneous XML schemas

被引:0
作者
De Meo, Pasquale [1 ]
Quattrone, Giovanni [1 ]
Terracina, Giorgio [2 ]
Ursino, Domenico [1 ]
机构
[1] Univ Mediterranea Reggio Calabria, DIMET, Via Graziella, I-89060 Reggio Di Calabria, Italy
[2] Univ Calabria, Dipartimento Matemat, I-87036 Arcavacata Di Rende, Italy
来源
JOURNAL ON DATA SEMANTICS IX | 2007年 / 4601卷
关键词
EFFICIENT; ALGORITHM; DOCUMENTS; FRAMEWORK; TREE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we illustrate an approach for clustering semantically heterogeneous XML Schemas. The proposed approach is driven by the semantics of the involved Schemas that is defined by means of the interschema properties existing among concepts represented therein; interschema properties taken into account by our approach are synonymies (indicating that two concepts have the same meaning), hyponymies (denoting that a concept has a more specific meaning than another one), and overlappings (indicating that two concepts are neither synonyms nor one hyponym of the other, but represent, to some extent, the same reality). An important feature of our approach consists of its capability of being integrated with almost all the clustering algorithms already proposed in the literature. Both a theoretical and an experimental analysis on the complexity of our approach are presented in the paper. They show that our approach is scalable and particularly suited in application contexts characterized by a great number and a large variety of XML Schemas.
引用
收藏
页码:39 / +
页数:5
相关论文
共 45 条
[1]  
Beil F., 2002, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, P436, DOI DOI 10.1145/775047.775110
[2]  
Bergamaschi S., 1999, SIGMOD Record, V28, P54, DOI 10.1145/309844.309897
[3]   Global viewing of heterogeneous data sources [J].
Castano, S ;
De Antonellis, V ;
di Vimercati, SD .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2001, 13 (02) :277-297
[4]   XML document indexes: A classification [J].
Catania, B ;
Maddalena, A ;
Vakali, A .
IEEE INTERNET COMPUTING, 2005, 9 (05) :64-71
[5]  
Chung C.W., 2002, P ACM SIGMOD INT C M, P121, DOI DOI 10.1145/564691.564706
[6]   Detecting changes in XML documents [J].
Cobéna, G ;
Abiteboul, S ;
Marian, A .
18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, :41-52
[7]  
Costa G, 2004, LECT NOTES ARTIF INT, V3202, P137
[8]  
Crescenzi V., 2001, Proceedings of the 27th International Conference on Very Large Data Bases, P109
[9]   A methodology for clustering XML documents by structure [J].
Dalamagas, T ;
Cheng, T ;
Winkel, KJ ;
Sellis, T .
INFORMATION SYSTEMS, 2006, 31 (03) :187-228
[10]   Extraction of synonymies, hyponymies, overlappings and homonymies from XML Schemas at various "severity" levels [J].
De Meo, P ;
Quattrone, G ;
Terracina, G ;
Ursino, D .
INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2004, :389-394