A semantic network approach to semi-structured documents repositories

被引:0
作者
Christophides, V [1 ]
Dorr, M [1 ]
Fundulaki, I [1 ]
机构
[1] Fdn Res & Technol Hellas, Inst Comp Sci, GR-71110 Iraklion, Crete, Greece
来源
RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES | 1997年 / 1324卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Using database technology for the administration of digital Libraries offers many advantages in a multi-user and distributed environment. However, conventional DBMS are not particularly suited to manage semi-structured data with heterogeneous, irregular, evolving structures as in the case of SGML documents found in digital libraries. To overcome the difficulties imposed by the rigid schema of conventional systems, several schema-less approaches have been proposed. Using instead unconstrained, extensible schemata offered by object-oriented semantic network systems, we are able both to map document specific structures as database classes, and to model the associated constraint information as integrated schema annotations. In this paper we present the benefits of this approach to create, access and process heterogeneous SGML documents, and in particular to exploit the shared semantics of evolving SGML structures. A respective application is currently being implemented in the context of the AQUARELLE project.
引用
收藏
页码:305 / 324
页数:20
相关论文
共 50 条
[21]   Lexical semantic SLVM for semi-structured document classification [J].
Wang, Luda ;
Long, Jun ;
Li, Zude ;
He, Ye .
Journal of Information and Computational Science, 2015, 12 (01) :307-316
[22]   An automated integration approach for semi-structured and structured data [J].
Lim, SJ ;
Ng, YK .
PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON COOPERATIVE DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2000, :12-21
[23]   WebDP: Understanding Discourse Structures in Semi-Structured Web Documents [J].
Liu, Peilin ;
Lin, Hongyu ;
Liao, Meng ;
Xiang, Hao ;
Han, Xianpei ;
Sun, Le .
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, :10235-10258
[24]   Supplementing domain knowledge to BERT with semi-structured information of documents [J].
Chen, Jing ;
Wei, Zhihua ;
Wang, Jiaqi ;
Wang, Rui ;
Gong, Chuanyang ;
Zhang, Hongyun ;
Miao, Duoqian .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
[25]   Recognition techniques for extracting information from semi-structured documents [J].
Della Ventura, A ;
Gagliardi, I ;
Zonta, B .
DOCUMENT RECOGNITION AND RETRIEVAL VIII, 2001, 4307 :130-137
[26]   OLERA: OnLine extraction rule analysis for semi-structured documents [J].
Chang, CH ;
Kuo, SC .
PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, VOLS 1AND 2, 2004, :736-742
[27]   Clustering method via independent components for semi-structured documents [J].
Wang, Tong ;
Liu, Da-Xin ;
Lin, Xuanzuo ;
Sun, Wei .
DATA MINING, INTRUSION DETECTION, INFORMATION ASSURANCE, AND DATA NETWORKS SECURITY 2006, 2006, 6241
[28]   Joint Distributed Representation of Text and Structure of Semi-Structured Documents [J].
Laddha, Abhishek ;
Joshi, Salil ;
Shaikh, Samiulla ;
Mehta, Sameep .
HT'18: PROCEEDINGS OF THE 29TH ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA, 2018, :25-32
[29]   Building RDF ontologies from semi-structured legal documents [J].
Amato, Flora ;
Mazzeo, Antonino ;
Penta, Antonio ;
Picariello, Antonio .
CISIS 2008: THE SECOND INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, PROCEEDINGS, 2008, :997-1002
[30]   EGA: An algorithm for automatic semi-structured Web documents extraction [J].
Li, LY ;
Tang, SW ;
Yang, DQ ;
Wang, TJ ;
Su, ZH .
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2004, 2973 :787-798