Making Unstructured Data SPARQL Using Semantic Indexing in Oracle Database

被引:5
|
作者
Das, Souripriya [1 ]
Sundara, Seema [1 ]
Perry, Matthew [1 ]
Srinivasan, Jagannathan [1 ]
Banerjee, Jayanta [1 ]
Yalamanchi, Aravind [1 ]
机构
[1] Oracle, Nashua, NH 03062 USA
关键词
D O I
10.1109/ICDE.2012.59
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes the Semantic Indexing feature introduced in Oracle Database 11g Release 2 for indexing unstructured text (document) columns. This capability enables searching for concepts (such as people, places, organizations, and events), in addition to words or phrases, with further options for sense disambiguation and term expansion by consulting knowledge captured in OWL/RDF ontologies. The distinguishing aspects of our approach are: 1) Indexing: Instead of building a traditional inverted index of (annotated) token and/or named entity occurrences, we extract the entities, associations, and events present in a text column data and store them as RDF named graphs in the Oracle Database Semantic Store. This base content can be further augmented with knowledge bases and inferred triples (obtained by applying domain-specific ontologies and rulebases). 2) Querying: Instead of relying on proprietary extensions for specifying a search, we allow users to specify a complete SPARQL query pattern that can capture arbitrarily complex relationships between query terms. We have implemented this feature by introducing a sem_contains SQL operator and the associated sem_indextype indexing scheme. The indexing scheme employs an extensible architecture that supports indexing of unstructured text using native as well as third party information extraction tools. The paper presents a model for the semantic indexing and querying, describes the feature, and outlines its implementation leveraging Oracle's native support for RDF/OWL storage, inferencing, and querying. We also report a study involving use of this feature on a TREC collection of over 130,000 news articles.
引用
收藏
页码:1405 / 1416
页数:12
相关论文
共 50 条
  • [1] Semantic integration of relational data using SPARQL
    Wang, Jinpeng
    Miao, Zhuang
    Zhang, Yafei
    Lu, Jianjiang
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL I, PROCEEDINGS, 2008, : 422 - 426
  • [2] SemIndex plus : A semantic indexing scheme for structured, unstructured, and partly structured data
    Tekli, Joe
    Chbeir, Richard
    Traina, Agma J. M.
    Traina, Caetano, Jr.
    KNOWLEDGE-BASED SYSTEMS, 2019, 164 : 378 - 403
  • [3] Using SPARQL and SPIN for Data Quality Management on the Semantic Web
    Fuerber, Christian
    Hepp, Martin
    BUSINESS INFORMATION SYSTEMS, PROCEEDINGS, 2010, 47 : 35 - 46
  • [4] A Comparative Study of NLP based Semantic Web Standard model using SPARQL database
    Rao, Chennamsetty Madhusudhana
    Babu, J. Ravindra
    Pimo, S. John
    Dixit, Asmita
    Jaiswal, Sushma
    Jamshed, Aatif
    2021 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES (ICCS 2021), 2021, : 1 - 6
  • [5] Semantic SPARQL query in a relational database based on ontology construction
    Hazber, Mohamed A. G.
    Li, Ruixuan
    Gu, Xiwu
    Xu, Guandong
    Li, Yuhua
    2015 11TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2015, : 25 - 32
  • [6] A Framework for Searching Semantic Data and Services with SPARQL
    Mouhoub, Mohamed Lamine
    Grigori, Daniela
    Manouvrier, Maude
    SERVICE-ORIENTED COMPUTING, ICSOC 2014, 2014, 8831 : 123 - 138
  • [7] Indexing medium-dimensionality data in oracle
    Kanth, KVR
    Ravada, S
    Sharma, J
    Banerjee, J
    SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999: SIGMOD99: PROCEEDINGS OF THE 1999 ACM SIGMOD - INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 1999, : 521 - 522
  • [8] Efficient Indexing and Searching Framework for Unstructured Data
    Aye, Kyar Nyo
    Thein, Ni Lar
    FOURTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2011): MACHINE VISION, IMAGE PROCESSING, AND PATTERN ANALYSIS, 2012, 8349
  • [9] Collaborative SPARQL Query Processing for Decentralized Semantic Data
    Grall, Arnaud
    Skaf-Molli, Hala
    Molli, Pascal
    Perrin, Matthieu
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2020, PT I, 2020, 12391 : 320 - 335
  • [10] Querying Heterogeneous Relational Database using SPARQL
    Wang, Jinpeng
    Miao, Zhuang
    Zhang, Yafei
    Zhou, Bo
    PROCEEDINGS OF THE 8TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, 2009, : 475 - 480