Automata Approach to XML Data Indexing

被引:1
|
作者
Sestakova, Eliska [1 ]
Janousek, Jan [1 ]
机构
[1] Czech Tech Univ, Fac Informat Technol, Thakurova 9, Prague 16000 6, Czech Republic
来源
INFORMATION | 2018年 / 9卷 / 01期
关键词
XML; XPath; index; indexing; tree; automaton; finite state automaton; finite state machine;
D O I
10.3390/info9010012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The internal structure of XML documents can be viewed as a tree. Trees are among the fundamental and well-studied data structures in computer science. They express a hierarchical structure and are widely used in many applications. This paper focuses on the problem of processing tree data structures; particularly, it studies the XML index problem. Although there exist many state-of-the-art methods, the XML index problem still belongs to the active research areas. However, existing methods usually lack clear references to a systematic approach to the standard theory of formal languages and automata. Therefore, we present some new methods solving the XML index problem using the automata theory. These methods are simple and allow one to efficiently process a small subset of XPath. Thus, having an XML data structure, our methods can be used efficiently as auxiliary data structures that enable answering a particular set of queries, e.g., XPath queries using any combination of the child and descendant-or-self axes. Given an XML tree model with n nodes, the searching phase uses the index, reads an input query of size m, finds the answer in time O (m) and does not depend on the size of the original XML document.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] An encoding scheme for indexing XML data
    Zhang, WS
    Liu, DX
    Li, J
    2004 IEEE INTERNATIONAL CONFERNECE ON E-TECHNOLOGY, E-COMMERE AND E-SERVICE, PROCEEDINGS, 2004, : 525 - 528
  • [2] Indexing XML data with a schema graph
    Luoma, O
    Proceedings of the IASTED International Conference on Databases and Applications, 2004, : 274 - 279
  • [3] Implementation of XPath axes in the multi-dimensional approach to indexing XML data
    Krátky, M
    Pokorny, J
    Snásel, V
    CURRENT TRENDS IN DATABASE TECHNOLOGY - EDBT 2004 WORKSHOPS, PROCEEDINGS, 2004, 3268 : 219 - 229
  • [4] Indexing and Querying the Compressed XML Data (IQCX)
    Senthilkumar, Radha
    Suganya, N.
    Kiruthika, I.
    Kannan, A.
    ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, 2011, 198 : 497 - 506
  • [5] An XML data model for inverted image indexing
    So, S
    Leung, C
    Tse, P
    INTERNET IMAGING IV, 2003, 5018 : 190 - 197
  • [6] Multi-resolution indexing for XML data
    Maghamez, A
    Hu, GZ
    Third ACIS International Conference on Software Engineering Research, Managment and Applications, Proceedings, 2005, : 206 - 211
  • [7] An indexing method for wireless broadcast XML data
    Chung, Yon Dohn
    Lee, Ji Yeon
    INFORMATION SCIENCES, 2007, 177 (09) : 1931 - 1953
  • [8] Indexing XML data for path expression queries
    Hu, G
    Tang, C
    SOFTWARE ENGINEERING RESEARCH AND APPLICATIONS, 2004, 3026 : 332 - 348
  • [9] RRSi: indexing XML data for proximity twig queries
    Ng, Patrick K. L.
    Ng, Vincent T. Y.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 17 (02) : 193 - 216
  • [10] Adaptive indexing structure on XML data stored in RDBMS
    College of Computer Science, Zhejiang University, Hangzhou 310027, China
    J. Comput. Inf. Syst., 2008, 1 (351-360):