Intelligent search in Big Data

被引:1
作者
Birialtsev, E. [1 ]
Bukharaev, N. [2 ]
Gusenkov, A. [2 ]
机构
[1] R&D Director Gradient Ltd, Kazan, Russia
[2] Kazan Fed Univ, Inst Computat Math & Informat Technol, Kazan, Russia
来源
BIGDATA CONFERENCE (FORMERLY INTERNATIONAL CONFERENCE ON BIG DATA AND ITS APPLICATIONS) | 2017年 / 913卷
关键词
D O I
10.1088/1742-6596/913/1/012010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An approach to data integration, aimed on the ontology-based intelligent search in Big Data, is considered in the case when information objects are represented in the form of relational databases (RDB), structurally marked by their schemes. The source of information for constructing an ontology and, later on, the organization of the search are texts in natural language, treated as semi-structured data. For the RDBs, these are comments on the names of tables and their attributes. Formal definition of RDBs integration model in terms of ontologies is given. Within framework of the model universal RDB representation ontology, oil production subject domain ontology and linguistic thesaurus of subject domain language are built. Technique of automatic SQL queries generation for subject domain specialists is proposed. On the base of it, information system for TATNEFT oil-producing company RDBs was implemented. Exploitation of the system showed good relevance with majority of queries.
引用
收藏
页数:8
相关论文
共 18 条
  • [1] ANDERSON JA, 2003, DISCRETE MATH COMBIN
  • [2] [Anonymous], 1998, WordNet, DOI DOI 10.7551/MITPRESS/7287.001.0001
  • [3] Birialtsev E, 2007, P KNOWL ONT THEOR ZO, P176
  • [4] Birialtsev E, 2007, SCHOLARSHIP NOTES KA, V149, P13
  • [5] Birialtsev E, 2007, P KAZ SCH COMP COGN, P32
  • [6] Birialtsev E, 2005, P KAZ SCH COMP COGN, P4
  • [7] Birialtsev E, 2006, P KAZ SCH COMP COGN, P38
  • [8] Birialtsev E, 2009, P KAZ SCH COMP COGN, P10
  • [9] Birialtsev E, 2007, P DIAL 2007 C, P50
  • [10] Booch G., 2000, COMPLETE UML TRAININ