The Model of Semantic Similarity Estimation for the Problems of Big Data Search and Structuring

被引:0
|
作者
Bova, Victoria [1 ]
Kureichik, Vladimir [1 ]
Leshchanov, Dmitry [1 ]
机构
[1] Southern Fed Univ, Dept CAD, Rostov Na Donu, Russia
来源
2017 11TH IEEE INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT 2017) | 2017年
基金
俄罗斯科学基金会;
关键词
Semantic similarity; ontology; semantic network; graph model; semantic meta-model; big data; clustering; PROBLEM-ORIENTED KNOWLEDGE; INFORMATION; ALGORITHMS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The main problem in the field of Big Data search and processing involves constantly growing complexity of its identification and structuring for the purpose of representation in the form suitable for understanding and further use. To solve this problem authors propose to use method of multilevel semantic net building to define connections between data meta descriptions in large distributed information arrays. The semantic model developed on the basis of the method provides visibility and compact presentation of structure of semantic relations between mass data arrays elements. Semantic meta descriptions are considered as sets of triples "subject-predicate object" in terms of subject area ontology of distributed operative databases and the query. Authors propose the model to search and estimate semantically similar elements of distributed databases based on clustering of semantic nets represented as graph models on corresponding levels: subject area level, search profile level and document meta-descriptions level. The relevance (semantic similarity) estimation method is based on closeness assessment of data in distributed information arrays of document and query semantic nets. To analyze the developed method authors carried out a set of computational experiments. Obtained data proved theoretical significance and application perspective of such approach.
引用
收藏
页码:27 / 31
页数:5
相关论文
共 50 条
  • [31] Similarity Grouping in Big Data Systems
    Silva, Yasin N.
    Sandoval, Manuel
    Prado, Diana
    Wallace, Xavier
    Rong, Chuitian
    SIMILARITY SEARCH AND APPLICATIONS (SISAP 2019), 2019, 11807 : 212 - 220
  • [32] Fuzzy Semantic Similarity in Linked Data using Wikipedia Infobox
    Zadeh, Parisa D. Hossein
    Reformat, Marek Z.
    PROCEEDINGS OF THE 2013 JOINT IFSA WORLD CONGRESS AND NAFIPS ANNUAL MEETING (IFSA/NAFIPS), 2013, : 395 - 400
  • [33] Exploratory search on big data
    MOE Key Laboratory of Data Engineering and Knowledge Engineering, Renmin University of China, Beijing
    100872, China
    不详
    100872, China
    Tongxin Xuebao, 12
  • [34] Towards geospatial semantic search: exploiting latent semantic relations in geospatial data
    Li, Wenwen
    Goodchild, Michael F.
    Raskin, Robert
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2014, 7 (01) : 17 - 37
  • [35] Keywords Semantic Extension in Semantic Search Model
    Yu, Xuejun
    Lv, Jing
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER, NETWORKS AND COMMUNICATION ENGINEERING (ICCNCE 2013), 2013, 30 : 367 - 370
  • [36] Analysis of Ontology Semantic Tagging Method for Semantic Web-Oriented Big Data
    Xu, Hongsheng
    Jiang, Shengli
    Zheng, Cong
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 1147 - 1150
  • [37] Big Data Integration: A Semantic Mediation Architecture Using Summary
    Aggoune, Aicha
    Bouramoul, Abdelkrim
    Kholladi, Mohamed-Khiereddine
    2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 21 - 25
  • [38] The Semantic Retrieval Model of Manufacturing Resource Based on Rules and Similarity
    Wei, Junying
    Zhong, Peisi
    MECHANICAL AND ELECTRONICS ENGINEERING III, PTS 1-5, 2012, 130-134 : 483 - 486
  • [39] Decomposing social and semantic networks in emerging "big data" research
    Park, Han Woo
    Leydesdorff, Loet
    JOURNAL OF INFORMETRICS, 2013, 7 (03) : 756 - 765
  • [40] Semantic Information Retrieval Systems Costing in Big Data Environment
    Mahmood, Khalid
    Rahmah, M.
    Ahmed, Md Manjur
    Raza, Muhammad Ahsan
    RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING (SCDM 2020), 2020, 978 : 192 - 201