Data Mining from NoSQL Document-Append Style Storages

被引:1
|
作者
Lomotey, Richard K. [1 ]
Deters, Ralph [1 ]
机构
[1] Univ Saskatchewan, Dept Comp Sci, Saskatoon, SK S7N 0W0, Canada
关键词
Data mining; NoSQL; Bayesian Rule; Unstructured data; Apriori; Big Data;
D O I
10.1109/ICWS.2014.62
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The modern data economy, which has been described as "Big Data", has changed the status quo on digital content creation and storage. While data storage has followed the schema-dictated approach for decades, the recent nature of digital content, which is widely unstructured, creates the need to adopt different storage techniques. Thus, the NoSQL database systems have been proposed to accommodate most of the content being generated today. One of such NoSQL databases that have received significant enterprise adoption is the document-append style storage. The emerging concern and challenge however is that, research and tools that can aid data mining processes from such NoSQL databases is generally lacking. Even though document-append style storages allow data accessibility as Web services and over URL/I, building a corresponding data mining tool deviates from the underlying techniques governing web crawlers. Also, existing data mining tools that have been designed for schema-based storages (e.g., RDBMS) are misfits. Hence, our goal in this work is to design a unique data analytics tool that enables knowledge discovery through information retrieval from document-append style storage. The tool is algorithmically built on the inference-based Apriori, which aids us to achieve optimization of the search duration. Preliminary test results of the proposed tool also show high accuracy in comparison to other approaches that were previously proposed.
引用
收藏
页码:385 / 392
页数:8
相关论文
共 50 条
  • [21] Exploring data structure alternatives in the RDB to NoSQL document store conversion process
    Kuszera, Evandro Miguel
    Peres, Leticia Mara
    Del Fabro, Marcos Didonet
    INFORMATION SYSTEMS, 2022, 105
  • [22] A formal algebra for document-oriented NoSQL data warehouses: formalisation and evaluation
    Senda Bouaziz
    Soumaya Boukettaya
    Ahlem Nabli
    Faiez Gargouri
    Cluster Computing, 2025, 28 (3)
  • [23] Online mining changes of items over continuous append-only and dynamic data streams
    Li, HF
    Lee, SY
    Shan, MK
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2005, 11 (08) : 1411 - 1425
  • [24] Model Transformation From Object Relational Database to NoSQL Document Database
    Fouad, Toufik
    Mohamed, Bahaj
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEMS & SECURITY (NISS19), 2019,
  • [25] Extraction of Semantic Links from a Document-Oriented NoSQL Database
    Abdelhedi F.
    Rajhi H.
    Zurfluh G.
    SN Computer Science, 4 (2)
  • [26] Enhanced Elearning Application for Data Mining in a NoSQL Distributed Database Management System
    Valentin, Pupezescu
    Mailena-Catalina, Dragomir
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON VIRTUAL LEARNING, ICVL 2019, 2019, : 476 - 482
  • [27] Data mining applied to music style classification
    Nie Y.-B.
    International Journal of Simulation: Systems, Science and Technology, 2016, 17 (02): : 19.1 - 19.6
  • [28] Digital Archiving and Data Mining of Historic Document
    Tabrizi, M. H. N.
    2008 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING, 2008, : 19 - 23
  • [29] METADATA-DRIVEN DATA MIGRATION FROM OBJECT-RELATIONAL DATABASE TO NOSQL DOCUMENT-ORIENTED DATABASE
    Aggoune, Aicha
    Namoune, Mohamed Sofiane
    COMPUTER SCIENCE-AGH, 2022, 23 (04): : 495 - 519
  • [30] Art Design Style Mining Based on Deep Learning and Data Mining
    Feng J.
    Wang Z.
    Computer-Aided Design and Applications, 2024, 21 (S19): : 33 - 47