Unstructured Data Extraction in Distributed NoSQL

被引:0
作者
Lomotey, Richard K. [1 ]
Deters, Ralph [1 ]
机构
[1] Univ Saskatchewan, Dept Comp Sci, Saskatoon, SK S7N 0W0, Canada
来源
2013 7TH IEEE INTERNATIONAL CONFERENCE ON DIGITAL ECOSYSTEMS AND TECHNOLOGIES (DEST) | 2013年
关键词
Unstructured data; big data; Hidden Markov Model (HMM); terms extraction; NoSQL; Re-usable dictionary; Association rules;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
While "Big data" has brought good tidings in terms of easy accessibility to voluminous data, we are faced with challenges too. The existing Knowledge Discovery in Database (KDD) processes which have been proposed for schema-oriented data sources are no longer applicable since today's data is unstructured. Previously, we deployed a tool called TouchR which relies on the Hidden Markov Model (HMM) to extract terms from unstructured data sources (specifically, NoSQL databases). This paper has advanced on the initially deployed version where we infroduced re-usable dictionary and association rules to improve on the quality of the extracted terms. Also, the tool in its present stage is more adaptable to the user search based on the most frequently searched term.
引用
收藏
页码:160 / 165
页数:6
相关论文
共 50 条
  • [1] Performance Evaluation of Unstructured NoSQL data over distributed framework
    Nyati, Suyog S.
    Pawar, Shivanand
    Ingle, Rajesh
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1623 - 1627
  • [2] USING NoSQL FOR PROCESSING UNSTRUCTURED BIG DATA
    Balakayeva, G. T.
    Phillips, C.
    Darkenbayev, D. K.
    Turdaliyev, M.
    NEWS OF THE NATIONAL ACADEMY OF SCIENCES OF THE REPUBLIC OF KAZAKHSTAN-SERIES OF GEOLOGY AND TECHNICAL SCIENCES, 2019, (06): : 12 - 21
  • [3] Terms Extraction from Unstructured Data Silos
    Lomotey, Richard K.
    Deters, Ralph
    2013 8TH INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING (SOSE), 2013, : 19 - 24
  • [4] SECURITY ANALYSIS OF UNSTRUCTURED DATA IN NOSQL MONGODB DATABASE
    Kumar, Jitender
    Garg, Varsha
    2017 INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES FOR SMART NATION (IC3TSN), 2017, : 300 - 305
  • [5] Terms Mining in Document-Based NoSQL: Response to Unstructured Data
    Lomotey, Richard K.
    Deters, Ralph
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 661 - 668
  • [6] USE OF NOSQL TECHNOLOGY FOR ANALYSIS OF UNSTRUCTURED SPATIAL DATA
    Polakova, Monta
    Vitols, Gatis
    RESEARCH FOR RURAL DEVELOPMENT 2018, VOL 2, 2018, : 267 - 270
  • [7] Processing of association rules with ontology in distributed NoSQL systems
    Dahmani, Djilali
    Belalem, Ghalem
    Rahal, Sidi Ahmed
    WEB INTELLIGENCE, 2019, 17 (04) : 285 - 296
  • [8] Handling Big Data using NoSQL
    Bhogal, Jagdev
    Choksi, Imran
    2015 IEEE 29TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS WAINA 2015, 2015, : 393 - 398
  • [9] Big Data: The NoSQL and RDBMS review
    Zafar, Rashid
    Yafi, Eiad
    Zuhairi, Megat F.
    Dao, Hassan
    2016 PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICTM), 2016, : 120 - 126
  • [10] Processing of Unstructured data for Information Extraction
    Ingle, Vaishali A.
    3RD NIRMA UNIVERSITY INTERNATIONAL CONFERENCE ON ENGINEERING (NUICONE 2012), 2012,