Managing Large Scale Unstructured Data with RDBMS

被引:0
作者
Jiang, Zhe [1 ]
Luo, Yi [2 ]
Wu, Naihu [1 ]
He, Chunjiang [3 ]
Yuan, Pingpeng [2 ]
Jin, Hai [2 ]
机构
[1] Shandong Elect Power Res Inst, Jinan 250002, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Wuhan 430074, Peoples R China
[3] China Elect Power Res Inst, Beijing, Peoples R China
来源
2013 IEEE 11TH INTERNATIONAL CONFERENCE ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING (DASC) | 2013年
关键词
Unstructured data; Relational database; query processing; column-oriented database;
D O I
10.1109/DASC.2013.135
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of information technology, the needs of unstructured data storage and processing is growing rapidly, which develops a new requirement for the database storage. Traditional row-oriented relational databases appear to be inadequate for the data query and analysis. In this paper, we propose a novel approach to store the unstructured data in a relational database. By splitting the VALUE property of the unstructured KEY/VALUE data and recreating the two-dimensional data, the original data can be stored in relational databases. The system introduced in this paper is designed to handle this task. In addition, this system rebuilds the SQL as its query language, which makes it compatible with relational databases. In experiments of the query for unstructured data, the outcomes show that the system is good at decomposing the SQL statement submitted by users, and generating the corrected sub-query statements. The results of the experiments show that the performance of this system is good.
引用
收藏
页码:613 / 620
页数:8
相关论文
共 25 条
[1]   Consistency Tradeoffs in Modern Distributed Database System Design [J].
Abadi, Daniel J. .
COMPUTER, 2012, 45 (02) :37-42
[2]  
Andrykowski MA., 2003, Psychosocial Treatment for Medical Conditions: Principles and Techniques eds, P79
[3]   Dynamo: A transparent dynamic optimization system [J].
Bala, V ;
Duesterwald, E ;
Banerjia, S .
ACM SIGPLAN NOTICES, 2000, 35 (05) :1-12
[4]  
Byunggu Yu, 2012, Data Management in Cloud, Grid and P2P Systems. Proceedings of the 5th International Conference, Globe 2012, P25, DOI 10.1007/978-3-642-32344-7_3
[5]  
Chang F, 2006, USENIX ASSOCIATION 7TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P205
[6]   Bigtable: A distributed storage system for structured data [J].
Chang, Fay ;
Dean, Jeffrey ;
Ghemawat, Sanjay ;
Hsieh, Wilson C. ;
Wallach, Deborah A. ;
Burrows, Mike ;
Chandra, Tushar ;
Fikes, Andrew ;
Gruber, Robert E. .
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2008, 26 (02)
[7]  
Chen Feng, 2011, 2011 Seventh International Conference on Semantics Knowledge and Grid, P130, DOI 10.1109/SKG.2011.28
[8]  
CODD EF, 1970, COMMUN ACM, V13, P377, DOI 10.1145/357980.358007
[9]  
Fox A., 1997, Operating Systems Review, V31, P78, DOI 10.1145/269005.266662
[10]   SQL/MapReduce: A practical approach to self-describing, polymorphic, and parallelizable user-defined functions [J].
Friedman, Eric ;
Pawlowski, Peter ;
Cieslewicz, John .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (02) :1402-1413