An Efficient Data Duplication System based on Hadoop Distributed File System

被引:4
作者
Veeraiah, D. [1 ]
Rao, J. Nageswara [1 ]
机构
[1] Lakireddy Bali Reddy Coll Engn Autonomous, Dept Comp Sci & Engn, Mylavaram 521230, Andhra Pradesh, India
来源
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020) | 2020年
关键词
Data Locality; Data Duplication; Hadoop; Access Predication;
D O I
10.1109/icict48043.2020.9112567
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
HDFS [Hadoop Distributed File System] a part of Apache Hadoop to store large data set consistently. HDFS is used for process Massive-Scale Data in parallel and it ensures accessibility of facts by replicating data to different nodes. Still, the repetition policy of HDFS doesn't think about the name of knowledge. The recognition of the files tends to alter over time. Hence, maintaining a fixed replication issue can affect the storage efficiency of HDFS. An Efficient Data Duplication System Based on HDFS, is proposed which consider the reputations of the records set aside in HDFS before replication. The proposed technique successfully reduces storage consumption by up to 45% without moving the accessibility and fault recognition in HDFS.square
引用
收藏
页码:197 / 200
页数:4
相关论文
共 8 条
[1]   Target Tracking with Limited Sensing Range in Autonomous Mobile Sensor Networks [J].
Bai, Jing ;
Cheng, Peng ;
Chen, Jiming ;
Guenard, Adrien ;
Song, Yeqiong .
2012 IEEE 8TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (DCOSS), 2012, :329-334
[2]   Enhanced Fast Spread Replication strategy for Data Grid [J].
Bsoul, Mohammad ;
Al-Khasawneh, Ahmad ;
Abdallah, Emad Eddien ;
Kilani, Yousef .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (02) :575-580
[3]   Adaptive Replication Management in HDFS Based on Supervised Learning [J].
Bui, Dinh-Mao ;
Hussain, Shujaat ;
Huh, Eui-Nam ;
Lee, Sungyoung .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (06) :1369-1382
[4]  
Kaushik R., 2011, Green Computing Conference and Workshops (IGCC), 2011 International, P1
[5]   Enabling proactive data management in virtualized Hadoop clusters based on predicted data activity patterns [J].
Kousiouris, George ;
Vafiadis, George ;
Varvarigou, Theodora .
2013 EIGHTH INTERNATIONAL CONFERENCE ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC 2013), 2013, :1-8
[6]  
Papoulis A., 1977, SIGNAL ANAL, V191
[7]  
Qu KY, 2016, INT CONF CLOUD COMPU, P337, DOI 10.1109/CCIS.2016.7790280
[8]   POLYNOMIAL CODES OVER CERTAIN FINITE FIELDS [J].
REED, IS ;
SOLOMON, G .
JOURNAL OF THE SOCIETY FOR INDUSTRIAL AND APPLIED MATHEMATICS, 1960, 8 (02) :300-304