Efficient & Accurate Scheduling Algorithm For Cloudera Hadoop

被引:1
作者
Yadav, Swati [1 ]
Vishwakarma, Santosh [1 ]
Verma, Ashok [1 ]
机构
[1] Gyan Ganga Inst Sci & Technol, Dept Comp Sci, Jabalpur, India
来源
2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN) | 2015年
关键词
MapReduce; huge Data; Hadoop; Cloud Computing; Improved Scheduling;
D O I
10.1109/CICN.2015.322
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
the term immense data was coined to capture which suggests of this rising trend. To boot to its sheer volume, immense data to boot exhibits completely different distinctive characteristics as compared with ancient data. For instance, immense data is typically unstructured and wish extra amount analysis. This development incorporates new system architectures for data acquisition, transmission, storage, and large-scale process mechanisms. Recent technological advancements have semiconductor diode to a deluge of information from distinctive domains (e.g., health care and sciatic sensors, user generated data, net and money corporations, and supply chain systems). the buildup of information over the past twenty years has enlarged to large volumes. Apache Hadoop have introduced a economical and possible tool for distributed computing of such immense data for filtering and extracting massive volumes of knowledge. MapReduce can be a good used parallel computing framework for giant scale process. The two major performance metrics in MapReduce area unit job execution time and cluster production. MapReduce uses inventory accounting job programming by default and completely different programming algorithms area unit being introduced in proprietary domain. This work introduces a metric primarily based programming algorithmic rule to reinforce the potency and utilization of the server resources.
引用
收藏
页码:839 / 844
页数:6
相关论文
共 50 条
[21]   DYNAMIC SCHEDULING ALGORITHM FOR REDUCING START TIME IN HADOOP [J].
Gunasekaran, S. ;
SaiRamesh, L. ;
Sabena, S. ;
Selvakumar, K. ;
Ganapathy, S. ;
Kannan, A. .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
[22]   Deadline scheduling algorithm for sustainable computing in Hadoop environment [J].
Varga, Mihai ;
Petrescu-Nita, Alina ;
Pop, Florin .
COMPUTERS & SECURITY, 2018, 76 :354-366
[23]   Distributed Scheduling Extension on Hadoop [J].
Zeng Dadan ;
Wang Xieqin ;
Jiang Ningkang .
CLOUD COMPUTING, PROCEEDINGS, 2009, 5931 :687-693
[24]   A Review of Scheduling Algorithms in Hadoop [J].
Sharma, Anil ;
Singh, Gurwinder .
PROCEEDINGS OF RECENT INNOVATIONS IN COMPUTING, ICRIC 2019, 2020, 597 :125-135
[25]   Hadoop-MCC: Efficient Multiple Compound Comparison Algorithm Using Hadoop [J].
Hua, Guan-Jie ;
Hung, Che-Lun ;
Tang, Chuan Yi .
COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2018, 21 (02) :84-92
[26]   Energy-Efficient Task Scheduling for CPU-Intensive Streaming Jobs on Hadoop [J].
Jin, Peiquan ;
Hao, Xingjun ;
Wang, Xiaoliang ;
Yue, Lihua .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (06) :1298-1311
[27]   FRESH: Fair and Efficient Slot Configuration and Scheduling for Hadoop Clusters [J].
Wang, Jiayin ;
Yao, Yi ;
Mao, Ying ;
Sheng, Bo ;
Mi, Ningfang .
2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, :761-768
[28]   Improving Scheduling Efficiency of Hadoop YARN Using AFSA Algorithm [J].
Gao Junlei ;
Tang Tie ;
Wu Gangshan .
2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, :919-924
[29]   Profit-oriented task scheduling algorithm in Hadoop cluster [J].
Chai, Xu-qing ;
Dong, Yong-liang ;
Li, Jun-fei .
EURASIP JOURNAL ON EMBEDDED SYSTEMS, 2016,
[30]   Job Scheduling Optimization using BAT Algorithm in Hadoop Environment [J].
Raghav, R. S. ;
Amudhavel, J. ;
Dhavachelvan, P. .
BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2018, 11 (01) :134-139