On the optimization of Hadoop MapReduce default job scheduling through dynamic job prioritization
(ندگان)پدیدآور
Peyravi, NargesMoeini, Aliنوع مدرک
TextResearch Paper
زبان مدرک
Englishچکیده
One of the most popular frameworks for big data processing is Apache Hadoop MapReduce. The default Hadoop scheduler uses queue system. However, it does not consider any specific priority for the jobs required for MapReduce programming model. In this paper, a new dynamic score is developed to improve the performance of the default Hadoop MapReduce scheduler. This dynamic priority score is computed based on effective factors such as job runtime estimation, input data size, waiting time, and length or bustle of the waiting queue. The implementation of the proposed scheduling method, based on this dynamic score, not only improves CPU and memory performance, but also reduced waiting time and average turnaround time by approximately $45%$ and $40%$ respectively, compared to the default Hadoop scheduler.
کلید واژگان
Hadoop MapReduceJob scheduling
Prioritization
dynamic priority score
شماره نشریه
2تاریخ نشر
2020-12-011399-09-11
ناشر
University of Tehranسازمان پدید آورنده
Department of Computer Engineering and Information Technology, Faculty of Engineering, University of Qom, Qom, IranDepartment of Algorithms and Computation, School of Engineering Science, College of Engineering, University of Tehran
شاپا
2476-27762476-2784




