On the optimization of Hadoop MapReduce default job scheduling through dynamic job prioritization

Peyravi, Narges; Moeini, Ali

(ندگان)پدیدآور

Peyravi, NargesMoeini, Ali

دریافت مدرک

FullText

اندازه فایل:

1.065 مگابایت

نوع فايل (MIME):

PDF

نوع مدرک

Text
Research Paper

زبان مدرک

English

نمایش کامل رکورد

چکیده

One of the most popular frameworks for big data processing is Apache Hadoop MapReduce. The default Hadoop scheduler uses queue system. However, it does not consider any specific priority for the jobs required for MapReduce programming model. In this paper, a new dynamic score is developed to improve the performance of the default Hadoop MapReduce scheduler. This dynamic priority score is computed based on effective factors such as job runtime estimation, input data size, waiting time, and length or bustle of the waiting queue. The implementation of the proposed scheduling method, based on this dynamic score, not only improves CPU and memory performance, but also reduced waiting time and average turnaround time by approximately $45%$ and $40%$ respectively, compared to the default Hadoop scheduler.

کلید واژگان

Hadoop MapReduce
Job scheduling
Prioritization
dynamic priority score

شماره نشریه

تاریخ نشر

2020-12-01
1399-09-11

ناشر

University of Tehran

سازمان پدید آورنده

Department of Computer Engineering and Information Technology, Faculty of Engineering, University of Qom, Qom, Iran
Department of Algorithms and Computation, School of Engineering Science, College of Engineering, University of Tehran

شاپا

2476-2776
2476-2784

URI

https://jac.ut.ac.ir/article_79266.html
https://iranjournals.nlai.ir/handle/123456789/708509