Accelerating DAG-Style Job Execution via Optimizing Resource Pipeline Scheduling

Yubin Duan; Ning Wang; Jie Wu

doi:10.1007/s11390-021-1488-4

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

Article Link

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Regular Paper

Accelerating DAG-Style Job Execution via Optimizing Resource Pipeline Scheduling

Yubin Duan^¹, Ning Wang^², Jie Wu^¹(

)

Department of Computer and Information Sciences, Temple University, Philadelphia 19122, U.S.A.

Department of Computer Science, Rowan University, Glassboro 08028, U.S.A.

A preliminary version was published in the Proceedings of IEEE MASS 2020.

Show Author Information

Abstract

The volume of information that needs to be processed in big data clusters increases rapidly nowadays. It is critical to execute the data analysis in a time-efficient manner. However, simply adding more computation resources may not speed up the data analysis significantly. The data analysis jobs usually consist of multiple stages which are organized as a directed acyclic graph (DAG). The precedence relationships between stages cause scheduling challenges. General DAG scheduling is a well-known NP-hard problem. Moreover, we observe that in some parallel computing frameworks such as Spark, the execution of a stage in DAG contains multiple phases that use different resources. We notice that carefully arranging the execution of those resources in pipeline can reduce their idle time and improve the average resource utilization. Therefore, we propose a resource pipeline scheme with the objective of minimizing the job makespan. For perfectly parallel stages, we propose a contention-free scheduler with detailed theoretical analysis. Moreover, we extend the contention-free scheduler for three-phase stages, considering the computation phase of some stages can be partitioned. Additionally, we are aware that job stages in real-world applications are usually not perfectly parallel. We need to frequently adjust the parallelism levels during the DAG execution. Considering reinforcement learning (RL) techniques can adjust the scheduling policy on the fly, we investigate a scheduler based on RL for online arrival jobs. The RL-based scheduler can adjust the resource contention adaptively. We evaluate both contention-free and RL-based schedulers on a Spark cluster. In the evaluation, a real-world cluster trace dataset is used to simulate different DAG styles. Evaluation results show that our pipelined scheme can significantly improve CPU and network utilization.

Keywords

data center cluster directed acyclic graph scheduling makespan minimization pipeline

Electronic Supplementary Material

Download File(s)

1488_ESM.pdf (154.5 KB)

References

【1】

Crossref Google Scholar

Journal of Computer Science and Technology

Volume 37 Issue 4,
July 2022

Pages 852-868

DOI: 10.1007/s11390-021-1488-4

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , , {{reviewData.reportCite.doi}}

Cite this article:

Duan Y, Wang N, Wu J. Accelerating DAG-Style Job Execution via Optimizing Resource Pipeline Scheduling. Journal of Computer Science and Technology, 2022, 37(4): 852-868. https://doi.org/10.1007/s11390-021-1488-4

1029

Views

Crossref

Web of Science

Scopus

CSCD

Google Scholar
Citation

Received: 06 April 2021

Revised: 01 September 2021

Accepted: 23 November 2021

Published: 25 July 2022