Evolution Strategies-Guided Deep Reinforcement Learning for Dynamic Hybrid Flow-Shop Scheduling Problem

Lin Luo; Xuesong Yan; Qinghua Wu; Victor S. Sheng

doi:10.26599/TST.2024.9010141

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

PDF (6.1 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Open Access

Evolution Strategies-Guided Deep Reinforcement Learning for Dynamic Hybrid Flow-Shop Scheduling Problem

Lin Luo^¹, Xuesong Yan^², Qinghua Wu^³(

), Victor S. Sheng^⁴

1School of Computer Science, China University of Geosciences, Wuhan 430078, China

2School of Computer Science, China University of Geosciences, Wuhan 430078, China, and also with Engineering Research Center of Natural Resource Information Management and Digital Twin Engineering Software, Ministry of Education, Wuhan 430074, China

3Faculty of Computer Science and Engineering, Wuhan Institute of Technology, Wuhan 430205, China

4Department of Computer Science, Texas Tech University, Lubbock, TX 79409-3104, USA

Show Author Information

Abstract

Flexible manufacturing faces the challenge of increasing productivity and conserving resources, especially in complex production environments with dynamic event. This paper addresses a dynamic Hybrid Flow-shop Scheduling Problem (HFSP) with unrelated parallel machines using a Deep Reinforcement Learning (DRL) approach to intelligently allocate continuous new job arrivals while minimizing the total weighted tardiness cost. In this paper, Evolution Strategies-guided Deep Reinforcement Learning (ES-DRL) scheduling model is proposed by designing appropriate state features, scheduling actions, and training strategies. In addition, goal-directed composite rules are proposed to provide effective scheduling actions. Meanwhile, the state transition in the environment is adjusted by introducing key state. The ES-DRL model is then trained to make decisions, indicating the reasoning behind the system design. Experimental results show that ES-DRL outperforms the other comparison algorithms regarding significance. In addition, the experiments are extended to the multi-factories system to further validate the scalability and adaptability of the scheduling model, and this extension also yields encouraging results. These results affirm the universal applicability of ES-DRL for dynamic HFSP.

Keywords

Hybrid Flow-shop Scheduling Problem (HFSP)real-time scheduling Deep Reinforcement Learning (DRL)evolution strategies intelligent manufacturing multi-factories

References

【1】

Crossref Google Scholar

Tsinghua Science and Technology

Volume 31 Issue 1,
February 2026

Pages 125-141

DOI: 10.26599/TST.2024.9010141

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , , {{reviewData.reportCite.doi}}

Cite this article:

Luo L, Yan X, Wu Q, et al. Evolution Strategies-Guided Deep Reinforcement Learning for Dynamic Hybrid Flow-Shop Scheduling Problem. Tsinghua Science and Technology, 2026, 31(1): 125-141. https://doi.org/10.26599/TST.2024.9010141

Part of a topical collection:

Special Issue on Learning-Driven Optimization for Complex Systems

3969

Views

303

Downloads

Crossref

Web of Science

Scopus

CSCD

Google Scholar
Citation

Received: 27 April 2024

Revised: 30 May 2024

Accepted: 30 July 2024

Published: 25 August 2025

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).