AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (6.1 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access

Evolution Strategies-Guided Deep Reinforcement Learning for Dynamic Hybrid Flow-Shop Scheduling Problem

School of Computer Science, China University of Geosciences, Wuhan 430078, China
School of Computer Science, China University of Geosciences, Wuhan 430078, China, and also with Engineering Research Center of Natural Resource Information Management and Digital Twin Engineering Software, Ministry of Education, Wuhan 430074, China
Faculty of Computer Science and Engineering, Wuhan Institute of Technology, Wuhan 430205, China
Department of Computer Science, Texas Tech University, Lubbock, TX 79409-3104, USA
Show Author Information

Abstract

Flexible manufacturing faces the challenge of increasing productivity and conserving resources, especially in complex production environments with dynamic event. This paper addresses a dynamic Hybrid Flow-shop Scheduling Problem (HFSP) with unrelated parallel machines using a Deep Reinforcement Learning (DRL) approach to intelligently allocate continuous new job arrivals while minimizing the total weighted tardiness cost. In this paper, Evolution Strategies-guided Deep Reinforcement Learning (ES-DRL) scheduling model is proposed by designing appropriate state features, scheduling actions, and training strategies. In addition, goal-directed composite rules are proposed to provide effective scheduling actions. Meanwhile, the state transition in the environment is adjusted by introducing key state. The ES-DRL model is then trained to make decisions, indicating the reasoning behind the system design. Experimental results show that ES-DRL outperforms the other comparison algorithms regarding significance. In addition, the experiments are extended to the multi-factories system to further validate the scalability and adaptability of the scheduling model, and this extension also yields encouraging results. These results affirm the universal applicability of ES-DRL for dynamic HFSP.

References

【1】
【1】
 
 
Tsinghua Science and Technology
Pages 125-141

{{item.num}}

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Close
Close
Cite this article:
Luo L, Yan X, Wu Q, et al. Evolution Strategies-Guided Deep Reinforcement Learning for Dynamic Hybrid Flow-Shop Scheduling Problem. Tsinghua Science and Technology, 2026, 31(1): 125-141. https://doi.org/10.26599/TST.2024.9010141
Part of a topical collection:

3969

Views

303

Downloads

8

Crossref

9

Web of Science

0

Scopus

0

CSCD

Received: 27 April 2024
Revised: 30 May 2024
Accepted: 30 July 2024
Published: 25 August 2025
© The author(s) 2026.

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).