Deep Reinforcement Learning for Distributed Flow Shop Scheduling with Flexible Maintenance

A common situation arising in flow shops is that the job processing order must be the same on each machine; this is referred to as a permutation flow shop scheduling problem (PFSSP). Although many algorithms have been designed to solve PFSSPs, machine availability is typically ignored. Healthy machi...

Full description

Bibliographic Details
Main Authors: Wang, H. (Author), Wu, W. (Author), Yan, Q. (Author)
Format: Article
Language:English
Published: MDPI 2022
Subjects:
Online Access:View Fulltext in Publisher
LEADER 02359nam a2200217Ia 4500
001 0.3390-machines10030210
008 220421s2022 CNT 000 0 und d
020 |a 20751702 (ISSN) 
245 1 0 |a Deep Reinforcement Learning for Distributed Flow Shop Scheduling with Flexible Maintenance 
260 0 |b MDPI  |c 2022 
856 |z View Fulltext in Publisher  |u https://doi.org/10.3390/machines10030210 
520 3 |a A common situation arising in flow shops is that the job processing order must be the same on each machine; this is referred to as a permutation flow shop scheduling problem (PFSSP). Although many algorithms have been designed to solve PFSSPs, machine availability is typically ignored. Healthy machine conditions are essential for the production process, which can ensure productivity and quality; thus, machine deteriorating effects and periodic preventive maintenance (PM) activities are considered in this paper. Moreover, distributed production networks, which can manufacture products quickly, are of increasing interest to factories. To this end, this paper investigates an integrated optimization of the distributed PFSSP with flexible PM. With the introduction of machine maintenance constraints in multi-factory production scheduling, the complexity and computation time of solving the problem increases substantially in large-scale arithmetic cases. In order to solve it, a deep Q network-based solution framework is designed with a diminishing greedy rate in this paper. The proposed solution framework is compared to the DQN with fixed greedy rate, in addition to two well-known metaheuristic algorithms, including the genetic algorithm and the iterated greedy algorithm. Numerical studies show that the application of the proposed approach in the studied production-maintenance joint scheduling problem exhibits strong solution performance and generalization abilities. Moreover, a suitable maintenance interval is also obtained, in addition to some managerial insights. © 2022 by the authors. Licensee MDPI, Basel, Switzerland. 
650 0 4 |a deep reinforcement learning 
650 0 4 |a distributed manufacturing 
650 0 4 |a machine deterioration 
650 0 4 |a permutation flow shop scheduling 
650 0 4 |a preventive maintenance 
700 1 0 |a Wang, H.  |e author 
700 1 0 |a Wu, W.  |e author 
700 1 0 |a Yan, Q.  |e author 
773 |t Machines