In a high-mix and low-volume manufacturing facility, heterogeneous jobs introduce frequent reconfiguration of machines which increases the chance of unplanned machine breakdowns. As machines are often nonidentical and their performance degrades over time, it is critical to consider the heterogeneity and non-stationarity of the machines during scheduling. We propose a reinforcement learning-based framework with a novel sampling method to train the agent to schedule heterogeneous jobs on non-stationary unreliable parallel machines to minimize weighted tardiness. The results indicate that the new sampling approach expedites the learning process and the resulting policy significantly outperforms static dispatching rules.
All Science Journal Classification (ASJC) codes
- Chemical Engineering (miscellaneous)