By Topic

An application of reinforcement learning to manufacturing scheduling problems

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Y. Tanaka ; Sch. of Inf. Sci., Japan Advanced Inst. of Sci. & Technol., Ishikawa, Japan ; T. Yoshida

The feasibility of applying reinforcement learning to a flow shop scheduling problem, the objective of which is to minimize the maximum completion time, is studied for two and three machines. It is generally hard to obtain any optimal solution in this problem domain with more than two machines, whereas with exactly two machines, the optimal is given by Johnson's algorithm. The impressive points revealed by the implementation of various instances of the reinforcement learning formulations are as follows. First, a good formulation may sometimes lead an agent to acquire the optimal rule that minimizes an objective function. Secondly, an agent can learn and obtain the improved schedules even when the formulation is not perfect. Thirdly, the same formulation is sound not only for the two-machine problem, but for the three-machine problem where Johnson's algorithm does not necessarily give any optimal solution. Consequently, the utilization of the reinforcement learning has potential to help us find an approximate solution or sometimes the optimal solution in a relatively simple way. The capability of a reinforcement learning agent, however, mostly depends upon the problem formulation. It is devised by utilizing theoretical solution methods and heuristics. At the same time, the agent has great flexibility to obtain improved schedules under the formulation with less prior knowledge

Published in:

Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on  (Volume:4 )

Date of Conference:

1999