TY - GEN
T1 - Event-based optimization for dispatching policies in material handling systems of general assembly lines
AU - Zhao, Yanjia
AU - Zhao, Qianchuan
AU - Jia, Qing Shan
AU - Guan, Xiaohong
AU - Cao, Xi Ren
PY - 2008
Y1 - 2008
N2 - A material handling (MH) system of a general assembly line dispatching parts from inventory to working buffers could be complicated and costly to operate. Generally it is extremely difficult to find the optimal dispatching policy due to the complicated system dynamics and the large problem size. In this paper, we formulate the dispatching problem as a Markov decision process (MDP), and use event-based optimization framework to overcome the difficulty caused by problem dimensionality and size. By exploiting the problem structures, we focus on responding to certain events instead of all state transitions, so that the number of aggregated potential function (i.e., value function) is scaled to the square of the system size despite of the exponential growth of the state space. This effectively reduces the computational requirements to a level that is acceptable in practice. We then develop a sample path based algorithm to estimate the potentials, and implement a gradient-based policy optimization procedure. Numerical results demonstrate that the policies obtained by the event-based optimization approach significantly outperform the current dispatching method in production.
AB - A material handling (MH) system of a general assembly line dispatching parts from inventory to working buffers could be complicated and costly to operate. Generally it is extremely difficult to find the optimal dispatching policy due to the complicated system dynamics and the large problem size. In this paper, we formulate the dispatching problem as a Markov decision process (MDP), and use event-based optimization framework to overcome the difficulty caused by problem dimensionality and size. By exploiting the problem structures, we focus on responding to certain events instead of all state transitions, so that the number of aggregated potential function (i.e., value function) is scaled to the square of the system size despite of the exponential growth of the state space. This effectively reduces the computational requirements to a level that is acceptable in practice. We then develop a sample path based algorithm to estimate the potentials, and implement a gradient-based policy optimization procedure. Numerical results demonstrate that the policies obtained by the event-based optimization approach significantly outperform the current dispatching method in production.
UR - https://www.scopus.com/pages/publications/62949129183
U2 - 10.1109/CDC.2008.4739093
DO - 10.1109/CDC.2008.4739093
M3 - Conference Paper published in a book
AN - SCOPUS:62949129183
SN - 9781424431243
T3 - Proceedings of the IEEE Conference on Decision and Control
SP - 2173
EP - 2178
BT - Proceedings of the 47th IEEE Conference on Decision and Control, CDC 2008
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 47th IEEE Conference on Decision and Control, CDC 2008
Y2 - 9 December 2008 through 11 December 2008
ER -