RP-DQN: An application of Q-Learning to Vehicle Routing Problems

04/25/2021
by   Ahmad Bdeir, et al.
0

In this paper we present a new approach to tackle complex routing problems with an improved state representation that utilizes the model complexity better than previous methods. We enable this by training from temporal differences. Specifically Q-Learning is employed. We show that our approach achieves state-of-the-art performance for autoregressive policies that sequentially insert nodes to construct solutions on the CVRP. Additionally, we are the first to tackle the MDVRP with machine learning methods and demonstrate that this problem type greatly benefits from our approach over other ML methods.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset