File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

A deep reinforcement learning approach for solving the Traveling Salesman Problem with Drone

Author(s)
Bogyrbayeva, AigerimYoon, TaehyunKo, HanbumLim, SungbinYun, HyokunKwon, Changhyun
Issued Date
2023-03
DOI
10.1016/j.trc.2022.103981
URI
https://scholarworks.unist.ac.kr/handle/201301/64755
Citation
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, v.148, pp.103981
Abstract
Reinforcement learning has recently shown promise in learning quality solutions in many combinatorial optimization problems. In particular, the attention-based encoder-decoder models show high effectiveness on various routing problems, including the Traveling Salesman Problem (TSP). Unfortunately, they perform poorly for the TSP with Drone (TSP-D), requiring routing a heterogeneous fleet of vehicles in coordination-a truck and a drone. In TSP-D, the two vehicles are moving in tandem and may need to wait at a node for the other vehicle to join. State-less attention-based decoder fails to make such coordination between vehicles. We propose a hybrid model that uses an attention encoder and a Long Short-Term Memory (LSTM) network decoder, in which the decoder's hidden state can represent the sequence of actions made. We empirically demonstrate that such a hybrid model improves upon a purely attention-based model for both solution quality and computational efficiency. Our experiments on the min-max Capacitated Vehicle Routing Problem (mmCVRP) also confirm that the hybrid model is more suitable for the coordinated routing of multiple vehicles than the attention-based model. The proposed model demonstrates comparable results as the operations research baseline methods.
Publisher
PERGAMON-ELSEVIER SCIENCE LTD
ISSN
0968-090X
Keyword (Author)
Vehicle routingTraveling salesman problemDronesReinforcement learningNeural networks
Keyword
NEIGHBORHOOD SEARCHOPTIMIZATIONLOGISTICSTRUCK

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.