CTD: Cascaded Temporal Difference Learning for the Mean-Standard Deviation Shortest Path Problem

Page view(s)
16
Checked on Sep 20, 2024
CTD: Cascaded Temporal Difference Learning for the Mean-Standard Deviation Shortest Path Problem
Title:
CTD: Cascaded Temporal Difference Learning for the Mean-Standard Deviation Shortest Path Problem
Journal Title:
IEEE Transactions on Intelligent Transportation Systems
Publication Date:
27 July 2021
Citation:
Guo, H., Hou, X., & Peng, Q. (2022). CTD: Cascaded Temporal Difference Learning for the Mean-Standard Deviation Shortest Path Problem. IEEE Transactions on Intelligent Transportation Systems, 23(8), 10868–10886. https://doi.org/10.1109/tits.2021.3096829
Abstract:
This paper investigates the reliable shortest path (RSP) planning problem from the reinforcement learning perspective. Different from canonical path planning methods, which require at least the first- order statistic (mean) and second-order statistic (variance) information of travel time distribution, we target at the RSP planning problem without the assumption of knowing any travel time distribution characteristic beforehand, and propose a cascaded temporal difference learning (CTD) method, which simultaneously estimates the mean and variance of the executing path and thereby gradually makes improvements through the generalized policy iteration (GPI) scheme, as the ego vehicle interacts with the environment. Extensive simulation results demonstrate the applicability of the proposed method for RSP learning in various transportation networks.
License type:
Publisher Copyright
Funding Info:
There was no specific funding for the research done
Description:
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
ISSN:
1524-9050
1558-0016
Files uploaded:

File Size Format Action
3ctd-t-its-published.pdf 8.37 MB PDF Open