SEGAC: Sample Efficient Generalized Actor Critic for the Stochastic On-Time Arrival Problem

Hongliang Guo, Zhi He,Wenda Sheng,Zhiguang Cao,Yingjie Zhou,Weinan Gao

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS（2024）

引用 0|浏览14

暂无评分

摘要

This paper studies the problem in transportation networks and introduces a novel reinforcement learning-based algorithm, namely. Different from almost all canonical sota solutions, which are usually computationally expensive and lack generalizability to unforeseen destination nodes, segac offers the following appealing characteristics. segac updates the ego vehicle's navigation policy in a sample efficient manner, reduces the variance of both value network and policy network during training, and is automatically adaptive to new destinations. Furthermore, the pre-trained segac policy network enables its real-time decision-making ability within seconds, outperforming state-of-the-art sota algorithms in simulations across various transportation networks. We also successfully deploy segac to two real metropolitan transportation networks, namely Chengdu and Beijing, using real traffic data, with satisfying results.

查看译文

关键词

Navigation,Reliability,Transportation,Optimization,Gaussian distribution,Routing,Real-time systems,Generalized actor critic,stochastic on-time arrival (SOTA),sample efficiency,variance reduction

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要