Reinforcement Learning-Based Dynamic Order Recommendation for On-Demand Food Delivery

Xing Wang,Ling Wang,Chenxin Dong,Hao Ren,Ke Xing

TSINGHUA SCIENCE AND TECHNOLOGY（2024）

引用 0|浏览18

暂无评分

摘要

On-demand food delivery (OFD) is gaining more and more popularity in modern society. As a kernel order assignment manner in OFD scenario, order recommendation directly influences the delivery efficiency of the platform and the delivery experience of riders. This paper addresses the dynamism of the order recommendation problem and proposes a reinforcement learning solution method. An actor-critic network based on long short term memory (LSTM) unit is designed to deal with the order-grabbing conflict between different riders. Besides, three rider sequencing rules are accordingly proposed to match different time steps of the LSTM unit with different riders. To test the performance of the proposed method, extensive experiments are conducted based on real data from Meituan delivery platform. The results demonstrate that the proposed reinforcement learning based order recommendation method can significantly increase the number of grabbed orders and reduce the number of order-grabbing conflicts, resulting in better delivery efficiency and experience for the platform and riders.

查看译文

关键词

on-demand food delivery,order recommendation,reinforcement learning,actor-critic network,long short term memory

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要