Reinforcement learning in BitTorrent systems

Rafit Izhak-Ratzin,Hyunggon Park,Mihaela van der Schaar

Shanghai（2011）

引用 19|浏览16

暂无评分

摘要

In this paper, we propose a BitTorrent-like protocol that replaces the peer selection mechanisms in the regular BitTorrent protocol with a novel reinforcement learning based mechanism. The inherent operation of P2P systems, which involves repeated interactions among peers over a long time period, allows peers to efficiently identify free-riders as well as desirable collaborators by learning the behavior of their associated peers. Thus, it can help peers improve their download rates and discourage free-riding (FR), while improving fairness. We model the peers' interactions in the BitTorrent-like network as a repeated interaction game, where we explicitly consider the strategic behavior of the peers. A peer that applies the reinforcement learning based mechanism uses a partial history of the observations on associated peers' statistical reciprocal behaviors to determine its best responses and estimate the corresponding impact on its expected utility. The policy determines the peer's resource reciprocations with other peers, which would maximize the peer's long-term performance.

查看译文

关键词

learning (artificial intelligence),peer-to-peer computing,protocols,P2P systems,bittorrent protocol,peer selection mechanisms,reinforcement learning,strategic behavior,BitTorrent,P2P,reinforcement learning

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要