The Cross-Entropy Method Optimizes for Quantiles.
ICML'13: Proceedings of the 30th International Conference on International Conference on Machine Learning - Volume 28(2013)
摘要
Cross-entropy optimization (CE) has proven to be a powerful tool for search in control environments. In the basic scheme, a distribution over proposed solutions is repeatedly adapted by evaluating a sample of solutions and refocusing the distribution on a percentage of those with the highest scores. We show that, in the kind of noisy evaluation environments that are common in decision-making domains, this percentage-based refocusing does not optimize the expected utility of solutions, but instead a quantile metric. We provide a variant of CE (Proportional CE) that effectively optimizes the expected value. We show using variants of established noisy environments that Proportional CE can be used in place of CE and can improve solution quality.
更多查看译文
关键词
quantiles,cross-entropy
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络