方略学科导航

搜索结果: 1-2 共查到“管理学 Contextual Bandits”相关记录2条 . 查询时间(0.109 秒)

Thompson Sampling for Contextual Bandits with Linear Payoffs Thompson Sampling Contextual Bandits Linear Payoffs 2012/11/23

Thompson Sampling is one of the oldest heuristics for multi-armed bandit problems. It is a randomized algorithm based on Bayesian ideas, and has recently generated significant interest after several s...

存档附件原文地址

Efficient Optimal Learning for Contextual Bandits Efficient Optimal Learning Contextual Bandits 2011/7/6

We address the problem of learning in an online setting where the learner repeatedly observes features, selects among a set of actions, and receives reward for the action taken.

存档附件原文地址

中国研究生教育排行榜-条

正在加载...

中国学术期刊排行榜-条

正在加载...

世界大学科研机构排行榜-条

正在加载...

中国大学排行榜-条

正在加载...

人　物-篇

正在加载...

课　件-篇

正在加载...

视听资料-篇

正在加载...

研招资料 -篇

正在加载...

知识要闻-篇

正在加载...

国际动态-篇

正在加载...

会议中心-篇

正在加载...

学术指南-篇

正在加载...

学术站点-篇

正在加载...