搜索结果: 1-1 共查到“统计学其他学科 Bandits”相关记录1条 . 查询时间(0.093 秒)
We consider an adversarial online learning setting where a decision maker can choose an action in every stage of the game. In addition to observing the reward of the chosen action, the decision maker ...