Reinforcement Learning

From Wiki2
Revision as of 15:19, 10 July 2019 by Vnhxz (talk | contribs)
Jump to navigation Jump to search

Multi-Armed Bandit Examples


Image Ranking

Multi-Agent Learning

  • Stochastic games, Nash-Q, Gradient Ascent, WOLF, and Mean-field Q learning

Extra

Git Repos

Literature

RTB image