Difference between revisions of "Reinforcement Learning"

From Wiki2
Jump to navigation Jump to search
Line 10: Line 10:
  
 
==Multi-Agent Learning==
 
==Multi-Agent Learning==
 +
* Stochastic games, Nash-Q, Gradient Ascent, WOLF, and Mean-field Q learning
  
 
==Extra==
 
==Extra==

Revision as of 15:19, 10 July 2019

Multi-Armed Bandit Examples


Image Ranking

Multi-Agent Learning

  • Stochastic games, Nash-Q, Gradient Ascent, WOLF, and Mean-field Q learning

Extra

Git Repos

Literature

RTB image