Difference between revisions of "Reinforcement Learning"

From Wiki2
Jump to navigation Jump to search
Line 11: Line 11:
 
==Multi-Agent Learning==
 
==Multi-Agent Learning==
 
* Stochastic games, Nash-Q, Gradient Ascent, WOLF, and Mean-field Q learning, particle swarm intelligence, Ant Colony Optimization (Colorni et al., 1991)
 
* Stochastic games, Nash-Q, Gradient Ascent, WOLF, and Mean-field Q learning, particle swarm intelligence, Ant Colony Optimization (Colorni et al., 1991)
 
+
* [https://towardsdatascience.com/smart-incentives-and-game-theory-in-decentralized-multi-agent-reinforcement-learning-systems-58442e508378 Game Theory in Smart Decentralised multi-agent RL]
 
==Extra==
 
==Extra==
  

Revision as of 15:29, 10 July 2019

Multi-Armed Bandit Examples


Image Ranking

Multi-Agent Learning

Extra

Git Repos

Literature

RTB image