Difference between revisions of "Reinforcement Learning"

From Wiki2
Jump to navigation Jump to search
Line 30: Line 30:
 
* [https://arxiv.org/pdf/1802.09756.pdf Real Time Bidding] (Distributed Coordinated Multi-agent reinforcement learning)  
 
* [https://arxiv.org/pdf/1802.09756.pdf Real Time Bidding] (Distributed Coordinated Multi-agent reinforcement learning)  
 
[https://chemoinformatician.co.uk/images/RTB_multi-agent.png RTB image]
 
[https://chemoinformatician.co.uk/images/RTB_multi-agent.png RTB image]
* [https://rise.cs.berkeley.edu/blog/scaling-multi-agent-rl-with-rllib/ Berkeley Multi-agent RL]
+
* [https://rise.cs.berkeley.edu/blog/scaling-multi-agent-rl-with-rllib/ Berkeley Multi-agent RL Scaling OpenSource]

Revision as of 15:02, 10 July 2019

Multi-Armed Bandit Examples


Image Ranking

Multi-Agent Learning

Extra

Git Repos

Literature

RTB image