Difference between revisions of "Reinforcement Learning"

From Wiki2
Jump to navigation Jump to search
Line 3: Line 3:
 
==Multi-Armed Bandit Examples==
 
==Multi-Armed Bandit Examples==
 
* [https://www.analyticsvidhya.com/blog/2018/09/reinforcement-multi-armed-bandit-scratch-python/ Click Through Rate: Random, UCB]
 
* [https://www.analyticsvidhya.com/blog/2018/09/reinforcement-multi-armed-bandit-scratch-python/ Click Through Rate: Random, UCB]
* [https://www.spotx.tv/resources/blog/developer-blog/introduction-to-multi-armed-bandits-with-applications-in-digital-advertising/ Digital Advertising]
+
* [https://www.spotx.tv/resources/blog/developer-blog/introduction-to-multi-armed-bandits-with-applications-in-digital-advertising/ Digital Advertising] (Epsilon-greedy and Thompson sampling)

Revision as of 16:52, 9 July 2019

Multi-Armed Bandit Examples