Difference between revisions of "Reinforcement Learning"

From Wiki2
Jump to navigation Jump to search
Line 24: Line 24:
 
* [https://arxiv.org/pdf/1706.06978.pdf Zhou et al. 2018] (Alibaba Group, Deep Interest Network, Click Through Rate Prediction)
 
* [https://arxiv.org/pdf/1706.06978.pdf Zhou et al. 2018] (Alibaba Group, Deep Interest Network, Click Through Rate Prediction)
 
* [https://medium.com/@vermashresth/a-primer-on-deep-reinforcement-learning-frameworks-part-1-6c9ab6a0f555 RL Frameworks]
 
* [https://medium.com/@vermashresth/a-primer-on-deep-reinforcement-learning-frameworks-part-1-6c9ab6a0f555 RL Frameworks]
* [https://arxiv.org/pdf/1802.09756.pdf Real Time Bidding]
+
* [https://arxiv.org/pdf/1802.09756.pdf Real Time Bidding] (Multi-agent reinforcement learning)

Revision as of 14:17, 10 July 2019

Multi-Armed Bandit Examples


Image Ranking


Git Repos

Literature