Reinforcement Learning

From Wiki2
Revision as of 08:45, 10 July 2019 by Vnhxz (talk | contribs)
Jump to navigation Jump to search

Multi-Armed Bandit Examples


Image Ranking


Git Repos

  • basic (softmax, UCB, epsilon-greedy)
  • intermediate (more algorithms, contextual bandits)
  • MobileNet (Rank Hotels, extending MobileNet Architecture)

Literature