Difference between revisions of "Reinforcement Learning"

From Wiki2
Jump to navigation Jump to search
Line 7: Line 7:
  
 
==Image Ranking==
 
==Image Ranking==
* [https://medium.com/idealo-tech-blog/using-deep-learning-to-automatically-rank-millions-of-hotel-images-c7e2d2e5cae2 Hotel Image Ranking]
+
* [https://medium.com/idealo-tech-blog/using-deep-learning-to-automatically-rank-millions-of-hotel-images-c7e2d2e5cae2 Hotel Image Ranking] (asthetic & technical quality of images)
  
  

Revision as of 08:45, 10 July 2019

Multi-Armed Bandit Examples


Image Ranking


Git Repos

  • basic (softmax, UCB, epsilon-greedy)
  • intermediate (more algorithms, contextual bandits)
  • MobileNet (Rank Hotels, extending MobileNet Architecture)

Literature