Difference between revisions of "Reinforcement Learning"

From Wiki2
Jump to navigation Jump to search
Line 14: Line 14:
 
* [https://github.com/david-cortes/contextualbandits, intermediate] (more algorithms, contextual bandits)
 
* [https://github.com/david-cortes/contextualbandits, intermediate] (more algorithms, contextual bandits)
 
* [https://github.com/idealo/image-quality-assessment/blob/master/data/TID2013/get_labels.py MobileNet] (Rank Hotels, extending MobileNet Architecture)
 
* [https://github.com/idealo/image-quality-assessment/blob/master/data/TID2013/get_labels.py MobileNet] (Rank Hotels, extending MobileNet Architecture)
 +
* [https://github.com/google/dopamine Google Dopamine] (Dopamine is a research framework for fast prototyping of reinforcement learning algorithms).
 +
* [https://github.com/deepmind/trfl/blob/master/docs/index.md TRFL Reinforcement Learning]
 +
* [https://github.com/facebookresearch/ELF Facebook ELF Research RL]
 +
* [https://github.com/tensorflow/agents TF-Agents] (TF-Agents is a library for Reinforcement Learning in TensorFlow)
 +
* [https://github.com/kengz/SLM-Lab SLM-Lab] (Modular Deep Reinforcement Learning framework in PyTorch)
  
 
===Literature===
 
===Literature===

Revision as of 09:15, 10 July 2019

Multi-Armed Bandit Examples


Image Ranking


Git Repos

Literature