What if?What if?
  • Machine Learning
  • Deep Learning
  • Reinforcement Learning
  • Statistical NLP
  • Linear Algebra
  • Information Theory
Dev
Papers
Snippets
  • Machine Learning
  • Deep Learning
  • Reinforcement Learning
  • Statistical NLP
  • Linear Algebra
  • Information Theory
Dev
Papers
Snippets
  • 基础

    • 基本概念
    • 马尔科夫决策过程
    • 贝尔曼方程
    • 基于模型的动态规划
  • 基于值函数的方法

    • 蒙特卡洛
    • 时间差分

# 强化学习 / Reinforcement Learning

# Courses

  • UCL: Reinforcement Learning (David Silver)
  • UC Berkeley CS285: Deep Reinforcement Learning
  • UWaterloo CS885: Reinforcement Learning
  • Deep Reinforcement Learning (Shusen Wang)

# Books

  • Reinforcement Learning: An Introduction (2nd Edition) (Richard S. Sutton and Andrew G. Barto)
  • 深度强化学习 (Shusen Wang)

# Blogs / Talks / Tutorials

  • A (Long) Peek into Reinforcement Learning (Lilian Weng's Blog)
  • Policy Gradient Algorithms [中文翻译] (Lilian Weng's Blog)
  • dennybritz/reinforcement-learning (Denny Britz's Repo)

# Code / Library

  • openai/spinningup
  • stable-baselines3
  • ray
Edit this page on GitHub
Last Updated: