Hands-On Intelligent Agents with OpenAI Gym
上QQ阅读APP看书,第一时间看更新

Exploring the Learning Algorithm Landscape - DDPG (Actor-Critic) PPO (Policy-Gradient) Rainbow (Value-Based)