Reinforcement Learning

An paradigm of ML: How intelligent agent should take actions to accumulate maximum reward

Given an action by the agent,

UnityML

Agent:

Brain:

Academy:

Proximal Policy Optimization:

Model Locomotion