Intro

The approach we explore, called reinforcement learning, is much more focused on goal-directed learning from interaction than are other approaches to machine learning.

Reinforcement learning is learning what to do—how to map situations to actions—so as to maximize a numerical reward signal.

RL v/s Supervised

Reinforcement learning differs fundamentally from supervised learning. While supervised learning dominates current machine learning research, it relies on:

RL v/s Unsupervised

Reinforcement learning differs from unsupervised learning in several key ways:

Tradeoff

One of the key challenges unique to reinforcement learning is the exploration-exploitation trade-off:

This creates a fundamental dilemma: exclusive focus on either strategy leads to failure. The agent must balance both by:

Despite decades of mathematical research, this exploration-exploitation dilemma remains an open challenge in the field.

Agent type approach