Reinforcement Learning Basics

Pro

Hard

Agents learn by interacting with an environment. Rewards shape behavior through the Markov Decision Process framework — the basis of game-playing AI.

Learning Objectives

The optional multiple-choice concept check tracks your understanding. Browse the coding problems below, then sign in when you're ready to solve them.

Discounted Return

~12 min· Hard

Epsilon-Greedy Action Selection