The chain rule made computational. Learn how gradients flow backward through a network to train every parameter simultaneously.
Sign in for the concept check
The optional multiple-choice concept check tracks your understanding. Browse the coding problems below, then sign in when you're ready to solve them.
MSE Loss Gradient
~15 min· Hard
Sigmoid Gradient (Backprop)
ReLU Backward Pass
~10 min· Easy
Linear Layer Weight Gradient
~15 min· Medium