Normalize activations within a mini-batch to keep training stable. BatchNorm lets you use higher learning rates and reduces sensitivity to initialization.
Sign in for the concept check
The optional multiple-choice concept check tracks your understanding. Browse the coding problems below, then sign in when you're ready to solve them.
Batch Normalization Forward
~15 min· Medium
Layer Normalization
~12 min· Easy