Neural Network Backpropagation

Forward pass · chain rule gradient flow · weight updates · layer activations

Epoch: 0  |  Loss (MSE):  |  Phase: idle
Gradient magnitudes shown as edge thickness. Red = large gradient, blue = small. Chain rule: δL/δw = δL/δa · δa/δz · δz/δw.