AI Blog

AI Blog

by Michele Laurelli

Learning Rate

Concept
Definition

Hyperparameter controlling how much to adjust model weights during training.

Step size in gradient descent. Too high: unstable training. Too low: slow convergence. Typical values: 0.001, 0.01, 0.1.

Examples

1

Learning rate 0.001

2

LR too high causes divergence

3

Adaptive learning rates

Michele Laurelli - AI Research & Engineering