AI Blog

by Michele Laurelli

Back to Glossary

Learning Rate

/ˈlɜːrnɪŋ reɪt/

Training

Definition

A hyperparameter controlling how much model weights are updated during training.

Learning rate determines the step size in gradient descent. Too high causes instability, too low slows training. Common strategies: fixed, decay, cyclical, adaptive (Adam).

Examples

Learning rate 0.001 for Adam

Learning rate decay schedule

Warm-up then decay strategy

Related Terms

Gradient Descent

An optimization algorithm that iteratively adjusts parameters to minimize a loss function by following the gradient.

Hyperparameter

A configuration variable that is set before training and controls the learning process.

Michele Laurelli - AI Research & Engineering