AI Blog

by Michele Laurelli

Learning Rate

Concept

Definition

Hyperparameter controlling how much to adjust model weights during training.

Step size in gradient descent. Too high: unstable training. Too low: slow convergence. Typical values: 0.001, 0.01, 0.1.

Learning rate 0.001

LR too high causes divergence

Adaptive learning rates

An optimization algorithm that iteratively adjusts parameters to minimize a loss function by following the gradient.

A configuration variable that is set before training and controls the learning process.

Michele Laurelli - AI Research & Engineering