AI Blog

by Michele Laurelli

Back to Glossary

Adam Optimizer

/ˈædəm/

Algorithm

Definition

An adaptive learning rate optimization algorithm combining momentum and RMSprop.

Adam (Adaptive Moment Estimation) computes adaptive learning rates for each parameter using first and second moment estimates. It's the default optimizer for many deep learning applications due to its robustness.

Examples

Training transformers

Deep neural networks

Default optimizer in many frameworks

Related Terms

Gradient Descent

An optimization algorithm that iteratively adjusts parameters to minimize a loss function by following the gradient.

Learning Rate

A hyperparameter controlling how much model weights are updated during training.

Michele Laurelli - AI Research & Engineering