AI Blog

by Michele Laurelli

Back to Glossary

Batch Size

/bætʃ saɪz/

Training

Definition

The number of training examples used in one iteration of model training.

Batch size affects training speed, memory usage, and model convergence. Small batches provide noisier gradients, large batches are more stable but memory-intensive. Common values: 32, 64, 128, 256.

Examples

Batch size 32 for limited GPU memory

Batch size 256 for faster training

Mini-batch gradient descent

Related Terms

Gradient Descent

An optimization algorithm that iteratively adjusts parameters to minimize a loss function by following the gradient.

Michele Laurelli - AI Research & Engineering