AI Blog

by Michele Laurelli

Back to Glossary

Exploding Gradient

/ɪkˈsploʊdɪŋ ˈɡreɪdiənt/

Concept

Definition

A problem where gradients become extremely large during training, causing unstable updates and divergence.

Exploding gradients occur when repeated multiplication of large derivatives (> 1) makes gradients exponentially larger. This causes massive weight updates that destabilize training. Solutions: gradient clipping, proper initialization, batch normalization.

Examples

RNN training divergence

NaN values in weights

Oscillating loss

Related Terms

Backpropagation

An algorithm for training neural networks by calculating gradients of the loss function with respect to weights.

Michele Laurelli - AI Research & Engineering