AI Blog

AI Blog

by Michele Laurelli

Softmax Function

/ˈsɒftmæks/
Concept
Definition

An activation function that converts a vector of values into a probability distribution summing to 1.

Softmax is used in multi-class classification output layers. It exponentiates each value and normalizes by the sum, producing interpretable probabilities for each class.

Examples

1

Multi-class classification

2

Language model next-token prediction

3

Image classification output

Michele Laurelli - AI Research & Engineering