AI Blog

by Michele Laurelli

Back to Glossary

Attention Weight

/əˈtɛnʃən weɪt/

Concept

Definition

A scalar value indicating how much focus to place on a specific part of the input when producing output.

Attention weights are computed through dot products of query and key vectors, followed by softmax normalization. Higher weights mean greater importance. They enable models to focus on relevant information dynamically.

Examples

Translation attention to source words

Image captioning attention to regions

Document summarization

Related Terms

Attention Mechanism

A technique allowing models to focus on specific parts of the input when producing output.

Softmax Function

An activation function that converts a vector of values into a probability distribution summing to 1.

Michele Laurelli - AI Research & Engineering