AI Blog

AI Blog

by Michele Laurelli

BLEU Score

Metric
Definition

Metric for machine translation quality comparing n-gram overlap between generated and reference translations.

Measures precision of n-grams (1-4) with brevity penalty. Score ranges 0-1 or 0-100. Higher is better.

Examples

1

Machine translation evaluation

2

Text generation quality

3

MT model comparison

Michele Laurelli - AI Research & Engineering