by Michele Laurelli
Metric for machine translation quality comparing n-gram overlap between generated and reference translations.
Measures precision of n-grams (1-4) with brevity penalty. Score ranges 0-1 or 0-100. Higher is better.
Machine translation evaluation
Text generation quality
MT model comparison