AI Blog

by Michele Laurelli

Nucleus Sampling (Top-P)

Technique

Definition

Text generation sampling from smallest set of tokens whose cumulative probability exceeds threshold P.

Dynamically adjusts vocabulary size based on probability distribution. More flexible than top-k. Common: P=0.9-0.95.

P=0.9 for diverse generation

Dynamic vocabulary selection

Better than top-k

Text generation technique sampling from only the K most likely next tokens.

A neural network with billions of parameters trained on massive text datasets to understand and generate human language.

Michele Laurelli - AI Research & Engineering