Token Budget

Token Budget is the practical limit you set for prompt and completion tokens to balance quality, latency, and cost.

Related terms

Related terms

  • Reasoning Effort

    AI

    Reasoning Effort is a controllable depth setting for model thinking, balancing answer quality, latency, and cost.