Prompt Caching

Prompt Caching stores reusable prompt context so repeated requests can skip redundant processing and improve performance.

Related terms

Related terms

  • Caching

    Performance

    The process of storing frequently accessed data in temporary storage for faster retrieval on subsequent requests. Browser caching stores assets locally while server caching reduces database queries and processing time. Understanding caching helps troubleshoot why changes don't appear immediately—you may need to clear cache or wait for CDN propagation.

  • HTTP Headers

    Publishing

    Metadata sent with HTTP requests and responses that controls behavior such as caching, security policies, and content handling. Common examples include Cache-Control, Content-Security-Policy, and Strict-Transport-Security.

  • Token Budget

    AI

    Token Budget is the practical limit you set for prompt and completion tokens to balance quality, latency, and cost.

  • Generative UI

    AI