API Reference
Query Cache
Generate a completion using a pre-populated cache.
Args: name: Name of the cache to query. query: Prompt text to send to the model. max_tokens: Optional maximum number of tokens to generate. temperature: Optional sampling temperature for the model. auth: Authentication context for permission checks.
Returns:
A :class:CompletionResponse
object containing the model output.
POST
Headers
Path Parameters
Response
200
application/json
Successful Response
Response from completion generation