Generates a response based on a list of messages sent by the user to the model. The list of messages may include system messages, user messages, and assistant messages.
This endpoint allows other tools in the developer ecosystem to leverage poolside as a drop-in replacement for other models.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
An array of messages representing the conversation.
1The model to use for the completion.
Whether to include the stop strings in output text. Defaults to false.
Whether to return log probabilities of the output tokens or not.
The maximum number of tokens to generate in the completion.
The maximum number of tokens to generate in the completion.
Sets a minimum probability threshold relative to the most likely token.
0 <= x <= 1Whether to return the generated tokens as token IDs instead of text. Defaults to false.
If specified, our system will make a best effort to sample deterministically, such that repeated requests with the same seed and parameters should return the same result. Determinism is not guaranteed.
Whether to skip special tokens in the output.
An array of sequences where the API will stop generating further tokens.
If true, the response will be streamed as a series of events instead of a single JSON object.
Options for streaming response. Only set this when you set stream: true.
What sampling temperature to use, between 0 and 2.
0 <= x <= 2A list of tools the model may call. Currently, only functions are supported as a tool. Use this to provide a list of functions the model may generate JSON inputs for.
Limits the model to consider only the top K most likely tokens at each step.
An alternative to sampling with temperature, called nucleus sampling.
x <= 1Successful response - JSON when stream=false, SSE when stream=true
A list of chat completions from the model.
1The Unix timestamp (in seconds) when the completion was created.
The model used for the chat completion.
The object type 'chat.completion' or ''chat.completion.chunk'
A URL to the JSON Schema for this object.
"https://example.com/openai/schemas/ChatCompletion.json"
Usage statistics for the chat completion.