Create completion

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json

model

string

required

The model to use for the completion.

prompt

string

required

The prompt to generate completions for.

cache_control

object

Anthropic-style prompt-cache marker, forwarded verbatim to the upstream provider.

Show child attributes

include_stop_str_in_output

boolean

Whether to include the stop strings in output text. Defaults to false.

max_completion_tokens

integer<int64>

The maximum number of tokens to generate in the completion.

max_tokens

integer<int64>

The maximum number of tokens to generate in the completion.

min_p

number<double>

Sets a minimum probability threshold relative to the most likely token.

Required range: 0 <= x <= 1

return_tokens_as_token_ids

boolean

Whether to return the generated tokens as token IDs instead of text. Defaults to false.

seed

integer<int64>

If specified, our system will make a best effort to sample deterministically, such that repeated requests with the same seed and parameters should return the same result. Determinism is not guaranteed.

skip_special_tokens

boolean

Whether to skip special tokens in the output.

stop

string[]

An array of sequences where the API will stop generating further tokens.

stream

boolean

If true, the response will be streamed as a series of events instead of a single JSON object.

stream_options

object

Options for streaming response. Only set this when you set stream: true.

Show child attributes

temperature

number<double>

What sampling temperature to use, between 0 and 2.

Required range: 0 <= x <= 2

top_k

integer<int64>

Limits the model to consider only the top K most likely tokens at each step.

top_p

number<double>

An alternative to sampling with temperature, called nucleus sampling.

Required range: x <= 1

Response

Successful response - JSON when stream=false, SSE when stream=true

choices

object[]

required

The list of completion choices.

Show child attributes

created

integer<int64>

required

The Unix timestamp (in seconds) of when the completion was created.

model

string

required

The model used for the completion.

object

string

required

The object type, which is always 'text_completion'.

$schema

string<uri>

read-only

A URL to the JSON Schema for this object.

Example:

"https://example.com/openai/schemas/Completion.json"

usage

object

Usage statistics for the completion request.

Show child attributes