Completions

Given a prompt, the model will return one or more predicted completions, and can also return the probabilities of alternative tokens at each position.

Create completion

completions.create() -> Completion

POST/completions

ModelsExpand Collapse

class Completion: …

Represents a completion response from the API. Note: both the streamed and non-streamed response objects share the same shape (unlike the chat endpoint).

id: str

A unique identifier for the completion.

choices: List[CompletionChoice]

The list of completion choices the model generated for the input prompt.

finish_reason: Literal["stop", "length", "content_filter"]

The reason the model stopped generating tokens. This will be stop if the model hit a natural stop point or a provided stop sequence, length if the maximum number of tokens specified in the request was reached, or content_filter if content was omitted due to a flag from our content filters.

One of the following:

"stop"

"length"

"content_filter"

index: int

logprobs: Optional[Logprobs]

text_offset: Optional[List[int]]

token_logprobs: Optional[List[float]]

tokens: Optional[List[str]]

top_logprobs: Optional[List[Dict[str, float]]]

text: str

created: int

The Unix timestamp (in seconds) of when the completion was created.

formatunixtime

model: str

The model used for completion.

object: Literal["text_completion"]

The object type, which is always “text_completion”

system_fingerprint: Optional[str]

This fingerprint represents the backend configuration that the model runs with.

Can be used in conjunction with the seed request parameter to understand when backend changes have been made that might impact determinism.

usage: Optional[CompletionUsage]

Usage statistics for the completion request.

class CompletionChoice: …

finish_reason: Literal["stop", "length", "content_filter"]

One of the following:

"stop"

"length"

"content_filter"

index: int

logprobs: Optional[Logprobs]

text_offset: Optional[List[int]]

token_logprobs: Optional[List[float]]

tokens: Optional[List[str]]

top_logprobs: Optional[List[Dict[str, float]]]

text: str

class CompletionUsage: …

Usage statistics for the completion request.

completion_tokens: int

Number of tokens in the generated completion.

prompt_tokens: int

Number of tokens in the prompt.

total_tokens: int

Total number of tokens used in the request (prompt + completion).

completion_tokens_details: Optional[CompletionTokensDetails]

Breakdown of tokens used in a completion.

accepted_prediction_tokens: Optional[int]

When using Predicted Outputs, the number of tokens in the prediction that appeared in the completion.

audio_tokens: Optional[int]

Audio input tokens generated by the model.

reasoning_tokens: Optional[int]

Tokens generated by the model for reasoning.

rejected_prediction_tokens: Optional[int]

When using Predicted Outputs, the number of tokens in the prediction that did not appear in the completion. However, like reasoning tokens, these tokens are still counted in the total completion tokens for purposes of billing, output, and context window limits.

prompt_tokens_details: Optional[PromptTokensDetails]

Breakdown of tokens used in the prompt.

audio_tokens: Optional[int]

Audio input tokens present in the prompt.

cached_tokens: Optional[int]

Cached tokens present in the prompt.

Suggested

Completions

Create completion

ModelsExpand Collapse