Transcriptions
Turn audio into text or text into audio.
Create transcription
ModelsExpand Collapse
class Transcription: …Represents a transcription response returned by model, based on the provided input.
Represents a transcription response returned by model, based on the provided input.
logprobs: Optional[List[Logprob]]The log probabilities of the tokens in the transcription. Only returned with the models gpt-4o-transcribe and gpt-4o-mini-transcribe if logprobs is added to the include array.
The log probabilities of the tokens in the transcription. Only returned with the models gpt-4o-transcribe and gpt-4o-mini-transcribe if logprobs is added to the include array.
class TranscriptionDiarized: …Represents a diarized transcription response returned by the model, including the combined transcript and speaker-segment annotations.
Represents a diarized transcription response returned by the model, including the combined transcript and speaker-segment annotations.
Segments of the transcript annotated with timestamps and speaker labels.
Segments of the transcript annotated with timestamps and speaker labels.
class TranscriptionSegment: …
Average logprob of the segment. If the value is lower than -1, consider the logprobs failed.
Compression ratio of the segment. If the value is greater than 2.4, consider the compression failed.
Emitted when a diarized transcription returns a completed segment with speaker information. Only emitted when you create a transcription with stream set to true and response_format set to diarized_json.
Emitted when a diarized transcription returns a completed segment with speaker information. Only emitted when you create a transcription with stream set to true and response_format set to diarized_json.
class TranscriptionTextSegmentEvent: …Emitted when a diarized transcription returns a completed segment with speaker information. Only emitted when you create a transcription with stream set to true and response_format set to diarized_json.
Emitted when a diarized transcription returns a completed segment with speaker information. Only emitted when you create a transcription with stream set to true and response_format set to diarized_json.
class TranscriptionTextDeltaEvent: …Emitted when there is an additional text delta. This is also the first event emitted when the transcription starts. Only emitted when you create a transcription with the Stream parameter set to true.
Emitted when there is an additional text delta. This is also the first event emitted when the transcription starts. Only emitted when you create a transcription with the Stream parameter set to true.
logprobs: Optional[List[Logprob]]The log probabilities of the delta. Only included if you create a transcription with the include[] parameter set to logprobs.
The log probabilities of the delta. Only included if you create a transcription with the include[] parameter set to logprobs.
class TranscriptionTextDoneEvent: …Emitted when the transcription is complete. Contains the complete transcription text. Only emitted when you create a transcription with the Stream parameter set to true.
Emitted when the transcription is complete. Contains the complete transcription text. Only emitted when you create a transcription with the Stream parameter set to true.
logprobs: Optional[List[Logprob]]The log probabilities of the individual tokens in the transcription. Only included if you create a transcription with the include[] parameter set to logprobs.
The log probabilities of the individual tokens in the transcription. Only included if you create a transcription with the include[] parameter set to logprobs.
class TranscriptionTextDeltaEvent: …Emitted when there is an additional text delta. This is also the first event emitted when the transcription starts. Only emitted when you create a transcription with the Stream parameter set to true.
Emitted when there is an additional text delta. This is also the first event emitted when the transcription starts. Only emitted when you create a transcription with the Stream parameter set to true.
logprobs: Optional[List[Logprob]]The log probabilities of the delta. Only included if you create a transcription with the include[] parameter set to logprobs.
The log probabilities of the delta. Only included if you create a transcription with the include[] parameter set to logprobs.
class TranscriptionTextDoneEvent: …Emitted when the transcription is complete. Contains the complete transcription text. Only emitted when you create a transcription with the Stream parameter set to true.
Emitted when the transcription is complete. Contains the complete transcription text. Only emitted when you create a transcription with the Stream parameter set to true.
logprobs: Optional[List[Logprob]]The log probabilities of the individual tokens in the transcription. Only included if you create a transcription with the include[] parameter set to logprobs.
The log probabilities of the individual tokens in the transcription. Only included if you create a transcription with the include[] parameter set to logprobs.
class TranscriptionTextSegmentEvent: …Emitted when a diarized transcription returns a completed segment with speaker information. Only emitted when you create a transcription with stream set to true and response_format set to diarized_json.
Emitted when a diarized transcription returns a completed segment with speaker information. Only emitted when you create a transcription with stream set to true and response_format set to diarized_json.
class TranscriptionVerbose: …Represents a verbose json transcription response returned by model, based on the provided input.
Represents a verbose json transcription response returned by model, based on the provided input.
Segments of the transcribed text and their corresponding details.
Segments of the transcribed text and their corresponding details.
Average logprob of the segment. If the value is lower than -1, consider the logprobs failed.
Compression ratio of the segment. If the value is greater than 2.4, consider the compression failed.