Transcriptions
Turn audio into text or text into audio.
Create transcription
ModelsExpand Collapse
class Transcription:Represents a transcription response returned by model, based on the provided input.
Represents a transcription response returned by model, based on the provided input.
Optional<List<Logprob>> logprobsThe log probabilities of the tokens in the transcription. Only returned with the models gpt-4o-transcribe and gpt-4o-mini-transcribe if logprobs is added to the include array.
The log probabilities of the tokens in the transcription. Only returned with the models gpt-4o-transcribe and gpt-4o-mini-transcribe if logprobs is added to the include array.
class TranscriptionDiarized:Represents a diarized transcription response returned by the model, including the combined transcript and speaker-segment annotations.
Represents a diarized transcription response returned by the model, including the combined transcript and speaker-segment annotations.
List<TranscriptionDiarizedSegment> segmentsSegments of the transcript annotated with timestamps and speaker labels.
Segments of the transcript annotated with timestamps and speaker labels.
The type of task that was run. Always transcribe.
class TranscriptionSegment:
Average logprob of the segment. If the value is lower than -1, consider the logprobs failed.
Compression ratio of the segment. If the value is greater than 2.4, consider the compression failed.
class TranscriptionStreamEvent: A class that can be one of several variants.union Emitted when a diarized transcription returns a completed segment with speaker information. Only emitted when you create a transcription with stream set to true and response_format set to diarized_json.
Emitted when a diarized transcription returns a completed segment with speaker information. Only emitted when you create a transcription with stream set to true and response_format set to diarized_json.
class TranscriptionTextSegmentEvent:Emitted when a diarized transcription returns a completed segment with speaker information. Only emitted when you create a transcription with stream set to true and response_format set to diarized_json.
Emitted when a diarized transcription returns a completed segment with speaker information. Only emitted when you create a transcription with stream set to true and response_format set to diarized_json.
class TranscriptionTextDeltaEvent:Emitted when there is an additional text delta. This is also the first event emitted when the transcription starts. Only emitted when you create a transcription with the Stream parameter set to true.
Emitted when there is an additional text delta. This is also the first event emitted when the transcription starts. Only emitted when you create a transcription with the Stream parameter set to true.
The type of the event. Always transcript.text.delta.
Optional<List<Logprob>> logprobsThe log probabilities of the delta. Only included if you create a transcription with the include[] parameter set to logprobs.
The log probabilities of the delta. Only included if you create a transcription with the include[] parameter set to logprobs.
class TranscriptionTextDoneEvent:Emitted when the transcription is complete. Contains the complete transcription text. Only emitted when you create a transcription with the Stream parameter set to true.
Emitted when the transcription is complete. Contains the complete transcription text. Only emitted when you create a transcription with the Stream parameter set to true.
The type of the event. Always transcript.text.done.
Optional<List<Logprob>> logprobsThe log probabilities of the individual tokens in the transcription. Only included if you create a transcription with the include[] parameter set to logprobs.
The log probabilities of the individual tokens in the transcription. Only included if you create a transcription with the include[] parameter set to logprobs.
class TranscriptionTextDeltaEvent:Emitted when there is an additional text delta. This is also the first event emitted when the transcription starts. Only emitted when you create a transcription with the Stream parameter set to true.
Emitted when there is an additional text delta. This is also the first event emitted when the transcription starts. Only emitted when you create a transcription with the Stream parameter set to true.
The type of the event. Always transcript.text.delta.
Optional<List<Logprob>> logprobsThe log probabilities of the delta. Only included if you create a transcription with the include[] parameter set to logprobs.
The log probabilities of the delta. Only included if you create a transcription with the include[] parameter set to logprobs.
class TranscriptionTextDoneEvent:Emitted when the transcription is complete. Contains the complete transcription text. Only emitted when you create a transcription with the Stream parameter set to true.
Emitted when the transcription is complete. Contains the complete transcription text. Only emitted when you create a transcription with the Stream parameter set to true.
The type of the event. Always transcript.text.done.
Optional<List<Logprob>> logprobsThe log probabilities of the individual tokens in the transcription. Only included if you create a transcription with the include[] parameter set to logprobs.
The log probabilities of the individual tokens in the transcription. Only included if you create a transcription with the include[] parameter set to logprobs.
class TranscriptionTextSegmentEvent:Emitted when a diarized transcription returns a completed segment with speaker information. Only emitted when you create a transcription with stream set to true and response_format set to diarized_json.
Emitted when a diarized transcription returns a completed segment with speaker information. Only emitted when you create a transcription with stream set to true and response_format set to diarized_json.
class TranscriptionVerbose:Represents a verbose json transcription response returned by model, based on the provided input.
Represents a verbose json transcription response returned by model, based on the provided input.
Segments of the transcribed text and their corresponding details.
Segments of the transcribed text and their corresponding details.
Average logprob of the segment. If the value is lower than -1, consider the logprobs failed.
Compression ratio of the segment. If the value is greater than 2.4, consider the compression failed.