GPT-4o mini Transcribe Model

Models

GPT-4o mini Transcribe

Default

Speech-to-text model powered by GPT-4o mini

Performance

High

Speed

Fast

Price

$1.25•$5

Input•Output

Input

Audio, text

Output

Text

GPT-4o mini Transcribe is a speech-to-text model that uses GPT-4o mini to transcribe audio. It offers improvements to word error rate and better language recognition and accuracy compared to original Whisper models. Use it for more accurate transcripts.

16,000 context window

2,000 max output tokens

Jun 01, 2024 knowledge cutoff

Pricing

Pricing is based on the number of tokens used, or other metrics based on the model type. For tool-specific models, like search and computer use, there’s a fee per tool call. See details in the pricing page.

Audio tokens

Per 1M tokens

Input

$1.25

Output

$5.00

Quick comparison

Input

Output

GPT-4o Transcribe

$2.50

GPT-4o mini Transcribe

$1.25

Modalities

Text

Input and output

Image

Not supported

Audio

Input only

Video

Not supported

Endpoints

Chat Completions

v1/chat/completions

Responses

v1/responses

Realtime

v1/realtime

Realtime translation

v1/realtime/translations

Realtime transcription

v1/realtime/transcription_sessions

Assistants

v1/assistants

Batch

v1/batch

Fine-tuning

v1/fine-tuning

Embeddings

v1/embeddings

Image generation

v1/images/generations

Videos

v1/videos

Image edit

v1/images/edits

Speech generation

v1/audio/speech

Transcription

v1/audio/transcriptions

Translation

v1/audio/translations

Moderation

v1/moderations

Completions (legacy)

v1/completions

Snapshots

Snapshots let you lock in a specific version of the model so that performance and behavior remain consistent. Below is a list of all available snapshots and aliases for GPT-4o mini Transcribe.

gpt-4o-mini-transcribe

gpt-4o-mini-transcribe-2025-12-15

gpt-4o-mini-transcribe-2025-03-20

gpt-4o-mini-transcribe-2025-12-15

Rate limits

Rate limits ensure fair and reliable access to the API by placing specific caps on requests, tokens, audio duration, or other usage within a given time period. Your usage tier determines how high these limits are set and automatically increases as you send more requests and spend more on the API.

Tier	RPM	TPM
Free	Not supported
Tier 1	500	50,000
Tier 2	2,000	150,000
Tier 3	5,000	600,000
Tier 4	10,000	2,000,000
Tier 5	10,000	8,000,000

Suggested

Get started

Core concepts

Agents SDK

Tools

Run and scale

Evaluation

Realtime and audio

Specialized models

Going live

Legacy APIs

Resources

Getting Started

Using Codex

Configuration

Administration

Automation

Learn

Releases

Core Concepts

Plan

Build

Deploy

Conversion apps

Guides

Resources

Get started

Guides

File Upload

API

Measurement

Advertiser API

API Reference

Recent

Topics

Topics

Contribute

Categories

Topics

Programs

Events

Spaces