Audio & Voice

Transcribing User Audio with a Separate Realtime Request
Cookbook to transcribe user audio using out-of-band Realtime sessions.

DevDay — realtime breakout
DevDay session focused on realtime agent capabilities.

New audio models intro
Overview video of new audio models for speech and transcription.

openai.fm
Code samples for speech processing from the openai.fm repo.

Realtime & Twilio starter app
Starter app integrating realtime agents with Twilio.

Realtime agent demo
Video introduction to the TypeScript Agents SDK.

Realtime console
Console application demonstrating realtime API usage.

Realtime guide
Comprehensive guide to building realtime interactions.

Realtime intro
Introduction to building realtime voice applications.

Realtime solar system
Demo of realtime agent interactions in a solar system example.

Realtime tool delegation guide
Guide on delegating tasks through tools in realtime agents.

Realtime transcription guide
Guide for implementing realtime speech transcription.

Realtime translation guide
Guide to performing realtime speech translation.

Speech-to-text guide
Guide for building speech recognition pipelines.

Speech-to-text intro
Introduction to speech recognition with OpenAI.

Voice agents guide
Guide to building voice agents using speech-to-speech API.

Voice applications intro
Introduction to building voice-enabled applications with OpenAI.

Audio & speech guide
Overview of approaches for audio processing and speech in applications.

Realtime agents starter app
Starter app demonstrating realtime agent capabilities.

Comparing Speech-to-Text Methods with the OpenAI API
Cookbook to compare speech-to-text methods and choose the right approach.

Multi-Language One-Way Translation with the Realtime API
Cookbook to build one-way speech translation with the Realtime API.