stt microphone live example #2254

ChitranshS · 2025-03-24T16:07:20Z

I understand that this repository is auto-generated and my pull request may not be merged

Changes being requested

This PR adds a real-time speech-to-text example script demonstrating how to use OpenAI's WebSocket-based transcription API. The script:

Captures audio from the microphone in real-time
Streams the audio data to OpenAI's transcription API via WebSockets
Processes and displays transcription events as they occur
Handles speech detection events (speech start/stop)
Properly manages resources and connections

This example would be valuable for users who want to implement real-time transcription functionality in their applications using the OpenAI API.

Additional context & links

This implementation uses:

websockets for WebSocket communication
sounddevice for microphone input
numpy for audio data processing
pydantic for data validation and configuration

The script demonstrates best practices for real-time audio streaming and event handling with OpenAI's transcription API, including proper connection management, error handling, and resource cleanup.

wronkiew · 2025-03-30T02:05:25Z

I was not able to get this to work. One thing I ran into is it requires websockets==10.1 for extra_headers support. But once running the transcription endpoint only returned a session update event, no transcribed text. Did you get more events? I confirmed it is recording good audio.
I got a variant based on this example to return transcription events.

stt microphone live example

750ef90

ChitranshS requested a review from a team as a code owner March 24, 2025 16:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

stt microphone live example #2254

stt microphone live example #2254

Uh oh!

ChitranshS commented Mar 24, 2025

wronkiew commented Mar 30, 2025

stt microphone live example #2254

Are you sure you want to change the base?

stt microphone live example #2254

Uh oh!

Conversation

ChitranshS commented Mar 24, 2025

Changes being requested

Additional context & links

wronkiew commented Mar 30, 2025