AI Edge Bootcamp

Voice Changer App

Transform voices in dialogue audio using AI

Upload Audio

Select or upload a dialogue audio file to convert

or select an existing file

Choose Provider

Select the AI service to convert your audio

OpenAI

Whisper STT + GPT-4o TTS

API key required

ElevenLabs

Direct S2S or per-speaker TTS

API key required

Your key is used only for this session and is never stored.

Your key is used only for this session and is never stored.

Direct S2S

Faster. Converts audio directly. One voice for all speakers.

Per-Speaker

Transcribes first. Assign a different voice to each speaker.

Find Voice IDs in your ElevenLabs dashboard.

Transcription typically takes 15–30 seconds…

Assign Voices

Found speakers in your audio. Assign a voice to each one.

Transcript loaded in memory — assign a voice to each detected speaker below.

The output will be in English, but spoken with the accent of a native speaker of the selected language.

Synthesis can take up to a minute for longer audio…

Audio Generated!

Your voice-converted audio is ready

voice_output.mp3

Download Audio
Provider:
Audio processed in memory — no files stored on server

Powered by OpenAI Whisper · GPT-4o TTS · ElevenLabs

Built by AI Edge Bootcamp