skip to content

voice api

The voice API

Gabriel needed a voice, so we built one. Now the same stack is opening up to the public. Endpoints are OpenAI-compatible, so if your code talks to the OpenAI API, it already talks to ours.

compat

OpenAI SDKs

streaming

HTTP + WebSocket

formats

mp3 wav opus pcm

price

free in early access

Text to Speech

live

Powered by Pocket TTS. It's the voice you hear in our VRChat projects, and it does a lot more than read text out loud.

  • OpenAI-compatible /v1/audio/speech endpoint
  • Voice cloning from your own audio samples
  • Real-time streaming over HTTP and WebSocket
  • PCM, WAV, MP3, and Opus output
  • Per-key rate limits and usage tracking

Speech to Text

in testing

Built on parakeet.cpp. Fast, accurate transcription without a datacenter bill. Currently in internal testing, public access soon.

  • Same base URL, same API keys as TTS
  • OpenAI-compatible /v1/audio/transcriptions
  • Built on parakeet.cpp for speed
  • Streaming transcription planned

want early access? say hi: hello@hoppou.ai

quick start

# python, with the openai sdk

from openai import OpenAI

client = OpenAI(
    base_url="https://api.hoppou.ai/tts/v1",
    api_key="your-hoppou-key",
)

audio = client.audio.speech.create(
    model="pocket-tts",
    voice="your-cloned-voice",
    input="hello world",
)
audio.write_to_file("speech.mp3")

people point this at

discord botsvrchat aisgame npcsaccessibility toolshome assistantscontent pipelinestwitch alerts

01

Get a key

Sign in with Discord on the developer portal and create an API key. Keys are free while we're in early access.

02

Clone a voice

Upload a clean audio sample on the portal and you get a voice ID you can use in any speech request.

03

Ship it

Point your existing OpenAI SDK at our base URL. Streaming, WebSocket, and batch all work the same way.

questions people ask

Is it actually OpenAI-compatible? +

Yes. Same request and response shapes as /v1/audio/speech and /v1/audio/transcriptions. Point your existing SDK at our base URL, swap the model name, done. If something behaves differently, that's a bug and we want to hear about it.

What does it cost? +

Nothing right now. Keys are free during early access while we figure out what fair limits look like. There's no credit card field anywhere on this site.

Can I clone any voice? +

You can clone voices you have the rights to: your own, ones you made, or ones you have permission for. Don't clone real people without their OK. We pull keys for that.

How is this different from the big providers? +

It's small and fast, and it's the same stack our VRChat AIs talk through in production every day. No sales call, no dashboard maze, no surprise invoice.

When does speech-to-text open up? +

It's in internal testing on parakeet.cpp right now. Email hello@hoppou.ai if you want to kick the tires early.

get started

Ready when you are

Grab a key, point your SDK north, and you're making noise in a couple minutes. Questions first? We actually answer email.