Self-hosted · Production-ready · Low latency
Voice AI APIs for
Indian Languages
Production-ready Speech-to-Text and Text-to-Speech APIs supporting 22+ Indian languages. Enterprise-grade accuracy with sub-2-second latency.
Powerful Voice AI APIs
Built for Indian languages with state-of-the-art models, optimized for production workloads.
Speech-to-Text
Convert audio to text with high accuracy across Indian languages.
- 22 Indian languages via IndicConformer
- English via faster-whisper
- WebSocket streaming support
- <2s latency, optimized inference
Text-to-Speech
Generate natural-sounding speech from text in multiple Indian languages.
- 10 Indian languages supported
- MMS-TTS VITS model architecture
- <2s synthesis time
- High-quality WAV output
Supported Languages
Comprehensive coverage across India's linguistic diversity.
Tamil
Hindi
Telugu
Malayalam
Kannada
Bengali
Gujarati
Marathi
Punjabi
English
Assamese
Bodo
Dogri
Konkani
Kashmiri
Maithili
Manipuri
Nepali
Odia
Sanskrit
Santali
Sindhi
Urdu
Simple, Transparent Pricing
Start free and scale as you grow. All plans include access to both STT and TTS APIs.
Free
Free
- 50 STT requests/month
- 50 TTS requests/month
- 5 req/min rate limit
- Community support
Starter
₹
499
/mo
- 1,000 STT requests/month
- 1,000 TTS requests/month
- 20 req/min rate limit
- Email support
Most Popular
Pro
₹
2999
/mo
- 10,000 STT requests/month
- 10,000 TTS requests/month
- 60 req/min rate limit
- Priority support
- WebSocket streaming
Enterprise
₹
9999
/mo
- Unlimited STT requests
- Unlimited TTS requests
- 200 req/min rate limit
- Dedicated support
- WebSocket streaming
- Custom models
Try It Live
Test our APIs directly in your browser. No signup required.
Press and release to record audio. Max 10 seconds.
Your transcribed text will appear here...
Simple API Integration
Get started with just a few lines of code.
Speech-to-Text
curl
curl -X POST \
http://voice.chozha.io/api/v1/stt/transcribe \
-H "X-API-Key: YOUR_API_KEY" \
-F "file=@audio.wav" \
-F "language=ta"
Text-to-Speech
curl
curl -X POST \
http://voice.chozha.io/api/v1/tts/synthesize \
-H "X-API-Key: YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{"text":"வணக்கம்","language":"ta"}' \
--output speech.wav