**xAI Grok Speech STT/TTS APIs Launch** [developing]
Key Questions
When were the Grok STT and TTS APIs launched and what is their pricing?
The Grok Speech to Text (STT) and Text to Speech (TTS) APIs were launched on December 8, 2026. Pricing is $0.10-0.20 per hour for STT and $4.20 per million characters for TTS, undercutting competitors by 60%.
What features does the Grok STT API offer?
The STT API supports 25+ languages, diarization, and timestamps, outperforming ElevenLabs and Deepgram with 5% entity error. It is ideal for low-cost real-time voice agents and podcast SaaS.
How does the Grok TTS API perform compared to competitors?
The TTS API provides natural control and prosody at a competitive price. It undercuts rivals by 60% while maintaining high quality, suitable for B2C applications on platforms like HF, Replicate, and Fal.ai amid surges in Gemini and VoxCPM2.
Grok STT/TTS APIs launched Dec 8 2026 at $0.10-0.20/hr STT (25+ langs/diarization/timestamps outperforming ElevenLabs/Deepgram) and $4.20/M chars TTS (natural control), undercutting competitors by 60% with 5% entity error/prosody; ideal low-cost HF/Replicate/Fal.ai B2C real-time voice agent/podcast SaaS amid Gemini/VoxCPM2 surge.