xAI has launched its Grok Speech-to-Text (STT) and Text-to-Speech (TTS) APIs, offering advanced audio processing capabilities. The Grok STT API provides accurate, low-latency transcription services with features like word-level timestamps and speaker diarization, supporting over 25 languages. It is priced at $0.10 per hour for batch processing and $0.20 per hour for streaming. Benchmark tests indicate its performance surpasses that of leading models such as ElevenLabs and Deepgram.
The Grok TTS API delivers fast, natural speech synthesis with detailed control via voice tags, priced at $4.20 per million characters. Both APIs leverage the technology stack used in Grok Voice, Tesla vehicles, and Starlink support, highlighting xAI's commitment to integrating cutting-edge audio solutions across its platforms.
xAI Unveils Grok STT and TTS APIs with Competitive Pricing
Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.
