xAI has launched its Grok Speech-to-Text (STT) and Text-to-Speech (TTS) APIs, offering advanced audio processing capabilities. The Grok STT API provides accurate, low-latency transcription services with features like word-level timestamps and speaker diarization, supporting over 25 languages. It is priced at $0.10 per hour for batch processing and $0.20 per hour for streaming. Benchmark tests indicate its performance surpasses that of leading models such as ElevenLabs and Deepgram. The Grok TTS API delivers fast, natural speech synthesis with detailed control via voice tags, priced at $4.20 per million characters. Both APIs leverage the technology stack used in Grok Voice, Tesla vehicles, and Starlink support, highlighting xAI's commitment to integrating cutting-edge audio solutions across its platforms.