xai / grok-text-to-speech
Convert text to natural-sounding speech with xAI's Grok TTS. 5 voices, 20 languages, expressive speech tags, and high-fidelity MP3 / WAV / telephony audio output.
xai / grok-speech-to-text
Transcribe audio to text with xAI's Grok. Handles 25 languages, word-level timestamps, speaker diarization, multichannel audio, and files up to 500 MB.
xai / grok-imagine-r2v
Generate videos guided by reference images using xAI's Grok Imagine Video model
xai / grok-imagine-video-extension
Extend videos with xAI's Grok Imagine Video model. Provide a source video and describe what happens next.
xai / grok-imagine-image
SOTA image model from xAI
xai / grok-imagine-video
Generate videos using xAI's Grok Imagine Video model
xai / grok-4
Grok 4 is xAI’s most advanced reasoning model. Excels at logical thinking and in-depth analysis. Ideal for insightful discussions and complex problem-solving.