xai/grok-speech-to-text

Transcribe audio to text with xAI's Grok. Handles 25 languages, word-level timestamps, speaker diarization, multichannel audio, and files up to 500 MB.

8 runs

Want to make some of these yourself?

Run this model