Readme

⛓️‍💥 Whisper Unchained

Break free from: - 🔐 Token requirements → Zero setup - 📏 File size limits → 1 GB supported - 🌍 English-only translation → 37 languages - 🔧 Multiple API calls → One call. Everything.

Unchained from limitations. Unleashed for production.

⛓️ What You’re Chained To (With Other Models)

Limitation	Them (Chained)	Whisper Unchained
File Size	~100 MB max 🔒	1 GB ⛓️‍💥
Translation	English only 🔒	37 languages ⛓️‍💥
Setup	Token required 🔒	Zero setup ⛓️‍💥
Formats	JSON only 🔒	JSON + Text + SRT + VTT ⛓️‍💥
Complexity	13+ parameters 🔒	3 simple ⛓️‍💥
Cost	Multiple APIs 🔒	One call ⛓️‍💥

⛓️‍💥 Breaking Free From BS

🔓 Unchained from Token Hell

Chained models: “Sign up. Accept terms. Get HuggingFace token. Configure permissions.”
Unchained: Paste audio URL. Done.

🌍 Unchained from English-Only

Chained models: “Translate to English!”
Unchained: Spanish → Japanese. French → Arabic. ANY → ANY (37 languages).

📦 Unchained from Format Juggling

Chained models: “Here’s JSON. Go convert it yourself.”
Unchained: JSON + SRT + VTT auto-generated. Every. Single. Time.

🐘 Unchained from File Limits

Chained models: “Split your 500 MB podcast into 5 parts.”
Unchained: Upload the whole 1 GB file. We handle it.

🚀 What You Actually Get

Every single API call returns:

{
  "transcript": "Full text transcription",
  "translation": "Translated to your target language",
  "language": "Auto-detected source language",
  "duration": "Audio length in seconds",
  "json": "Structured data with segments, words, speakers",
  "srt": "Ready-to-use video subtitles",
  "vtt": "Ready-to-use web subtitles"
}

7 outputs. 1 API call. Zero extra work.

💪 Real Power Features

🌍 True Multi-Language Translation

Them: “We translate to English!”
Us: Translate ANY language to ANY of 37 languages (DeepL powered)
Spanish → Japanese? ✅ French → Arabic? ✅ German → Korean? ✅

📏 Enterprise File Sizes

Them: Split your 500 MB podcast into 5 parts
Us: Upload the whole 1 GB file. We handle it.

🎤 Unlimited Speaker Detection

Them: “Specify min/max speakers”
Us: Auto-detects as many speakers as exist

🎯 118 Languages Supported

Full Whisper language coverage
Auto-detection when you don’t know the language
Works on ANY audio content

⚡ 3 Parameters. That’s It.

{
  "audio_url": "https://your-file.mp3",  # Required
  "language": "Auto-detect",              # Optional
  "translate_to": "Spanish"               # Optional
}

No batch_size. No vad_onset. No temperature. No HuggingFace tokens.

Just the essentials.

🎯 Perfect For

Content Creators - Transcribe + translate + subtitle your videos in ONE call - No more exporting to 3 different services

Podcasters - 1 GB file support = full episodes, no splitting - Speaker diarization included, not extra

Businesses - Meeting transcripts with speaker labels - Translate to team’s languages automatically

Developers - 1 endpoint replaces 3+ services - Clean API, zero token management

💰 Stop Paying for Chains

Chained to multiple services: 1. Transcription API: $$ 2. Translation API: $$ 3. Subtitle converter: $$ 4. Large file storage: $$$ 5. HuggingFace subscription: $$

Total: 💸💸💸

Unchained: 1. One API call: Everything ✅

Total: 💸

⚡ Break Free. Start Now.

No setup. No tokens. No limits.

{
  "audio_url": "https://your-1gb-file.mp3",
  "language": "Auto-detect",
  "translate_to": "Spanish"
}

Output: Transcription + Translation + JSON + SRT + VTT + Speakers

One call. Unchained.

⛓️‍💥 Break free from limitations | Built by SIÁN Agency

Run time and cost