thomasmol / whisper-diarization

⚡️ Blazing fast audio transcription with speaker diarization | Whisper Large V3 Turbo | word & sentence level timestamps | prompt

  • Public
  • 1.1M runs
  • L40S
  • GitHub
  • Paper
  • License

Run thomasmol/whisper-diarization with an API

Use one of our client libraries to get started quickly.

Set the REPLICATE_API_TOKEN environment variable

export REPLICATE_API_TOKEN=<paste-your-token-here>

Learn more about authentication

Install Replicate’s Node.js client library

npm install replicate
Learn more about setup

Run thomasmol/whisper-diarization using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

import Replicate from "replicate";
const replicate = new Replicate();

const input = {
    file: "https://replicate.delivery/pbxt/JcL0ttZLlbchC0tL9ZtB20phzeXCSuMm0EJNdLYElgILoZci/AI%20should%20be%20open-sourced.mp3",
    prompt: "LLama, AI, Meta.",
    file_url: "",
    language: "en",
    num_speakers: 2
};

const output = await replicate.run("thomasmol/whisper-diarization:d8bc5908738ebd84a9bb7d77d94b9c5e5a3d867886791d7171ddb60455b4c6af", { input });
console.log(output)
//=> {"language":"en","segments":[{"end":4.48,"text":"Let me a...
Learn more