Input

Run this model in Node.js with one line of code:

npx create-replicate --model=gianpaj/cog-orpheus-3b-0.1-ft

or set up a project from scratch

Install Replicate’s Node.js client library:

npm install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import and set up the client:

import Replicate from "replicate";
import fs from "node:fs";

const replicate = new Replicate({
  auth: process.env.REPLICATE_API_TOKEN,
});

Run gianpaj/cog-orpheus-3b-0.1-ft using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

const output = await replicate.run(
  "gianpaj/cog-orpheus-3b-0.1-ft:666dc0c400952f2c18f0a46233dca2053ebef622754769878cd5497e20714650",
  {
    input: {
      text: "Hola, me llamo Javi, encantado de conocerte <giggle>",
      top_p: 0.95,
      voice: "javi",
      temperature: 0.6,
      max_new_tokens: 1200,
      repetition_penalty: 1.1
    }
  }
);

// To access the file URL:
console.log(output.url()); //=> "http://example.com"

// To write the file to disk:
fs.writeFile("my-image.png", output);

To learn more, take a look at the guide on getting started with Node.js.

Install Replicate’s Python client library:

pip install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import the client:

import replicate

Run gianpaj/cog-orpheus-3b-0.1-ft using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

output = replicate.run(
    "gianpaj/cog-orpheus-3b-0.1-ft:666dc0c400952f2c18f0a46233dca2053ebef622754769878cd5497e20714650",
    input={
        "text": "Hola, me llamo Javi, encantado de conocerte <giggle>",
        "top_p": 0.95,
        "voice": "javi",
        "temperature": 0.6,
        "max_new_tokens": 1200,
        "repetition_penalty": 1.1
    }
)

# To access the file URL:
print(output.url())
#=> "http://example.com"

# To write the file to disk:
with open("my-image.png", "wb") as file:
    file.write(output.read())

To learn more, take a look at the guide on getting started with Python.

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Run gianpaj/cog-orpheus-3b-0.1-ft using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

curl -s -X POST \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Prefer: wait" \
  -d $'{
    "version": "gianpaj/cog-orpheus-3b-0.1-ft:666dc0c400952f2c18f0a46233dca2053ebef622754769878cd5497e20714650",
    "input": {
      "text": "Hola, me llamo Javi, encantado de conocerte <giggle>",
      "top_p": 0.95,
      "voice": "javi",
      "temperature": 0.6,
      "max_new_tokens": 1200,
      "repetition_penalty": 1.1
    }
  }' \
  https://api.replicate.com/v1/predictions

To learn more, take a look at Replicate’s HTTP API reference docs.

Output

Video Player is loading.

Current Time 00:00:000

Duration 00:00:000

Loaded: 0%

Stream Type LIVE

Remaining Time 00:00:000

Generated in

7.4 seconds

Tweak it ShareReport View full prediction

Run time and cost

This model runs on Nvidia L40S GPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

Spanish and Italian model: 3b-es_it-ft-research_release https://huggingface.co/canopylabs/3b-es_it-ft-research_release

Orpheus 3B 0.1 Finetuned

Note on emotional tags: - Italian supports sigh, laugh, cough, sniffle, groan, yawn, gemito, gasp - Spanish supports groan, chuckle, gasp, resoplido, laugh, yawn, cough

More info: https://canopylabs.ai/releases/orpheus_can_speak_any_language

Orpheus TTS is a state-of-the-art, Llama-based Speech-LLM designed for high-quality, empathetic text-to-speech generation. This model has been finetuned to deliver human-level speech synthesis, achieving exceptional clarity, expressiveness, and real-time streaming performances.

Model Details

Model Capabilities

Human-Like Speech: Natural intonation, emotion, and rhythm that is superior to SOTA closed source models
Zero-Shot Voice Cloning: Clone voices without prior fine-tuning
Guided Emotion and Intonation: Control speech and emotion characteristics with simple tags
Low Latency: ~200ms streaming latency for realtime applications, reducible to ~100ms with input streaming

Model Sources

GitHub Repo: https://github.com/canopyai/Orpheus-TTS
Blog Posts: https://canopylabs.ai/releases

Model Misuse

Do not use our models for impersonation without consent, misinformation or deception (including fake news or fraudulent calls), or any illegal or harmful activity. By using this model, you agree to follow all applicable laws and ethical guidelines. We disclaim responsibility for any use.

gianpaj / cog-orpheus-3b-0.1-ft

Input

Output

Run time and cost

Readme

Orpheus 3B 0.1 Finetuned

Model Details

Model Capabilities

Model Sources

Model Misuse

Logs (ypr5eq58pnrm80cp5ttv01ghwg)