Prediction

Official model

minimax/speech-02-turbo

by67sg9dxdrm80cpjat9x3apxw

Status

Succeeded

Source

Web

Total duration

2.4s

Created

8 months ago

Webhook

–

Input

text: Speech-02-series is a Text-to-Audio and voice cloning technology that offers voice synthesis, emotional expression, and multilingual capabilities. The HD version is optimized for high-fidelity applications like voiceovers and audiobooks. While the turbo one is designed for real-time applications with low latency. When using this model on Replicate, each character represents 1 token.
voice_id: Deep_Voice_Man
speed: 1
volume: 1
pitch: 0
emotion: angry
english_normalization: true
sample_rate: 32000
bitrate: 128000
channel: mono
language_boost: English

{
  "bitrate": 128000,
  "channel": "mono",
  "emotion": "angry",
  "english_normalization": true,
  "language_boost": "English",
  "pitch": 0,
  "sample_rate": 32000,
  "speed": 1,
  "text": "Speech-02-series is a Text-to-Audio and voice cloning technology that offers voice synthesis, emotional expression, and multilingual capabilities.\n\nThe HD version is optimized for high-fidelity applications like voiceovers and audiobooks. While the turbo one is designed for real-time applications with low latency.\n\nWhen using this model on Replicate, each character represents 1 token.",
  "voice_id": "Deep_Voice_Man",
  "volume": 1
}

Install Replicate’s Node.js client library:

npm install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=r8_Ym5**********************************

This is your API token. Keep it to yourself.

Import and set up the client:

import Replicate from "replicate";
import fs from "node:fs";

const replicate = new Replicate({
  auth: process.env.REPLICATE_API_TOKEN,
});

Run minimax/speech-02-turbo using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

const input = {
  bitrate: 128000,
  channel: "mono",
  emotion: "angry",
  english_normalization: true,
  language_boost: "English",
  pitch: 0,
  sample_rate: 32000,
  speed: 1,
  text: "Speech-02-series is a Text-to-Audio and voice cloning technology that offers voice synthesis, emotional expression, and multilingual capabilities.\n\nThe HD version is optimized for high-fidelity applications like voiceovers and audiobooks. While the turbo one is designed for real-time applications with low latency.\n\nWhen using this model on Replicate, each character represents 1 token.",
  voice_id: "Deep_Voice_Man",
  volume: 1
};

const output = await replicate.run("minimax/speech-02-turbo", { input });

// To access the file URL:
console.log(output.url()); //=> "http://example.com"

// To write the file to disk:
fs.writeFile("my-image.png", output);

To learn more, take a look at the guide on getting started with Node.js.

Install Replicate’s Python client library:

pip install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=r8_Ym5**********************************

This is your API token. Keep it to yourself.

Import the client:

import replicate

Run minimax/speech-02-turbo using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

output = replicate.run(
    "minimax/speech-02-turbo",
    input={
        "bitrate": 128000,
        "channel": "mono",
        "emotion": "angry",
        "english_normalization": True,
        "language_boost": "English",
        "pitch": 0,
        "sample_rate": 32000,
        "speed": 1,
        "text": "Speech-02-series is a Text-to-Audio and voice cloning technology that offers voice synthesis, emotional expression, and multilingual capabilities.\n\nThe HD version is optimized for high-fidelity applications like voiceovers and audiobooks. While the turbo one is designed for real-time applications with low latency.\n\nWhen using this model on Replicate, each character represents 1 token.",
        "voice_id": "Deep_Voice_Man",
        "volume": 1
    }
)

# To access the file URL:
print(output.url())
#=> "http://example.com"

# To write the file to disk:
with open("my-image.png", "wb") as file:
    file.write(output.read())

To learn more, take a look at the guide on getting started with Python.

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=r8_Ym5**********************************

This is your API token. Keep it to yourself.

Run minimax/speech-02-turbo using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

curl -s -X POST \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Prefer: wait" \
  -d $'{
    "input": {
      "bitrate": 128000,
      "channel": "mono",
      "emotion": "angry",
      "english_normalization": true,
      "language_boost": "English",
      "pitch": 0,
      "sample_rate": 32000,
      "speed": 1,
      "text": "Speech-02-series is a Text-to-Audio and voice cloning technology that offers voice synthesis, emotional expression, and multilingual capabilities.\\n\\nThe HD version is optimized for high-fidelity applications like voiceovers and audiobooks. While the turbo one is designed for real-time applications with low latency.\\n\\nWhen using this model on Replicate, each character represents 1 token.",
      "voice_id": "Deep_Voice_Man",
      "volume": 1
    }
  }' \
  https://api.replicate.com/v1/models/minimax/speech-02-turbo/predictions

To learn more, take a look at Replicate’s HTTP API reference docs.

Output

Generated in

2.4 seconds

Input tokens

380

Output tokens

Tokens per second

0.42 tokens / second

Time to first token

10 milliseconds

Tweak it Iterate in playground Report