You're looking at a specific version of this model. Jump to the model overview.
Input
Run this model in Node.js with one line of code:
npm install replicate
REPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
import Replicate from "replicate";
import fs from "node:fs";
const replicate = new Replicate({
auth: process.env.REPLICATE_API_TOKEN,
});
Run adirik/styletts2 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run(
"adirik/styletts2:53fd5081feae9440974d1ef9cae83bf7af5fe18be1646343f37e559f5f80a613",
{
input: {
beta: 0.7,
seed: 0,
text: "StyleTTS 2 is a text-to-speech model that leverages style diffusion and adversarial training with large speech language models to achieve human-level text-to-speech synthesis.",
alpha: 0.3,
diffusion_steps: 10,
embedding_scale: 1.5
}
}
);
// To access the file URL:
console.log(output.url()); //=> "http://example.com"
// To write the file to disk:
fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
pip install replicate
REPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
import replicate
Run adirik/styletts2 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run(
"adirik/styletts2:53fd5081feae9440974d1ef9cae83bf7af5fe18be1646343f37e559f5f80a613",
input={
"beta": 0.7,
"seed": 0,
"text": "StyleTTS 2 is a text-to-speech model that leverages style diffusion and adversarial training with large speech language models to achieve human-level text-to-speech synthesis.",
"alpha": 0.3,
"diffusion_steps": 10,
"embedding_scale": 1.5
}
)
print(output)
To learn more, take a look at the guide on getting started with Python.
REPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run adirik/styletts2 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \
-H "Authorization: Bearer $REPLICATE_API_TOKEN" \
-H "Content-Type: application/json" \
-H "Prefer: wait" \
-d $'{
"version": "adirik/styletts2:53fd5081feae9440974d1ef9cae83bf7af5fe18be1646343f37e559f5f80a613",
"input": {
"beta": 0.7,
"seed": 0,
"text": "StyleTTS 2 is a text-to-speech model that leverages style diffusion and adversarial training with large speech language models to achieve human-level text-to-speech synthesis.",
"alpha": 0.3,
"diffusion_steps": 10,
"embedding_scale": 1.5
}
}' \
https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
brew install cog
If you don’t have Homebrew, there are other installation options available.
Run this to download the model and run it in your local environment:
cog predict r8.im/adirik/styletts2@sha256:53fd5081feae9440974d1ef9cae83bf7af5fe18be1646343f37e559f5f80a613 \
-i 'beta=0.7' \
-i 'seed=0' \
-i 'text="StyleTTS 2 is a text-to-speech model that leverages style diffusion and adversarial training with large speech language models to achieve human-level text-to-speech synthesis."' \
-i 'alpha=0.3' \
-i 'diffusion_steps=10' \
-i 'embedding_scale=1.5'
To learn more, take a look at the Cog documentation.
Run this to download the model and run it in your local environment:
docker run -d -p 5000:5000 --gpus=all r8.im/adirik/styletts2@sha256:53fd5081feae9440974d1ef9cae83bf7af5fe18be1646343f37e559f5f80a613
curl -s -X POST \ -H "Content-Type: application/json" \ -d $'{ "input": { "beta": 0.7, "seed": 0, "text": "StyleTTS 2 is a text-to-speech model that leverages style diffusion and adversarial training with large speech language models to achieve human-level text-to-speech synthesis.", "alpha": 0.3, "diffusion_steps": 10, "embedding_scale": 1.5 } }' \ http://localhost:5000/predictions
To learn more, take a look at the Cog documentation.
Add a payment method to run this model.
Each run costs approximately $0.077. Alternatively, try out our featured models for free.
By signing in, you agree to our
terms of service and privacy policy
Output
- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
{
"completed_at": "2023-11-20T19:35:54.404786Z",
"created_at": "2023-11-20T19:35:47.182253Z",
"data_removed": false,
"error": null,
"id": "ehv426rccpf2wzjpe4yon6oahq",
"input": {
"beta": 0.7,
"seed": 0,
"text": "StyleTTS 2 is a text-to-speech model that leverages style diffusion and adversarial training with large speech language models to achieve human-level text-to-speech synthesis.",
"alpha": 0.3,
"diffusion_steps": 10,
"embedding_scale": 1.5
},
"logs": null,
"metrics": {
"predict_time": 5.425799,
"total_time": 7.222533
},
"output": "https://replicate.delivery/pbxt/CehZad4ot3zvXSeotg8BVPf0rKSMdNusZ5eD6HepJwOQjzSPC/out.mp3",
"started_at": "2023-11-20T19:35:48.978987Z",
"status": "succeeded",
"urls": {
"get": "https://api.replicate.com/v1/predictions/ehv426rccpf2wzjpe4yon6oahq",
"cancel": "https://api.replicate.com/v1/predictions/ehv426rccpf2wzjpe4yon6oahq/cancel"
},
"version": "53fd5081feae9440974d1ef9cae83bf7af5fe18be1646343f37e559f5f80a613"
}