suno-ai / bark

🔊 Text-Prompted Generative Audio Model

  • Public
  • 297.5K runs
  • T4
  • GitHub
  • License

Input

string
Shift + Return to add a new line

Input prompt

Default: "Hello, my name is Suno. And, uh — and I like pizza. [laughs] But I also have other interests such as playing tic tac toe."

string

history choice for audio cloning, choose from the list

file

Provide your own .npz file with history choice for audio cloning, this will override the previous history_prompt setting

number

generation temperature (1.0 more diverse, 0.0 more conservative)

Default: 0.7

number

generation temperature (1.0 more diverse, 0.0 more conservative)

Default: 0.7

boolean

return full generation as a .npz file to be used as a history prompt

Default: false

Output

Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x
Generated in

This example was created by a different version, suno-ai/bark:f23937d7.

Run time and cost

This model costs approximately $0.033 to run on Replicate, or 30 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia T4 GPU hardware. Predictions typically complete within 148 seconds. The predict time for this model varies significantly based on the inputs.

Readme

🐶 Bark

Original repo: https://github.com/suno-ai/bark

Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying. To support the research community, we are providing access to pretrained model checkpoints ready for inference.

🙏 Appreciation

  • nanoGPT for a dead-simple and blazing fast implementation of GPT-style models
  • EnCodec for a state-of-the-art implementation of a fantastic audio codec
  • AudioLM for very related training and inference code
  • Vall-E, AudioLM and many other ground-breaking papers that enabled the development of Bark

© License

Bark is licensed under the MIT License.

Please contact us at bark@suno.ai to request access to a larger version of the model.