zsxkib / flux-music

🎼FluxMusic Text-to-Music Generation with Rectified Flow Transformer🎶

  • Public
  • 1.3K runs
  • L40S
  • GitHub
  • Weights
  • Paper
  • License

Input

string
Shift + Return to add a new line

Text prompt for music generation

Default: "The song is an epic blend of space-rock, rock, and post-rock genres."

string
Shift + Return to add a new line

Text prompt for negative guidance (unconditioned prompt)

Default: "low quality, gentle"

number
(minimum: 0, maximum: 20)

Classifier-free guidance scale

Default: 7

string

Select the model version to use

Default: "base"

integer
(minimum: 1, maximum: 200)

Number of sampling steps

Default: 50

boolean

Whether to save the spectrogram image

Default: false

integer

Random seed. Leave blank to randomize the seed

Output

wav

Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x

melspectrogram

melspectrogram
Generated in

Run time and cost

This model costs approximately $0.052 to run on Replicate, or 19 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 54 seconds. The predict time for this model varies significantly based on the inputs.

Readme

This model doesn't have a readme.