nateraw / audio-super-resolution
AudioSR: Versatile Audio Super-resolution at Scale
Prediction
nateraw/audio-super-resolution:0e453d5e4c2e0ef4f8d38a6167053dda09cf3c8dbca2355cde61dca55a915bc5IDolvui3lb3zratd7oxlp3urq6yyStatusSucceededSourceWebHardwareA40 (Large)Total durationCreatedby @naterawInput
- seed
- 1234
- ddim_steps
- 200
- input_file
- Video Player is loading.Current Time 00:00:000/Duration 00:00:000Loaded: 0%Stream Type LIVERemaining Time -00:00:0001x
- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
- guidance_scale
- 3.5
{ "seed": 1234, "ddim_steps": 200, "input_file": "https://replicate.delivery/pbxt/JYuc1GkrIZ9SUZ8DdP4UEPjjWxtEEODL4H4uqBjsinrjfed0/20230905-104523.wav", "guidance_scale": 3.5 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run nateraw/audio-super-resolution using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "nateraw/audio-super-resolution:0e453d5e4c2e0ef4f8d38a6167053dda09cf3c8dbca2355cde61dca55a915bc5", { input: { seed: 1234, ddim_steps: 200, input_file: "https://replicate.delivery/pbxt/JYuc1GkrIZ9SUZ8DdP4UEPjjWxtEEODL4H4uqBjsinrjfed0/20230905-104523.wav", guidance_scale: 3.5 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run nateraw/audio-super-resolution using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "nateraw/audio-super-resolution:0e453d5e4c2e0ef4f8d38a6167053dda09cf3c8dbca2355cde61dca55a915bc5", input={ "seed": 1234, "ddim_steps": 200, "input_file": "https://replicate.delivery/pbxt/JYuc1GkrIZ9SUZ8DdP4UEPjjWxtEEODL4H4uqBjsinrjfed0/20230905-104523.wav", "guidance_scale": 3.5 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run nateraw/audio-super-resolution using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "nateraw/audio-super-resolution:0e453d5e4c2e0ef4f8d38a6167053dda09cf3c8dbca2355cde61dca55a915bc5", "input": { "seed": 1234, "ddim_steps": 200, "input_file": "https://replicate.delivery/pbxt/JYuc1GkrIZ9SUZ8DdP4UEPjjWxtEEODL4H4uqBjsinrjfed0/20230905-104523.wav", "guidance_scale": 3.5 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
Video Player is loading.Current Time 00:00:000/Duration 00:00:000Loaded: 0%Stream Type LIVERemaining Time -00:00:0001x- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
{ "completed_at": "2023-09-20T18:45:01.710156Z", "created_at": "2023-09-20T18:44:20.862312Z", "data_removed": false, "error": null, "id": "olvui3lb3zratd7oxlp3urq6yy", "input": { "seed": 1234, "ddim_steps": 200, "input_file": "https://replicate.delivery/pbxt/JYuc1GkrIZ9SUZ8DdP4UEPjjWxtEEODL4H4uqBjsinrjfed0/20230905-104523.wav", "guidance_scale": 3.5 }, "logs": "Warning: audio is longer than 10.24 seconds, may degrade the model performance. It's recommand to truncate your audio to 5.12 seconds before input to AudioSR to get the best performance.\n/root/.pyenv/versions/3.9.18/lib/python3.9/site-packages/audiosr/utils.py:109: FutureWarning: Pass sr=48000, n_fft=2048, n_mels=256, fmin=20, fmax=24000 as keyword args. From version 0.10 passing these as positional arguments will result in an error\nmel = librosa_mel_fn(sampling_rate, filter_length, n_mel, mel_fmin, mel_fmax)\nRunning DDIM Sampling with 200 timesteps\nDDIM Sampler: 0%| | 0/200 [00:00<?, ?it/s]\nDDIM Sampler: 0%| | 1/200 [00:00<00:36, 5.52it/s]\nDDIM Sampler: 1%| | 2/200 [00:00<00:35, 5.53it/s]\nDDIM Sampler: 2%|▏ | 3/200 [00:00<00:35, 5.53it/s]\nDDIM Sampler: 2%|▏ | 4/200 [00:00<00:35, 5.53it/s]\nDDIM Sampler: 2%|▎ | 5/200 [00:00<00:35, 5.54it/s]\nDDIM Sampler: 3%|▎ | 6/200 [00:01<00:35, 5.54it/s]\nDDIM Sampler: 4%|▎ | 7/200 [00:01<00:34, 5.55it/s]\nDDIM Sampler: 4%|▍ | 8/200 [00:01<00:34, 5.55it/s]\nDDIM Sampler: 4%|▍ | 9/200 [00:01<00:34, 5.55it/s]\nDDIM Sampler: 5%|▌ | 10/200 [00:01<00:34, 5.55it/s]\nDDIM Sampler: 6%|▌ | 11/200 [00:01<00:34, 5.54it/s]\nDDIM Sampler: 6%|▌ | 12/200 [00:02<00:33, 5.55it/s]\nDDIM Sampler: 6%|▋ | 13/200 [00:02<00:33, 5.54it/s]\nDDIM Sampler: 7%|▋ | 14/200 [00:02<00:33, 5.54it/s]\nDDIM Sampler: 8%|▊ | 15/200 [00:02<00:33, 5.53it/s]\nDDIM Sampler: 8%|▊ | 16/200 [00:02<00:33, 5.53it/s]\nDDIM Sampler: 8%|▊ | 17/200 [00:03<00:33, 5.53it/s]\nDDIM Sampler: 9%|▉ | 18/200 [00:03<00:32, 5.53it/s]\nDDIM Sampler: 10%|▉ | 19/200 [00:03<00:32, 5.52it/s]\nDDIM Sampler: 10%|█ | 20/200 [00:03<00:32, 5.53it/s]\nDDIM Sampler: 10%|█ | 21/200 [00:03<00:32, 5.53it/s]\nDDIM Sampler: 11%|█ | 22/200 [00:03<00:32, 5.54it/s]\nDDIM Sampler: 12%|█▏ | 23/200 [00:04<00:31, 5.54it/s]\nDDIM Sampler: 12%|█▏ | 24/200 [00:04<00:31, 5.54it/s]\nDDIM Sampler: 12%|█▎ | 25/200 [00:04<00:31, 5.55it/s]\nDDIM Sampler: 13%|█▎ | 26/200 [00:04<00:31, 5.55it/s]\nDDIM Sampler: 14%|█▎ | 27/200 [00:04<00:31, 5.54it/s]\nDDIM Sampler: 14%|█▍ | 28/200 [00:05<00:31, 5.54it/s]\nDDIM Sampler: 14%|█▍ | 29/200 [00:05<00:30, 5.53it/s]\nDDIM Sampler: 15%|█▌ | 30/200 [00:05<00:30, 5.54it/s]\nDDIM Sampler: 16%|█▌ | 31/200 [00:05<00:30, 5.54it/s]\nDDIM Sampler: 16%|█▌ | 32/200 [00:05<00:30, 5.55it/s]\nDDIM Sampler: 16%|█▋ | 33/200 [00:05<00:30, 5.55it/s]\nDDIM Sampler: 17%|█▋ | 34/200 [00:06<00:29, 5.54it/s]\nDDIM Sampler: 18%|█▊ | 35/200 [00:06<00:29, 5.54it/s]\nDDIM Sampler: 18%|█▊ | 36/200 [00:06<00:29, 5.54it/s]\nDDIM Sampler: 18%|█▊ | 37/200 [00:06<00:29, 5.54it/s]\nDDIM Sampler: 19%|█▉ | 38/200 [00:06<00:29, 5.54it/s]\nDDIM Sampler: 20%|█▉ | 39/200 [00:07<00:29, 5.54it/s]\nDDIM Sampler: 20%|██ | 40/200 [00:07<00:28, 5.55it/s]\nDDIM Sampler: 20%|██ | 41/200 [00:07<00:28, 5.55it/s]\nDDIM Sampler: 21%|██ | 42/200 [00:07<00:28, 5.55it/s]\nDDIM Sampler: 22%|██▏ | 43/200 [00:07<00:28, 5.55it/s]\nDDIM Sampler: 22%|██▏ | 44/200 [00:07<00:28, 5.56it/s]\nDDIM Sampler: 22%|██▎ | 45/200 [00:08<00:27, 5.56it/s]\nDDIM Sampler: 23%|██▎ | 46/200 [00:08<00:27, 5.55it/s]\nDDIM Sampler: 24%|██▎ | 47/200 [00:08<00:27, 5.55it/s]\nDDIM Sampler: 24%|██▍ | 48/200 [00:08<00:27, 5.55it/s]\nDDIM Sampler: 24%|██▍ | 49/200 [00:08<00:27, 5.55it/s]\nDDIM Sampler: 25%|██▌ | 50/200 [00:09<00:27, 5.55it/s]\nDDIM Sampler: 26%|██▌ | 51/200 [00:09<00:26, 5.55it/s]\nDDIM Sampler: 26%|██▌ | 52/200 [00:09<00:26, 5.50it/s]\nDDIM Sampler: 26%|██▋ | 53/200 [00:09<00:26, 5.51it/s]\nDDIM Sampler: 27%|██▋ | 54/200 [00:09<00:26, 5.52it/s]\nDDIM Sampler: 28%|██▊ | 55/200 [00:09<00:26, 5.53it/s]\nDDIM Sampler: 28%|██▊ | 56/200 [00:10<00:26, 5.53it/s]\nDDIM Sampler: 28%|██▊ | 57/200 [00:10<00:25, 5.54it/s]\nDDIM Sampler: 29%|██▉ | 58/200 [00:10<00:25, 5.54it/s]\nDDIM Sampler: 30%|██▉ | 59/200 [00:10<00:25, 5.55it/s]\nDDIM Sampler: 30%|███ | 60/200 [00:10<00:25, 5.54it/s]\nDDIM Sampler: 30%|███ | 61/200 [00:11<00:25, 5.54it/s]\nDDIM Sampler: 31%|███ | 62/200 [00:11<00:24, 5.55it/s]\nDDIM Sampler: 32%|███▏ | 63/200 [00:11<00:24, 5.54it/s]\nDDIM Sampler: 32%|███▏ | 64/200 [00:11<00:24, 5.55it/s]\nDDIM Sampler: 32%|███▎ | 65/200 [00:11<00:24, 5.55it/s]\nDDIM Sampler: 33%|███▎ | 66/200 [00:11<00:24, 5.55it/s]\nDDIM Sampler: 34%|███▎ | 67/200 [00:12<00:24, 5.54it/s]\nDDIM Sampler: 34%|███▍ | 68/200 [00:12<00:23, 5.54it/s]\nDDIM Sampler: 34%|███▍ | 69/200 [00:12<00:23, 5.55it/s]\nDDIM Sampler: 35%|███▌ | 70/200 [00:12<00:23, 5.55it/s]\nDDIM Sampler: 36%|███▌ | 71/200 [00:12<00:23, 5.55it/s]\nDDIM Sampler: 36%|███▌ | 72/200 [00:12<00:23, 5.55it/s]\nDDIM Sampler: 36%|███▋ | 73/200 [00:13<00:22, 5.55it/s]\nDDIM Sampler: 37%|███▋ | 74/200 [00:13<00:22, 5.55it/s]\nDDIM Sampler: 38%|███▊ | 75/200 [00:13<00:22, 5.55it/s]\nDDIM Sampler: 38%|███▊ | 76/200 [00:13<00:22, 5.55it/s]\nDDIM Sampler: 38%|███▊ | 77/200 [00:13<00:22, 5.55it/s]\nDDIM Sampler: 39%|███▉ | 78/200 [00:14<00:21, 5.55it/s]\nDDIM Sampler: 40%|███▉ | 79/200 [00:14<00:21, 5.55it/s]\nDDIM Sampler: 40%|████ | 80/200 [00:14<00:21, 5.55it/s]\nDDIM Sampler: 40%|████ | 81/200 [00:14<00:21, 5.55it/s]\nDDIM Sampler: 41%|████ | 82/200 [00:14<00:21, 5.55it/s]\nDDIM Sampler: 42%|████▏ | 83/200 [00:14<00:21, 5.55it/s]\nDDIM Sampler: 42%|████▏ | 84/200 [00:15<00:20, 5.55it/s]\nDDIM Sampler: 42%|████▎ | 85/200 [00:15<00:20, 5.55it/s]\nDDIM Sampler: 43%|████▎ | 86/200 [00:15<00:20, 5.55it/s]\nDDIM Sampler: 44%|████▎ | 87/200 [00:15<00:20, 5.55it/s]\nDDIM Sampler: 44%|████▍ | 88/200 [00:15<00:20, 5.55it/s]\nDDIM Sampler: 44%|████▍ | 89/200 [00:16<00:20, 5.55it/s]\nDDIM Sampler: 45%|████▌ | 90/200 [00:16<00:19, 5.55it/s]\nDDIM Sampler: 46%|████▌ | 91/200 [00:16<00:19, 5.54it/s]\nDDIM Sampler: 46%|████▌ | 92/200 [00:16<00:19, 5.54it/s]\nDDIM Sampler: 46%|████▋ | 93/200 [00:16<00:19, 5.54it/s]\nDDIM Sampler: 47%|████▋ | 94/200 [00:16<00:19, 5.54it/s]\nDDIM Sampler: 48%|████▊ | 95/200 [00:17<00:18, 5.55it/s]\nDDIM Sampler: 48%|████▊ | 96/200 [00:17<00:18, 5.55it/s]\nDDIM Sampler: 48%|████▊ | 97/200 [00:17<00:18, 5.54it/s]\nDDIM Sampler: 49%|████▉ | 98/200 [00:17<00:18, 5.54it/s]\nDDIM Sampler: 50%|████▉ | 99/200 [00:17<00:18, 5.55it/s]\nDDIM Sampler: 50%|█████ | 100/200 [00:18<00:18, 5.55it/s]\nDDIM Sampler: 50%|█████ | 101/200 [00:18<00:17, 5.55it/s]\nDDIM Sampler: 51%|█████ | 102/200 [00:18<00:17, 5.55it/s]\nDDIM Sampler: 52%|█████▏ | 103/200 [00:18<00:17, 5.55it/s]\nDDIM Sampler: 52%|█████▏ | 104/200 [00:18<00:17, 5.52it/s]\nDDIM Sampler: 52%|█████▎ | 105/200 [00:18<00:17, 5.53it/s]\nDDIM Sampler: 53%|█████▎ | 106/200 [00:19<00:16, 5.54it/s]\nDDIM Sampler: 54%|█████▎ | 107/200 [00:19<00:16, 5.54it/s]\nDDIM Sampler: 54%|█████▍ | 108/200 [00:19<00:16, 5.55it/s]\nDDIM Sampler: 55%|█████▍ | 109/200 [00:19<00:16, 5.55it/s]\nDDIM Sampler: 55%|█████▌ | 110/200 [00:19<00:16, 5.55it/s]\nDDIM Sampler: 56%|█████▌ | 111/200 [00:20<00:16, 5.55it/s]\nDDIM Sampler: 56%|█████▌ | 112/200 [00:20<00:15, 5.54it/s]\nDDIM Sampler: 56%|█████▋ | 113/200 [00:20<00:15, 5.54it/s]\nDDIM Sampler: 57%|█████▋ | 114/200 [00:20<00:15, 5.54it/s]\nDDIM Sampler: 57%|█████▊ | 115/200 [00:20<00:15, 5.53it/s]\nDDIM Sampler: 58%|█████▊ | 116/200 [00:20<00:15, 5.54it/s]\nDDIM Sampler: 58%|█████▊ | 117/200 [00:21<00:14, 5.54it/s]\nDDIM Sampler: 59%|█████▉ | 118/200 [00:21<00:14, 5.54it/s]\nDDIM Sampler: 60%|█████▉ | 119/200 [00:21<00:14, 5.54it/s]\nDDIM Sampler: 60%|██████ | 120/200 [00:21<00:14, 5.54it/s]\nDDIM Sampler: 60%|██████ | 121/200 [00:21<00:14, 5.54it/s]\nDDIM Sampler: 61%|██████ | 122/200 [00:22<00:14, 5.54it/s]\nDDIM Sampler: 62%|██████▏ | 123/200 [00:22<00:13, 5.54it/s]\nDDIM Sampler: 62%|██████▏ | 124/200 [00:22<00:13, 5.55it/s]\nDDIM Sampler: 62%|██████▎ | 125/200 [00:22<00:13, 5.55it/s]\nDDIM Sampler: 63%|██████▎ | 126/200 [00:22<00:13, 5.55it/s]\nDDIM Sampler: 64%|██████▎ | 127/200 [00:22<00:13, 5.55it/s]\nDDIM Sampler: 64%|██████▍ | 128/200 [00:23<00:12, 5.55it/s]\nDDIM Sampler: 64%|██████▍ | 129/200 [00:23<00:12, 5.55it/s]\nDDIM Sampler: 65%|██████▌ | 130/200 [00:23<00:12, 5.55it/s]\nDDIM Sampler: 66%|██████▌ | 131/200 [00:23<00:12, 5.54it/s]\nDDIM Sampler: 66%|██████▌ | 132/200 [00:23<00:12, 5.54it/s]\nDDIM Sampler: 66%|██████▋ | 133/200 [00:23<00:12, 5.54it/s]\nDDIM Sampler: 67%|██████▋ | 134/200 [00:24<00:11, 5.55it/s]\nDDIM Sampler: 68%|██████▊ | 135/200 [00:24<00:11, 5.55it/s]\nDDIM Sampler: 68%|██████▊ | 136/200 [00:24<00:11, 5.55it/s]\nDDIM Sampler: 68%|██████▊ | 137/200 [00:24<00:11, 5.55it/s]\nDDIM Sampler: 69%|██████▉ | 138/200 [00:24<00:11, 5.55it/s]\nDDIM Sampler: 70%|██████▉ | 139/200 [00:25<00:10, 5.55it/s]\nDDIM Sampler: 70%|███████ | 140/200 [00:25<00:10, 5.54it/s]\nDDIM Sampler: 70%|███████ | 141/200 [00:25<00:10, 5.54it/s]\nDDIM Sampler: 71%|███████ | 142/200 [00:25<00:10, 5.54it/s]\nDDIM Sampler: 72%|███████▏ | 143/200 [00:25<00:10, 5.54it/s]\nDDIM Sampler: 72%|███████▏ | 144/200 [00:25<00:10, 5.54it/s]\nDDIM Sampler: 72%|███████▎ | 145/200 [00:26<00:09, 5.54it/s]\nDDIM Sampler: 73%|███████▎ | 146/200 [00:26<00:09, 5.54it/s]\nDDIM Sampler: 74%|███████▎ | 147/200 [00:26<00:09, 5.55it/s]\nDDIM Sampler: 74%|███████▍ | 148/200 [00:26<00:09, 5.55it/s]\nDDIM Sampler: 74%|███████▍ | 149/200 [00:26<00:09, 5.54it/s]\nDDIM Sampler: 75%|███████▌ | 150/200 [00:27<00:09, 5.53it/s]\nDDIM Sampler: 76%|███████▌ | 151/200 [00:27<00:08, 5.53it/s]\nDDIM Sampler: 76%|███████▌ | 152/200 [00:27<00:08, 5.52it/s]\nDDIM Sampler: 76%|███████▋ | 153/200 [00:27<00:08, 5.53it/s]\nDDIM Sampler: 77%|███████▋ | 154/200 [00:27<00:08, 5.54it/s]\nDDIM Sampler: 78%|███████▊ | 155/200 [00:27<00:08, 5.54it/s]\nDDIM Sampler: 78%|███████▊ | 156/200 [00:28<00:07, 5.54it/s]\nDDIM Sampler: 78%|███████▊ | 157/200 [00:28<00:07, 5.54it/s]\nDDIM Sampler: 79%|███████▉ | 158/200 [00:28<00:07, 5.54it/s]\nDDIM Sampler: 80%|███████▉ | 159/200 [00:28<00:07, 5.53it/s]\nDDIM Sampler: 80%|████████ | 160/200 [00:28<00:07, 5.54it/s]\nDDIM Sampler: 80%|████████ | 161/200 [00:29<00:07, 5.54it/s]\nDDIM Sampler: 81%|████████ | 162/200 [00:29<00:06, 5.54it/s]\nDDIM Sampler: 82%|████████▏ | 163/200 [00:29<00:06, 5.54it/s]\nDDIM Sampler: 82%|████████▏ | 164/200 [00:29<00:06, 5.53it/s]\nDDIM Sampler: 82%|████████▎ | 165/200 [00:29<00:06, 5.54it/s]\nDDIM Sampler: 83%|████████▎ | 166/200 [00:29<00:06, 5.54it/s]\nDDIM Sampler: 84%|████████▎ | 167/200 [00:30<00:05, 5.54it/s]\nDDIM Sampler: 84%|████████▍ | 168/200 [00:30<00:05, 5.54it/s]\nDDIM Sampler: 84%|████████▍ | 169/200 [00:30<00:05, 5.55it/s]\nDDIM Sampler: 85%|████████▌ | 170/200 [00:30<00:05, 5.55it/s]\nDDIM Sampler: 86%|████████▌ | 171/200 [00:30<00:05, 5.54it/s]\nDDIM Sampler: 86%|████████▌ | 172/200 [00:31<00:05, 5.54it/s]\nDDIM Sampler: 86%|████████▋ | 173/200 [00:31<00:04, 5.54it/s]\nDDIM Sampler: 87%|████████▋ | 174/200 [00:31<00:04, 5.54it/s]\nDDIM Sampler: 88%|████████▊ | 175/200 [00:31<00:04, 5.55it/s]\nDDIM Sampler: 88%|████████▊ | 176/200 [00:31<00:04, 5.55it/s]\nDDIM Sampler: 88%|████████▊ | 177/200 [00:31<00:04, 5.54it/s]\nDDIM Sampler: 89%|████████▉ | 178/200 [00:32<00:03, 5.55it/s]\nDDIM Sampler: 90%|████████▉ | 179/200 [00:32<00:03, 5.55it/s]\nDDIM Sampler: 90%|█████████ | 180/200 [00:32<00:03, 5.55it/s]\nDDIM Sampler: 90%|█████████ | 181/200 [00:32<00:03, 5.55it/s]\nDDIM Sampler: 91%|█████████ | 182/200 [00:32<00:03, 5.54it/s]\nDDIM Sampler: 92%|█████████▏| 183/200 [00:33<00:03, 5.54it/s]\nDDIM Sampler: 92%|█████████▏| 184/200 [00:33<00:02, 5.54it/s]\nDDIM Sampler: 92%|█████████▎| 185/200 [00:33<00:02, 5.54it/s]\nDDIM Sampler: 93%|█████████▎| 186/200 [00:33<00:02, 5.54it/s]\nDDIM Sampler: 94%|█████████▎| 187/200 [00:33<00:02, 5.54it/s]\nDDIM Sampler: 94%|█████████▍| 188/200 [00:33<00:02, 5.53it/s]\nDDIM Sampler: 94%|█████████▍| 189/200 [00:34<00:01, 5.53it/s]\nDDIM Sampler: 95%|█████████▌| 190/200 [00:34<00:01, 5.52it/s]\nDDIM Sampler: 96%|█████████▌| 191/200 [00:34<00:01, 5.53it/s]\nDDIM Sampler: 96%|█████████▌| 192/200 [00:34<00:01, 5.53it/s]\nDDIM Sampler: 96%|█████████▋| 193/200 [00:34<00:01, 5.52it/s]\nDDIM Sampler: 97%|█████████▋| 194/200 [00:35<00:01, 5.53it/s]\nDDIM Sampler: 98%|█████████▊| 195/200 [00:35<00:00, 5.53it/s]\nDDIM Sampler: 98%|█████████▊| 196/200 [00:35<00:00, 5.54it/s]\nDDIM Sampler: 98%|█████████▊| 197/200 [00:35<00:00, 5.54it/s]\nDDIM Sampler: 99%|█████████▉| 198/200 [00:35<00:00, 5.54it/s]\nDDIM Sampler: 100%|█████████▉| 199/200 [00:35<00:00, 5.55it/s]\nDDIM Sampler: 100%|██████████| 200/200 [00:36<00:00, 5.55it/s]\nDDIM Sampler: 100%|██████████| 200/200 [00:36<00:00, 5.54it/s]", "metrics": { "predict_time": 40.907754, "total_time": 40.847844 }, "output": "https://replicate.delivery/pbxt/aYf1eA1fZVVv7I8dT1QHZBJBahbS234jAwfN0UbrEsaz67YGB/out.wav", "started_at": "2023-09-20T18:44:20.802402Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/olvui3lb3zratd7oxlp3urq6yy", "cancel": "https://api.replicate.com/v1/predictions/olvui3lb3zratd7oxlp3urq6yy/cancel" }, "version": "0e453d5e4c2e0ef4f8d38a6167053dda09cf3c8dbca2355cde61dca55a915bc5" }
Generated inWarning: audio is longer than 10.24 seconds, may degrade the model performance. It's recommand to truncate your audio to 5.12 seconds before input to AudioSR to get the best performance. /root/.pyenv/versions/3.9.18/lib/python3.9/site-packages/audiosr/utils.py:109: FutureWarning: Pass sr=48000, n_fft=2048, n_mels=256, fmin=20, fmax=24000 as keyword args. From version 0.10 passing these as positional arguments will result in an error mel = librosa_mel_fn(sampling_rate, filter_length, n_mel, mel_fmin, mel_fmax) Running DDIM Sampling with 200 timesteps DDIM Sampler: 0%| | 0/200 [00:00<?, ?it/s] DDIM Sampler: 0%| | 1/200 [00:00<00:36, 5.52it/s] DDIM Sampler: 1%| | 2/200 [00:00<00:35, 5.53it/s] DDIM Sampler: 2%|▏ | 3/200 [00:00<00:35, 5.53it/s] DDIM Sampler: 2%|▏ | 4/200 [00:00<00:35, 5.53it/s] DDIM Sampler: 2%|▎ | 5/200 [00:00<00:35, 5.54it/s] DDIM Sampler: 3%|▎ | 6/200 [00:01<00:35, 5.54it/s] DDIM Sampler: 4%|▎ | 7/200 [00:01<00:34, 5.55it/s] DDIM Sampler: 4%|▍ | 8/200 [00:01<00:34, 5.55it/s] DDIM Sampler: 4%|▍ | 9/200 [00:01<00:34, 5.55it/s] DDIM Sampler: 5%|▌ | 10/200 [00:01<00:34, 5.55it/s] DDIM Sampler: 6%|▌ | 11/200 [00:01<00:34, 5.54it/s] DDIM Sampler: 6%|▌ | 12/200 [00:02<00:33, 5.55it/s] DDIM Sampler: 6%|▋ | 13/200 [00:02<00:33, 5.54it/s] DDIM Sampler: 7%|▋ | 14/200 [00:02<00:33, 5.54it/s] DDIM Sampler: 8%|▊ | 15/200 [00:02<00:33, 5.53it/s] DDIM Sampler: 8%|▊ | 16/200 [00:02<00:33, 5.53it/s] DDIM Sampler: 8%|▊ | 17/200 [00:03<00:33, 5.53it/s] DDIM Sampler: 9%|▉ | 18/200 [00:03<00:32, 5.53it/s] DDIM Sampler: 10%|▉ | 19/200 [00:03<00:32, 5.52it/s] DDIM Sampler: 10%|█ | 20/200 [00:03<00:32, 5.53it/s] DDIM Sampler: 10%|█ | 21/200 [00:03<00:32, 5.53it/s] DDIM Sampler: 11%|█ | 22/200 [00:03<00:32, 5.54it/s] DDIM Sampler: 12%|█▏ | 23/200 [00:04<00:31, 5.54it/s] DDIM Sampler: 12%|█▏ | 24/200 [00:04<00:31, 5.54it/s] DDIM Sampler: 12%|█▎ | 25/200 [00:04<00:31, 5.55it/s] DDIM Sampler: 13%|█▎ | 26/200 [00:04<00:31, 5.55it/s] DDIM Sampler: 14%|█▎ | 27/200 [00:04<00:31, 5.54it/s] DDIM Sampler: 14%|█▍ | 28/200 [00:05<00:31, 5.54it/s] DDIM Sampler: 14%|█▍ | 29/200 [00:05<00:30, 5.53it/s] DDIM Sampler: 15%|█▌ | 30/200 [00:05<00:30, 5.54it/s] DDIM Sampler: 16%|█▌ | 31/200 [00:05<00:30, 5.54it/s] DDIM Sampler: 16%|█▌ | 32/200 [00:05<00:30, 5.55it/s] DDIM Sampler: 16%|█▋ | 33/200 [00:05<00:30, 5.55it/s] DDIM Sampler: 17%|█▋ | 34/200 [00:06<00:29, 5.54it/s] DDIM Sampler: 18%|█▊ | 35/200 [00:06<00:29, 5.54it/s] DDIM Sampler: 18%|█▊ | 36/200 [00:06<00:29, 5.54it/s] DDIM Sampler: 18%|█▊ | 37/200 [00:06<00:29, 5.54it/s] DDIM Sampler: 19%|█▉ | 38/200 [00:06<00:29, 5.54it/s] DDIM Sampler: 20%|█▉ | 39/200 [00:07<00:29, 5.54it/s] DDIM Sampler: 20%|██ | 40/200 [00:07<00:28, 5.55it/s] DDIM Sampler: 20%|██ | 41/200 [00:07<00:28, 5.55it/s] DDIM Sampler: 21%|██ | 42/200 [00:07<00:28, 5.55it/s] DDIM Sampler: 22%|██▏ | 43/200 [00:07<00:28, 5.55it/s] DDIM Sampler: 22%|██▏ | 44/200 [00:07<00:28, 5.56it/s] DDIM Sampler: 22%|██▎ | 45/200 [00:08<00:27, 5.56it/s] DDIM Sampler: 23%|██▎ | 46/200 [00:08<00:27, 5.55it/s] DDIM Sampler: 24%|██▎ | 47/200 [00:08<00:27, 5.55it/s] DDIM Sampler: 24%|██▍ | 48/200 [00:08<00:27, 5.55it/s] DDIM Sampler: 24%|██▍ | 49/200 [00:08<00:27, 5.55it/s] DDIM Sampler: 25%|██▌ | 50/200 [00:09<00:27, 5.55it/s] DDIM Sampler: 26%|██▌ | 51/200 [00:09<00:26, 5.55it/s] DDIM Sampler: 26%|██▌ | 52/200 [00:09<00:26, 5.50it/s] DDIM Sampler: 26%|██▋ | 53/200 [00:09<00:26, 5.51it/s] DDIM Sampler: 27%|██▋ | 54/200 [00:09<00:26, 5.52it/s] DDIM Sampler: 28%|██▊ | 55/200 [00:09<00:26, 5.53it/s] DDIM Sampler: 28%|██▊ | 56/200 [00:10<00:26, 5.53it/s] DDIM Sampler: 28%|██▊ | 57/200 [00:10<00:25, 5.54it/s] DDIM Sampler: 29%|██▉ | 58/200 [00:10<00:25, 5.54it/s] DDIM Sampler: 30%|██▉ | 59/200 [00:10<00:25, 5.55it/s] DDIM Sampler: 30%|███ | 60/200 [00:10<00:25, 5.54it/s] DDIM Sampler: 30%|███ | 61/200 [00:11<00:25, 5.54it/s] DDIM Sampler: 31%|███ | 62/200 [00:11<00:24, 5.55it/s] DDIM Sampler: 32%|███▏ | 63/200 [00:11<00:24, 5.54it/s] DDIM Sampler: 32%|███▏ | 64/200 [00:11<00:24, 5.55it/s] DDIM Sampler: 32%|███▎ | 65/200 [00:11<00:24, 5.55it/s] DDIM Sampler: 33%|███▎ | 66/200 [00:11<00:24, 5.55it/s] DDIM Sampler: 34%|███▎ | 67/200 [00:12<00:24, 5.54it/s] DDIM Sampler: 34%|███▍ | 68/200 [00:12<00:23, 5.54it/s] DDIM Sampler: 34%|███▍ | 69/200 [00:12<00:23, 5.55it/s] DDIM Sampler: 35%|███▌ | 70/200 [00:12<00:23, 5.55it/s] DDIM Sampler: 36%|███▌ | 71/200 [00:12<00:23, 5.55it/s] DDIM Sampler: 36%|███▌ | 72/200 [00:12<00:23, 5.55it/s] DDIM Sampler: 36%|███▋ | 73/200 [00:13<00:22, 5.55it/s] DDIM Sampler: 37%|███▋ | 74/200 [00:13<00:22, 5.55it/s] DDIM Sampler: 38%|███▊ | 75/200 [00:13<00:22, 5.55it/s] DDIM Sampler: 38%|███▊ | 76/200 [00:13<00:22, 5.55it/s] DDIM Sampler: 38%|███▊ | 77/200 [00:13<00:22, 5.55it/s] DDIM Sampler: 39%|███▉ | 78/200 [00:14<00:21, 5.55it/s] DDIM Sampler: 40%|███▉ | 79/200 [00:14<00:21, 5.55it/s] DDIM Sampler: 40%|████ | 80/200 [00:14<00:21, 5.55it/s] DDIM Sampler: 40%|████ | 81/200 [00:14<00:21, 5.55it/s] DDIM Sampler: 41%|████ | 82/200 [00:14<00:21, 5.55it/s] DDIM Sampler: 42%|████▏ | 83/200 [00:14<00:21, 5.55it/s] DDIM Sampler: 42%|████▏ | 84/200 [00:15<00:20, 5.55it/s] DDIM Sampler: 42%|████▎ | 85/200 [00:15<00:20, 5.55it/s] DDIM Sampler: 43%|████▎ | 86/200 [00:15<00:20, 5.55it/s] DDIM Sampler: 44%|████▎ | 87/200 [00:15<00:20, 5.55it/s] DDIM Sampler: 44%|████▍ | 88/200 [00:15<00:20, 5.55it/s] DDIM Sampler: 44%|████▍ | 89/200 [00:16<00:20, 5.55it/s] DDIM Sampler: 45%|████▌ | 90/200 [00:16<00:19, 5.55it/s] DDIM Sampler: 46%|████▌ | 91/200 [00:16<00:19, 5.54it/s] DDIM Sampler: 46%|████▌ | 92/200 [00:16<00:19, 5.54it/s] DDIM Sampler: 46%|████▋ | 93/200 [00:16<00:19, 5.54it/s] DDIM Sampler: 47%|████▋ | 94/200 [00:16<00:19, 5.54it/s] DDIM Sampler: 48%|████▊ | 95/200 [00:17<00:18, 5.55it/s] DDIM Sampler: 48%|████▊ | 96/200 [00:17<00:18, 5.55it/s] DDIM Sampler: 48%|████▊ | 97/200 [00:17<00:18, 5.54it/s] DDIM Sampler: 49%|████▉ | 98/200 [00:17<00:18, 5.54it/s] DDIM Sampler: 50%|████▉ | 99/200 [00:17<00:18, 5.55it/s] DDIM Sampler: 50%|█████ | 100/200 [00:18<00:18, 5.55it/s] DDIM Sampler: 50%|█████ | 101/200 [00:18<00:17, 5.55it/s] DDIM Sampler: 51%|█████ | 102/200 [00:18<00:17, 5.55it/s] DDIM Sampler: 52%|█████▏ | 103/200 [00:18<00:17, 5.55it/s] DDIM Sampler: 52%|█████▏ | 104/200 [00:18<00:17, 5.52it/s] DDIM Sampler: 52%|█████▎ | 105/200 [00:18<00:17, 5.53it/s] DDIM Sampler: 53%|█████▎ | 106/200 [00:19<00:16, 5.54it/s] DDIM Sampler: 54%|█████▎ | 107/200 [00:19<00:16, 5.54it/s] DDIM Sampler: 54%|█████▍ | 108/200 [00:19<00:16, 5.55it/s] DDIM Sampler: 55%|█████▍ | 109/200 [00:19<00:16, 5.55it/s] DDIM Sampler: 55%|█████▌ | 110/200 [00:19<00:16, 5.55it/s] DDIM Sampler: 56%|█████▌ | 111/200 [00:20<00:16, 5.55it/s] DDIM Sampler: 56%|█████▌ | 112/200 [00:20<00:15, 5.54it/s] DDIM Sampler: 56%|█████▋ | 113/200 [00:20<00:15, 5.54it/s] DDIM Sampler: 57%|█████▋ | 114/200 [00:20<00:15, 5.54it/s] DDIM Sampler: 57%|█████▊ | 115/200 [00:20<00:15, 5.53it/s] DDIM Sampler: 58%|█████▊ | 116/200 [00:20<00:15, 5.54it/s] DDIM Sampler: 58%|█████▊ | 117/200 [00:21<00:14, 5.54it/s] DDIM Sampler: 59%|█████▉ | 118/200 [00:21<00:14, 5.54it/s] DDIM Sampler: 60%|█████▉ | 119/200 [00:21<00:14, 5.54it/s] DDIM Sampler: 60%|██████ | 120/200 [00:21<00:14, 5.54it/s] DDIM Sampler: 60%|██████ | 121/200 [00:21<00:14, 5.54it/s] DDIM Sampler: 61%|██████ | 122/200 [00:22<00:14, 5.54it/s] DDIM Sampler: 62%|██████▏ | 123/200 [00:22<00:13, 5.54it/s] DDIM Sampler: 62%|██████▏ | 124/200 [00:22<00:13, 5.55it/s] DDIM Sampler: 62%|██████▎ | 125/200 [00:22<00:13, 5.55it/s] DDIM Sampler: 63%|██████▎ | 126/200 [00:22<00:13, 5.55it/s] DDIM Sampler: 64%|██████▎ | 127/200 [00:22<00:13, 5.55it/s] DDIM Sampler: 64%|██████▍ | 128/200 [00:23<00:12, 5.55it/s] DDIM Sampler: 64%|██████▍ | 129/200 [00:23<00:12, 5.55it/s] DDIM Sampler: 65%|██████▌ | 130/200 [00:23<00:12, 5.55it/s] DDIM Sampler: 66%|██████▌ | 131/200 [00:23<00:12, 5.54it/s] DDIM Sampler: 66%|██████▌ | 132/200 [00:23<00:12, 5.54it/s] DDIM Sampler: 66%|██████▋ | 133/200 [00:23<00:12, 5.54it/s] DDIM Sampler: 67%|██████▋ | 134/200 [00:24<00:11, 5.55it/s] DDIM Sampler: 68%|██████▊ | 135/200 [00:24<00:11, 5.55it/s] DDIM Sampler: 68%|██████▊ | 136/200 [00:24<00:11, 5.55it/s] DDIM Sampler: 68%|██████▊ | 137/200 [00:24<00:11, 5.55it/s] DDIM Sampler: 69%|██████▉ | 138/200 [00:24<00:11, 5.55it/s] DDIM Sampler: 70%|██████▉ | 139/200 [00:25<00:10, 5.55it/s] DDIM Sampler: 70%|███████ | 140/200 [00:25<00:10, 5.54it/s] DDIM Sampler: 70%|███████ | 141/200 [00:25<00:10, 5.54it/s] DDIM Sampler: 71%|███████ | 142/200 [00:25<00:10, 5.54it/s] DDIM Sampler: 72%|███████▏ | 143/200 [00:25<00:10, 5.54it/s] DDIM Sampler: 72%|███████▏ | 144/200 [00:25<00:10, 5.54it/s] DDIM Sampler: 72%|███████▎ | 145/200 [00:26<00:09, 5.54it/s] DDIM Sampler: 73%|███████▎ | 146/200 [00:26<00:09, 5.54it/s] DDIM Sampler: 74%|███████▎ | 147/200 [00:26<00:09, 5.55it/s] DDIM Sampler: 74%|███████▍ | 148/200 [00:26<00:09, 5.55it/s] DDIM Sampler: 74%|███████▍ | 149/200 [00:26<00:09, 5.54it/s] DDIM Sampler: 75%|███████▌ | 150/200 [00:27<00:09, 5.53it/s] DDIM Sampler: 76%|███████▌ | 151/200 [00:27<00:08, 5.53it/s] DDIM Sampler: 76%|███████▌ | 152/200 [00:27<00:08, 5.52it/s] DDIM Sampler: 76%|███████▋ | 153/200 [00:27<00:08, 5.53it/s] DDIM Sampler: 77%|███████▋ | 154/200 [00:27<00:08, 5.54it/s] DDIM Sampler: 78%|███████▊ | 155/200 [00:27<00:08, 5.54it/s] DDIM Sampler: 78%|███████▊ | 156/200 [00:28<00:07, 5.54it/s] DDIM Sampler: 78%|███████▊ | 157/200 [00:28<00:07, 5.54it/s] DDIM Sampler: 79%|███████▉ | 158/200 [00:28<00:07, 5.54it/s] DDIM Sampler: 80%|███████▉ | 159/200 [00:28<00:07, 5.53it/s] DDIM Sampler: 80%|████████ | 160/200 [00:28<00:07, 5.54it/s] DDIM Sampler: 80%|████████ | 161/200 [00:29<00:07, 5.54it/s] DDIM Sampler: 81%|████████ | 162/200 [00:29<00:06, 5.54it/s] DDIM Sampler: 82%|████████▏ | 163/200 [00:29<00:06, 5.54it/s] DDIM Sampler: 82%|████████▏ | 164/200 [00:29<00:06, 5.53it/s] DDIM Sampler: 82%|████████▎ | 165/200 [00:29<00:06, 5.54it/s] DDIM Sampler: 83%|████████▎ | 166/200 [00:29<00:06, 5.54it/s] DDIM Sampler: 84%|████████▎ | 167/200 [00:30<00:05, 5.54it/s] DDIM Sampler: 84%|████████▍ | 168/200 [00:30<00:05, 5.54it/s] DDIM Sampler: 84%|████████▍ | 169/200 [00:30<00:05, 5.55it/s] DDIM Sampler: 85%|████████▌ | 170/200 [00:30<00:05, 5.55it/s] DDIM Sampler: 86%|████████▌ | 171/200 [00:30<00:05, 5.54it/s] DDIM Sampler: 86%|████████▌ | 172/200 [00:31<00:05, 5.54it/s] DDIM Sampler: 86%|████████▋ | 173/200 [00:31<00:04, 5.54it/s] DDIM Sampler: 87%|████████▋ | 174/200 [00:31<00:04, 5.54it/s] DDIM Sampler: 88%|████████▊ | 175/200 [00:31<00:04, 5.55it/s] DDIM Sampler: 88%|████████▊ | 176/200 [00:31<00:04, 5.55it/s] DDIM Sampler: 88%|████████▊ | 177/200 [00:31<00:04, 5.54it/s] DDIM Sampler: 89%|████████▉ | 178/200 [00:32<00:03, 5.55it/s] DDIM Sampler: 90%|████████▉ | 179/200 [00:32<00:03, 5.55it/s] DDIM Sampler: 90%|█████████ | 180/200 [00:32<00:03, 5.55it/s] DDIM Sampler: 90%|█████████ | 181/200 [00:32<00:03, 5.55it/s] DDIM Sampler: 91%|█████████ | 182/200 [00:32<00:03, 5.54it/s] DDIM Sampler: 92%|█████████▏| 183/200 [00:33<00:03, 5.54it/s] DDIM Sampler: 92%|█████████▏| 184/200 [00:33<00:02, 5.54it/s] DDIM Sampler: 92%|█████████▎| 185/200 [00:33<00:02, 5.54it/s] DDIM Sampler: 93%|█████████▎| 186/200 [00:33<00:02, 5.54it/s] DDIM Sampler: 94%|█████████▎| 187/200 [00:33<00:02, 5.54it/s] DDIM Sampler: 94%|█████████▍| 188/200 [00:33<00:02, 5.53it/s] DDIM Sampler: 94%|█████████▍| 189/200 [00:34<00:01, 5.53it/s] DDIM Sampler: 95%|█████████▌| 190/200 [00:34<00:01, 5.52it/s] DDIM Sampler: 96%|█████████▌| 191/200 [00:34<00:01, 5.53it/s] DDIM Sampler: 96%|█████████▌| 192/200 [00:34<00:01, 5.53it/s] DDIM Sampler: 96%|█████████▋| 193/200 [00:34<00:01, 5.52it/s] DDIM Sampler: 97%|█████████▋| 194/200 [00:35<00:01, 5.53it/s] DDIM Sampler: 98%|█████████▊| 195/200 [00:35<00:00, 5.53it/s] DDIM Sampler: 98%|█████████▊| 196/200 [00:35<00:00, 5.54it/s] DDIM Sampler: 98%|█████████▊| 197/200 [00:35<00:00, 5.54it/s] DDIM Sampler: 99%|█████████▉| 198/200 [00:35<00:00, 5.54it/s] DDIM Sampler: 100%|█████████▉| 199/200 [00:35<00:00, 5.55it/s] DDIM Sampler: 100%|██████████| 200/200 [00:36<00:00, 5.55it/s] DDIM Sampler: 100%|██████████| 200/200 [00:36<00:00, 5.54it/s]
Prediction
nateraw/audio-super-resolution:9c3d3e39fb0cb6aea677264881d8073f835336137b39fdea4e94093319379535IDqqynzilbxm2t5h6eaywx2p7g54StatusSucceededSourceWebHardwareA40 (Large)Total durationCreatedInput
- seed
- 42
- ddim_steps
- 50
- input_file
- Video Player is loading.Current Time 00:00:000/Duration 00:00:000Loaded: 0%Stream Type LIVERemaining Time -00:00:0001x
- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
- guidance_scale
- 3.5
{ "seed": 42, "ddim_steps": 50, "input_file": "https://replicate.delivery/pbxt/JYv70XQsiZBbSmknfMhGoEb4QYbuyJ9hJkfgjyzCvh4TzPmT/music.wav", "guidance_scale": 3.5 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run nateraw/audio-super-resolution using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "nateraw/audio-super-resolution:9c3d3e39fb0cb6aea677264881d8073f835336137b39fdea4e94093319379535", { input: { seed: 42, ddim_steps: 50, input_file: "https://replicate.delivery/pbxt/JYv70XQsiZBbSmknfMhGoEb4QYbuyJ9hJkfgjyzCvh4TzPmT/music.wav", guidance_scale: 3.5 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run nateraw/audio-super-resolution using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "nateraw/audio-super-resolution:9c3d3e39fb0cb6aea677264881d8073f835336137b39fdea4e94093319379535", input={ "seed": 42, "ddim_steps": 50, "input_file": "https://replicate.delivery/pbxt/JYv70XQsiZBbSmknfMhGoEb4QYbuyJ9hJkfgjyzCvh4TzPmT/music.wav", "guidance_scale": 3.5 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run nateraw/audio-super-resolution using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "nateraw/audio-super-resolution:9c3d3e39fb0cb6aea677264881d8073f835336137b39fdea4e94093319379535", "input": { "seed": 42, "ddim_steps": 50, "input_file": "https://replicate.delivery/pbxt/JYv70XQsiZBbSmknfMhGoEb4QYbuyJ9hJkfgjyzCvh4TzPmT/music.wav", "guidance_scale": 3.5 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
Video Player is loading.Current Time 00:00:000/Duration 00:00:000Loaded: 0%Stream Type LIVERemaining Time -00:00:0001x- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
{ "completed_at": "2023-09-20T19:19:06.183902Z", "created_at": "2023-09-20T19:17:04.700648Z", "data_removed": false, "error": null, "id": "qqynzilbxm2t5h6eaywx2p7g54", "input": { "seed": 42, "ddim_steps": 50, "input_file": "https://replicate.delivery/pbxt/JYv70XQsiZBbSmknfMhGoEb4QYbuyJ9hJkfgjyzCvh4TzPmT/music.wav", "guidance_scale": 3.5 }, "logs": "\u001b[93m Warning: audio is longer than 10.24 seconds, may degrade the model performance. It's recommand to truncate your audio to 5.12 seconds before input to AudioSR to get the best performance.\u001b[00m\n/src/audiosr/utils.py:109: FutureWarning: Pass sr=48000, n_fft=2048, n_mels=256, fmin=20, fmax=24000 as keyword args. From version 0.10 passing these as positional arguments will result in an error\nmel = librosa_mel_fn(sampling_rate, filter_length, n_mel, mel_fmin, mel_fmax)\nRunning DDIM Sampling with 50 timesteps\nDDIM Sampler: 0%| | 0/50 [00:00<?, ?it/s]\nDDIM Sampler: 2%|▏ | 1/50 [00:06<05:20, 6.54s/it]\nDDIM Sampler: 4%|▍ | 2/50 [00:06<02:14, 2.80s/it]\nDDIM Sampler: 6%|▌ | 3/50 [00:06<01:15, 1.60s/it]\nDDIM Sampler: 8%|▊ | 4/50 [00:07<00:47, 1.04s/it]\nDDIM Sampler: 10%|█ | 5/50 [00:07<00:32, 1.37it/s]\nDDIM Sampler: 12%|█▏ | 6/50 [00:07<00:23, 1.84it/s]\nDDIM Sampler: 14%|█▍ | 7/50 [00:07<00:18, 2.36it/s]\nDDIM Sampler: 16%|█▌ | 8/50 [00:07<00:14, 2.89it/s]\nDDIM Sampler: 18%|█▊ | 9/50 [00:07<00:12, 3.40it/s]\nDDIM Sampler: 20%|██ | 10/50 [00:08<00:10, 3.86it/s]\nDDIM Sampler: 22%|██▏ | 11/50 [00:08<00:09, 4.26it/s]\nDDIM Sampler: 24%|██▍ | 12/50 [00:08<00:08, 4.59it/s]\nDDIM Sampler: 26%|██▌ | 13/50 [00:08<00:07, 4.85it/s]\nDDIM Sampler: 28%|██▊ | 14/50 [00:08<00:07, 5.04it/s]\nDDIM Sampler: 30%|███ | 15/50 [00:09<00:06, 5.19it/s]\nDDIM Sampler: 32%|███▏ | 16/50 [00:09<00:06, 5.29it/s]\nDDIM Sampler: 34%|███▍ | 17/50 [00:09<00:06, 5.37it/s]\nDDIM Sampler: 36%|███▌ | 18/50 [00:09<00:05, 5.42it/s]\nDDIM Sampler: 38%|███▊ | 19/50 [00:09<00:05, 5.46it/s]\nDDIM Sampler: 40%|████ | 20/50 [00:09<00:05, 5.49it/s]\nDDIM Sampler: 42%|████▏ | 21/50 [00:10<00:05, 5.51it/s]\nDDIM Sampler: 44%|████▍ | 22/50 [00:10<00:05, 5.53it/s]\nDDIM Sampler: 46%|████▌ | 23/50 [00:10<00:04, 5.54it/s]\nDDIM Sampler: 48%|████▊ | 24/50 [00:10<00:04, 5.55it/s]\nDDIM Sampler: 50%|█████ | 25/50 [00:10<00:04, 5.55it/s]\nDDIM Sampler: 52%|█████▏ | 26/50 [00:11<00:04, 5.56it/s]\nDDIM Sampler: 54%|█████▍ | 27/50 [00:11<00:04, 5.56it/s]\nDDIM Sampler: 56%|█████▌ | 28/50 [00:11<00:03, 5.56it/s]\nDDIM Sampler: 58%|█████▊ | 29/50 [00:11<00:03, 5.56it/s]\nDDIM Sampler: 60%|██████ | 30/50 [00:11<00:03, 5.56it/s]\nDDIM Sampler: 62%|██████▏ | 31/50 [00:11<00:03, 5.56it/s]\nDDIM Sampler: 64%|██████▍ | 32/50 [00:12<00:03, 5.48it/s]\nDDIM Sampler: 66%|██████▌ | 33/50 [00:12<00:03, 5.50it/s]\nDDIM Sampler: 68%|██████▊ | 34/50 [00:12<00:02, 5.44it/s]\nDDIM Sampler: 70%|███████ | 35/50 [00:12<00:02, 5.23it/s]\nDDIM Sampler: 72%|███████▏ | 36/50 [00:12<00:02, 5.08it/s]\nDDIM Sampler: 74%|███████▍ | 37/50 [00:13<00:02, 4.99it/s]\nDDIM Sampler: 76%|███████▌ | 38/50 [00:13<00:02, 4.85it/s]\nDDIM Sampler: 78%|███████▊ | 39/50 [00:13<00:02, 4.74it/s]\nDDIM Sampler: 80%|████████ | 40/50 [00:13<00:02, 4.71it/s]\nDDIM Sampler: 82%|████████▏ | 41/50 [00:13<00:01, 4.69it/s]\nDDIM Sampler: 84%|████████▍ | 42/50 [00:14<00:01, 4.68it/s]\nDDIM Sampler: 86%|████████▌ | 43/50 [00:14<00:01, 4.68it/s]\nDDIM Sampler: 88%|████████▊ | 44/50 [00:14<00:01, 4.81it/s]\nDDIM Sampler: 90%|█████████ | 45/50 [00:14<00:00, 5.01it/s]\nDDIM Sampler: 92%|█████████▏| 46/50 [00:14<00:00, 5.16it/s]\nDDIM Sampler: 94%|█████████▍| 47/50 [00:15<00:00, 5.28it/s]\nDDIM Sampler: 96%|█████████▌| 48/50 [00:15<00:00, 5.36it/s]\nDDIM Sampler: 98%|█████████▊| 49/50 [00:15<00:00, 5.42it/s]\nDDIM Sampler: 100%|██████████| 50/50 [00:15<00:00, 5.46it/s]\nDDIM Sampler: 100%|██████████| 50/50 [00:15<00:00, 3.19it/s]", "metrics": { "predict_time": 32.909901, "total_time": 121.483254 }, "output": "https://pbxt.replicate.delivery/73xEQwFed8UXSCQpBJuLYgg8ym0vjp2nSidaqWKwcxhUvHzIA/out.wav", "started_at": "2023-09-20T19:18:33.274001Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/qqynzilbxm2t5h6eaywx2p7g54", "cancel": "https://api.replicate.com/v1/predictions/qqynzilbxm2t5h6eaywx2p7g54/cancel" }, "version": "9c3d3e39fb0cb6aea677264881d8073f835336137b39fdea4e94093319379535" }
Generated inWarning: audio is longer than 10.24 seconds, may degrade the model performance. It's recommand to truncate your audio to 5.12 seconds before input to AudioSR to get the best performance. /src/audiosr/utils.py:109: FutureWarning: Pass sr=48000, n_fft=2048, n_mels=256, fmin=20, fmax=24000 as keyword args. From version 0.10 passing these as positional arguments will result in an error mel = librosa_mel_fn(sampling_rate, filter_length, n_mel, mel_fmin, mel_fmax) Running DDIM Sampling with 50 timesteps DDIM Sampler: 0%| | 0/50 [00:00<?, ?it/s] DDIM Sampler: 2%|▏ | 1/50 [00:06<05:20, 6.54s/it] DDIM Sampler: 4%|▍ | 2/50 [00:06<02:14, 2.80s/it] DDIM Sampler: 6%|▌ | 3/50 [00:06<01:15, 1.60s/it] DDIM Sampler: 8%|▊ | 4/50 [00:07<00:47, 1.04s/it] DDIM Sampler: 10%|█ | 5/50 [00:07<00:32, 1.37it/s] DDIM Sampler: 12%|█▏ | 6/50 [00:07<00:23, 1.84it/s] DDIM Sampler: 14%|█▍ | 7/50 [00:07<00:18, 2.36it/s] DDIM Sampler: 16%|█▌ | 8/50 [00:07<00:14, 2.89it/s] DDIM Sampler: 18%|█▊ | 9/50 [00:07<00:12, 3.40it/s] DDIM Sampler: 20%|██ | 10/50 [00:08<00:10, 3.86it/s] DDIM Sampler: 22%|██▏ | 11/50 [00:08<00:09, 4.26it/s] DDIM Sampler: 24%|██▍ | 12/50 [00:08<00:08, 4.59it/s] DDIM Sampler: 26%|██▌ | 13/50 [00:08<00:07, 4.85it/s] DDIM Sampler: 28%|██▊ | 14/50 [00:08<00:07, 5.04it/s] DDIM Sampler: 30%|███ | 15/50 [00:09<00:06, 5.19it/s] DDIM Sampler: 32%|███▏ | 16/50 [00:09<00:06, 5.29it/s] DDIM Sampler: 34%|███▍ | 17/50 [00:09<00:06, 5.37it/s] DDIM Sampler: 36%|███▌ | 18/50 [00:09<00:05, 5.42it/s] DDIM Sampler: 38%|███▊ | 19/50 [00:09<00:05, 5.46it/s] DDIM Sampler: 40%|████ | 20/50 [00:09<00:05, 5.49it/s] DDIM Sampler: 42%|████▏ | 21/50 [00:10<00:05, 5.51it/s] DDIM Sampler: 44%|████▍ | 22/50 [00:10<00:05, 5.53it/s] DDIM Sampler: 46%|████▌ | 23/50 [00:10<00:04, 5.54it/s] DDIM Sampler: 48%|████▊ | 24/50 [00:10<00:04, 5.55it/s] DDIM Sampler: 50%|█████ | 25/50 [00:10<00:04, 5.55it/s] DDIM Sampler: 52%|█████▏ | 26/50 [00:11<00:04, 5.56it/s] DDIM Sampler: 54%|█████▍ | 27/50 [00:11<00:04, 5.56it/s] DDIM Sampler: 56%|█████▌ | 28/50 [00:11<00:03, 5.56it/s] DDIM Sampler: 58%|█████▊ | 29/50 [00:11<00:03, 5.56it/s] DDIM Sampler: 60%|██████ | 30/50 [00:11<00:03, 5.56it/s] DDIM Sampler: 62%|██████▏ | 31/50 [00:11<00:03, 5.56it/s] DDIM Sampler: 64%|██████▍ | 32/50 [00:12<00:03, 5.48it/s] DDIM Sampler: 66%|██████▌ | 33/50 [00:12<00:03, 5.50it/s] DDIM Sampler: 68%|██████▊ | 34/50 [00:12<00:02, 5.44it/s] DDIM Sampler: 70%|███████ | 35/50 [00:12<00:02, 5.23it/s] DDIM Sampler: 72%|███████▏ | 36/50 [00:12<00:02, 5.08it/s] DDIM Sampler: 74%|███████▍ | 37/50 [00:13<00:02, 4.99it/s] DDIM Sampler: 76%|███████▌ | 38/50 [00:13<00:02, 4.85it/s] DDIM Sampler: 78%|███████▊ | 39/50 [00:13<00:02, 4.74it/s] DDIM Sampler: 80%|████████ | 40/50 [00:13<00:02, 4.71it/s] DDIM Sampler: 82%|████████▏ | 41/50 [00:13<00:01, 4.69it/s] DDIM Sampler: 84%|████████▍ | 42/50 [00:14<00:01, 4.68it/s] DDIM Sampler: 86%|████████▌ | 43/50 [00:14<00:01, 4.68it/s] DDIM Sampler: 88%|████████▊ | 44/50 [00:14<00:01, 4.81it/s] DDIM Sampler: 90%|█████████ | 45/50 [00:14<00:00, 5.01it/s] DDIM Sampler: 92%|█████████▏| 46/50 [00:14<00:00, 5.16it/s] DDIM Sampler: 94%|█████████▍| 47/50 [00:15<00:00, 5.28it/s] DDIM Sampler: 96%|█████████▌| 48/50 [00:15<00:00, 5.36it/s] DDIM Sampler: 98%|█████████▊| 49/50 [00:15<00:00, 5.42it/s] DDIM Sampler: 100%|██████████| 50/50 [00:15<00:00, 5.46it/s] DDIM Sampler: 100%|██████████| 50/50 [00:15<00:00, 3.19it/s]
Want to make some of these yourself?
Run this model