Readme
This package provides models for music structure analysis, predicting:
Tempo (BPM) Beats Downbeats Functional segment boundaries Functional segment labels (e.g., intro, verse, chorus, bridge, outro)
Cog implementation of mir-aidj(Taejun Kim)'s 'All-In-One Music Structure Analyzer'
Run this model in Node.js with one line of code:
npm install replicate
REPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
import Replicate from "replicate";
const replicate = new Replicate({
auth: process.env.REPLICATE_API_TOKEN,
});
Run sakemin/all-in-one-music-structure-analyzer using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run(
"sakemin/all-in-one-music-structure-analyzer:001b4137be6ac67bdc28cb5cffacf128b874f530258d033de23121e785cb7290",
{
input: {
activ: false,
embed: false,
model: "harmonix-all",
sonify: true,
visualize: true,
music_input: "https://replicate.delivery/pbxt/K3iP4RhDPayT24NMYswahQ7kYfG1NS4vhNaF3PZVSZoLaSSY/x2mate.com%20-%20Sean%20Lennon.%20Parachute%20%28128%20kbps%29.mp3",
include_embeddings: false,
include_activations: false
}
}
);
// To access the file URL:
console.log(output[0].url()); //=> "http://example.com"
// To write the file to disk:
fs.writeFile("my-image.png", output[0]);
To learn more, take a look at the guide on getting started with Node.js.
pip install replicate
REPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
import replicate
Run sakemin/all-in-one-music-structure-analyzer using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run(
"sakemin/all-in-one-music-structure-analyzer:001b4137be6ac67bdc28cb5cffacf128b874f530258d033de23121e785cb7290",
input={
"activ": False,
"embed": False,
"model": "harmonix-all",
"sonify": True,
"visualize": True,
"music_input": "https://replicate.delivery/pbxt/K3iP4RhDPayT24NMYswahQ7kYfG1NS4vhNaF3PZVSZoLaSSY/x2mate.com%20-%20Sean%20Lennon.%20Parachute%20%28128%20kbps%29.mp3",
"include_embeddings": False,
"include_activations": False
}
)
print(output)
To learn more, take a look at the guide on getting started with Python.
REPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run sakemin/all-in-one-music-structure-analyzer using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \
-H "Authorization: Bearer $REPLICATE_API_TOKEN" \
-H "Content-Type: application/json" \
-H "Prefer: wait" \
-d $'{
"version": "sakemin/all-in-one-music-structure-analyzer:001b4137be6ac67bdc28cb5cffacf128b874f530258d033de23121e785cb7290",
"input": {
"activ": false,
"embed": false,
"model": "harmonix-all",
"sonify": true,
"visualize": true,
"music_input": "https://replicate.delivery/pbxt/K3iP4RhDPayT24NMYswahQ7kYfG1NS4vhNaF3PZVSZoLaSSY/x2mate.com%20-%20Sean%20Lennon.%20Parachute%20%28128%20kbps%29.mp3",
"include_embeddings": false,
"include_activations": false
}
}' \
https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Add a payment method to run this model.
By signing in, you agree to our
terms of service and privacy policy
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
{
"completed_at": "2023-12-21T09:07:04.696262Z",
"created_at": "2023-12-21T09:04:43.008830Z",
"data_removed": false,
"error": null,
"id": "ofu2ymdbv6iwlqh3htdy2df2rq",
"input": {
"activ": false,
"embed": false,
"model": "harmonix-all",
"sonify": true,
"visualize": true,
"music_input": "https://replicate.delivery/pbxt/K3iP4RhDPayT24NMYswahQ7kYfG1NS4vhNaF3PZVSZoLaSSY/x2mate.com%20-%20Sean%20Lennon.%20Parachute%20%28128%20kbps%29.mp3",
"include_embeddings": false,
"include_activations": false
},
"logs": "=> Found 0 tracks already analyzed and 1 tracks to analyze.\n=> Found 0 tracks already demixed, 1 to demix.\nDownloading: \"https://dl.fbaipublicfiles.com/demucs/hybrid_transformer/955717e8-8726e21a.th\" to /root/.cache/torch/hub/checkpoints/955717e8-8726e21a.th\n 0%| | 0.00/80.2M [00:00<?, ?B/s]\n 0%| | 104k/80.2M [00:00<01:21, 1.03MB/s]\n 1%| | 488k/80.2M [00:00<00:31, 2.68MB/s]\n 2%|▏ | 1.96M/80.2M [00:00<00:09, 8.46MB/s]\n 10%|▉ | 7.84M/80.2M [00:00<00:02, 29.2MB/s]\n 38%|███▊ | 30.9M/80.2M [00:00<00:00, 105MB/s] \n 68%|██████▊ | 54.8M/80.2M [00:00<00:00, 155MB/s]\n 97%|█████████▋| 77.6M/80.2M [00:00<00:00, 182MB/s]\n100%|██████████| 80.2M/80.2M [00:00<00:00, 117MB/s]\nSelected model is a bag of 1 models. You will see that many progress bars per track.\nSeparated tracks will be stored in /src/demix/htdemucs\nSeparating track /tmp/tmphbi1m7kax2mate.com - Sean Lennon. Parachute (128 kbps).mp3\n 0%| | 0.0/204.75 [00:00<?, ?seconds/s]\n 3%|██ | 5.85/204.75 [00:01<01:02, 3.19seconds/s]\n 6%|████ | 11.7/204.75 [00:01<00:27, 7.12seconds/s]\n 9%|████▉ | 17.549999999999997/204.75 [00:02<00:15, 11.75seconds/s]\n 11%|████████▏ | 23.4/204.75 [00:02<00:10, 16.92seconds/s]\n 14%|██████████▏ | 29.25/204.75 [00:02<00:07, 22.37seconds/s]\n 17%|█████████▉ | 35.099999999999994/204.75 [00:02<00:06, 27.74seconds/s]\n 20%|███████████▌ | 40.949999999999996/204.75 [00:02<00:04, 32.84seconds/s]\n 23%|████████████████▍ | 46.8/204.75 [00:02<00:04, 37.21seconds/s]\n 26%|██████████████████▎ | 52.65/204.75 [00:02<00:03, 40.95seconds/s]\n 29%|████████████████████▌ | 58.5/204.75 [00:02<00:03, 43.60seconds/s]\n 31%|██████████████████████▎ | 64.35/204.75 [00:02<00:03, 45.56seconds/s]\n 34%|████████████████████▏ | 70.19999999999999/204.75 [00:03<00:02, 47.30seconds/s]\n 37%|██████████████████████████▎ | 76.05/204.75 [00:03<00:02, 48.58seconds/s]\n 40%|███████████████████████▌ | 81.89999999999999/204.75 [00:03<00:02, 49.59seconds/s]\n 43%|██████████████████████████████▍ | 87.75/204.75 [00:03<00:02, 50.27seconds/s]\n 46%|████████████████████████████████▉ | 93.6/204.75 [00:03<00:02, 50.95seconds/s]\n 49%|████████████████████████████▋ | 99.44999999999999/204.75 [00:03<00:02, 51.35seconds/s]\n 51%|████████████████████████████████████▌ | 105.3/204.75 [00:03<00:01, 51.61seconds/s]\n 54%|███████████████████████████████▍ | 111.14999999999999/204.75 [00:03<00:01, 51.40seconds/s]\n 57%|████████████████████████████████████████▌ | 117.0/204.75 [00:03<00:01, 51.55seconds/s]\n 60%|██████████████████████████████████████████ | 122.85/204.75 [00:04<00:01, 52.05seconds/s]\n 63%|████████████████████████████████████████████▋ | 128.7/204.75 [00:04<00:01, 52.24seconds/s]\n 66%|██████████████████████████████████████ | 134.54999999999998/204.75 [00:04<00:01, 51.83seconds/s]\n 69%|███████████████████████████████████████▊ | 140.39999999999998/204.75 [00:04<00:01, 51.48seconds/s]\n 71%|██████████████████████████████████████████████████ | 146.25/204.75 [00:04<00:01, 51.40seconds/s]\n 74%|████████████████████████████████████████████████████▋ | 152.1/204.75 [00:04<00:01, 51.40seconds/s]\n 77%|█████████████████████████████████████████████████████▉ | 157.95/204.75 [00:04<00:00, 51.45seconds/s]\n 80%|██████████████████████████████████████████████▍ | 163.79999999999998/204.75 [00:04<00:00, 51.26seconds/s]\n 83%|████████████████████████████████████████████████ | 169.64999999999998/204.75 [00:04<00:00, 51.05seconds/s]\n 86%|████████████████████████████████████████████████████████████▊ | 175.5/204.75 [00:05<00:00, 51.64seconds/s]\n 89%|██████████████████████████████████████████████████████████████ | 181.35/204.75 [00:05<00:00, 52.30seconds/s]\n 91%|████████████████████████████████████████████████████████████████▉ | 187.2/204.75 [00:05<00:00, 51.43seconds/s]\n 94%|██████████████████████████████████████████████████████▋ | 193.04999999999998/204.75 [00:05<00:00, 51.65seconds/s]\n 97%|████████████████████████████████████████████████████████▎ | 198.89999999999998/204.75 [00:05<00:00, 49.10seconds/s]\n100%|██████████████████████████████████████████████████████████████████████| 204.75/204.75 [00:05<00:00, 50.66seconds/s]\n100%|██████████████████████████████████████████████████████████████████████| 204.75/204.75 [00:05<00:00, 36.00seconds/s]\n=> Found 0 spectrograms already extracted, 1 to extract.\nExtracting spectrograms: 0%| | 0/1 [00:00<?, ?it/s]\nExtracting spectrograms: 100%|██████████| 1/1 [00:06<00:00, 6.23s/it]\nExtracting spectrograms: 100%|██████████| 1/1 [00:06<00:00, 6.23s/it]\nharmonix-fold0-0vra4ys2.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]\nharmonix-fold0-0vra4ys2.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 5.04MB/s]\nharmonix-fold0-0vra4ys2.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 5.03MB/s]\nharmonix-fold1-3ozjhtsj.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]\nharmonix-fold1-3ozjhtsj.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 26.8MB/s]\nharmonix-fold2-gmgo0nsy.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]\nharmonix-fold2-gmgo0nsy.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 77.8MB/s]\nharmonix-fold3-i92b7m8p.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]\nharmonix-fold3-i92b7m8p.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 81.7MB/s]\nharmonix-fold4-1bql5qo0.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]\nharmonix-fold4-1bql5qo0.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 78.5MB/s]\nharmonix-fold5-x4z5zeef.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]\nharmonix-fold5-x4z5zeef.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 77.5MB/s]\nharmonix-fold6-x7t226rq.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]\nharmonix-fold6-x7t226rq.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 76.5MB/s]\nharmonix-fold7-qwwskhg6.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]\nharmonix-fold7-qwwskhg6.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 77.7MB/s]\n0%| | 0/1 [00:00<?, ?it/s]\nAnalyzing tmphbi1m7kax2mate.com - Sean Lennon. Parachute (128 kbps).mp3: 0%| | 0/1 [00:00<?, ?it/s]\nAnalyzing tmphbi1m7kax2mate.com - Sean Lennon. Parachute (128 kbps).mp3: 100%|██████████| 1/1 [00:03<00:00, 3.43s/it]\nAnalyzing tmphbi1m7kax2mate.com - Sean Lennon. Parachute (128 kbps).mp3: 100%|██████████| 1/1 [00:03<00:00, 3.43s/it]\nVisualizing results: 0%| | 0/1 [00:00<?, ?it/s]\nVisualizing results: 100%|██████████| 1/1 [00:10<00:00, 10.77s/it]\nVisualizing results: 100%|██████████| 1/1 [00:10<00:00, 10.77s/it]\n=> Plots are successfully saved to ./viz\nSonifying results: 0%| | 0/1 [00:00<?, ?it/s]\nSonifying results: 100%|██████████| 1/1 [00:07<00:00, 7.53s/it]\nSonifying results: 100%|██████████| 1/1 [00:07<00:00, 7.53s/it]\n=> Sonified tracks are successfully saved to ./sonif",
"metrics": {
"predict_time": 65.818421,
"total_time": 141.687432
},
"output": [
"https://replicate.delivery/pbxt/vmeY0lK6XzVOFKjJcGEJFsrK7gpjrcLQOhXzifrjoBp2IbESA/tmphbi1m7kax2mate.com%20-%20Sean%20Lennon.%20Parachute%20%28128%20kbps%29.json",
"https://replicate.delivery/pbxt/989hApZF7c7RB535295mWrCQ43npy2QxfHeMJfoY9jnvR2IkA/tmphbi1m7kax2mate.com%20-%20Sean%20Lennon.%20Parachute%20%28128%20kbps%29.png",
"https://replicate.delivery/pbxt/UGeLnwlwBwT6cKpwjvCSoeYJX5bYH1n8E2elbdffi6e2NyGhE/tmphbi1m7kax2mate.com%20-%20Sean%20Lennon.%20Parachute%20%28128%20kbps%29.sonif.mp3"
],
"started_at": "2023-12-21T09:05:58.877841Z",
"status": "succeeded",
"urls": {
"get": "https://api.replicate.com/v1/predictions/ofu2ymdbv6iwlqh3htdy2df2rq",
"cancel": "https://api.replicate.com/v1/predictions/ofu2ymdbv6iwlqh3htdy2df2rq/cancel"
},
"version": "001b4137be6ac67bdc28cb5cffacf128b874f530258d033de23121e785cb7290"
}
=> Found 0 tracks already analyzed and 1 tracks to analyze.
=> Found 0 tracks already demixed, 1 to demix.
Downloading: "https://dl.fbaipublicfiles.com/demucs/hybrid_transformer/955717e8-8726e21a.th" to /root/.cache/torch/hub/checkpoints/955717e8-8726e21a.th
0%| | 0.00/80.2M [00:00<?, ?B/s]
0%| | 104k/80.2M [00:00<01:21, 1.03MB/s]
1%| | 488k/80.2M [00:00<00:31, 2.68MB/s]
2%|▏ | 1.96M/80.2M [00:00<00:09, 8.46MB/s]
10%|▉ | 7.84M/80.2M [00:00<00:02, 29.2MB/s]
38%|███▊ | 30.9M/80.2M [00:00<00:00, 105MB/s]
68%|██████▊ | 54.8M/80.2M [00:00<00:00, 155MB/s]
97%|█████████▋| 77.6M/80.2M [00:00<00:00, 182MB/s]
100%|██████████| 80.2M/80.2M [00:00<00:00, 117MB/s]
Selected model is a bag of 1 models. You will see that many progress bars per track.
Separated tracks will be stored in /src/demix/htdemucs
Separating track /tmp/tmphbi1m7kax2mate.com - Sean Lennon. Parachute (128 kbps).mp3
0%| | 0.0/204.75 [00:00<?, ?seconds/s]
3%|██ | 5.85/204.75 [00:01<01:02, 3.19seconds/s]
6%|████ | 11.7/204.75 [00:01<00:27, 7.12seconds/s]
9%|████▉ | 17.549999999999997/204.75 [00:02<00:15, 11.75seconds/s]
11%|████████▏ | 23.4/204.75 [00:02<00:10, 16.92seconds/s]
14%|██████████▏ | 29.25/204.75 [00:02<00:07, 22.37seconds/s]
17%|█████████▉ | 35.099999999999994/204.75 [00:02<00:06, 27.74seconds/s]
20%|███████████▌ | 40.949999999999996/204.75 [00:02<00:04, 32.84seconds/s]
23%|████████████████▍ | 46.8/204.75 [00:02<00:04, 37.21seconds/s]
26%|██████████████████▎ | 52.65/204.75 [00:02<00:03, 40.95seconds/s]
29%|████████████████████▌ | 58.5/204.75 [00:02<00:03, 43.60seconds/s]
31%|██████████████████████▎ | 64.35/204.75 [00:02<00:03, 45.56seconds/s]
34%|████████████████████▏ | 70.19999999999999/204.75 [00:03<00:02, 47.30seconds/s]
37%|██████████████████████████▎ | 76.05/204.75 [00:03<00:02, 48.58seconds/s]
40%|███████████████████████▌ | 81.89999999999999/204.75 [00:03<00:02, 49.59seconds/s]
43%|██████████████████████████████▍ | 87.75/204.75 [00:03<00:02, 50.27seconds/s]
46%|████████████████████████████████▉ | 93.6/204.75 [00:03<00:02, 50.95seconds/s]
49%|████████████████████████████▋ | 99.44999999999999/204.75 [00:03<00:02, 51.35seconds/s]
51%|████████████████████████████████████▌ | 105.3/204.75 [00:03<00:01, 51.61seconds/s]
54%|███████████████████████████████▍ | 111.14999999999999/204.75 [00:03<00:01, 51.40seconds/s]
57%|████████████████████████████████████████▌ | 117.0/204.75 [00:03<00:01, 51.55seconds/s]
60%|██████████████████████████████████████████ | 122.85/204.75 [00:04<00:01, 52.05seconds/s]
63%|████████████████████████████████████████████▋ | 128.7/204.75 [00:04<00:01, 52.24seconds/s]
66%|██████████████████████████████████████ | 134.54999999999998/204.75 [00:04<00:01, 51.83seconds/s]
69%|███████████████████████████████████████▊ | 140.39999999999998/204.75 [00:04<00:01, 51.48seconds/s]
71%|██████████████████████████████████████████████████ | 146.25/204.75 [00:04<00:01, 51.40seconds/s]
74%|████████████████████████████████████████████████████▋ | 152.1/204.75 [00:04<00:01, 51.40seconds/s]
77%|█████████████████████████████████████████████████████▉ | 157.95/204.75 [00:04<00:00, 51.45seconds/s]
80%|██████████████████████████████████████████████▍ | 163.79999999999998/204.75 [00:04<00:00, 51.26seconds/s]
83%|████████████████████████████████████████████████ | 169.64999999999998/204.75 [00:04<00:00, 51.05seconds/s]
86%|████████████████████████████████████████████████████████████▊ | 175.5/204.75 [00:05<00:00, 51.64seconds/s]
89%|██████████████████████████████████████████████████████████████ | 181.35/204.75 [00:05<00:00, 52.30seconds/s]
91%|████████████████████████████████████████████████████████████████▉ | 187.2/204.75 [00:05<00:00, 51.43seconds/s]
94%|██████████████████████████████████████████████████████▋ | 193.04999999999998/204.75 [00:05<00:00, 51.65seconds/s]
97%|████████████████████████████████████████████████████████▎ | 198.89999999999998/204.75 [00:05<00:00, 49.10seconds/s]
100%|██████████████████████████████████████████████████████████████████████| 204.75/204.75 [00:05<00:00, 50.66seconds/s]
100%|██████████████████████████████████████████████████████████████████████| 204.75/204.75 [00:05<00:00, 36.00seconds/s]
=> Found 0 spectrograms already extracted, 1 to extract.
Extracting spectrograms: 0%| | 0/1 [00:00<?, ?it/s]
Extracting spectrograms: 100%|██████████| 1/1 [00:06<00:00, 6.23s/it]
Extracting spectrograms: 100%|██████████| 1/1 [00:06<00:00, 6.23s/it]
harmonix-fold0-0vra4ys2.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold0-0vra4ys2.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 5.04MB/s]
harmonix-fold0-0vra4ys2.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 5.03MB/s]
harmonix-fold1-3ozjhtsj.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold1-3ozjhtsj.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 26.8MB/s]
harmonix-fold2-gmgo0nsy.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold2-gmgo0nsy.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 77.8MB/s]
harmonix-fold3-i92b7m8p.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold3-i92b7m8p.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 81.7MB/s]
harmonix-fold4-1bql5qo0.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold4-1bql5qo0.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 78.5MB/s]
harmonix-fold5-x4z5zeef.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold5-x4z5zeef.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 77.5MB/s]
harmonix-fold6-x7t226rq.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold6-x7t226rq.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 76.5MB/s]
harmonix-fold7-qwwskhg6.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold7-qwwskhg6.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 77.7MB/s]
0%| | 0/1 [00:00<?, ?it/s]
Analyzing tmphbi1m7kax2mate.com - Sean Lennon. Parachute (128 kbps).mp3: 0%| | 0/1 [00:00<?, ?it/s]
Analyzing tmphbi1m7kax2mate.com - Sean Lennon. Parachute (128 kbps).mp3: 100%|██████████| 1/1 [00:03<00:00, 3.43s/it]
Analyzing tmphbi1m7kax2mate.com - Sean Lennon. Parachute (128 kbps).mp3: 100%|██████████| 1/1 [00:03<00:00, 3.43s/it]
Visualizing results: 0%| | 0/1 [00:00<?, ?it/s]
Visualizing results: 100%|██████████| 1/1 [00:10<00:00, 10.77s/it]
Visualizing results: 100%|██████████| 1/1 [00:10<00:00, 10.77s/it]
=> Plots are successfully saved to ./viz
Sonifying results: 0%| | 0/1 [00:00<?, ?it/s]
Sonifying results: 100%|██████████| 1/1 [00:07<00:00, 7.53s/it]
Sonifying results: 100%|██████████| 1/1 [00:07<00:00, 7.53s/it]
=> Sonified tracks are successfully saved to ./sonif
This model costs approximately $0.056 to run on Replicate, or 17 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.
This model runs on Nvidia A100 (40GB) GPU hardware. Predictions typically complete within 49 seconds. The predict time for this model varies significantly based on the inputs.
This package provides models for music structure analysis, predicting:
Tempo (BPM) Beats Downbeats Functional segment boundaries Functional segment labels (e.g., intro, verse, chorus, bridge, outro)
This model is warm. You'll get a fast response if the model is warm and already running, and a slower response if the model is cold and starting up.
Choose a file from your machine
Hint: you can also drag files onto the input
=> Found 0 tracks already analyzed and 1 tracks to analyze.
=> Found 0 tracks already demixed, 1 to demix.
Downloading: "https://dl.fbaipublicfiles.com/demucs/hybrid_transformer/955717e8-8726e21a.th" to /root/.cache/torch/hub/checkpoints/955717e8-8726e21a.th
0%| | 0.00/80.2M [00:00<?, ?B/s]
0%| | 104k/80.2M [00:00<01:21, 1.03MB/s]
1%| | 488k/80.2M [00:00<00:31, 2.68MB/s]
2%|▏ | 1.96M/80.2M [00:00<00:09, 8.46MB/s]
10%|▉ | 7.84M/80.2M [00:00<00:02, 29.2MB/s]
38%|███▊ | 30.9M/80.2M [00:00<00:00, 105MB/s]
68%|██████▊ | 54.8M/80.2M [00:00<00:00, 155MB/s]
97%|█████████▋| 77.6M/80.2M [00:00<00:00, 182MB/s]
100%|██████████| 80.2M/80.2M [00:00<00:00, 117MB/s]
Selected model is a bag of 1 models. You will see that many progress bars per track.
Separated tracks will be stored in /src/demix/htdemucs
Separating track /tmp/tmphbi1m7kax2mate.com - Sean Lennon. Parachute (128 kbps).mp3
0%| | 0.0/204.75 [00:00<?, ?seconds/s]
3%|██ | 5.85/204.75 [00:01<01:02, 3.19seconds/s]
6%|████ | 11.7/204.75 [00:01<00:27, 7.12seconds/s]
9%|████▉ | 17.549999999999997/204.75 [00:02<00:15, 11.75seconds/s]
11%|████████▏ | 23.4/204.75 [00:02<00:10, 16.92seconds/s]
14%|██████████▏ | 29.25/204.75 [00:02<00:07, 22.37seconds/s]
17%|█████████▉ | 35.099999999999994/204.75 [00:02<00:06, 27.74seconds/s]
20%|███████████▌ | 40.949999999999996/204.75 [00:02<00:04, 32.84seconds/s]
23%|████████████████▍ | 46.8/204.75 [00:02<00:04, 37.21seconds/s]
26%|██████████████████▎ | 52.65/204.75 [00:02<00:03, 40.95seconds/s]
29%|████████████████████▌ | 58.5/204.75 [00:02<00:03, 43.60seconds/s]
31%|██████████████████████▎ | 64.35/204.75 [00:02<00:03, 45.56seconds/s]
34%|████████████████████▏ | 70.19999999999999/204.75 [00:03<00:02, 47.30seconds/s]
37%|██████████████████████████▎ | 76.05/204.75 [00:03<00:02, 48.58seconds/s]
40%|███████████████████████▌ | 81.89999999999999/204.75 [00:03<00:02, 49.59seconds/s]
43%|██████████████████████████████▍ | 87.75/204.75 [00:03<00:02, 50.27seconds/s]
46%|████████████████████████████████▉ | 93.6/204.75 [00:03<00:02, 50.95seconds/s]
49%|████████████████████████████▋ | 99.44999999999999/204.75 [00:03<00:02, 51.35seconds/s]
51%|████████████████████████████████████▌ | 105.3/204.75 [00:03<00:01, 51.61seconds/s]
54%|███████████████████████████████▍ | 111.14999999999999/204.75 [00:03<00:01, 51.40seconds/s]
57%|████████████████████████████████████████▌ | 117.0/204.75 [00:03<00:01, 51.55seconds/s]
60%|██████████████████████████████████████████ | 122.85/204.75 [00:04<00:01, 52.05seconds/s]
63%|████████████████████████████████████████████▋ | 128.7/204.75 [00:04<00:01, 52.24seconds/s]
66%|██████████████████████████████████████ | 134.54999999999998/204.75 [00:04<00:01, 51.83seconds/s]
69%|███████████████████████████████████████▊ | 140.39999999999998/204.75 [00:04<00:01, 51.48seconds/s]
71%|██████████████████████████████████████████████████ | 146.25/204.75 [00:04<00:01, 51.40seconds/s]
74%|████████████████████████████████████████████████████▋ | 152.1/204.75 [00:04<00:01, 51.40seconds/s]
77%|█████████████████████████████████████████████████████▉ | 157.95/204.75 [00:04<00:00, 51.45seconds/s]
80%|██████████████████████████████████████████████▍ | 163.79999999999998/204.75 [00:04<00:00, 51.26seconds/s]
83%|████████████████████████████████████████████████ | 169.64999999999998/204.75 [00:04<00:00, 51.05seconds/s]
86%|████████████████████████████████████████████████████████████▊ | 175.5/204.75 [00:05<00:00, 51.64seconds/s]
89%|██████████████████████████████████████████████████████████████ | 181.35/204.75 [00:05<00:00, 52.30seconds/s]
91%|████████████████████████████████████████████████████████████████▉ | 187.2/204.75 [00:05<00:00, 51.43seconds/s]
94%|██████████████████████████████████████████████████████▋ | 193.04999999999998/204.75 [00:05<00:00, 51.65seconds/s]
97%|████████████████████████████████████████████████████████▎ | 198.89999999999998/204.75 [00:05<00:00, 49.10seconds/s]
100%|██████████████████████████████████████████████████████████████████████| 204.75/204.75 [00:05<00:00, 50.66seconds/s]
100%|██████████████████████████████████████████████████████████████████████| 204.75/204.75 [00:05<00:00, 36.00seconds/s]
=> Found 0 spectrograms already extracted, 1 to extract.
Extracting spectrograms: 0%| | 0/1 [00:00<?, ?it/s]
Extracting spectrograms: 100%|██████████| 1/1 [00:06<00:00, 6.23s/it]
Extracting spectrograms: 100%|██████████| 1/1 [00:06<00:00, 6.23s/it]
harmonix-fold0-0vra4ys2.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold0-0vra4ys2.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 5.04MB/s]
harmonix-fold0-0vra4ys2.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 5.03MB/s]
harmonix-fold1-3ozjhtsj.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold1-3ozjhtsj.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 26.8MB/s]
harmonix-fold2-gmgo0nsy.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold2-gmgo0nsy.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 77.8MB/s]
harmonix-fold3-i92b7m8p.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold3-i92b7m8p.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 81.7MB/s]
harmonix-fold4-1bql5qo0.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold4-1bql5qo0.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 78.5MB/s]
harmonix-fold5-x4z5zeef.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold5-x4z5zeef.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 77.5MB/s]
harmonix-fold6-x7t226rq.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold6-x7t226rq.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 76.5MB/s]
harmonix-fold7-qwwskhg6.pth: 0%| | 0.00/1.40M [00:00<?, ?B/s]
harmonix-fold7-qwwskhg6.pth: 100%|██████████| 1.40M/1.40M [00:00<00:00, 77.7MB/s]
0%| | 0/1 [00:00<?, ?it/s]
Analyzing tmphbi1m7kax2mate.com - Sean Lennon. Parachute (128 kbps).mp3: 0%| | 0/1 [00:00<?, ?it/s]
Analyzing tmphbi1m7kax2mate.com - Sean Lennon. Parachute (128 kbps).mp3: 100%|██████████| 1/1 [00:03<00:00, 3.43s/it]
Analyzing tmphbi1m7kax2mate.com - Sean Lennon. Parachute (128 kbps).mp3: 100%|██████████| 1/1 [00:03<00:00, 3.43s/it]
Visualizing results: 0%| | 0/1 [00:00<?, ?it/s]
Visualizing results: 100%|██████████| 1/1 [00:10<00:00, 10.77s/it]
Visualizing results: 100%|██████████| 1/1 [00:10<00:00, 10.77s/it]
=> Plots are successfully saved to ./viz
Sonifying results: 0%| | 0/1 [00:00<?, ?it/s]
Sonifying results: 100%|██████████| 1/1 [00:07<00:00, 7.53s/it]
Sonifying results: 100%|██████████| 1/1 [00:07<00:00, 7.53s/it]
=> Sonified tracks are successfully saved to ./sonif