zsxkib / hibiki
Hibiki: High-Fidelity Simultaneous Speech-To-Speech Translation
Prediction
zsxkib/hibiki:c72042d58e88a0dbcf77942f58b3b033b23f0d02d78cad22c4960afc3b8cce86IDmfa1v2wx6xrma0cmy3gb3b1ckrStatusSucceededSourceWebHardwareL40STotal durationCreatedInput
- audio_input
- Video Player is loading.Current Time 00:00:000/Duration 00:00:000Loaded: 0%Stream Type LIVERemaining Time -00:00:0001x
- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
- max_duration
- 0
- cut_start_seconds
- 2
- volume_reduction_db
- 30
{ "audio_input": "https://replicate.delivery/pbxt/MTZ2p9d3wjODaFBTC2qCxinnkFvHBAbVnZDQ9BdPIg8g9ABh/sample_fr_hibiki_crepes.mp3", "max_duration": 0, "cut_start_seconds": 2, "volume_reduction_db": 30 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run zsxkib/hibiki using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "zsxkib/hibiki:c72042d58e88a0dbcf77942f58b3b033b23f0d02d78cad22c4960afc3b8cce86", { input: { audio_input: "https://replicate.delivery/pbxt/MTZ2p9d3wjODaFBTC2qCxinnkFvHBAbVnZDQ9BdPIg8g9ABh/sample_fr_hibiki_crepes.mp3", max_duration: 0, cut_start_seconds: 2, volume_reduction_db: 30 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run zsxkib/hibiki using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "zsxkib/hibiki:c72042d58e88a0dbcf77942f58b3b033b23f0d02d78cad22c4960afc3b8cce86", input={ "audio_input": "https://replicate.delivery/pbxt/MTZ2p9d3wjODaFBTC2qCxinnkFvHBAbVnZDQ9BdPIg8g9ABh/sample_fr_hibiki_crepes.mp3", "max_duration": 0, "cut_start_seconds": 2, "volume_reduction_db": 30 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run zsxkib/hibiki using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "zsxkib/hibiki:c72042d58e88a0dbcf77942f58b3b033b23f0d02d78cad22c4960afc3b8cce86", "input": { "audio_input": "https://replicate.delivery/pbxt/MTZ2p9d3wjODaFBTC2qCxinnkFvHBAbVnZDQ9BdPIg8g9ABh/sample_fr_hibiki_crepes.mp3", "max_duration": 0, "cut_start_seconds": 2, "volume_reduction_db": 30 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
Video Player is loading.Current Time 00:00:000/Duration 00:00:000Loaded: 0%Stream Type LIVERemaining Time -00:00:0001x- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
{ "completed_at": "2025-02-10T17:22:08.245918Z", "created_at": "2025-02-10T17:20:39.735000Z", "data_removed": false, "error": null, "id": "mfa1v2wx6xrma0cmy3gb3b1ckr", "input": { "audio_input": "https://replicate.delivery/pbxt/MTZ2p9d3wjODaFBTC2qCxinnkFvHBAbVnZDQ9BdPIg8g9ABh/sample_fr_hibiki_crepes.mp3", "max_duration": 0, "cut_start_seconds": 2, "volume_reduction_db": 30 }, "logs": "\u001b[1;34m[Info]\u001b[0m retrieving checkpoint\n\u001b[1;34m[Info]\u001b[0m loading mimi\n\u001b[1;34m[Info]\u001b[0m mimi loaded\n\u001b[1;34m[Info]\u001b[0m loading moshi\n\u001b[1;34m[Info]\u001b[0m moshi loaded\n\u001b[1;34m[Info]\u001b[0m loading input file /tmp/tmpq1sdmc2dsample_fr_hibiki_crepes.mp3\nInfo: starting the inference loop\nInfo: processed 786 steps in 18s, 22.32ms/step\n<s> today we're going to make pancakes for this you'll need flour eggs milk a pinch of salt sugar and butter to start with put the flour in a bowl with salt and sugar make a well in the middle and pour the eggs into it start mixing slowly when the mixture becomes thick add the cold milk little by little when all the milk is mixed the dough becomes quite smooth if it seems too thick add some milk then add the melted butter cool mix well cook the pancakes in a hot pot about a small layer of dough in the pan make a rotation movement to spread the dough all over the surface sit over the fire and when the rum is in clear red color it's time to turn it over leave the eggs about one minute side and the pancake's ready to be eaten</s>\u001b[1;34m[Info]\u001b[0m writing /tmp/tmpugbtb_hf/out_en-0 with duration 60.3 sec.\n\u001b[1;34m[Info]\u001b[0m writing /tmp/tmpugbtb_hf/out_en-1 with duration 62.9 sec.", "metrics": { "predict_time": 23.824290533, "total_time": 88.510918 }, "output": "https://replicate.delivery/xezq/mBH468zlXfxyOqAZhnhG9un6wR432Rr7tHteymXXjQ1Af8boA/final_mix.wav", "started_at": "2025-02-10T17:21:44.421628Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bcwr-jgpcccj7qam6znguv3ie53ypkuwvyi6zpqtefmszg737ocve3qtq", "get": "https://api.replicate.com/v1/predictions/mfa1v2wx6xrma0cmy3gb3b1ckr", "cancel": "https://api.replicate.com/v1/predictions/mfa1v2wx6xrma0cmy3gb3b1ckr/cancel" }, "version": "c72042d58e88a0dbcf77942f58b3b033b23f0d02d78cad22c4960afc3b8cce86" }
Generated in[Info] retrieving checkpoint [Info] loading mimi [Info] mimi loaded [Info] loading moshi [Info] moshi loaded [Info] loading input file /tmp/tmpq1sdmc2dsample_fr_hibiki_crepes.mp3 Info: starting the inference loop Info: processed 786 steps in 18s, 22.32ms/step <s> today we're going to make pancakes for this you'll need flour eggs milk a pinch of salt sugar and butter to start with put the flour in a bowl with salt and sugar make a well in the middle and pour the eggs into it start mixing slowly when the mixture becomes thick add the cold milk little by little when all the milk is mixed the dough becomes quite smooth if it seems too thick add some milk then add the melted butter cool mix well cook the pancakes in a hot pot about a small layer of dough in the pan make a rotation movement to spread the dough all over the surface sit over the fire and when the rum is in clear red color it's time to turn it over leave the eggs about one minute side and the pancake's ready to be eaten</s>[Info] writing /tmp/tmpugbtb_hf/out_en-0 with duration 60.3 sec. [Info] writing /tmp/tmpugbtb_hf/out_en-1 with duration 62.9 sec.
Prediction
zsxkib/hibiki:c72042d58e88a0dbcf77942f58b3b033b23f0d02d78cad22c4960afc3b8cce86IDd1a2bzv3rxrme0cmy3ta4hx6p8StatusSucceededSourceWebHardwareL40STotal durationCreatedInput
- audio_input
- Video Player is loading.Current Time 00:00:000/Duration 00:00:000Loaded: 0%Stream Type LIVERemaining Time -00:00:0001x
- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
- max_duration
- 0
- cut_start_seconds
- 2
- volume_reduction_db
- 30
{ "audio_input": "https://replicate.delivery/pbxt/MTZNHE3PVfheTWoSVFVA3vvKxUA0is9pgtRFYnfRnRPgQz9K/sample_fr_hibiki_monologue_otis.mp3", "max_duration": 0, "cut_start_seconds": 2, "volume_reduction_db": 30 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run zsxkib/hibiki using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "zsxkib/hibiki:c72042d58e88a0dbcf77942f58b3b033b23f0d02d78cad22c4960afc3b8cce86", { input: { audio_input: "https://replicate.delivery/pbxt/MTZNHE3PVfheTWoSVFVA3vvKxUA0is9pgtRFYnfRnRPgQz9K/sample_fr_hibiki_monologue_otis.mp3", max_duration: 0, cut_start_seconds: 2, volume_reduction_db: 30 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run zsxkib/hibiki using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "zsxkib/hibiki:c72042d58e88a0dbcf77942f58b3b033b23f0d02d78cad22c4960afc3b8cce86", input={ "audio_input": "https://replicate.delivery/pbxt/MTZNHE3PVfheTWoSVFVA3vvKxUA0is9pgtRFYnfRnRPgQz9K/sample_fr_hibiki_monologue_otis.mp3", "max_duration": 0, "cut_start_seconds": 2, "volume_reduction_db": 30 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run zsxkib/hibiki using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "zsxkib/hibiki:c72042d58e88a0dbcf77942f58b3b033b23f0d02d78cad22c4960afc3b8cce86", "input": { "audio_input": "https://replicate.delivery/pbxt/MTZNHE3PVfheTWoSVFVA3vvKxUA0is9pgtRFYnfRnRPgQz9K/sample_fr_hibiki_monologue_otis.mp3", "max_duration": 0, "cut_start_seconds": 2, "volume_reduction_db": 30 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
Video Player is loading.Current Time 00:00:000/Duration 00:00:000Loaded: 0%Stream Type LIVERemaining Time -00:00:0001x- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
{ "completed_at": "2025-02-10T17:43:21.702520Z", "created_at": "2025-02-10T17:42:15.751000Z", "data_removed": false, "error": null, "id": "d1a2bzv3rxrme0cmy3ta4hx6p8", "input": { "audio_input": "https://replicate.delivery/pbxt/MTZNHE3PVfheTWoSVFVA3vvKxUA0is9pgtRFYnfRnRPgQz9K/sample_fr_hibiki_monologue_otis.mp3", "max_duration": 0, "cut_start_seconds": 2, "volume_reduction_db": 30 }, "logs": "\u001b[1;34m[Info]\u001b[0m retrieving checkpoint\n\u001b[1;34m[Info]\u001b[0m loading mimi\n\u001b[1;34m[Info]\u001b[0m mimi loaded\n\u001b[1;34m[Info]\u001b[0m loading moshi\n\u001b[1;34m[Info]\u001b[0m moshi loaded\n\u001b[1;34m[Info]\u001b[0m loading input file /tmp/tmpd90y11yasample_fr_hibiki_monologue_otis.mp3\nInfo: starting the inference loop\nInfo: processed 795 steps in 17s, 21.72ms/step\n<s> You know, I don't think there are any good or bad situations. For me, if I had to summarize my life today with you, I would say that it is first of all encounters, people who have reached out to me, perhaps at a time when I could not, where I was alone at home. And it is quite curious to think that chance encounters forge a destiny, because when we have the taste of the thing, when we have the taste of the thing done well, the beautiful gesture, sometimes we do not find the interlocutor in front of us. I would say the mirror that helps you move forward. So this is not my case, as I said there. Since I, on the contrary, I was able, and I say thank you to life, I say thank you, I sing life, I dance life, I am only love. And finally, when many people today tell me, but how do it to have this humanity? Well, I answered them very simply. I told them that this taste of love, this taste that pushed me today to undertake a mechanical construction with both hands, who knows? Maybe simply to put myself at the service of the community, to make the gift, the gift of this. That's it I'm</s>\u001b[1;34m[Info]\u001b[0m writing /tmp/tmpd6e64nai/out_en-0 with duration 61.9 sec.\n\u001b[1;34m[Info]\u001b[0m writing /tmp/tmpd6e64nai/out_en-1 with duration 63.6 sec.", "metrics": { "predict_time": 22.223840168, "total_time": 65.95152 }, "output": "https://replicate.delivery/xezq/9VfQ7P8yqAXjJSLwv3dixITJsrGLvGjz0jtiBXmghvxcZfNUA/final_mix.wav", "started_at": "2025-02-10T17:42:59.478680Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bcwr-6rrdnnqi3v5bqbargngco2cxchjgztnwph4w2w3zptyfuqbdm67a", "get": "https://api.replicate.com/v1/predictions/d1a2bzv3rxrme0cmy3ta4hx6p8", "cancel": "https://api.replicate.com/v1/predictions/d1a2bzv3rxrme0cmy3ta4hx6p8/cancel" }, "version": "c72042d58e88a0dbcf77942f58b3b033b23f0d02d78cad22c4960afc3b8cce86" }
Generated in[Info] retrieving checkpoint [Info] loading mimi [Info] mimi loaded [Info] loading moshi [Info] moshi loaded [Info] loading input file /tmp/tmpd90y11yasample_fr_hibiki_monologue_otis.mp3 Info: starting the inference loop Info: processed 795 steps in 17s, 21.72ms/step <s> You know, I don't think there are any good or bad situations. For me, if I had to summarize my life today with you, I would say that it is first of all encounters, people who have reached out to me, perhaps at a time when I could not, where I was alone at home. And it is quite curious to think that chance encounters forge a destiny, because when we have the taste of the thing, when we have the taste of the thing done well, the beautiful gesture, sometimes we do not find the interlocutor in front of us. I would say the mirror that helps you move forward. So this is not my case, as I said there. Since I, on the contrary, I was able, and I say thank you to life, I say thank you, I sing life, I dance life, I am only love. And finally, when many people today tell me, but how do it to have this humanity? Well, I answered them very simply. I told them that this taste of love, this taste that pushed me today to undertake a mechanical construction with both hands, who knows? Maybe simply to put myself at the service of the community, to make the gift, the gift of this. That's it I'm</s>[Info] writing /tmp/tmpd6e64nai/out_en-0 with duration 61.9 sec. [Info] writing /tmp/tmpd6e64nai/out_en-1 with duration 63.6 sec.
Want to make some of these yourself?
Run this model