deepfates / hunyuan-her
Hunyuan-Video model finetuned on Her (2013). Trigger word is "HR". Use "A video in the style of HR, HR" at the beginning of your prompt for best results. (Updated 4 months, 3 weeks ago)
- Public
- 116 runs
- Fine-tune
Prediction
deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63dID2db22py6nnrma0cmjpsakraq5mStatusSucceededSourceAPIHardwareH100Total durationCreatedInput
- crf
- 19
- seed
- 12345
- steps
- 50
- width
- 640
- height
- 360
- prompt
- A video in the style of HR, HR The video clip depicts a detailed portrait of a woman's face. She has fair skin and bright, intense blue eyes that gaze directly ahead. Her hair is dark and wavy, cascading down her shoulders in a curly pattern. She is dressed in a dark dress with a square neckline adorned with intricate metallic accents. The dress has a detailed, almost mosaic-like pattern, suggesting a ceremonial or formal attire. The woman's expression is stoic and serious, conveying a sense of determination or resolve. The background is dark, contrasting with the intricate details of her dress and the brightness of her eyes. The lighting is soft and diffused, casting a warm glow on
- lora_url
- scheduler
- DPMSolverMultistepScheduler
- flow_shift
- 9
- frame_rate
- 16
- num_frames
- 66
- enhance_end
- 1
- enhance_start
- 0
- force_offload
- lora_strength
- 1
- enhance_double
- enhance_single
- enhance_weight
- 0.3
- guidance_scale
- 6
- denoise_strength
- 1
{ "crf": 19, "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a detailed portrait of a woman's face. She has fair skin and bright, intense blue eyes that gaze directly ahead. Her hair is dark and wavy, cascading down her shoulders in a curly pattern. She is dressed in a dark dress with a square neckline adorned with intricate metallic accents. The dress has a detailed, almost mosaic-like pattern, suggesting a ceremonial or formal attire.\nThe woman's expression is stoic and serious, conveying a sense of determination or resolve. The background is dark, contrasting with the intricate details of her dress and the brightness of her eyes. The lighting is soft and diffused, casting a warm glow on", "lora_url": "", "scheduler": "DPMSolverMultistepScheduler", "flow_shift": 9, "frame_rate": 16, "num_frames": 66, "enhance_end": 1, "enhance_start": 0, "force_offload": true, "lora_strength": 1, "enhance_double": true, "enhance_single": true, "enhance_weight": 0.3, "guidance_scale": 6, "denoise_strength": 1 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; import fs from "node:fs"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", { input: { crf: 19, seed: 12345, steps: 50, width: 640, height: 360, prompt: "A video in the style of HR, HR The video clip depicts a detailed portrait of a woman's face. She has fair skin and bright, intense blue eyes that gaze directly ahead. Her hair is dark and wavy, cascading down her shoulders in a curly pattern. She is dressed in a dark dress with a square neckline adorned with intricate metallic accents. The dress has a detailed, almost mosaic-like pattern, suggesting a ceremonial or formal attire.\nThe woman's expression is stoic and serious, conveying a sense of determination or resolve. The background is dark, contrasting with the intricate details of her dress and the brightness of her eyes. The lighting is soft and diffused, casting a warm glow on", lora_url: "", scheduler: "DPMSolverMultistepScheduler", flow_shift: 9, frame_rate: 16, num_frames: 66, enhance_end: 1, enhance_start: 0, force_offload: true, lora_strength: 1, enhance_double: true, enhance_single: true, enhance_weight: 0.3, guidance_scale: 6, denoise_strength: 1 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", input={ "crf": 19, "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a detailed portrait of a woman's face. She has fair skin and bright, intense blue eyes that gaze directly ahead. Her hair is dark and wavy, cascading down her shoulders in a curly pattern. She is dressed in a dark dress with a square neckline adorned with intricate metallic accents. The dress has a detailed, almost mosaic-like pattern, suggesting a ceremonial or formal attire.\nThe woman's expression is stoic and serious, conveying a sense of determination or resolve. The background is dark, contrasting with the intricate details of her dress and the brightness of her eyes. The lighting is soft and diffused, casting a warm glow on", "lora_url": "", "scheduler": "DPMSolverMultistepScheduler", "flow_shift": 9, "frame_rate": 16, "num_frames": 66, "enhance_end": 1, "enhance_start": 0, "force_offload": True, "lora_strength": 1, "enhance_double": True, "enhance_single": True, "enhance_weight": 0.3, "guidance_scale": 6, "denoise_strength": 1 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", "input": { "crf": 19, "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a detailed portrait of a woman\'s face. She has fair skin and bright, intense blue eyes that gaze directly ahead. Her hair is dark and wavy, cascading down her shoulders in a curly pattern. She is dressed in a dark dress with a square neckline adorned with intricate metallic accents. The dress has a detailed, almost mosaic-like pattern, suggesting a ceremonial or formal attire.\\nThe woman\'s expression is stoic and serious, conveying a sense of determination or resolve. The background is dark, contrasting with the intricate details of her dress and the brightness of her eyes. The lighting is soft and diffused, casting a warm glow on", "lora_url": "", "scheduler": "DPMSolverMultistepScheduler", "flow_shift": 9, "frame_rate": 16, "num_frames": 66, "enhance_end": 1, "enhance_start": 0, "force_offload": true, "lora_strength": 1, "enhance_double": true, "enhance_single": true, "enhance_weight": 0.3, "guidance_scale": 6, "denoise_strength": 1 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2025-01-24T00:29:38.346068Z", "created_at": "2025-01-24T00:25:09.037000Z", "data_removed": false, "error": null, "id": "2db22py6nnrma0cmjpsakraq5m", "input": { "crf": 19, "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a detailed portrait of a woman's face. She has fair skin and bright, intense blue eyes that gaze directly ahead. Her hair is dark and wavy, cascading down her shoulders in a curly pattern. She is dressed in a dark dress with a square neckline adorned with intricate metallic accents. The dress has a detailed, almost mosaic-like pattern, suggesting a ceremonial or formal attire.\nThe woman's expression is stoic and serious, conveying a sense of determination or resolve. The background is dark, contrasting with the intricate details of her dress and the brightness of her eyes. The lighting is soft and diffused, casting a warm glow on", "lora_url": "", "scheduler": "DPMSolverMultistepScheduler", "flow_shift": 9, "frame_rate": 16, "num_frames": 66, "enhance_end": 1, "enhance_start": 0, "force_offload": true, "lora_strength": 1, "enhance_double": true, "enhance_single": true, "enhance_weight": 0.3, "guidance_scale": 6, "denoise_strength": 1 }, "logs": "Seed set to: 12345\n⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements\n⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements\nChecking inputs\n====================================\nChecking weights\n✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models\n✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae\n====================================\nRunning workflow\n[ComfyUI] got prompt\nExecuting node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode\n[ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 142\n[ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77\n[ComfyUI] Input (height, width, video_length) = (368, 640, 65)\n[ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\nExecuting node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler\n[ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps\n[ComfyUI]\n[ComfyUI] 0%| | 0/50 [00:00<?, ?it/s]\n[ComfyUI] 2%|▏ | 1/50 [00:02<01:54, 2.33s/it]\n[ComfyUI] 4%|▍ | 2/50 [00:04<01:37, 2.03s/it]\n[ComfyUI] 6%|▌ | 3/50 [00:06<01:40, 2.15s/it]\n[ComfyUI] 8%|▊ | 4/50 [00:08<01:41, 2.20s/it]\n[ComfyUI] 10%|█ | 5/50 [00:11<01:40, 2.23s/it]\n[ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.25s/it]\n[ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.27s/it]\n[ComfyUI] 16%|█▌ | 8/50 [00:17<01:35, 2.27s/it]\n[ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.28s/it]\n[ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.28s/it]\n[ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.28s/it]\n[ComfyUI] 24%|██▍ | 12/50 [00:27<01:26, 2.29s/it]\n[ComfyUI] 26%|██▌ | 13/50 [00:29<01:24, 2.29s/it]\n[ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.29s/it]\n[ComfyUI] 30%|███ | 15/50 [00:33<01:20, 2.29s/it]\n[ComfyUI] 32%|███▏ | 16/50 [00:36<01:17, 2.29s/it]\n[ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.29s/it]\n[ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.29s/it]\n[ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.29s/it]\n[ComfyUI] 40%|████ | 20/50 [00:45<01:08, 2.29s/it]\n[ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.29s/it]\n[ComfyUI] 44%|████▍ | 22/50 [00:49<01:04, 2.29s/it]\n[ComfyUI] 46%|████▌ | 23/50 [00:52<01:01, 2.29s/it]\n[ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.29s/it]\n[ComfyUI] 50%|█████ | 25/50 [00:56<00:57, 2.29s/it]\n[ComfyUI] 52%|█████▏ | 26/50 [00:59<00:54, 2.29s/it]\n[ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.29s/it]\n[ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.29s/it]\n[ComfyUI] 58%|█████▊ | 29/50 [01:05<00:48, 2.29s/it]\n[ComfyUI] 60%|██████ | 30/50 [01:08<00:45, 2.29s/it]\n[ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.29s/it]\n[ComfyUI] 64%|██████▍ | 32/50 [01:12<00:41, 2.29s/it]\n[ComfyUI] 66%|██████▌ | 33/50 [01:15<00:38, 2.29s/it]\n[ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.29s/it]\n[ComfyUI] 70%|███████ | 35/50 [01:19<00:34, 2.29s/it]\n[ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.29s/it]\n[ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.29s/it]\n[ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.29s/it]\n[ComfyUI] 78%|███████▊ | 39/50 [01:28<00:25, 2.29s/it]\n[ComfyUI] 80%|████████ | 40/50 [01:31<00:22, 2.29s/it]\n[ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.29s/it]\n[ComfyUI] 84%|████████▍ | 42/50 [01:35<00:18, 2.29s/it]\n[ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.29s/it]\n[ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.29s/it]\n[ComfyUI] 90%|█████████ | 45/50 [01:42<00:11, 2.29s/it]\n[ComfyUI] 92%|█████████▏| 46/50 [01:44<00:09, 2.29s/it]\n[ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.29s/it]\n[ComfyUI] 96%|█████████▌| 48/50 [01:49<00:04, 2.29s/it]\n[ComfyUI] 98%|█████████▊| 49/50 [01:51<00:02, 2.29s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.28s/it]\n[ComfyUI] Allocated memory: memory=12.300 GB\n[ComfyUI] Max allocated memory: max_memory=15.099 GB\n[ComfyUI] Max reserved memory: max_reserved=16.312 GB\nExecuting node 5, title: HunyuanVideo Decode, class type: HyVideoDecode\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.46s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.24s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.27s/it]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 25.58it/s]\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.55it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 3.03it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.95it/s]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\nExecuting node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 64.10it/s]\n[ComfyUI] Prompt executed in 132.81 seconds\noutputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}}\n====================================\nHunyuanVideo_00001.png\nHunyuanVideo_00001.mp4", "metrics": { "predict_time": 141.040719062, "total_time": 269.309068 }, "output": "https://replicate.delivery/xezq/vf7PkqCAexoGW0H8CFpOKjO6RY1JPY7eA9qVjWvvhLbkHSQoA/HunyuanVideo_00001.mp4", "started_at": "2025-01-24T00:27:17.305349Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bsvm-g2xqll4vldpewvgt7u4usq2lqrlytshrby2wt7jfuar6ups4y2fq", "get": "https://api.replicate.com/v1/predictions/2db22py6nnrma0cmjpsakraq5m", "cancel": "https://api.replicate.com/v1/predictions/2db22py6nnrma0cmjpsakraq5m/cancel" }, "version": "a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d" }
Generated inSeed set to: 12345 ⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements ⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements Checking inputs ==================================== Checking weights ✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models ✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae ==================================== Running workflow [ComfyUI] got prompt Executing node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode [ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 142 [ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77 [ComfyUI] Input (height, width, video_length) = (368, 640, 65) [ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. Executing node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler [ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps [ComfyUI] [ComfyUI] 0%| | 0/50 [00:00<?, ?it/s] [ComfyUI] 2%|▏ | 1/50 [00:02<01:54, 2.33s/it] [ComfyUI] 4%|▍ | 2/50 [00:04<01:37, 2.03s/it] [ComfyUI] 6%|▌ | 3/50 [00:06<01:40, 2.15s/it] [ComfyUI] 8%|▊ | 4/50 [00:08<01:41, 2.20s/it] [ComfyUI] 10%|█ | 5/50 [00:11<01:40, 2.23s/it] [ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.25s/it] [ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.27s/it] [ComfyUI] 16%|█▌ | 8/50 [00:17<01:35, 2.27s/it] [ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.28s/it] [ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.28s/it] [ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.28s/it] [ComfyUI] 24%|██▍ | 12/50 [00:27<01:26, 2.29s/it] [ComfyUI] 26%|██▌ | 13/50 [00:29<01:24, 2.29s/it] [ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.29s/it] [ComfyUI] 30%|███ | 15/50 [00:33<01:20, 2.29s/it] [ComfyUI] 32%|███▏ | 16/50 [00:36<01:17, 2.29s/it] [ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.29s/it] [ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.29s/it] [ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.29s/it] [ComfyUI] 40%|████ | 20/50 [00:45<01:08, 2.29s/it] [ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.29s/it] [ComfyUI] 44%|████▍ | 22/50 [00:49<01:04, 2.29s/it] [ComfyUI] 46%|████▌ | 23/50 [00:52<01:01, 2.29s/it] [ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.29s/it] [ComfyUI] 50%|█████ | 25/50 [00:56<00:57, 2.29s/it] [ComfyUI] 52%|█████▏ | 26/50 [00:59<00:54, 2.29s/it] [ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.29s/it] [ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.29s/it] [ComfyUI] 58%|█████▊ | 29/50 [01:05<00:48, 2.29s/it] [ComfyUI] 60%|██████ | 30/50 [01:08<00:45, 2.29s/it] [ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.29s/it] [ComfyUI] 64%|██████▍ | 32/50 [01:12<00:41, 2.29s/it] [ComfyUI] 66%|██████▌ | 33/50 [01:15<00:38, 2.29s/it] [ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.29s/it] [ComfyUI] 70%|███████ | 35/50 [01:19<00:34, 2.29s/it] [ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.29s/it] [ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.29s/it] [ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.29s/it] [ComfyUI] 78%|███████▊ | 39/50 [01:28<00:25, 2.29s/it] [ComfyUI] 80%|████████ | 40/50 [01:31<00:22, 2.29s/it] [ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.29s/it] [ComfyUI] 84%|████████▍ | 42/50 [01:35<00:18, 2.29s/it] [ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.29s/it] [ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.29s/it] [ComfyUI] 90%|█████████ | 45/50 [01:42<00:11, 2.29s/it] [ComfyUI] 92%|█████████▏| 46/50 [01:44<00:09, 2.29s/it] [ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.29s/it] [ComfyUI] 96%|█████████▌| 48/50 [01:49<00:04, 2.29s/it] [ComfyUI] 98%|█████████▊| 49/50 [01:51<00:02, 2.29s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.28s/it] [ComfyUI] Allocated memory: memory=12.300 GB [ComfyUI] Max allocated memory: max_memory=15.099 GB [ComfyUI] Max reserved memory: max_reserved=16.312 GB Executing node 5, title: HunyuanVideo Decode, class type: HyVideoDecode [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.46s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.24s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.27s/it] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 25.58it/s] [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.55it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 3.03it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.95it/s] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] Executing node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 64.10it/s] [ComfyUI] Prompt executed in 132.81 seconds outputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}} ==================================== HunyuanVideo_00001.png HunyuanVideo_00001.mp4
Prediction
deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63dIDw31kxem8qnrm80cmjpfbhscnecStatusSucceededSourceAPIHardwareH100Total durationCreatedInput
- crf
- 19
- seed
- 12345
- steps
- 50
- width
- 640
- height
- 360
- prompt
- A video in the style of HR, HR The video clip depicts a serene and picturesque snow-covered landscape. The scene is set in a hilly area with dense snow covering the ground, trees, and rocks. A man dressed in a dark coat and hat is walking along a snowy path, carrying a briefcase. The path is surrounded by large boulders and snow-covered trees, creating a stark contrast between the whiteness of the snow and the dark clothes of the man. The atmosphere is calm and peaceful, with a soft blue light illuminating the scene, suggesting that it might be early morning or late evening. The sound of crunching snow beneath the man's feet can be heard as he makes
- lora_url
- scheduler
- DPMSolverMultistepScheduler
- flow_shift
- 9
- frame_rate
- 16
- num_frames
- 66
- enhance_end
- 1
- enhance_start
- 0
- force_offload
- lora_strength
- 1
- enhance_double
- enhance_single
- enhance_weight
- 0.3
- guidance_scale
- 6
- denoise_strength
- 1
{ "crf": 19, "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a serene and picturesque snow-covered landscape. The scene is set in a hilly area with dense snow covering the ground, trees, and rocks. A man dressed in a dark coat and hat is walking along a snowy path, carrying a briefcase. The path is surrounded by large boulders and snow-covered trees, creating a stark contrast between the whiteness of the snow and the dark clothes of the man.\nThe atmosphere is calm and peaceful, with a soft blue light illuminating the scene, suggesting that it might be early morning or late evening. The sound of crunching snow beneath the man's feet can be heard as he makes", "lora_url": "", "scheduler": "DPMSolverMultistepScheduler", "flow_shift": 9, "frame_rate": 16, "num_frames": 66, "enhance_end": 1, "enhance_start": 0, "force_offload": true, "lora_strength": 1, "enhance_double": true, "enhance_single": true, "enhance_weight": 0.3, "guidance_scale": 6, "denoise_strength": 1 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; import fs from "node:fs"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", { input: { crf: 19, seed: 12345, steps: 50, width: 640, height: 360, prompt: "A video in the style of HR, HR The video clip depicts a serene and picturesque snow-covered landscape. The scene is set in a hilly area with dense snow covering the ground, trees, and rocks. A man dressed in a dark coat and hat is walking along a snowy path, carrying a briefcase. The path is surrounded by large boulders and snow-covered trees, creating a stark contrast between the whiteness of the snow and the dark clothes of the man.\nThe atmosphere is calm and peaceful, with a soft blue light illuminating the scene, suggesting that it might be early morning or late evening. The sound of crunching snow beneath the man's feet can be heard as he makes", lora_url: "", scheduler: "DPMSolverMultistepScheduler", flow_shift: 9, frame_rate: 16, num_frames: 66, enhance_end: 1, enhance_start: 0, force_offload: true, lora_strength: 1, enhance_double: true, enhance_single: true, enhance_weight: 0.3, guidance_scale: 6, denoise_strength: 1 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", input={ "crf": 19, "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a serene and picturesque snow-covered landscape. The scene is set in a hilly area with dense snow covering the ground, trees, and rocks. A man dressed in a dark coat and hat is walking along a snowy path, carrying a briefcase. The path is surrounded by large boulders and snow-covered trees, creating a stark contrast between the whiteness of the snow and the dark clothes of the man.\nThe atmosphere is calm and peaceful, with a soft blue light illuminating the scene, suggesting that it might be early morning or late evening. The sound of crunching snow beneath the man's feet can be heard as he makes", "lora_url": "", "scheduler": "DPMSolverMultistepScheduler", "flow_shift": 9, "frame_rate": 16, "num_frames": 66, "enhance_end": 1, "enhance_start": 0, "force_offload": True, "lora_strength": 1, "enhance_double": True, "enhance_single": True, "enhance_weight": 0.3, "guidance_scale": 6, "denoise_strength": 1 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", "input": { "crf": 19, "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a serene and picturesque snow-covered landscape. The scene is set in a hilly area with dense snow covering the ground, trees, and rocks. A man dressed in a dark coat and hat is walking along a snowy path, carrying a briefcase. The path is surrounded by large boulders and snow-covered trees, creating a stark contrast between the whiteness of the snow and the dark clothes of the man.\\nThe atmosphere is calm and peaceful, with a soft blue light illuminating the scene, suggesting that it might be early morning or late evening. The sound of crunching snow beneath the man\'s feet can be heard as he makes", "lora_url": "", "scheduler": "DPMSolverMultistepScheduler", "flow_shift": 9, "frame_rate": 16, "num_frames": 66, "enhance_end": 1, "enhance_start": 0, "force_offload": true, "lora_strength": 1, "enhance_double": true, "enhance_single": true, "enhance_weight": 0.3, "guidance_scale": 6, "denoise_strength": 1 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2025-01-24T00:11:01.178049Z", "created_at": "2025-01-24T00:03:02.461000Z", "data_removed": false, "error": null, "id": "w31kxem8qnrm80cmjpfbhscnec", "input": { "crf": 19, "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a serene and picturesque snow-covered landscape. The scene is set in a hilly area with dense snow covering the ground, trees, and rocks. A man dressed in a dark coat and hat is walking along a snowy path, carrying a briefcase. The path is surrounded by large boulders and snow-covered trees, creating a stark contrast between the whiteness of the snow and the dark clothes of the man.\nThe atmosphere is calm and peaceful, with a soft blue light illuminating the scene, suggesting that it might be early morning or late evening. The sound of crunching snow beneath the man's feet can be heard as he makes", "lora_url": "", "scheduler": "DPMSolverMultistepScheduler", "flow_shift": 9, "frame_rate": 16, "num_frames": 66, "enhance_end": 1, "enhance_start": 0, "force_offload": true, "lora_strength": 1, "enhance_double": true, "enhance_single": true, "enhance_weight": 0.3, "guidance_scale": 6, "denoise_strength": 1 }, "logs": "Seed set to: 12345\n⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements\n⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements\nChecking inputs\n====================================\nChecking weights\n✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models\n✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae\n====================================\nRunning workflow\n[ComfyUI] got prompt\nExecuting node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode\n[ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 143\n[ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77\n[ComfyUI] Input (height, width, video_length) = (368, 640, 65)\n[ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\nExecuting node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler\n[ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps\n[ComfyUI]\n[ComfyUI] 0%| | 0/50 [00:00<?, ?it/s]\n[ComfyUI] 2%|▏ | 1/50 [00:02<01:52, 2.30s/it]\n[ComfyUI] 4%|▍ | 2/50 [00:04<01:36, 2.02s/it]\n[ComfyUI] 6%|▌ | 3/50 [00:06<01:40, 2.15s/it]\n[ComfyUI] 8%|▊ | 4/50 [00:08<01:41, 2.21s/it]\n[ComfyUI] 10%|█ | 5/50 [00:11<01:40, 2.24s/it]\n[ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.26s/it]\n[ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.28s/it]\n[ComfyUI] 16%|█▌ | 8/50 [00:17<01:35, 2.29s/it]\n[ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.29s/it]\n[ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.29s/it]\n[ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.29s/it]\n[ComfyUI] 24%|██▍ | 12/50 [00:27<01:27, 2.30s/it]\n[ComfyUI] 26%|██▌ | 13/50 [00:29<01:24, 2.30s/it]\n[ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.30s/it]\n[ComfyUI] 30%|███ | 15/50 [00:34<01:20, 2.30s/it]\n[ComfyUI] 32%|███▏ | 16/50 [00:36<01:18, 2.30s/it]\n[ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.30s/it]\n[ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.30s/it]\n[ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.30s/it]\n[ComfyUI] 40%|████ | 20/50 [00:45<01:08, 2.30s/it]\n[ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.30s/it]\n[ComfyUI] 44%|████▍ | 22/50 [00:50<01:04, 2.30s/it]\n[ComfyUI] 46%|████▌ | 23/50 [00:52<01:02, 2.30s/it]\n[ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.30s/it]\n[ComfyUI] 50%|█████ | 25/50 [00:57<00:57, 2.30s/it]\n[ComfyUI] 52%|█████▏ | 26/50 [00:59<00:55, 2.30s/it]\n[ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.30s/it]\n[ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.30s/it]\n[ComfyUI] 58%|█████▊ | 29/50 [01:06<00:48, 2.30s/it]\n[ComfyUI] 60%|██████ | 30/50 [01:08<00:45, 2.30s/it]\n[ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.30s/it]\n[ComfyUI] 64%|██████▍ | 32/50 [01:13<00:41, 2.30s/it]\n[ComfyUI] 66%|██████▌ | 33/50 [01:15<00:39, 2.30s/it]\n[ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.30s/it]\n[ComfyUI] 70%|███████ | 35/50 [01:20<00:34, 2.30s/it]\n[ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.30s/it]\n[ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.30s/it]\n[ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.30s/it]\n[ComfyUI] 78%|███████▊ | 39/50 [01:29<00:25, 2.30s/it]\n[ComfyUI] 80%|████████ | 40/50 [01:31<00:22, 2.30s/it]\n[ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.30s/it]\n[ComfyUI] 84%|████████▍ | 42/50 [01:36<00:18, 2.30s/it]\n[ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.30s/it]\n[ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.30s/it]\n[ComfyUI] 90%|█████████ | 45/50 [01:42<00:11, 2.30s/it]\n[ComfyUI] 92%|█████████▏| 46/50 [01:45<00:09, 2.30s/it]\n[ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.30s/it]\n[ComfyUI] 96%|█████████▌| 48/50 [01:49<00:04, 2.30s/it]\n[ComfyUI] 98%|█████████▊| 49/50 [01:52<00:02, 2.30s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.30s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it]\n[ComfyUI] Allocated memory: memory=12.300 GB\n[ComfyUI] Max allocated memory: max_memory=15.099 GB\n[ComfyUI] Max reserved memory: max_reserved=16.344 GB\nExecuting node 5, title: HunyuanVideo Decode, class type: HyVideoDecode\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.47s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.24s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.28s/it]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 26.04it/s]\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.54it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 3.03it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.94it/s]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\nExecuting node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 64.94it/s]\n[ComfyUI] Prompt executed in 134.42 seconds\noutputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}}\n====================================\nHunyuanVideo_00001.png\nHunyuanVideo_00001.mp4", "metrics": { "predict_time": 143.929283736, "total_time": 478.717049 }, "output": "https://replicate.delivery/xezq/0afgA2rehcsENUyWMU34Qe83FXaqfkGdZAM9Vms3LeJuSGBhC/HunyuanVideo_00001.mp4", "started_at": "2025-01-24T00:08:37.248765Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bsvm-2mi3iazqvv5efxcnpyjcwtus26r32c3kw6qiaodfhljm3pgarhha", "get": "https://api.replicate.com/v1/predictions/w31kxem8qnrm80cmjpfbhscnec", "cancel": "https://api.replicate.com/v1/predictions/w31kxem8qnrm80cmjpfbhscnec/cancel" }, "version": "a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d" }
Generated inSeed set to: 12345 ⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements ⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements Checking inputs ==================================== Checking weights ✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models ✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae ==================================== Running workflow [ComfyUI] got prompt Executing node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode [ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 143 [ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77 [ComfyUI] Input (height, width, video_length) = (368, 640, 65) [ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. Executing node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler [ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps [ComfyUI] [ComfyUI] 0%| | 0/50 [00:00<?, ?it/s] [ComfyUI] 2%|▏ | 1/50 [00:02<01:52, 2.30s/it] [ComfyUI] 4%|▍ | 2/50 [00:04<01:36, 2.02s/it] [ComfyUI] 6%|▌ | 3/50 [00:06<01:40, 2.15s/it] [ComfyUI] 8%|▊ | 4/50 [00:08<01:41, 2.21s/it] [ComfyUI] 10%|█ | 5/50 [00:11<01:40, 2.24s/it] [ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.26s/it] [ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.28s/it] [ComfyUI] 16%|█▌ | 8/50 [00:17<01:35, 2.29s/it] [ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.29s/it] [ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.29s/it] [ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.29s/it] [ComfyUI] 24%|██▍ | 12/50 [00:27<01:27, 2.30s/it] [ComfyUI] 26%|██▌ | 13/50 [00:29<01:24, 2.30s/it] [ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.30s/it] [ComfyUI] 30%|███ | 15/50 [00:34<01:20, 2.30s/it] [ComfyUI] 32%|███▏ | 16/50 [00:36<01:18, 2.30s/it] [ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.30s/it] [ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.30s/it] [ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.30s/it] [ComfyUI] 40%|████ | 20/50 [00:45<01:08, 2.30s/it] [ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.30s/it] [ComfyUI] 44%|████▍ | 22/50 [00:50<01:04, 2.30s/it] [ComfyUI] 46%|████▌ | 23/50 [00:52<01:02, 2.30s/it] [ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.30s/it] [ComfyUI] 50%|█████ | 25/50 [00:57<00:57, 2.30s/it] [ComfyUI] 52%|█████▏ | 26/50 [00:59<00:55, 2.30s/it] [ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.30s/it] [ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.30s/it] [ComfyUI] 58%|█████▊ | 29/50 [01:06<00:48, 2.30s/it] [ComfyUI] 60%|██████ | 30/50 [01:08<00:45, 2.30s/it] [ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.30s/it] [ComfyUI] 64%|██████▍ | 32/50 [01:13<00:41, 2.30s/it] [ComfyUI] 66%|██████▌ | 33/50 [01:15<00:39, 2.30s/it] [ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.30s/it] [ComfyUI] 70%|███████ | 35/50 [01:20<00:34, 2.30s/it] [ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.30s/it] [ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.30s/it] [ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.30s/it] [ComfyUI] 78%|███████▊ | 39/50 [01:29<00:25, 2.30s/it] [ComfyUI] 80%|████████ | 40/50 [01:31<00:22, 2.30s/it] [ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.30s/it] [ComfyUI] 84%|████████▍ | 42/50 [01:36<00:18, 2.30s/it] [ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.30s/it] [ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.30s/it] [ComfyUI] 90%|█████████ | 45/50 [01:42<00:11, 2.30s/it] [ComfyUI] 92%|█████████▏| 46/50 [01:45<00:09, 2.30s/it] [ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.30s/it] [ComfyUI] 96%|█████████▌| 48/50 [01:49<00:04, 2.30s/it] [ComfyUI] 98%|█████████▊| 49/50 [01:52<00:02, 2.30s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.30s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it] [ComfyUI] Allocated memory: memory=12.300 GB [ComfyUI] Max allocated memory: max_memory=15.099 GB [ComfyUI] Max reserved memory: max_reserved=16.344 GB Executing node 5, title: HunyuanVideo Decode, class type: HyVideoDecode [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.47s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.24s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.28s/it] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 26.04it/s] [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.54it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 3.03it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.94it/s] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] Executing node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 64.94it/s] [ComfyUI] Prompt executed in 134.42 seconds outputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}} ==================================== HunyuanVideo_00001.png HunyuanVideo_00001.mp4
Prediction
deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63dInput
- crf
- 19
- seed
- 12345
- steps
- 50
- width
- 640
- height
- 360
- prompt
- A video in the style of HR, HR The video clip depicts a beach scene with several people enjoying their time. In the foreground, a man with curly hair and a mustache is wearing glasses and a red and white checkered shirt. He is sitting on a beach chair and appears to be laughing or smiling, looking off to the side. In the background, there are other people sitting on the beach, some under umbrellas, and others lying on towels. The beach is populated with various beachgoers, and the atmosphere seems relaxed and leisurely. The overall scene conveys a sense of a typical day at the beach with people engaging in typical beach activities.
- lora_url
- scheduler
- DPMSolverMultistepScheduler
- flow_shift
- 9
- frame_rate
- 16
- num_frames
- 66
- enhance_end
- 1
- enhance_start
- 0
- force_offload
- lora_strength
- 1
- enhance_double
- enhance_single
- enhance_weight
- 0.3
- guidance_scale
- 6
- denoise_strength
- 1
{ "crf": 19, "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a beach scene with several people enjoying their time. In the foreground, a man with curly hair and a mustache is wearing glasses and a red and white checkered shirt. He is sitting on a beach chair and appears to be laughing or smiling, looking off to the side. In the background, there are other people sitting on the beach, some under umbrellas, and others lying on towels. The beach is populated with various beachgoers, and the atmosphere seems relaxed and leisurely. The overall scene conveys a sense of a typical day at the beach with people engaging in typical beach activities.", "lora_url": "", "scheduler": "DPMSolverMultistepScheduler", "flow_shift": 9, "frame_rate": 16, "num_frames": 66, "enhance_end": 1, "enhance_start": 0, "force_offload": true, "lora_strength": 1, "enhance_double": true, "enhance_single": true, "enhance_weight": 0.3, "guidance_scale": 6, "denoise_strength": 1 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; import fs from "node:fs"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", { input: { crf: 19, seed: 12345, steps: 50, width: 640, height: 360, prompt: "A video in the style of HR, HR The video clip depicts a beach scene with several people enjoying their time. In the foreground, a man with curly hair and a mustache is wearing glasses and a red and white checkered shirt. He is sitting on a beach chair and appears to be laughing or smiling, looking off to the side. In the background, there are other people sitting on the beach, some under umbrellas, and others lying on towels. The beach is populated with various beachgoers, and the atmosphere seems relaxed and leisurely. The overall scene conveys a sense of a typical day at the beach with people engaging in typical beach activities.", lora_url: "", scheduler: "DPMSolverMultistepScheduler", flow_shift: 9, frame_rate: 16, num_frames: 66, enhance_end: 1, enhance_start: 0, force_offload: true, lora_strength: 1, enhance_double: true, enhance_single: true, enhance_weight: 0.3, guidance_scale: 6, denoise_strength: 1 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", input={ "crf": 19, "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a beach scene with several people enjoying their time. In the foreground, a man with curly hair and a mustache is wearing glasses and a red and white checkered shirt. He is sitting on a beach chair and appears to be laughing or smiling, looking off to the side. In the background, there are other people sitting on the beach, some under umbrellas, and others lying on towels. The beach is populated with various beachgoers, and the atmosphere seems relaxed and leisurely. The overall scene conveys a sense of a typical day at the beach with people engaging in typical beach activities.", "lora_url": "", "scheduler": "DPMSolverMultistepScheduler", "flow_shift": 9, "frame_rate": 16, "num_frames": 66, "enhance_end": 1, "enhance_start": 0, "force_offload": True, "lora_strength": 1, "enhance_double": True, "enhance_single": True, "enhance_weight": 0.3, "guidance_scale": 6, "denoise_strength": 1 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", "input": { "crf": 19, "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a beach scene with several people enjoying their time. In the foreground, a man with curly hair and a mustache is wearing glasses and a red and white checkered shirt. He is sitting on a beach chair and appears to be laughing or smiling, looking off to the side. In the background, there are other people sitting on the beach, some under umbrellas, and others lying on towels. The beach is populated with various beachgoers, and the atmosphere seems relaxed and leisurely. The overall scene conveys a sense of a typical day at the beach with people engaging in typical beach activities.", "lora_url": "", "scheduler": "DPMSolverMultistepScheduler", "flow_shift": 9, "frame_rate": 16, "num_frames": 66, "enhance_end": 1, "enhance_start": 0, "force_offload": true, "lora_strength": 1, "enhance_double": true, "enhance_single": true, "enhance_weight": 0.3, "guidance_scale": 6, "denoise_strength": 1 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2025-01-24T05:06:11.099198Z", "created_at": "2025-01-24T05:03:01.072000Z", "data_removed": false, "error": null, "id": "2c9sprsbt1rmc0cmjtrsnmqnq8", "input": { "crf": 19, "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a beach scene with several people enjoying their time. In the foreground, a man with curly hair and a mustache is wearing glasses and a red and white checkered shirt. He is sitting on a beach chair and appears to be laughing or smiling, looking off to the side. In the background, there are other people sitting on the beach, some under umbrellas, and others lying on towels. The beach is populated with various beachgoers, and the atmosphere seems relaxed and leisurely. The overall scene conveys a sense of a typical day at the beach with people engaging in typical beach activities.", "lora_url": "", "scheduler": "DPMSolverMultistepScheduler", "flow_shift": 9, "frame_rate": 16, "num_frames": 66, "enhance_end": 1, "enhance_start": 0, "force_offload": true, "lora_strength": 1, "enhance_double": true, "enhance_single": true, "enhance_weight": 0.3, "guidance_scale": 6, "denoise_strength": 1 }, "logs": "Seed set to: 12345\n⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements\n⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements\nChecking inputs\n====================================\nChecking weights\n✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae\n✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models\n====================================\nRunning workflow\n[ComfyUI] got prompt\nExecuting node 7, title: HunyuanVideo VAE Loader, class type: HyVideoVAELoader\nExecuting node 42, title: HunyuanVideo Enhance A Video, class type: HyVideoEnhanceAVideo\n[ComfyUI] Loading text encoder model (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14\nExecuting node 16, title: (Down)Load HunyuanVideo TextEncoder, class type: DownloadAndLoadHyVideoTextEncoder\n[ComfyUI] Text encoder to dtype: torch.float16\n[ComfyUI] Loading tokenizer (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14\n[ComfyUI] Loading text encoder model (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer\n[ComfyUI]\n[ComfyUI] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s]\n[ComfyUI] Loading checkpoint shards: 25%|██▌ | 1/4 [00:00<00:01, 1.77it/s]\n[ComfyUI] Loading checkpoint shards: 50%|█████ | 2/4 [00:01<00:01, 1.73it/s]\n[ComfyUI] Loading checkpoint shards: 75%|███████▌ | 3/4 [00:01<00:00, 1.76it/s]\n[ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:01<00:00, 2.58it/s]\n[ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:01<00:00, 2.20it/s]\n[ComfyUI] Text encoder to dtype: torch.float16\n[ComfyUI] Loading tokenizer (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer\nExecuting node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode\n[ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 135\n[ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77\nExecuting node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect\nExecuting node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader\n[ComfyUI] model_type FLOW\n[ComfyUI] The config attributes {'use_flow_sigmas': True, 'prediction_type': 'flow_prediction'} were passed to FlowMatchDiscreteScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\n[ComfyUI] Using accelerate to load and assign model weights to device...\n[ComfyUI] Loading LoRA: lora_comfyui with strength: 1.0\n[ComfyUI] Requested to load HyVideoModel\n[ComfyUI] loaded completely 9.5367431640625e+25 12555.953247070312 True\n[ComfyUI] Input (height, width, video_length) = (368, 640, 65)\nExecuting node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler\n[ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\n[ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps\n[ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('flow_shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['num_train_timesteps', 'n_tokens'])])\n[ComfyUI]\n[ComfyUI] 0%| | 0/50 [00:00<?, ?it/s]\n[ComfyUI] 2%|▏ | 1/50 [00:02<01:58, 2.42s/it]\n[ComfyUI] 4%|▍ | 2/50 [00:04<01:39, 2.06s/it]\n[ComfyUI] 6%|▌ | 3/50 [00:06<01:41, 2.17s/it]\n[ComfyUI] 8%|▊ | 4/50 [00:08<01:42, 2.22s/it]\n[ComfyUI] 10%|█ | 5/50 [00:11<01:41, 2.25s/it]\n[ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.27s/it]\n[ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.28s/it]\n[ComfyUI] 16%|█▌ | 8/50 [00:18<01:36, 2.30s/it]\n[ComfyUI] 18%|█▊ | 9/50 [00:20<01:34, 2.30s/it]\n[ComfyUI] 20%|██ | 10/50 [00:22<01:32, 2.30s/it]\n[ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.30s/it]\n[ComfyUI] 24%|██▍ | 12/50 [00:27<01:27, 2.30s/it]\n[ComfyUI] 26%|██▌ | 13/50 [00:29<01:25, 2.30s/it]\n[ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.30s/it]\n[ComfyUI] 30%|███ | 15/50 [00:34<01:20, 2.30s/it]\n[ComfyUI] 32%|███▏ | 16/50 [00:36<01:18, 2.30s/it]\n[ComfyUI] 34%|███▍ | 17/50 [00:38<01:16, 2.30s/it]\n[ComfyUI] 36%|███▌ | 18/50 [00:41<01:13, 2.30s/it]\n[ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.30s/it]\n[ComfyUI] 40%|████ | 20/50 [00:45<01:09, 2.30s/it]\n[ComfyUI] 42%|████▏ | 21/50 [00:48<01:06, 2.30s/it]\n[ComfyUI] 44%|████▍ | 22/50 [00:50<01:04, 2.30s/it]\n[ComfyUI] 46%|████▌ | 23/50 [00:52<01:02, 2.30s/it]\n[ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.30s/it]\n[ComfyUI] 50%|█████ | 25/50 [00:57<00:57, 2.30s/it]\n[ComfyUI] 52%|█████▏ | 26/50 [00:59<00:55, 2.30s/it]\n[ComfyUI] 54%|█████▍ | 27/50 [01:01<00:53, 2.30s/it]\n[ComfyUI] 56%|█████▌ | 28/50 [01:04<00:50, 2.30s/it]\n[ComfyUI] 58%|█████▊ | 29/50 [01:06<00:48, 2.30s/it]\n[ComfyUI] 60%|██████ | 30/50 [01:08<00:46, 2.30s/it]\n[ComfyUI] 62%|██████▏ | 31/50 [01:11<00:43, 2.30s/it]\n[ComfyUI] 64%|██████▍ | 32/50 [01:13<00:41, 2.30s/it]\n[ComfyUI] 66%|██████▌ | 33/50 [01:15<00:39, 2.31s/it]\n[ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.31s/it]\n[ComfyUI] 70%|███████ | 35/50 [01:20<00:34, 2.30s/it]\n[ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.30s/it]\n[ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.30s/it]\n[ComfyUI] 76%|███████▌ | 38/50 [01:27<00:27, 2.31s/it]\n[ComfyUI] 78%|███████▊ | 39/50 [01:29<00:25, 2.30s/it]\n[ComfyUI] 80%|████████ | 40/50 [01:31<00:23, 2.31s/it]\n[ComfyUI] 82%|████████▏ | 41/50 [01:34<00:20, 2.31s/it]\n[ComfyUI] 84%|████████▍ | 42/50 [01:36<00:18, 2.31s/it]\n[ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.31s/it]\n[ComfyUI] 88%|████████▊ | 44/50 [01:41<00:13, 2.30s/it]\n[ComfyUI] 90%|█████████ | 45/50 [01:43<00:11, 2.30s/it]\n[ComfyUI] 92%|█████████▏| 46/50 [01:45<00:09, 2.30s/it]\n[ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.30s/it]\n[ComfyUI] 96%|█████████▌| 48/50 [01:50<00:04, 2.30s/it]\n[ComfyUI] 98%|█████████▊| 49/50 [01:52<00:02, 2.30s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.30s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.30s/it]\n[ComfyUI] Allocated memory: memory=12.760 GB\n[ComfyUI] Max allocated memory: max_memory=15.559 GB\n[ComfyUI] Max reserved memory: max_reserved=16.875 GB\nExecuting node 5, title: HunyuanVideo Decode, class type: HyVideoDecode\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.50s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.25s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.29s/it]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 28.11it/s]\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.48it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.99it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.90it/s]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\nExecuting node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 84.36it/s]\n[ComfyUI] Prompt executed in 148.54 seconds\noutputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}}\n====================================\nHunyuanVideo_00001.png\nHunyuanVideo_00001.mp4", "metrics": { "predict_time": 163.065343157, "total_time": 190.027198 }, "output": "https://replicate.delivery/xezq/xPdy1MCEzqreXynpyUo3WHHijdOFw4J0dgQHZF2bUIyhjGEKA/HunyuanVideo_00001.mp4", "started_at": "2025-01-24T05:03:28.033855Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bsvm-6qlumqsy2avdszyunijc6yintvlh6n6vr7yektp6rlrl6q6nt3ka", "get": "https://api.replicate.com/v1/predictions/2c9sprsbt1rmc0cmjtrsnmqnq8", "cancel": "https://api.replicate.com/v1/predictions/2c9sprsbt1rmc0cmjtrsnmqnq8/cancel" }, "version": "a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d" }
Generated inSeed set to: 12345 ⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements ⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements Checking inputs ==================================== Checking weights ✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae ✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models ==================================== Running workflow [ComfyUI] got prompt Executing node 7, title: HunyuanVideo VAE Loader, class type: HyVideoVAELoader Executing node 42, title: HunyuanVideo Enhance A Video, class type: HyVideoEnhanceAVideo [ComfyUI] Loading text encoder model (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14 Executing node 16, title: (Down)Load HunyuanVideo TextEncoder, class type: DownloadAndLoadHyVideoTextEncoder [ComfyUI] Text encoder to dtype: torch.float16 [ComfyUI] Loading tokenizer (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14 [ComfyUI] Loading text encoder model (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer [ComfyUI] [ComfyUI] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s] [ComfyUI] Loading checkpoint shards: 25%|██▌ | 1/4 [00:00<00:01, 1.77it/s] [ComfyUI] Loading checkpoint shards: 50%|█████ | 2/4 [00:01<00:01, 1.73it/s] [ComfyUI] Loading checkpoint shards: 75%|███████▌ | 3/4 [00:01<00:00, 1.76it/s] [ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:01<00:00, 2.58it/s] [ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:01<00:00, 2.20it/s] [ComfyUI] Text encoder to dtype: torch.float16 [ComfyUI] Loading tokenizer (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer Executing node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode [ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 135 [ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77 Executing node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect Executing node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader [ComfyUI] model_type FLOW [ComfyUI] The config attributes {'use_flow_sigmas': True, 'prediction_type': 'flow_prediction'} were passed to FlowMatchDiscreteScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. [ComfyUI] Using accelerate to load and assign model weights to device... [ComfyUI] Loading LoRA: lora_comfyui with strength: 1.0 [ComfyUI] Requested to load HyVideoModel [ComfyUI] loaded completely 9.5367431640625e+25 12555.953247070312 True [ComfyUI] Input (height, width, video_length) = (368, 640, 65) Executing node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler [ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. [ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps [ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('flow_shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['num_train_timesteps', 'n_tokens'])]) [ComfyUI] [ComfyUI] 0%| | 0/50 [00:00<?, ?it/s] [ComfyUI] 2%|▏ | 1/50 [00:02<01:58, 2.42s/it] [ComfyUI] 4%|▍ | 2/50 [00:04<01:39, 2.06s/it] [ComfyUI] 6%|▌ | 3/50 [00:06<01:41, 2.17s/it] [ComfyUI] 8%|▊ | 4/50 [00:08<01:42, 2.22s/it] [ComfyUI] 10%|█ | 5/50 [00:11<01:41, 2.25s/it] [ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.27s/it] [ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.28s/it] [ComfyUI] 16%|█▌ | 8/50 [00:18<01:36, 2.30s/it] [ComfyUI] 18%|█▊ | 9/50 [00:20<01:34, 2.30s/it] [ComfyUI] 20%|██ | 10/50 [00:22<01:32, 2.30s/it] [ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.30s/it] [ComfyUI] 24%|██▍ | 12/50 [00:27<01:27, 2.30s/it] [ComfyUI] 26%|██▌ | 13/50 [00:29<01:25, 2.30s/it] [ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.30s/it] [ComfyUI] 30%|███ | 15/50 [00:34<01:20, 2.30s/it] [ComfyUI] 32%|███▏ | 16/50 [00:36<01:18, 2.30s/it] [ComfyUI] 34%|███▍ | 17/50 [00:38<01:16, 2.30s/it] [ComfyUI] 36%|███▌ | 18/50 [00:41<01:13, 2.30s/it] [ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.30s/it] [ComfyUI] 40%|████ | 20/50 [00:45<01:09, 2.30s/it] [ComfyUI] 42%|████▏ | 21/50 [00:48<01:06, 2.30s/it] [ComfyUI] 44%|████▍ | 22/50 [00:50<01:04, 2.30s/it] [ComfyUI] 46%|████▌ | 23/50 [00:52<01:02, 2.30s/it] [ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.30s/it] [ComfyUI] 50%|█████ | 25/50 [00:57<00:57, 2.30s/it] [ComfyUI] 52%|█████▏ | 26/50 [00:59<00:55, 2.30s/it] [ComfyUI] 54%|█████▍ | 27/50 [01:01<00:53, 2.30s/it] [ComfyUI] 56%|█████▌ | 28/50 [01:04<00:50, 2.30s/it] [ComfyUI] 58%|█████▊ | 29/50 [01:06<00:48, 2.30s/it] [ComfyUI] 60%|██████ | 30/50 [01:08<00:46, 2.30s/it] [ComfyUI] 62%|██████▏ | 31/50 [01:11<00:43, 2.30s/it] [ComfyUI] 64%|██████▍ | 32/50 [01:13<00:41, 2.30s/it] [ComfyUI] 66%|██████▌ | 33/50 [01:15<00:39, 2.31s/it] [ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.31s/it] [ComfyUI] 70%|███████ | 35/50 [01:20<00:34, 2.30s/it] [ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.30s/it] [ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.30s/it] [ComfyUI] 76%|███████▌ | 38/50 [01:27<00:27, 2.31s/it] [ComfyUI] 78%|███████▊ | 39/50 [01:29<00:25, 2.30s/it] [ComfyUI] 80%|████████ | 40/50 [01:31<00:23, 2.31s/it] [ComfyUI] 82%|████████▏ | 41/50 [01:34<00:20, 2.31s/it] [ComfyUI] 84%|████████▍ | 42/50 [01:36<00:18, 2.31s/it] [ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.31s/it] [ComfyUI] 88%|████████▊ | 44/50 [01:41<00:13, 2.30s/it] [ComfyUI] 90%|█████████ | 45/50 [01:43<00:11, 2.30s/it] [ComfyUI] 92%|█████████▏| 46/50 [01:45<00:09, 2.30s/it] [ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.30s/it] [ComfyUI] 96%|█████████▌| 48/50 [01:50<00:04, 2.30s/it] [ComfyUI] 98%|█████████▊| 49/50 [01:52<00:02, 2.30s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.30s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.30s/it] [ComfyUI] Allocated memory: memory=12.760 GB [ComfyUI] Max allocated memory: max_memory=15.559 GB [ComfyUI] Max reserved memory: max_reserved=16.875 GB Executing node 5, title: HunyuanVideo Decode, class type: HyVideoDecode [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.50s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.25s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.29s/it] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 28.11it/s] [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.48it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.99it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.90it/s] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] Executing node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 84.36it/s] [ComfyUI] Prompt executed in 148.54 seconds outputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}} ==================================== HunyuanVideo_00001.png HunyuanVideo_00001.mp4
Prediction
deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63dIDbhf645er4xrmc0cmk5x8648afgStatusSucceededSourceAPIHardwareH100Total durationCreatedInput
- seed
- 12345
- steps
- 50
- width
- 640
- height
- 360
- prompt
- A video in the style of HR, HR The video clip features three individuals standing in a red elevator. The person on the left is wearing a purple uniform with gold buttons and a matching cap, standing with one hand on the elevator door. The person in the center is seated in a chair, wearing a light blue suit with a white shirt and a black bow tie. This individual has a mustache and is looking directly at the camera. The person on the right is also wearing a purple uniform with a cap that has the word BOBBY written on it. The background of the elevator is a vibrant red, creating a striking contrast with the purple uniforms. The overall scene
- frame_rate
- 16
- num_frames
- 66
- lora_strength
- 1
- guidance_scale
- 6
{ "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip features three individuals standing in a red elevator. The person on the left is wearing a purple uniform with gold buttons and a matching cap, standing with one hand on the elevator door. The person in the center is seated in a chair, wearing a light blue suit with a white shirt and a black bow tie. This individual has a mustache and is looking directly at the camera. The person on the right is also wearing a purple uniform with a cap that has the word BOBBY written on it. The background of the elevator is a vibrant red, creating a striking contrast with the purple uniforms. The overall scene", "frame_rate": 16, "num_frames": 66, "lora_strength": 1, "guidance_scale": 6 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; import fs from "node:fs"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", { input: { seed: 12345, steps: 50, width: 640, height: 360, prompt: "A video in the style of HR, HR The video clip features three individuals standing in a red elevator. The person on the left is wearing a purple uniform with gold buttons and a matching cap, standing with one hand on the elevator door. The person in the center is seated in a chair, wearing a light blue suit with a white shirt and a black bow tie. This individual has a mustache and is looking directly at the camera. The person on the right is also wearing a purple uniform with a cap that has the word BOBBY written on it. The background of the elevator is a vibrant red, creating a striking contrast with the purple uniforms. The overall scene", frame_rate: 16, num_frames: 66, lora_strength: 1, guidance_scale: 6 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", input={ "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip features three individuals standing in a red elevator. The person on the left is wearing a purple uniform with gold buttons and a matching cap, standing with one hand on the elevator door. The person in the center is seated in a chair, wearing a light blue suit with a white shirt and a black bow tie. This individual has a mustache and is looking directly at the camera. The person on the right is also wearing a purple uniform with a cap that has the word BOBBY written on it. The background of the elevator is a vibrant red, creating a striking contrast with the purple uniforms. The overall scene", "frame_rate": 16, "num_frames": 66, "lora_strength": 1, "guidance_scale": 6 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", "input": { "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip features three individuals standing in a red elevator. The person on the left is wearing a purple uniform with gold buttons and a matching cap, standing with one hand on the elevator door. The person in the center is seated in a chair, wearing a light blue suit with a white shirt and a black bow tie. This individual has a mustache and is looking directly at the camera. The person on the right is also wearing a purple uniform with a cap that has the word BOBBY written on it. The background of the elevator is a vibrant red, creating a striking contrast with the purple uniforms. The overall scene", "frame_rate": 16, "num_frames": 66, "lora_strength": 1, "guidance_scale": 6 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2025-01-24T18:08:42.159973Z", "created_at": "2025-01-24T18:02:32.359000Z", "data_removed": false, "error": null, "id": "bhf645er4xrmc0cmk5x8648afg", "input": { "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip features three individuals standing in a red elevator. The person on the left is wearing a purple uniform with gold buttons and a matching cap, standing with one hand on the elevator door. The person in the center is seated in a chair, wearing a light blue suit with a white shirt and a black bow tie. This individual has a mustache and is looking directly at the camera. The person on the right is also wearing a purple uniform with a cap that has the word BOBBY written on it. The background of the elevator is a vibrant red, creating a striking contrast with the purple uniforms. The overall scene", "frame_rate": 16, "num_frames": 66, "lora_strength": 1, "guidance_scale": 6 }, "logs": "Seed set to: 12345\n⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements\n⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements\n�� USING REPLICATE WEIGHTS (preferred method)\n🎯 USING REPLICATE WEIGHTS TAR FILE 🎯\n----------------------------------------\n📦 Processing replicate weights tar file...\n🔄 Will rename LoRA to: replicate_a9a385ff-708f-444b-a29f-37ddb5b0a444.safetensors\n📂 Extracting tar contents...\n✅ Found lora_comfyui.safetensors in tar\n✨ Successfully copied LoRA to: ComfyUI/models/loras/replicate_a9a385ff-708f-444b-a29f-37ddb5b0a444.safetensors\n----------------------------------------\nChecking inputs\n====================================\nChecking weights\n✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models\n✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae\n====================================\nRunning workflow\n[ComfyUI] got prompt\nExecuting node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode\n[ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 136\n[ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77\nExecuting node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect\nExecuting node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader\n[ComfyUI] model_type FLOW\n[ComfyUI] The config attributes {'use_flow_sigmas': True, 'prediction_type': 'flow_prediction'} were passed to FlowMatchDiscreteScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\n[ComfyUI] Using accelerate to load and assign model weights to device...\n[ComfyUI] Loading LoRA: replicate_a9a385ff-708f-444b-a29f-37ddb5b0a444 with strength: 1.0\n[ComfyUI] Requested to load HyVideoModel\n[ComfyUI] loaded completely 9.5367431640625e+25 12555.953247070312 True\n[ComfyUI] Input (height, width, video_length) = (368, 640, 65)\nExecuting node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler\n[ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\n[ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps\n[ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('flow_shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['n_tokens', 'num_train_timesteps'])])\n[ComfyUI]\n[ComfyUI] 0%| | 0/50 [00:00<?, ?it/s]\n[ComfyUI] 2%|▏ | 1/50 [00:02<01:52, 2.30s/it]\n[ComfyUI] 4%|▍ | 2/50 [00:04<01:36, 2.01s/it]\n[ComfyUI] 6%|▌ | 3/50 [00:06<01:40, 2.14s/it]\n[ComfyUI] 8%|▊ | 4/50 [00:08<01:41, 2.20s/it]\n[ComfyUI] 10%|█ | 5/50 [00:11<01:40, 2.24s/it]\n[ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.26s/it]\n[ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.27s/it]\n[ComfyUI] 16%|█▌ | 8/50 [00:17<01:35, 2.28s/it]\n[ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.28s/it]\n[ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.29s/it]\n[ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.29s/it]\n[ComfyUI] 24%|██▍ | 12/50 [00:27<01:27, 2.29s/it]\n[ComfyUI] 26%|██▌ | 13/50 [00:29<01:24, 2.29s/it]\n[ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.29s/it]\n[ComfyUI] 30%|███ | 15/50 [00:33<01:20, 2.29s/it]\n[ComfyUI] 32%|███▏ | 16/50 [00:36<01:18, 2.29s/it]\n[ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.29s/it]\n[ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.29s/it]\n[ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.30s/it]\n[ComfyUI] 40%|████ | 20/50 [00:45<01:08, 2.30s/it]\n[ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.30s/it]\n[ComfyUI] 44%|████▍ | 22/50 [00:50<01:04, 2.30s/it]\n[ComfyUI] 46%|████▌ | 23/50 [00:52<01:01, 2.30s/it]\n[ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.30s/it]\n[ComfyUI] 50%|█████ | 25/50 [00:56<00:57, 2.30s/it]\n[ComfyUI] 52%|█████▏ | 26/50 [00:59<00:55, 2.30s/it]\n[ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.30s/it]\n[ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.30s/it]\n[ComfyUI] 58%|█████▊ | 29/50 [01:06<00:48, 2.30s/it]\n[ComfyUI] 60%|██████ | 30/50 [01:08<00:45, 2.30s/it]\n[ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.30s/it]\n[ComfyUI] 64%|██████▍ | 32/50 [01:12<00:41, 2.30s/it]\n[ComfyUI] 66%|██████▌ | 33/50 [01:15<00:39, 2.30s/it]\n[ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.30s/it]\n[ComfyUI] 70%|███████ | 35/50 [01:19<00:34, 2.30s/it]\n[ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.30s/it]\n[ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.30s/it]\n[ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.30s/it]\n[ComfyUI] 78%|███████▊ | 39/50 [01:29<00:25, 2.30s/it]\n[ComfyUI] 80%|████████ | 40/50 [01:31<00:22, 2.30s/it]\n[ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.30s/it]\n[ComfyUI] 84%|████████▍ | 42/50 [01:35<00:18, 2.30s/it]\n[ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.30s/it]\n[ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.30s/it]\n[ComfyUI] 90%|█████████ | 45/50 [01:42<00:11, 2.30s/it]\n[ComfyUI] 92%|█████████▏| 46/50 [01:45<00:09, 2.30s/it]\n[ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.30s/it]\n[ComfyUI] 96%|█████████▌| 48/50 [01:49<00:04, 2.30s/it]\n[ComfyUI] 98%|█████████▊| 49/50 [01:52<00:02, 2.30s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.30s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it]\n[ComfyUI] Allocated memory: memory=12.300 GB\n[ComfyUI] Max allocated memory: max_memory=15.099 GB\n[ComfyUI] Max reserved memory: max_reserved=16.344 GB\nExecuting node 5, title: HunyuanVideo Decode, class type: HyVideoDecode\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.46s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.24s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.28s/it]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 25.85it/s]\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.54it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 3.02it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.94it/s]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\nExecuting node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 65.09it/s]\n[ComfyUI] Prompt executed in 141.61 seconds\noutputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}}\n====================================\nHunyuanVideo_00001.png\nHunyuanVideo_00001.mp4", "metrics": { "predict_time": 144.213480699, "total_time": 369.800973 }, "output": "https://replicate.delivery/xezq/afIjNwR5m6VPWCrZmu70tfujnLcmRe8niAhVPEEibTfpSihQB/HunyuanVideo_00001.mp4", "started_at": "2025-01-24T18:06:17.946493Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bsvm-wmqegxs7z3ztuefp6ewjjuyvnyyzj3tsq3lk2fv6rilmgoxahsoq", "get": "https://api.replicate.com/v1/predictions/bhf645er4xrmc0cmk5x8648afg", "cancel": "https://api.replicate.com/v1/predictions/bhf645er4xrmc0cmk5x8648afg/cancel" }, "version": "a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d" }
Generated inSeed set to: 12345 ⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements ⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements �� USING REPLICATE WEIGHTS (preferred method) 🎯 USING REPLICATE WEIGHTS TAR FILE 🎯 ---------------------------------------- 📦 Processing replicate weights tar file... 🔄 Will rename LoRA to: replicate_a9a385ff-708f-444b-a29f-37ddb5b0a444.safetensors 📂 Extracting tar contents... ✅ Found lora_comfyui.safetensors in tar ✨ Successfully copied LoRA to: ComfyUI/models/loras/replicate_a9a385ff-708f-444b-a29f-37ddb5b0a444.safetensors ---------------------------------------- Checking inputs ==================================== Checking weights ✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models ✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae ==================================== Running workflow [ComfyUI] got prompt Executing node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode [ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 136 [ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77 Executing node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect Executing node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader [ComfyUI] model_type FLOW [ComfyUI] The config attributes {'use_flow_sigmas': True, 'prediction_type': 'flow_prediction'} were passed to FlowMatchDiscreteScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. [ComfyUI] Using accelerate to load and assign model weights to device... [ComfyUI] Loading LoRA: replicate_a9a385ff-708f-444b-a29f-37ddb5b0a444 with strength: 1.0 [ComfyUI] Requested to load HyVideoModel [ComfyUI] loaded completely 9.5367431640625e+25 12555.953247070312 True [ComfyUI] Input (height, width, video_length) = (368, 640, 65) Executing node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler [ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. [ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps [ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('flow_shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['n_tokens', 'num_train_timesteps'])]) [ComfyUI] [ComfyUI] 0%| | 0/50 [00:00<?, ?it/s] [ComfyUI] 2%|▏ | 1/50 [00:02<01:52, 2.30s/it] [ComfyUI] 4%|▍ | 2/50 [00:04<01:36, 2.01s/it] [ComfyUI] 6%|▌ | 3/50 [00:06<01:40, 2.14s/it] [ComfyUI] 8%|▊ | 4/50 [00:08<01:41, 2.20s/it] [ComfyUI] 10%|█ | 5/50 [00:11<01:40, 2.24s/it] [ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.26s/it] [ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.27s/it] [ComfyUI] 16%|█▌ | 8/50 [00:17<01:35, 2.28s/it] [ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.28s/it] [ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.29s/it] [ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.29s/it] [ComfyUI] 24%|██▍ | 12/50 [00:27<01:27, 2.29s/it] [ComfyUI] 26%|██▌ | 13/50 [00:29<01:24, 2.29s/it] [ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.29s/it] [ComfyUI] 30%|███ | 15/50 [00:33<01:20, 2.29s/it] [ComfyUI] 32%|███▏ | 16/50 [00:36<01:18, 2.29s/it] [ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.29s/it] [ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.29s/it] [ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.30s/it] [ComfyUI] 40%|████ | 20/50 [00:45<01:08, 2.30s/it] [ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.30s/it] [ComfyUI] 44%|████▍ | 22/50 [00:50<01:04, 2.30s/it] [ComfyUI] 46%|████▌ | 23/50 [00:52<01:01, 2.30s/it] [ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.30s/it] [ComfyUI] 50%|█████ | 25/50 [00:56<00:57, 2.30s/it] [ComfyUI] 52%|█████▏ | 26/50 [00:59<00:55, 2.30s/it] [ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.30s/it] [ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.30s/it] [ComfyUI] 58%|█████▊ | 29/50 [01:06<00:48, 2.30s/it] [ComfyUI] 60%|██████ | 30/50 [01:08<00:45, 2.30s/it] [ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.30s/it] [ComfyUI] 64%|██████▍ | 32/50 [01:12<00:41, 2.30s/it] [ComfyUI] 66%|██████▌ | 33/50 [01:15<00:39, 2.30s/it] [ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.30s/it] [ComfyUI] 70%|███████ | 35/50 [01:19<00:34, 2.30s/it] [ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.30s/it] [ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.30s/it] [ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.30s/it] [ComfyUI] 78%|███████▊ | 39/50 [01:29<00:25, 2.30s/it] [ComfyUI] 80%|████████ | 40/50 [01:31<00:22, 2.30s/it] [ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.30s/it] [ComfyUI] 84%|████████▍ | 42/50 [01:35<00:18, 2.30s/it] [ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.30s/it] [ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.30s/it] [ComfyUI] 90%|█████████ | 45/50 [01:42<00:11, 2.30s/it] [ComfyUI] 92%|█████████▏| 46/50 [01:45<00:09, 2.30s/it] [ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.30s/it] [ComfyUI] 96%|█████████▌| 48/50 [01:49<00:04, 2.30s/it] [ComfyUI] 98%|█████████▊| 49/50 [01:52<00:02, 2.30s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.30s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it] [ComfyUI] Allocated memory: memory=12.300 GB [ComfyUI] Max allocated memory: max_memory=15.099 GB [ComfyUI] Max reserved memory: max_reserved=16.344 GB Executing node 5, title: HunyuanVideo Decode, class type: HyVideoDecode [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.46s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.24s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.28s/it] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 25.85it/s] [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.54it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 3.02it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.94it/s] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] Executing node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 65.09it/s] [ComfyUI] Prompt executed in 141.61 seconds outputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}} ==================================== HunyuanVideo_00001.png HunyuanVideo_00001.mp4
Prediction
deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63dIDmbgpdvfk29rm80cmk5xr8x9p38StatusSucceededSourceAPIHardwareH100Total durationCreatedInput
- seed
- 12345
- steps
- 50
- width
- 640
- height
- 360
- prompt
- A video in the style of HR, HR The video clip features a woman standing outdoors in what appears to be a historical or rural setting. The lighting is warm and golden, suggesting that it is either early morning or late afternoon. The woman has long, wavy hair that is partially tied back, and she is wearing a blue top with a lace or embroidered neckline. The background is slightly blurred, but it shows wooden structures and a dirt path, indicating a rustic environment. The overall atmosphere is serene and contemplative.
- frame_rate
- 16
- num_frames
- 66
- lora_strength
- 1
- guidance_scale
- 6
{ "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip features a woman standing outdoors in what appears to be a historical or rural setting. The lighting is warm and golden, suggesting that it is either early morning or late afternoon. The woman has long, wavy hair that is partially tied back, and she is wearing a blue top with a lace or embroidered neckline. The background is slightly blurred, but it shows wooden structures and a dirt path, indicating a rustic environment. The overall atmosphere is serene and contemplative.", "frame_rate": 16, "num_frames": 66, "lora_strength": 1, "guidance_scale": 6 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; import fs from "node:fs"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", { input: { seed: 12345, steps: 50, width: 640, height: 360, prompt: "A video in the style of HR, HR The video clip features a woman standing outdoors in what appears to be a historical or rural setting. The lighting is warm and golden, suggesting that it is either early morning or late afternoon. The woman has long, wavy hair that is partially tied back, and she is wearing a blue top with a lace or embroidered neckline. The background is slightly blurred, but it shows wooden structures and a dirt path, indicating a rustic environment. The overall atmosphere is serene and contemplative.", frame_rate: 16, num_frames: 66, lora_strength: 1, guidance_scale: 6 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", input={ "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip features a woman standing outdoors in what appears to be a historical or rural setting. The lighting is warm and golden, suggesting that it is either early morning or late afternoon. The woman has long, wavy hair that is partially tied back, and she is wearing a blue top with a lace or embroidered neckline. The background is slightly blurred, but it shows wooden structures and a dirt path, indicating a rustic environment. The overall atmosphere is serene and contemplative.", "frame_rate": 16, "num_frames": 66, "lora_strength": 1, "guidance_scale": 6 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", "input": { "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip features a woman standing outdoors in what appears to be a historical or rural setting. The lighting is warm and golden, suggesting that it is either early morning or late afternoon. The woman has long, wavy hair that is partially tied back, and she is wearing a blue top with a lace or embroidered neckline. The background is slightly blurred, but it shows wooden structures and a dirt path, indicating a rustic environment. The overall atmosphere is serene and contemplative.", "frame_rate": 16, "num_frames": 66, "lora_strength": 1, "guidance_scale": 6 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2025-01-24T18:10:15.605639Z", "created_at": "2025-01-24T18:03:44.786000Z", "data_removed": false, "error": null, "id": "mbgpdvfk29rm80cmk5xr8x9p38", "input": { "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip features a woman standing outdoors in what appears to be a historical or rural setting. The lighting is warm and golden, suggesting that it is either early morning or late afternoon. The woman has long, wavy hair that is partially tied back, and she is wearing a blue top with a lace or embroidered neckline. The background is slightly blurred, but it shows wooden structures and a dirt path, indicating a rustic environment. The overall atmosphere is serene and contemplative.", "frame_rate": 16, "num_frames": 66, "lora_strength": 1, "guidance_scale": 6 }, "logs": "Seed set to: 12345\n⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements\n⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements\n�� USING REPLICATE WEIGHTS (preferred method)\n🎯 USING REPLICATE WEIGHTS TAR FILE 🎯\n----------------------------------------\n📦 Processing replicate weights tar file...\n🔄 Will rename LoRA to: replicate_f9ce0e6e-d5dc-420c-a4c5-ab602e0d31da.safetensors\n📂 Extracting tar contents...\n✅ Found lora_comfyui.safetensors in tar\n✨ Successfully copied LoRA to: ComfyUI/models/loras/replicate_f9ce0e6e-d5dc-420c-a4c5-ab602e0d31da.safetensors\n----------------------------------------\nChecking inputs\n====================================\nChecking weights\n✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models\n✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae\n====================================\nRunning workflow\n[ComfyUI] got prompt\nExecuting node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode\n[ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 105\n[ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77\nExecuting node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect\nExecuting node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader\n[ComfyUI] model_type FLOW\n[ComfyUI] The config attributes {'use_flow_sigmas': True, 'prediction_type': 'flow_prediction'} were passed to FlowMatchDiscreteScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\n[ComfyUI] Using accelerate to load and assign model weights to device...\n[ComfyUI] Loading LoRA: replicate_f9ce0e6e-d5dc-420c-a4c5-ab602e0d31da with strength: 1.0\n[ComfyUI] Requested to load HyVideoModel\n[ComfyUI] loaded completely 9.5367431640625e+25 12555.953247070312 True\n[ComfyUI] Input (height, width, video_length) = (368, 640, 65)\nExecuting node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler\n[ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\n[ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps\n[ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('flow_shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['n_tokens', 'num_train_timesteps'])])\n[ComfyUI]\n[ComfyUI] 0%| | 0/50 [00:00<?, ?it/s]\n[ComfyUI] 2%|▏ | 1/50 [00:02<01:52, 2.30s/it]\n[ComfyUI] 4%|▍ | 2/50 [00:04<01:36, 2.01s/it]\n[ComfyUI] 6%|▌ | 3/50 [00:06<01:40, 2.14s/it]\n[ComfyUI] 8%|▊ | 4/50 [00:08<01:41, 2.20s/it]\n[ComfyUI] 10%|█ | 5/50 [00:10<01:40, 2.23s/it]\n[ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.25s/it]\n[ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.27s/it]\n[ComfyUI] 16%|█▌ | 8/50 [00:17<01:35, 2.27s/it]\n[ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.28s/it]\n[ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.28s/it]\n[ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.29s/it]\n[ComfyUI] 24%|██▍ | 12/50 [00:27<01:26, 2.29s/it]\n[ComfyUI] 26%|██▌ | 13/50 [00:29<01:24, 2.29s/it]\n[ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.29s/it]\n[ComfyUI] 30%|███ | 15/50 [00:33<01:20, 2.29s/it]\n[ComfyUI] 32%|███▏ | 16/50 [00:36<01:17, 2.29s/it]\n[ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.29s/it]\n[ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.29s/it]\n[ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.29s/it]\n[ComfyUI] 40%|████ | 20/50 [00:45<01:08, 2.29s/it]\n[ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.29s/it]\n[ComfyUI] 44%|████▍ | 22/50 [00:49<01:04, 2.29s/it]\n[ComfyUI] 46%|████▌ | 23/50 [00:52<01:01, 2.29s/it]\n[ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.29s/it]\n[ComfyUI] 50%|█████ | 25/50 [00:56<00:57, 2.29s/it]\n[ComfyUI] 52%|█████▏ | 26/50 [00:59<00:55, 2.29s/it]\n[ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.29s/it]\n[ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.29s/it]\n[ComfyUI] 58%|█████▊ | 29/50 [01:05<00:48, 2.29s/it]\n[ComfyUI] 60%|██████ | 30/50 [01:08<00:45, 2.29s/it]\n[ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.29s/it]\n[ComfyUI] 64%|██████▍ | 32/50 [01:12<00:41, 2.29s/it]\n[ComfyUI] 66%|██████▌ | 33/50 [01:15<00:38, 2.29s/it]\n[ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.29s/it]\n[ComfyUI] 70%|███████ | 35/50 [01:19<00:34, 2.29s/it]\n[ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.29s/it]\n[ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.29s/it]\n[ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.29s/it]\n[ComfyUI] 78%|███████▊ | 39/50 [01:28<00:25, 2.29s/it]\n[ComfyUI] 80%|████████ | 40/50 [01:31<00:22, 2.29s/it]\n[ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.29s/it]\n[ComfyUI] 84%|████████▍ | 42/50 [01:35<00:18, 2.29s/it]\n[ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.29s/it]\n[ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.29s/it]\n[ComfyUI] 90%|█████████ | 45/50 [01:42<00:11, 2.29s/it]\n[ComfyUI] 92%|█████████▏| 46/50 [01:44<00:09, 2.29s/it]\n[ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.29s/it]\n[ComfyUI] 96%|█████████▌| 48/50 [01:49<00:04, 2.29s/it]\n[ComfyUI] 98%|█████████▊| 49/50 [01:51<00:02, 2.29s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.28s/it]\n[ComfyUI] Allocated memory: memory=12.300 GB\n[ComfyUI] Max allocated memory: max_memory=15.099 GB\n[ComfyUI] Max reserved memory: max_reserved=16.281 GB\nExecuting node 5, title: HunyuanVideo Decode, class type: HyVideoDecode\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.48s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.25s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.28s/it]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 25.81it/s]\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.53it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 3.01it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.93it/s]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\nExecuting node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 63.98it/s]\n[ComfyUI] Prompt executed in 141.93 seconds\noutputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}}\n====================================\nHunyuanVideo_00001.png\nHunyuanVideo_00001.mp4", "metrics": { "predict_time": 153.424505346, "total_time": 390.819639 }, "output": "https://replicate.delivery/xezq/fp8CHi6OBm1GXqUL6kpjENGUp26G5y2H58J0Y4gHDK6DTMEKA/HunyuanVideo_00001.mp4", "started_at": "2025-01-24T18:07:42.181133Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bsvm-iq3k6bosw45ryoicn27ru5ex6yia3j4ojdibfcj4dycdpgvys54a", "get": "https://api.replicate.com/v1/predictions/mbgpdvfk29rm80cmk5xr8x9p38", "cancel": "https://api.replicate.com/v1/predictions/mbgpdvfk29rm80cmk5xr8x9p38/cancel" }, "version": "a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d" }
Generated inSeed set to: 12345 ⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements ⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements �� USING REPLICATE WEIGHTS (preferred method) 🎯 USING REPLICATE WEIGHTS TAR FILE 🎯 ---------------------------------------- 📦 Processing replicate weights tar file... 🔄 Will rename LoRA to: replicate_f9ce0e6e-d5dc-420c-a4c5-ab602e0d31da.safetensors 📂 Extracting tar contents... ✅ Found lora_comfyui.safetensors in tar ✨ Successfully copied LoRA to: ComfyUI/models/loras/replicate_f9ce0e6e-d5dc-420c-a4c5-ab602e0d31da.safetensors ---------------------------------------- Checking inputs ==================================== Checking weights ✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models ✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae ==================================== Running workflow [ComfyUI] got prompt Executing node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode [ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 105 [ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77 Executing node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect Executing node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader [ComfyUI] model_type FLOW [ComfyUI] The config attributes {'use_flow_sigmas': True, 'prediction_type': 'flow_prediction'} were passed to FlowMatchDiscreteScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. [ComfyUI] Using accelerate to load and assign model weights to device... [ComfyUI] Loading LoRA: replicate_f9ce0e6e-d5dc-420c-a4c5-ab602e0d31da with strength: 1.0 [ComfyUI] Requested to load HyVideoModel [ComfyUI] loaded completely 9.5367431640625e+25 12555.953247070312 True [ComfyUI] Input (height, width, video_length) = (368, 640, 65) Executing node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler [ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. [ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps [ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('flow_shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['n_tokens', 'num_train_timesteps'])]) [ComfyUI] [ComfyUI] 0%| | 0/50 [00:00<?, ?it/s] [ComfyUI] 2%|▏ | 1/50 [00:02<01:52, 2.30s/it] [ComfyUI] 4%|▍ | 2/50 [00:04<01:36, 2.01s/it] [ComfyUI] 6%|▌ | 3/50 [00:06<01:40, 2.14s/it] [ComfyUI] 8%|▊ | 4/50 [00:08<01:41, 2.20s/it] [ComfyUI] 10%|█ | 5/50 [00:10<01:40, 2.23s/it] [ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.25s/it] [ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.27s/it] [ComfyUI] 16%|█▌ | 8/50 [00:17<01:35, 2.27s/it] [ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.28s/it] [ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.28s/it] [ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.29s/it] [ComfyUI] 24%|██▍ | 12/50 [00:27<01:26, 2.29s/it] [ComfyUI] 26%|██▌ | 13/50 [00:29<01:24, 2.29s/it] [ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.29s/it] [ComfyUI] 30%|███ | 15/50 [00:33<01:20, 2.29s/it] [ComfyUI] 32%|███▏ | 16/50 [00:36<01:17, 2.29s/it] [ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.29s/it] [ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.29s/it] [ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.29s/it] [ComfyUI] 40%|████ | 20/50 [00:45<01:08, 2.29s/it] [ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.29s/it] [ComfyUI] 44%|████▍ | 22/50 [00:49<01:04, 2.29s/it] [ComfyUI] 46%|████▌ | 23/50 [00:52<01:01, 2.29s/it] [ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.29s/it] [ComfyUI] 50%|█████ | 25/50 [00:56<00:57, 2.29s/it] [ComfyUI] 52%|█████▏ | 26/50 [00:59<00:55, 2.29s/it] [ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.29s/it] [ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.29s/it] [ComfyUI] 58%|█████▊ | 29/50 [01:05<00:48, 2.29s/it] [ComfyUI] 60%|██████ | 30/50 [01:08<00:45, 2.29s/it] [ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.29s/it] [ComfyUI] 64%|██████▍ | 32/50 [01:12<00:41, 2.29s/it] [ComfyUI] 66%|██████▌ | 33/50 [01:15<00:38, 2.29s/it] [ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.29s/it] [ComfyUI] 70%|███████ | 35/50 [01:19<00:34, 2.29s/it] [ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.29s/it] [ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.29s/it] [ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.29s/it] [ComfyUI] 78%|███████▊ | 39/50 [01:28<00:25, 2.29s/it] [ComfyUI] 80%|████████ | 40/50 [01:31<00:22, 2.29s/it] [ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.29s/it] [ComfyUI] 84%|████████▍ | 42/50 [01:35<00:18, 2.29s/it] [ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.29s/it] [ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.29s/it] [ComfyUI] 90%|█████████ | 45/50 [01:42<00:11, 2.29s/it] [ComfyUI] 92%|█████████▏| 46/50 [01:44<00:09, 2.29s/it] [ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.29s/it] [ComfyUI] 96%|█████████▌| 48/50 [01:49<00:04, 2.29s/it] [ComfyUI] 98%|█████████▊| 49/50 [01:51<00:02, 2.29s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.28s/it] [ComfyUI] Allocated memory: memory=12.300 GB [ComfyUI] Max allocated memory: max_memory=15.099 GB [ComfyUI] Max reserved memory: max_reserved=16.281 GB Executing node 5, title: HunyuanVideo Decode, class type: HyVideoDecode [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.48s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.25s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.28s/it] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 25.81it/s] [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.53it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 3.01it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.93it/s] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] Executing node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 63.98it/s] [ComfyUI] Prompt executed in 141.93 seconds outputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}} ==================================== HunyuanVideo_00001.png HunyuanVideo_00001.mp4
Prediction
deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63dIDgbdwcqyqjxrma0cmk6htbbkxacStatusSucceededSourceAPIHardwareH100Total durationCreatedInput
- seed
- 12345
- steps
- 50
- width
- 640
- height
- 360
- prompt
- A video in the style of HR, HR The video clip depicts a dimly lit jazz club with a cozy, intimate atmosphere. The stage is set with a piano and a saxophone player, both engrossed in their performance. The saxophonist is positioned in the center, playing with a focused expression, while the pianist sits at the back, immersed in his music. The club is filled with patrons seated at tables, engaged in conversation and enjoying the live music. The audience appears to be a mix of men and women, dressed in casual to semi-formal attire. The lighting is warm and subdued, with a neon sign on the right side of the stage reading
- frame_rate
- 16
- num_frames
- 66
- lora_strength
- 1
- guidance_scale
- 6
{ "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a dimly lit jazz club with a cozy, intimate atmosphere. The stage is set with a piano and a saxophone player, both engrossed in their performance. The saxophonist is positioned in the center, playing with a focused expression, while the pianist sits at the back, immersed in his music. The club is filled with patrons seated at tables, engaged in conversation and enjoying the live music. The audience appears to be a mix of men and women, dressed in casual to semi-formal attire. The lighting is warm and subdued, with a neon sign on the right side of the stage reading ", "frame_rate": 16, "num_frames": 66, "lora_strength": 1, "guidance_scale": 6 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; import fs from "node:fs"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", { input: { seed: 12345, steps: 50, width: 640, height: 360, prompt: "A video in the style of HR, HR The video clip depicts a dimly lit jazz club with a cozy, intimate atmosphere. The stage is set with a piano and a saxophone player, both engrossed in their performance. The saxophonist is positioned in the center, playing with a focused expression, while the pianist sits at the back, immersed in his music. The club is filled with patrons seated at tables, engaged in conversation and enjoying the live music. The audience appears to be a mix of men and women, dressed in casual to semi-formal attire. The lighting is warm and subdued, with a neon sign on the right side of the stage reading ", frame_rate: 16, num_frames: 66, lora_strength: 1, guidance_scale: 6 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", input={ "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a dimly lit jazz club with a cozy, intimate atmosphere. The stage is set with a piano and a saxophone player, both engrossed in their performance. The saxophonist is positioned in the center, playing with a focused expression, while the pianist sits at the back, immersed in his music. The club is filled with patrons seated at tables, engaged in conversation and enjoying the live music. The audience appears to be a mix of men and women, dressed in casual to semi-formal attire. The lighting is warm and subdued, with a neon sign on the right side of the stage reading ", "frame_rate": 16, "num_frames": 66, "lora_strength": 1, "guidance_scale": 6 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", "input": { "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a dimly lit jazz club with a cozy, intimate atmosphere. The stage is set with a piano and a saxophone player, both engrossed in their performance. The saxophonist is positioned in the center, playing with a focused expression, while the pianist sits at the back, immersed in his music. The club is filled with patrons seated at tables, engaged in conversation and enjoying the live music. The audience appears to be a mix of men and women, dressed in casual to semi-formal attire. The lighting is warm and subdued, with a neon sign on the right side of the stage reading ", "frame_rate": 16, "num_frames": 66, "lora_strength": 1, "guidance_scale": 6 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2025-01-24T18:50:51.183010Z", "created_at": "2025-01-24T18:47:19.191000Z", "data_removed": false, "error": null, "id": "gbdwcqyqjxrma0cmk6htbbkxac", "input": { "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a dimly lit jazz club with a cozy, intimate atmosphere. The stage is set with a piano and a saxophone player, both engrossed in their performance. The saxophonist is positioned in the center, playing with a focused expression, while the pianist sits at the back, immersed in his music. The club is filled with patrons seated at tables, engaged in conversation and enjoying the live music. The audience appears to be a mix of men and women, dressed in casual to semi-formal attire. The lighting is warm and subdued, with a neon sign on the right side of the stage reading ", "frame_rate": 16, "num_frames": 66, "lora_strength": 1, "guidance_scale": 6 }, "logs": "Seed set to: 12345\n⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements\n⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements\n�� USING REPLICATE WEIGHTS (preferred method)\n🎯 USING REPLICATE WEIGHTS TAR FILE 🎯\n----------------------------------------\n📦 Processing replicate weights tar file...\n🔄 Will rename LoRA to: replicate_b993eada-6919-4267-b32d-d6d697239d7a.safetensors\n📂 Extracting tar contents...\n✅ Found lora_comfyui.safetensors in tar\n✨ Successfully copied LoRA to: ComfyUI/models/loras/replicate_b993eada-6919-4267-b32d-d6d697239d7a.safetensors\n----------------------------------------\nChecking inputs\n====================================\nChecking weights\n✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models\n✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae\n====================================\nRunning workflow\n[ComfyUI] got prompt\nExecuting node 7, title: HunyuanVideo VAE Loader, class type: HyVideoVAELoader\nExecuting node 42, title: HunyuanVideo Enhance A Video, class type: HyVideoEnhanceAVideo\nExecuting node 16, title: (Down)Load HunyuanVideo TextEncoder, class type: DownloadAndLoadHyVideoTextEncoder\n[ComfyUI] Loading text encoder model (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14\n[ComfyUI] Text encoder to dtype: torch.float16\n[ComfyUI] Loading tokenizer (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14\n[ComfyUI] Loading text encoder model (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer\n[ComfyUI]\n[ComfyUI] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s]\n[ComfyUI] Loading checkpoint shards: 25%|██▌ | 1/4 [00:00<00:01, 1.74it/s]\n[ComfyUI] Loading checkpoint shards: 50%|█████ | 2/4 [00:01<00:01, 1.73it/s]\n[ComfyUI] Loading checkpoint shards: 75%|███████▌ | 3/4 [00:01<00:00, 1.75it/s]\n[ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:01<00:00, 2.20it/s]\n[ComfyUI] Text encoder to dtype: torch.float16\n[ComfyUI] Loading tokenizer (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer\nExecuting node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode\n[ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 138\n[ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77\nExecuting node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect\nExecuting node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader\n[ComfyUI] model_type FLOW\n[ComfyUI] The config attributes {'use_flow_sigmas': True, 'prediction_type': 'flow_prediction'} were passed to FlowMatchDiscreteScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\n[ComfyUI] Using accelerate to load and assign model weights to device...\n[ComfyUI] Loading LoRA: replicate_b993eada-6919-4267-b32d-d6d697239d7a with strength: 1.0\n[ComfyUI] Requested to load HyVideoModel\n[ComfyUI] loaded completely 9.5367431640625e+25 12555.953247070312 True\n[ComfyUI] Input (height, width, video_length) = (368, 640, 65)\nExecuting node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler\n[ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\n[ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps\n[ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('flow_shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['num_train_timesteps', 'n_tokens'])])\n[ComfyUI]\n[ComfyUI] 0%| | 0/50 [00:00<?, ?it/s]\n[ComfyUI] 2%|▏ | 1/50 [00:02<02:01, 2.48s/it]\n[ComfyUI] 4%|▍ | 2/50 [00:04<01:40, 2.09s/it]\n[ComfyUI] 6%|▌ | 3/50 [00:06<01:42, 2.18s/it]\n[ComfyUI] 8%|▊ | 4/50 [00:08<01:42, 2.23s/it]\n[ComfyUI] 10%|█ | 5/50 [00:11<01:41, 2.25s/it]\n[ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.26s/it]\n[ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.27s/it]\n[ComfyUI] 16%|█▌ | 8/50 [00:18<01:35, 2.28s/it]\n[ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.28s/it]\n[ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.28s/it]\n[ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.29s/it]\n[ComfyUI] 24%|██▍ | 12/50 [00:27<01:26, 2.29s/it]\n[ComfyUI] 26%|██▌ | 13/50 [00:29<01:24, 2.29s/it]\n[ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.29s/it]\n[ComfyUI] 30%|███ | 15/50 [00:34<01:20, 2.29s/it]\n[ComfyUI] 32%|███▏ | 16/50 [00:36<01:17, 2.29s/it]\n[ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.29s/it]\n[ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.29s/it]\n[ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.29s/it]\n[ComfyUI] 40%|████ | 20/50 [00:45<01:08, 2.29s/it]\n[ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.29s/it]\n[ComfyUI] 44%|████▍ | 22/50 [00:50<01:04, 2.29s/it]\n[ComfyUI] 46%|████▌ | 23/50 [00:52<01:01, 2.29s/it]\n[ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.29s/it]\n[ComfyUI] 50%|█████ | 25/50 [00:56<00:57, 2.29s/it]\n[ComfyUI] 52%|█████▏ | 26/50 [00:59<00:54, 2.29s/it]\n[ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.29s/it]\n[ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.29s/it]\n[ComfyUI] 58%|█████▊ | 29/50 [01:06<00:48, 2.29s/it]\n[ComfyUI] 60%|██████ | 30/50 [01:08<00:45, 2.29s/it]\n[ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.29s/it]\n[ComfyUI] 64%|██████▍ | 32/50 [01:13<00:41, 2.29s/it]\n[ComfyUI] 66%|██████▌ | 33/50 [01:15<00:38, 2.29s/it]\n[ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.29s/it]\n[ComfyUI] 70%|███████ | 35/50 [01:19<00:34, 2.29s/it]\n[ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.29s/it]\n[ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.29s/it]\n[ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.29s/it]\n[ComfyUI] 78%|███████▊ | 39/50 [01:29<00:25, 2.29s/it]\n[ComfyUI] 80%|████████ | 40/50 [01:31<00:22, 2.29s/it]\n[ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.29s/it]\n[ComfyUI] 84%|████████▍ | 42/50 [01:35<00:18, 2.29s/it]\n[ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.29s/it]\n[ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.29s/it]\n[ComfyUI] 90%|█████████ | 45/50 [01:42<00:11, 2.29s/it]\n[ComfyUI] 92%|█████████▏| 46/50 [01:45<00:09, 2.29s/it]\n[ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.29s/it]\n[ComfyUI] 96%|█████████▌| 48/50 [01:49<00:04, 2.29s/it]\n[ComfyUI] 98%|█████████▊| 49/50 [01:52<00:02, 2.29s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it]\n[ComfyUI] Allocated memory: memory=12.760 GB\n[ComfyUI] Max allocated memory: max_memory=15.559 GB\n[ComfyUI] Max reserved memory: max_reserved=16.875 GB\nExecuting node 5, title: HunyuanVideo Decode, class type: HyVideoDecode\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.49s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.25s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.28s/it]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 28.83it/s]\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.50it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 3.00it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.91it/s]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\nExecuting node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 84.74it/s]\n[ComfyUI] Prompt executed in 148.27 seconds\noutputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}}\n====================================\nHunyuanVideo_00001.png\nHunyuanVideo_00001.mp4", "metrics": { "predict_time": 159.015256809, "total_time": 211.99201 }, "output": "https://replicate.delivery/xezq/ZgDI6uQONkpTHloW6CyG03mRdDskmifZFMFZn5JYTQeLMZIUA/HunyuanVideo_00001.mp4", "started_at": "2025-01-24T18:48:12.167753Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bsvm-oteg56agjs4vg6rm4klpynuxncs5qoyswilymvodsqd5a23fdi6a", "get": "https://api.replicate.com/v1/predictions/gbdwcqyqjxrma0cmk6htbbkxac", "cancel": "https://api.replicate.com/v1/predictions/gbdwcqyqjxrma0cmk6htbbkxac/cancel" }, "version": "a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d" }
Generated inSeed set to: 12345 ⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements ⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements �� USING REPLICATE WEIGHTS (preferred method) 🎯 USING REPLICATE WEIGHTS TAR FILE 🎯 ---------------------------------------- 📦 Processing replicate weights tar file... 🔄 Will rename LoRA to: replicate_b993eada-6919-4267-b32d-d6d697239d7a.safetensors 📂 Extracting tar contents... ✅ Found lora_comfyui.safetensors in tar ✨ Successfully copied LoRA to: ComfyUI/models/loras/replicate_b993eada-6919-4267-b32d-d6d697239d7a.safetensors ---------------------------------------- Checking inputs ==================================== Checking weights ✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models ✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae ==================================== Running workflow [ComfyUI] got prompt Executing node 7, title: HunyuanVideo VAE Loader, class type: HyVideoVAELoader Executing node 42, title: HunyuanVideo Enhance A Video, class type: HyVideoEnhanceAVideo Executing node 16, title: (Down)Load HunyuanVideo TextEncoder, class type: DownloadAndLoadHyVideoTextEncoder [ComfyUI] Loading text encoder model (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14 [ComfyUI] Text encoder to dtype: torch.float16 [ComfyUI] Loading tokenizer (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14 [ComfyUI] Loading text encoder model (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer [ComfyUI] [ComfyUI] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s] [ComfyUI] Loading checkpoint shards: 25%|██▌ | 1/4 [00:00<00:01, 1.74it/s] [ComfyUI] Loading checkpoint shards: 50%|█████ | 2/4 [00:01<00:01, 1.73it/s] [ComfyUI] Loading checkpoint shards: 75%|███████▌ | 3/4 [00:01<00:00, 1.75it/s] [ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:01<00:00, 2.20it/s] [ComfyUI] Text encoder to dtype: torch.float16 [ComfyUI] Loading tokenizer (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer Executing node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode [ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 138 [ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77 Executing node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect Executing node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader [ComfyUI] model_type FLOW [ComfyUI] The config attributes {'use_flow_sigmas': True, 'prediction_type': 'flow_prediction'} were passed to FlowMatchDiscreteScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. [ComfyUI] Using accelerate to load and assign model weights to device... [ComfyUI] Loading LoRA: replicate_b993eada-6919-4267-b32d-d6d697239d7a with strength: 1.0 [ComfyUI] Requested to load HyVideoModel [ComfyUI] loaded completely 9.5367431640625e+25 12555.953247070312 True [ComfyUI] Input (height, width, video_length) = (368, 640, 65) Executing node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler [ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. [ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps [ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('flow_shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['num_train_timesteps', 'n_tokens'])]) [ComfyUI] [ComfyUI] 0%| | 0/50 [00:00<?, ?it/s] [ComfyUI] 2%|▏ | 1/50 [00:02<02:01, 2.48s/it] [ComfyUI] 4%|▍ | 2/50 [00:04<01:40, 2.09s/it] [ComfyUI] 6%|▌ | 3/50 [00:06<01:42, 2.18s/it] [ComfyUI] 8%|▊ | 4/50 [00:08<01:42, 2.23s/it] [ComfyUI] 10%|█ | 5/50 [00:11<01:41, 2.25s/it] [ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.26s/it] [ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.27s/it] [ComfyUI] 16%|█▌ | 8/50 [00:18<01:35, 2.28s/it] [ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.28s/it] [ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.28s/it] [ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.29s/it] [ComfyUI] 24%|██▍ | 12/50 [00:27<01:26, 2.29s/it] [ComfyUI] 26%|██▌ | 13/50 [00:29<01:24, 2.29s/it] [ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.29s/it] [ComfyUI] 30%|███ | 15/50 [00:34<01:20, 2.29s/it] [ComfyUI] 32%|███▏ | 16/50 [00:36<01:17, 2.29s/it] [ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.29s/it] [ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.29s/it] [ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.29s/it] [ComfyUI] 40%|████ | 20/50 [00:45<01:08, 2.29s/it] [ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.29s/it] [ComfyUI] 44%|████▍ | 22/50 [00:50<01:04, 2.29s/it] [ComfyUI] 46%|████▌ | 23/50 [00:52<01:01, 2.29s/it] [ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.29s/it] [ComfyUI] 50%|█████ | 25/50 [00:56<00:57, 2.29s/it] [ComfyUI] 52%|█████▏ | 26/50 [00:59<00:54, 2.29s/it] [ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.29s/it] [ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.29s/it] [ComfyUI] 58%|█████▊ | 29/50 [01:06<00:48, 2.29s/it] [ComfyUI] 60%|██████ | 30/50 [01:08<00:45, 2.29s/it] [ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.29s/it] [ComfyUI] 64%|██████▍ | 32/50 [01:13<00:41, 2.29s/it] [ComfyUI] 66%|██████▌ | 33/50 [01:15<00:38, 2.29s/it] [ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.29s/it] [ComfyUI] 70%|███████ | 35/50 [01:19<00:34, 2.29s/it] [ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.29s/it] [ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.29s/it] [ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.29s/it] [ComfyUI] 78%|███████▊ | 39/50 [01:29<00:25, 2.29s/it] [ComfyUI] 80%|████████ | 40/50 [01:31<00:22, 2.29s/it] [ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.29s/it] [ComfyUI] 84%|████████▍ | 42/50 [01:35<00:18, 2.29s/it] [ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.29s/it] [ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.29s/it] [ComfyUI] 90%|█████████ | 45/50 [01:42<00:11, 2.29s/it] [ComfyUI] 92%|█████████▏| 46/50 [01:45<00:09, 2.29s/it] [ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.29s/it] [ComfyUI] 96%|█████████▌| 48/50 [01:49<00:04, 2.29s/it] [ComfyUI] 98%|█████████▊| 49/50 [01:52<00:02, 2.29s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it] [ComfyUI] Allocated memory: memory=12.760 GB [ComfyUI] Max allocated memory: max_memory=15.559 GB [ComfyUI] Max reserved memory: max_reserved=16.875 GB Executing node 5, title: HunyuanVideo Decode, class type: HyVideoDecode [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.49s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.25s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.28s/it] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 28.83it/s] [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.50it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 3.00it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.91it/s] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] Executing node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 84.74it/s] [ComfyUI] Prompt executed in 148.27 seconds outputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}} ==================================== HunyuanVideo_00001.png HunyuanVideo_00001.mp4
Prediction
deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63dIDswd69qrqbdrmc0cmn41ap2ke6wStatusSucceededSourceAPIHardwareH100Total durationCreatedInput
- seed
- 12345
- steps
- 50
- width
- 640
- height
- 360
- prompt
- A video in the style of HR, HR The video clip depicts a man walking on a rooftop at night
- frame_rate
- 16
- num_frames
- 66
- lora_strength
- 1.3
- guidance_scale
- 6
{ "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a man walking on a rooftop at night", "frame_rate": 16, "num_frames": 66, "lora_strength": 1.3, "guidance_scale": 6 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; import fs from "node:fs"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", { input: { seed: 12345, steps: 50, width: 640, height: 360, prompt: "A video in the style of HR, HR The video clip depicts a man walking on a rooftop at night", frame_rate: 16, num_frames: 66, lora_strength: 1.3, guidance_scale: 6 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", input={ "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a man walking on a rooftop at night", "frame_rate": 16, "num_frames": 66, "lora_strength": 1.3, "guidance_scale": 6 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", "input": { "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a man walking on a rooftop at night", "frame_rate": 16, "num_frames": 66, "lora_strength": 1.3, "guidance_scale": 6 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2025-01-27T18:30:05.749292Z", "created_at": "2025-01-27T18:24:34.139000Z", "data_removed": false, "error": null, "id": "swd69qrqbdrmc0cmn41ap2ke6w", "input": { "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts a man walking on a rooftop at night", "frame_rate": 16, "num_frames": 66, "lora_strength": 1.3, "guidance_scale": 6 }, "logs": "Seed set to: 12345\n⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements\n⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements\n�� USING REPLICATE WEIGHTS (preferred method)\n🎯 USING REPLICATE WEIGHTS TAR FILE 🎯\n----------------------------------------\n📦 Processing replicate weights tar file...\n🔄 Will rename LoRA to: replicate_13f01b69-4afa-4c15-b30b-27e425aec803.safetensors\n📂 Extracting tar contents...\n✅ Found lora_comfyui.safetensors in tar\n✨ Successfully copied LoRA to: ComfyUI/models/loras/replicate_13f01b69-4afa-4c15-b30b-27e425aec803.safetensors\n----------------------------------------\nChecking inputs\n====================================\nChecking weights\n✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models\n✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae\n====================================\nRunning workflow\n[ComfyUI] got prompt\nExecuting node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode\n[ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 22\n[ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 23\nExecuting node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect\nExecuting node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader\n[ComfyUI] model_type FLOW\n[ComfyUI] The config attributes {'use_flow_sigmas': True, 'prediction_type': 'flow_prediction'} were passed to FlowMatchDiscreteScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\n[ComfyUI] Using accelerate to load and assign model weights to device...\n[ComfyUI] Loading LoRA: replicate_13f01b69-4afa-4c15-b30b-27e425aec803 with strength: 1.3\n[ComfyUI] Requested to load HyVideoModel\n[ComfyUI] loaded completely 9.5367431640625e+25 12555.953247070312 True\n[ComfyUI] Input (height, width, video_length) = (368, 640, 65)\nExecuting node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler\n[ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\n[ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps\n[ComfyUI]\n[ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('flow_shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['n_tokens', 'num_train_timesteps'])])\n[ComfyUI] 0%| | 0/50 [00:00<?, ?it/s]\n[ComfyUI] 2%|▏ | 1/50 [00:02<01:52, 2.30s/it]\n[ComfyUI] 4%|▍ | 2/50 [00:04<01:36, 2.02s/it]\n[ComfyUI] 6%|▌ | 3/50 [00:06<01:40, 2.15s/it]\n[ComfyUI] 8%|▊ | 4/50 [00:08<01:41, 2.21s/it]\n[ComfyUI] 10%|█ | 5/50 [00:11<01:40, 2.24s/it]\n[ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.26s/it]\n[ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.27s/it]\n[ComfyUI] 16%|█▌ | 8/50 [00:17<01:35, 2.28s/it]\n[ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.29s/it]\n[ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.29s/it]\n[ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.29s/it]\n[ComfyUI] 24%|██▍ | 12/50 [00:27<01:27, 2.30s/it]\n[ComfyUI] 26%|██▌ | 13/50 [00:29<01:24, 2.30s/it]\n[ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.30s/it]\n[ComfyUI] 30%|███ | 15/50 [00:34<01:20, 2.30s/it]\n[ComfyUI] 32%|███▏ | 16/50 [00:36<01:18, 2.30s/it]\n[ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.30s/it]\n[ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.30s/it]\n[ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.30s/it]\n[ComfyUI] 40%|████ | 20/50 [00:45<01:09, 2.30s/it]\n[ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.30s/it]\n[ComfyUI] 44%|████▍ | 22/50 [00:50<01:04, 2.30s/it]\n[ComfyUI] 46%|████▌ | 23/50 [00:52<01:02, 2.30s/it]\n[ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.30s/it]\n[ComfyUI] 50%|█████ | 25/50 [00:57<00:57, 2.30s/it]\n[ComfyUI] 52%|█████▏ | 26/50 [00:59<00:55, 2.30s/it]\n[ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.30s/it]\n[ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.30s/it]\n[ComfyUI] 58%|█████▊ | 29/50 [01:06<00:48, 2.30s/it]\n[ComfyUI] 60%|██████ | 30/50 [01:08<00:45, 2.30s/it]\n[ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.30s/it]\n[ComfyUI] 64%|██████▍ | 32/50 [01:13<00:41, 2.30s/it]\n[ComfyUI] 66%|██████▌ | 33/50 [01:15<00:39, 2.30s/it]\n[ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.30s/it]\n[ComfyUI] 70%|███████ | 35/50 [01:20<00:34, 2.30s/it]\n[ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.30s/it]\n[ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.30s/it]\n[ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.30s/it]\n[ComfyUI] 78%|███████▊ | 39/50 [01:29<00:25, 2.30s/it]\n[ComfyUI] 80%|████████ | 40/50 [01:31<00:22, 2.30s/it]\n[ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.30s/it]\n[ComfyUI] 84%|████████▍ | 42/50 [01:36<00:18, 2.30s/it]\n[ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.30s/it]\n[ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.30s/it]\n[ComfyUI] 90%|█████████ | 45/50 [01:43<00:11, 2.30s/it]\n[ComfyUI] 92%|█████████▏| 46/50 [01:45<00:09, 2.30s/it]\n[ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.30s/it]\n[ComfyUI] 96%|█████████▌| 48/50 [01:49<00:04, 2.30s/it]\n[ComfyUI] 98%|█████████▊| 49/50 [01:52<00:02, 2.30s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.30s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it]\n[ComfyUI] Allocated memory: memory=12.300 GB\n[ComfyUI] Max allocated memory: max_memory=15.099 GB\n[ComfyUI] Max reserved memory: max_reserved=16.281 GB\nExecuting node 5, title: HunyuanVideo Decode, class type: HyVideoDecode\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.47s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.25s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.28s/it]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 25.84it/s]\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.54it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 3.01it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.93it/s]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\nExecuting node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 64.99it/s]\n[ComfyUI] Prompt executed in 142.30 seconds\noutputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}}\n====================================\nHunyuanVideo_00001.png\nHunyuanVideo_00001.mp4", "metrics": { "predict_time": 146.563728387, "total_time": 331.610292 }, "output": "https://replicate.delivery/xezq/27uVtekRwc3MWyVGT8PndknCTCNgOcgOfRleSoYMDHrbVwSoA/HunyuanVideo_00001.mp4", "started_at": "2025-01-27T18:27:39.185563Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bsvm-i24tslloqhsjmyxp2ej26h6tg7gi5gjvdu2elwpa3jjmfglz6qbq", "get": "https://api.replicate.com/v1/predictions/swd69qrqbdrmc0cmn41ap2ke6w", "cancel": "https://api.replicate.com/v1/predictions/swd69qrqbdrmc0cmn41ap2ke6w/cancel" }, "version": "a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d" }
Generated inSeed set to: 12345 ⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements ⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements �� USING REPLICATE WEIGHTS (preferred method) 🎯 USING REPLICATE WEIGHTS TAR FILE 🎯 ---------------------------------------- 📦 Processing replicate weights tar file... 🔄 Will rename LoRA to: replicate_13f01b69-4afa-4c15-b30b-27e425aec803.safetensors 📂 Extracting tar contents... ✅ Found lora_comfyui.safetensors in tar ✨ Successfully copied LoRA to: ComfyUI/models/loras/replicate_13f01b69-4afa-4c15-b30b-27e425aec803.safetensors ---------------------------------------- Checking inputs ==================================== Checking weights ✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models ✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae ==================================== Running workflow [ComfyUI] got prompt Executing node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode [ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 22 [ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 23 Executing node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect Executing node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader [ComfyUI] model_type FLOW [ComfyUI] The config attributes {'use_flow_sigmas': True, 'prediction_type': 'flow_prediction'} were passed to FlowMatchDiscreteScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. [ComfyUI] Using accelerate to load and assign model weights to device... [ComfyUI] Loading LoRA: replicate_13f01b69-4afa-4c15-b30b-27e425aec803 with strength: 1.3 [ComfyUI] Requested to load HyVideoModel [ComfyUI] loaded completely 9.5367431640625e+25 12555.953247070312 True [ComfyUI] Input (height, width, video_length) = (368, 640, 65) Executing node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler [ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. [ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps [ComfyUI] [ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('flow_shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['n_tokens', 'num_train_timesteps'])]) [ComfyUI] 0%| | 0/50 [00:00<?, ?it/s] [ComfyUI] 2%|▏ | 1/50 [00:02<01:52, 2.30s/it] [ComfyUI] 4%|▍ | 2/50 [00:04<01:36, 2.02s/it] [ComfyUI] 6%|▌ | 3/50 [00:06<01:40, 2.15s/it] [ComfyUI] 8%|▊ | 4/50 [00:08<01:41, 2.21s/it] [ComfyUI] 10%|█ | 5/50 [00:11<01:40, 2.24s/it] [ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.26s/it] [ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.27s/it] [ComfyUI] 16%|█▌ | 8/50 [00:17<01:35, 2.28s/it] [ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.29s/it] [ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.29s/it] [ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.29s/it] [ComfyUI] 24%|██▍ | 12/50 [00:27<01:27, 2.30s/it] [ComfyUI] 26%|██▌ | 13/50 [00:29<01:24, 2.30s/it] [ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.30s/it] [ComfyUI] 30%|███ | 15/50 [00:34<01:20, 2.30s/it] [ComfyUI] 32%|███▏ | 16/50 [00:36<01:18, 2.30s/it] [ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.30s/it] [ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.30s/it] [ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.30s/it] [ComfyUI] 40%|████ | 20/50 [00:45<01:09, 2.30s/it] [ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.30s/it] [ComfyUI] 44%|████▍ | 22/50 [00:50<01:04, 2.30s/it] [ComfyUI] 46%|████▌ | 23/50 [00:52<01:02, 2.30s/it] [ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.30s/it] [ComfyUI] 50%|█████ | 25/50 [00:57<00:57, 2.30s/it] [ComfyUI] 52%|█████▏ | 26/50 [00:59<00:55, 2.30s/it] [ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.30s/it] [ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.30s/it] [ComfyUI] 58%|█████▊ | 29/50 [01:06<00:48, 2.30s/it] [ComfyUI] 60%|██████ | 30/50 [01:08<00:45, 2.30s/it] [ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.30s/it] [ComfyUI] 64%|██████▍ | 32/50 [01:13<00:41, 2.30s/it] [ComfyUI] 66%|██████▌ | 33/50 [01:15<00:39, 2.30s/it] [ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.30s/it] [ComfyUI] 70%|███████ | 35/50 [01:20<00:34, 2.30s/it] [ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.30s/it] [ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.30s/it] [ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.30s/it] [ComfyUI] 78%|███████▊ | 39/50 [01:29<00:25, 2.30s/it] [ComfyUI] 80%|████████ | 40/50 [01:31<00:22, 2.30s/it] [ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.30s/it] [ComfyUI] 84%|████████▍ | 42/50 [01:36<00:18, 2.30s/it] [ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.30s/it] [ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.30s/it] [ComfyUI] 90%|█████████ | 45/50 [01:43<00:11, 2.30s/it] [ComfyUI] 92%|█████████▏| 46/50 [01:45<00:09, 2.30s/it] [ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.30s/it] [ComfyUI] 96%|█████████▌| 48/50 [01:49<00:04, 2.30s/it] [ComfyUI] 98%|█████████▊| 49/50 [01:52<00:02, 2.30s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.30s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it] [ComfyUI] Allocated memory: memory=12.300 GB [ComfyUI] Max allocated memory: max_memory=15.099 GB [ComfyUI] Max reserved memory: max_reserved=16.281 GB Executing node 5, title: HunyuanVideo Decode, class type: HyVideoDecode [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.47s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.25s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.28s/it] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 25.84it/s] [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.54it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 3.01it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.93it/s] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] Executing node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 64.99it/s] [ComfyUI] Prompt executed in 142.30 seconds outputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}} ==================================== HunyuanVideo_00001.png HunyuanVideo_00001.mp4
Prediction
deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63dIDc8d1tr7bcsrmc0cmn4qrdwcwg0StatusSucceededSourceAPIHardwareH100Total durationCreatedInput
- seed
- 12345
- steps
- 50
- width
- 640
- height
- 360
- prompt
- A video in the style of HR, HR The video clip depicts A warrior woman with long dark hair standing on a cliff edge, her black cloak billowing in the storm wind
- frame_rate
- 16
- num_frames
- 66
- lora_strength
- 1.2
- guidance_scale
- 6
{ "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts A warrior woman with long dark hair standing on a cliff edge, her black cloak billowing in the storm wind", "frame_rate": 16, "num_frames": 66, "lora_strength": 1.2, "guidance_scale": 6 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; import fs from "node:fs"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", { input: { seed: 12345, steps: 50, width: 640, height: 360, prompt: "A video in the style of HR, HR The video clip depicts A warrior woman with long dark hair standing on a cliff edge, her black cloak billowing in the storm wind", frame_rate: 16, num_frames: 66, lora_strength: 1.2, guidance_scale: 6 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", input={ "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts A warrior woman with long dark hair standing on a cliff edge, her black cloak billowing in the storm wind", "frame_rate": 16, "num_frames": 66, "lora_strength": 1.2, "guidance_scale": 6 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run deepfates/hunyuan-her using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "deepfates/hunyuan-her:a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d", "input": { "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts A warrior woman with long dark hair standing on a cliff edge, her black cloak billowing in the storm wind", "frame_rate": 16, "num_frames": 66, "lora_strength": 1.2, "guidance_scale": 6 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2025-01-27T19:19:36.739638Z", "created_at": "2025-01-27T19:14:37.542000Z", "data_removed": false, "error": null, "id": "c8d1tr7bcsrmc0cmn4qrdwcwg0", "input": { "seed": 12345, "steps": 50, "width": 640, "height": 360, "prompt": "A video in the style of HR, HR The video clip depicts A warrior woman with long dark hair standing on a cliff edge, her black cloak billowing in the storm wind", "frame_rate": 16, "num_frames": 66, "lora_strength": 1.2, "guidance_scale": 6 }, "logs": "Seed set to: 12345\n⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements\n⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements\n�� USING REPLICATE WEIGHTS (preferred method)\n🎯 USING REPLICATE WEIGHTS TAR FILE 🎯\n----------------------------------------\n📦 Processing replicate weights tar file...\n🔄 Will rename LoRA to: replicate_7d0fd4e0-57c1-4091-ab5a-70211710e639.safetensors\n📂 Extracting tar contents...\n✅ Found lora_comfyui.safetensors in tar\n✨ Successfully copied LoRA to: ComfyUI/models/loras/replicate_7d0fd4e0-57c1-4091-ab5a-70211710e639.safetensors\n----------------------------------------\nChecking inputs\n====================================\nChecking weights\n✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models\n✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae\n====================================\nRunning workflow\n[ComfyUI] got prompt\nExecuting node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode\n[ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 36\n[ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 37\nExecuting node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect\nExecuting node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader\n[ComfyUI] model_type FLOW\n[ComfyUI] The config attributes {'use_flow_sigmas': True, 'prediction_type': 'flow_prediction'} were passed to FlowMatchDiscreteScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\n[ComfyUI] Using accelerate to load and assign model weights to device...\n[ComfyUI] Loading LoRA: replicate_7d0fd4e0-57c1-4091-ab5a-70211710e639 with strength: 1.2\n[ComfyUI] Requested to load HyVideoModel\n[ComfyUI] loaded completely 9.5367431640625e+25 12555.953247070312 True\n[ComfyUI] Input (height, width, video_length) = (368, 640, 65)\n[ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file.\nExecuting node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler\n[ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps\n[ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('flow_shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['num_train_timesteps', 'n_tokens'])])\n[ComfyUI]\n[ComfyUI] 0%| | 0/50 [00:00<?, ?it/s]\n[ComfyUI] 2%|▏ | 1/50 [00:02<01:52, 2.30s/it]\n[ComfyUI] 4%|▍ | 2/50 [00:04<01:36, 2.02s/it]\n[ComfyUI] 6%|▌ | 3/50 [00:06<01:40, 2.15s/it]\n[ComfyUI] 8%|▊ | 4/50 [00:08<01:41, 2.21s/it]\n[ComfyUI] 10%|█ | 5/50 [00:11<01:40, 2.24s/it]\n[ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.26s/it]\n[ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.28s/it]\n[ComfyUI] 16%|█▌ | 8/50 [00:17<01:35, 2.29s/it]\n[ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.29s/it]\n[ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.29s/it]\n[ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.30s/it]\n[ComfyUI] 24%|██▍ | 12/50 [00:27<01:27, 2.30s/it]\n[ComfyUI] 26%|██▌ | 13/50 [00:29<01:25, 2.30s/it]\n[ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.30s/it]\n[ComfyUI] 30%|███ | 15/50 [00:34<01:20, 2.30s/it]\n[ComfyUI] 32%|███▏ | 16/50 [00:36<01:18, 2.30s/it]\n[ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.30s/it]\n[ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.30s/it]\n[ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.30s/it]\n[ComfyUI] 40%|████ | 20/50 [00:45<01:09, 2.30s/it]\n[ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.30s/it]\n[ComfyUI] 44%|████▍ | 22/50 [00:50<01:04, 2.30s/it]\n[ComfyUI] 46%|████▌ | 23/50 [00:52<01:02, 2.30s/it]\n[ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.30s/it]\n[ComfyUI] 50%|█████ | 25/50 [00:57<00:57, 2.30s/it]\n[ComfyUI] 52%|█████▏ | 26/50 [00:59<00:55, 2.30s/it]\n[ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.30s/it]\n[ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.30s/it]\n[ComfyUI] 58%|█████▊ | 29/50 [01:06<00:48, 2.30s/it]\n[ComfyUI] 60%|██████ | 30/50 [01:08<00:46, 2.30s/it]\n[ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.30s/it]\n[ComfyUI] 64%|██████▍ | 32/50 [01:13<00:41, 2.30s/it]\n[ComfyUI] 66%|██████▌ | 33/50 [01:15<00:39, 2.30s/it]\n[ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.30s/it]\n[ComfyUI] 70%|███████ | 35/50 [01:20<00:34, 2.30s/it]\n[ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.30s/it]\n[ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.30s/it]\n[ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.30s/it]\n[ComfyUI] 78%|███████▊ | 39/50 [01:29<00:25, 2.30s/it]\n[ComfyUI] 80%|████████ | 40/50 [01:31<00:23, 2.30s/it]\n[ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.30s/it]\n[ComfyUI] 84%|████████▍ | 42/50 [01:36<00:18, 2.30s/it]\n[ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.30s/it]\n[ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.30s/it]\n[ComfyUI] 90%|█████████ | 45/50 [01:43<00:11, 2.30s/it]\n[ComfyUI] 92%|█████████▏| 46/50 [01:45<00:09, 2.30s/it]\n[ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.30s/it]\n[ComfyUI] 96%|█████████▌| 48/50 [01:50<00:04, 2.30s/it]\n[ComfyUI] 98%|█████████▊| 49/50 [01:52<00:02, 2.30s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.30s/it]\n[ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it]\n[ComfyUI] Allocated memory: memory=12.300 GB\n[ComfyUI] Max allocated memory: max_memory=15.099 GB\n[ComfyUI] Max reserved memory: max_reserved=16.281 GB\nExecuting node 5, title: HunyuanVideo Decode, class type: HyVideoDecode\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.47s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.26s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.30s/it]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 25.67it/s]\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.53it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.99it/s]\n[ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.91it/s]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s]\nExecuting node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine\n[ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 64.07it/s]\n[ComfyUI] Prompt executed in 140.05 seconds\noutputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}}\n====================================\nHunyuanVideo_00001.png\nHunyuanVideo_00001.mp4", "metrics": { "predict_time": 147.810784679, "total_time": 299.197638 }, "output": "https://replicate.delivery/xezq/eG9qqeH6k5oFHEeN8hy8IeKrJzqvaf3ld3D8yunJncgGJHLhC/HunyuanVideo_00001.mp4", "started_at": "2025-01-27T19:17:08.928853Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bsvm-nxix2wydgl2dnjnovs56lvt3hhqrcfjvzo2dp6zx3usz6qjiy6uq", "get": "https://api.replicate.com/v1/predictions/c8d1tr7bcsrmc0cmn4qrdwcwg0", "cancel": "https://api.replicate.com/v1/predictions/c8d1tr7bcsrmc0cmn4qrdwcwg0/cancel" }, "version": "a4289ea2d95ec71b94fcbf243d47e406b473b93e5bb700c5ec8966730d8ca63d" }
Generated inSeed set to: 12345 ⚠️ Adjusted dimensions from 640x360 to 640x368 to satisfy model requirements ⚠️ Adjusted frame count from 66 to 65 to satisfy model requirements �� USING REPLICATE WEIGHTS (preferred method) 🎯 USING REPLICATE WEIGHTS TAR FILE 🎯 ---------------------------------------- 📦 Processing replicate weights tar file... 🔄 Will rename LoRA to: replicate_7d0fd4e0-57c1-4091-ab5a-70211710e639.safetensors 📂 Extracting tar contents... ✅ Found lora_comfyui.safetensors in tar ✨ Successfully copied LoRA to: ComfyUI/models/loras/replicate_7d0fd4e0-57c1-4091-ab5a-70211710e639.safetensors ---------------------------------------- Checking inputs ==================================== Checking weights ✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models ✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae ==================================== Running workflow [ComfyUI] got prompt Executing node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode [ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 36 [ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 37 Executing node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect Executing node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader [ComfyUI] model_type FLOW [ComfyUI] The config attributes {'use_flow_sigmas': True, 'prediction_type': 'flow_prediction'} were passed to FlowMatchDiscreteScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. [ComfyUI] Using accelerate to load and assign model weights to device... [ComfyUI] Loading LoRA: replicate_7d0fd4e0-57c1-4091-ab5a-70211710e639 with strength: 1.2 [ComfyUI] Requested to load HyVideoModel [ComfyUI] loaded completely 9.5367431640625e+25 12555.953247070312 True [ComfyUI] Input (height, width, video_length) = (368, 640, 65) [ComfyUI] The config attributes {'reverse': True, 'solver': 'euler'} were passed to DPMSolverMultistepScheduler, but are not expected and will be ignored. Please verify your scheduler_config.json configuration file. Executing node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler [ComfyUI] Sampling 65 frames in 17 latents at 640x368 with 50 inference steps [ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('flow_shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['num_train_timesteps', 'n_tokens'])]) [ComfyUI] [ComfyUI] 0%| | 0/50 [00:00<?, ?it/s] [ComfyUI] 2%|▏ | 1/50 [00:02<01:52, 2.30s/it] [ComfyUI] 4%|▍ | 2/50 [00:04<01:36, 2.02s/it] [ComfyUI] 6%|▌ | 3/50 [00:06<01:40, 2.15s/it] [ComfyUI] 8%|▊ | 4/50 [00:08<01:41, 2.21s/it] [ComfyUI] 10%|█ | 5/50 [00:11<01:40, 2.24s/it] [ComfyUI] 12%|█▏ | 6/50 [00:13<01:39, 2.26s/it] [ComfyUI] 14%|█▍ | 7/50 [00:15<01:37, 2.28s/it] [ComfyUI] 16%|█▌ | 8/50 [00:17<01:35, 2.29s/it] [ComfyUI] 18%|█▊ | 9/50 [00:20<01:33, 2.29s/it] [ComfyUI] 20%|██ | 10/50 [00:22<01:31, 2.29s/it] [ComfyUI] 22%|██▏ | 11/50 [00:24<01:29, 2.30s/it] [ComfyUI] 24%|██▍ | 12/50 [00:27<01:27, 2.30s/it] [ComfyUI] 26%|██▌ | 13/50 [00:29<01:25, 2.30s/it] [ComfyUI] 28%|██▊ | 14/50 [00:31<01:22, 2.30s/it] [ComfyUI] 30%|███ | 15/50 [00:34<01:20, 2.30s/it] [ComfyUI] 32%|███▏ | 16/50 [00:36<01:18, 2.30s/it] [ComfyUI] 34%|███▍ | 17/50 [00:38<01:15, 2.30s/it] [ComfyUI] 36%|███▌ | 18/50 [00:40<01:13, 2.30s/it] [ComfyUI] 38%|███▊ | 19/50 [00:43<01:11, 2.30s/it] [ComfyUI] 40%|████ | 20/50 [00:45<01:09, 2.30s/it] [ComfyUI] 42%|████▏ | 21/50 [00:47<01:06, 2.30s/it] [ComfyUI] 44%|████▍ | 22/50 [00:50<01:04, 2.30s/it] [ComfyUI] 46%|████▌ | 23/50 [00:52<01:02, 2.30s/it] [ComfyUI] 48%|████▊ | 24/50 [00:54<00:59, 2.30s/it] [ComfyUI] 50%|█████ | 25/50 [00:57<00:57, 2.30s/it] [ComfyUI] 52%|█████▏ | 26/50 [00:59<00:55, 2.30s/it] [ComfyUI] 54%|█████▍ | 27/50 [01:01<00:52, 2.30s/it] [ComfyUI] 56%|█████▌ | 28/50 [01:03<00:50, 2.30s/it] [ComfyUI] 58%|█████▊ | 29/50 [01:06<00:48, 2.30s/it] [ComfyUI] 60%|██████ | 30/50 [01:08<00:46, 2.30s/it] [ComfyUI] 62%|██████▏ | 31/50 [01:10<00:43, 2.30s/it] [ComfyUI] 64%|██████▍ | 32/50 [01:13<00:41, 2.30s/it] [ComfyUI] 66%|██████▌ | 33/50 [01:15<00:39, 2.30s/it] [ComfyUI] 68%|██████▊ | 34/50 [01:17<00:36, 2.30s/it] [ComfyUI] 70%|███████ | 35/50 [01:20<00:34, 2.30s/it] [ComfyUI] 72%|███████▏ | 36/50 [01:22<00:32, 2.30s/it] [ComfyUI] 74%|███████▍ | 37/50 [01:24<00:29, 2.30s/it] [ComfyUI] 76%|███████▌ | 38/50 [01:26<00:27, 2.30s/it] [ComfyUI] 78%|███████▊ | 39/50 [01:29<00:25, 2.30s/it] [ComfyUI] 80%|████████ | 40/50 [01:31<00:23, 2.30s/it] [ComfyUI] 82%|████████▏ | 41/50 [01:33<00:20, 2.30s/it] [ComfyUI] 84%|████████▍ | 42/50 [01:36<00:18, 2.30s/it] [ComfyUI] 86%|████████▌ | 43/50 [01:38<00:16, 2.30s/it] [ComfyUI] 88%|████████▊ | 44/50 [01:40<00:13, 2.30s/it] [ComfyUI] 90%|█████████ | 45/50 [01:43<00:11, 2.30s/it] [ComfyUI] 92%|█████████▏| 46/50 [01:45<00:09, 2.30s/it] [ComfyUI] 94%|█████████▍| 47/50 [01:47<00:06, 2.30s/it] [ComfyUI] 96%|█████████▌| 48/50 [01:50<00:04, 2.30s/it] [ComfyUI] 98%|█████████▊| 49/50 [01:52<00:02, 2.30s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.30s/it] [ComfyUI] 100%|██████████| 50/50 [01:54<00:00, 2.29s/it] [ComfyUI] Allocated memory: memory=12.300 GB [ComfyUI] Max allocated memory: max_memory=15.099 GB [ComfyUI] Max reserved memory: max_reserved=16.281 GB Executing node 5, title: HunyuanVideo Decode, class type: HyVideoDecode [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:01<00:01, 1.47s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.26s/it] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:02<00:00, 1.30s/it] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 25.67it/s] [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/2 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 50%|█████ | 1/2 [00:00<00:00, 2.53it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.99it/s] [ComfyUI] Decoding rows: 100%|██████████| 2/2 [00:00<00:00, 2.91it/s] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/2 [00:00<?, ?it/s] Executing node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine [ComfyUI] Blending tiles: 100%|██████████| 2/2 [00:00<00:00, 64.07it/s] [ComfyUI] Prompt executed in 140.05 seconds outputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}} ==================================== HunyuanVideo_00001.png HunyuanVideo_00001.mp4
Want to make some of these yourself?
Run this model