lucataco / hunyuan-heygen-woman-2
HunyuanVideo finetune of an AI avatar from Heygen
- Public
- 185 runs
-
H100
- Fine-tune
Prediction
lucataco/hunyuan-heygen-woman-2:923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4IDv2yv7zgj5hrmc0cmc39a8ckf2rStatusSucceededSourceWebHardwareH100Total durationCreatedInput
- crf
- 19
- steps
- 30
- width
- 960
- height
- 544
- prompt
- HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The couch is adorned with several throw pillows, each with a black and white geometric pattern. The background includes a wooden chair with a yellow cushion, a wooden side table, and a large mirror with a wooden frame. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression
- lora_url
- flow_shift
- 9
- frame_rate
- 20
- num_frames
- 49
- force_offload
- lora_strength
- 0.9
- guidance_scale
- 6
- denoise_strength
- 1
{ "crf": 19, "steps": 30, "width": 960, "height": 544, "prompt": "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The couch is adorned with several throw pillows, each with a black and white geometric pattern. The background includes a wooden chair with a yellow cushion, a wooden side table, and a large mirror with a wooden frame. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", "lora_url": "", "flow_shift": 9, "frame_rate": 20, "num_frames": 49, "force_offload": true, "lora_strength": 0.9, "guidance_scale": 6, "denoise_strength": 1 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run lucataco/hunyuan-heygen-woman-2 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "lucataco/hunyuan-heygen-woman-2:923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4", { input: { crf: 19, steps: 30, width: 960, height: 544, prompt: "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The couch is adorned with several throw pillows, each with a black and white geometric pattern. The background includes a wooden chair with a yellow cushion, a wooden side table, and a large mirror with a wooden frame. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", lora_url: "", flow_shift: 9, frame_rate: 20, num_frames: 49, force_offload: true, lora_strength: 0.9, guidance_scale: 6, denoise_strength: 1 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run lucataco/hunyuan-heygen-woman-2 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "lucataco/hunyuan-heygen-woman-2:923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4", input={ "crf": 19, "steps": 30, "width": 960, "height": 544, "prompt": "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The couch is adorned with several throw pillows, each with a black and white geometric pattern. The background includes a wooden chair with a yellow cushion, a wooden side table, and a large mirror with a wooden frame. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", "lora_url": "", "flow_shift": 9, "frame_rate": 20, "num_frames": 49, "force_offload": True, "lora_strength": 0.9, "guidance_scale": 6, "denoise_strength": 1 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run lucataco/hunyuan-heygen-woman-2 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "lucataco/hunyuan-heygen-woman-2:923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4", "input": { "crf": 19, "steps": 30, "width": 960, "height": 544, "prompt": "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The couch is adorned with several throw pillows, each with a black and white geometric pattern. The background includes a wooden chair with a yellow cushion, a wooden side table, and a large mirror with a wooden frame. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", "lora_url": "", "flow_shift": 9, "frame_rate": 20, "num_frames": 49, "force_offload": true, "lora_strength": 0.9, "guidance_scale": 6, "denoise_strength": 1 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2025-01-13T18:03:26.723907Z", "created_at": "2025-01-13T17:59:27.532000Z", "data_removed": false, "error": null, "id": "v2yv7zgj5hrmc0cmc39a8ckf2r", "input": { "crf": 19, "steps": 30, "width": 960, "height": 544, "prompt": "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The couch is adorned with several throw pillows, each with a black and white geometric pattern. The background includes a wooden chair with a yellow cushion, a wooden side table, and a large mirror with a wooden frame. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", "lora_url": "", "flow_shift": 9, "frame_rate": 20, "num_frames": 49, "force_offload": true, "lora_strength": 0.9, "guidance_scale": 6, "denoise_strength": 1 }, "logs": "Random seed set to: 3841941661\nChecking inputs\n====================================\nChecking weights\n✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models\n✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae\n====================================\nRunning workflow\n[ComfyUI] got prompt\nExecuting node 7, title: HunyuanVideo VAE Loader, class type: HyVideoVAELoader\n[ComfyUI] Loading text encoder model (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14\nExecuting node 16, title: (Down)Load HunyuanVideo TextEncoder, class type: DownloadAndLoadHyVideoTextEncoder\n[ComfyUI] Text encoder to dtype: torch.float16\n[ComfyUI] Loading tokenizer (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14\n[ComfyUI] Loading text encoder model (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer\n[ComfyUI]\n[ComfyUI] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s]\n[ComfyUI] Loading checkpoint shards: 25%|██▌ | 1/4 [00:00<00:02, 1.43it/s]\n[ComfyUI] Loading checkpoint shards: 50%|█████ | 2/4 [00:01<00:01, 1.23it/s]\n[ComfyUI] Loading checkpoint shards: 75%|███████▌ | 3/4 [00:02<00:00, 1.13it/s]\n[ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:02<00:00, 1.57it/s]\n[ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:02<00:00, 1.42it/s]\n[ComfyUI] Text encoder to dtype: torch.float16\n[ComfyUI] Loading tokenizer (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer\nExecuting node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode\n[ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 123\n[ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77\nExecuting node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect\nExecuting node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader\n[ComfyUI] model_type FLOW\n[ComfyUI] Using accelerate to load and assign model weights to device...\n[ComfyUI] Loading LoRA: lora_comfyui with strength: 0.9\n[ComfyUI] Requested to load HyVideoModel\n[ComfyUI] Loading 1 new model\n[ComfyUI] loaded completely 0.0 12555.953247070312 True\n[ComfyUI] Input (height, width, video_length) = (544, 960, 49)\nExecuting node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler\n[ComfyUI] Sampling 49 frames in 13 latents at 960x544 with 30 inference steps\n[ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['num_train_timesteps', 'n_tokens'])])[ComfyUI]\n[ComfyUI] 0%| | 0/30 [00:00<?, ?it/s]\n[ComfyUI] 3%|▎ | 1/30 [00:04<01:58, 4.09s/it]\n[ComfyUI] 7%|▋ | 2/30 [00:09<02:17, 4.92s/it]\n[ComfyUI] 10%|█ | 3/30 [00:15<02:19, 5.18s/it]\n[ComfyUI] 13%|█▎ | 4/30 [00:20<02:17, 5.31s/it]\n[ComfyUI] 17%|█▋ | 5/30 [00:26<02:14, 5.38s/it]\n[ComfyUI] 20%|██ | 6/30 [00:31<02:10, 5.42s/it]\n[ComfyUI] 23%|██▎ | 7/30 [00:37<02:05, 5.45s/it]\n[ComfyUI] 27%|██▋ | 8/30 [00:42<02:00, 5.46s/it]\n[ComfyUI] 30%|███ | 9/30 [00:48<01:55, 5.48s/it]\n[ComfyUI] 33%|███▎ | 10/30 [00:53<01:49, 5.48s/it]\n[ComfyUI] 37%|███▋ | 11/30 [00:59<01:44, 5.49s/it]\n[ComfyUI] 40%|████ | 12/30 [01:04<01:38, 5.49s/it]\n[ComfyUI] 43%|████▎ | 13/30 [01:10<01:33, 5.50s/it]\n[ComfyUI] 47%|████▋ | 14/30 [01:15<01:27, 5.50s/it]\n[ComfyUI] 50%|█████ | 15/30 [01:21<01:22, 5.50s/it]\n[ComfyUI] 53%|█████▎ | 16/30 [01:26<01:17, 5.50s/it]\n[ComfyUI] 57%|█████▋ | 17/30 [01:32<01:11, 5.51s/it]\n[ComfyUI] 60%|██████ | 18/30 [01:37<01:06, 5.50s/it]\n[ComfyUI] 63%|██████▎ | 19/30 [01:43<01:00, 5.50s/it]\n[ComfyUI] 67%|██████▋ | 20/30 [01:48<00:55, 5.51s/it]\n[ComfyUI] 70%|███████ | 21/30 [01:54<00:49, 5.50s/it]\n[ComfyUI] 73%|███████▎ | 22/30 [01:59<00:44, 5.50s/it]\n[ComfyUI] 77%|███████▋ | 23/30 [02:05<00:38, 5.51s/it]\n[ComfyUI] 80%|████████ | 24/30 [02:10<00:33, 5.51s/it]\n[ComfyUI] 83%|████████▎ | 25/30 [02:16<00:27, 5.51s/it]\n[ComfyUI] 87%|████████▋ | 26/30 [02:21<00:22, 5.50s/it]\n[ComfyUI] 90%|█████████ | 27/30 [02:27<00:16, 5.50s/it]\n[ComfyUI] 93%|█████████▎| 28/30 [02:32<00:11, 5.50s/it]\n[ComfyUI] 97%|█████████▋| 29/30 [02:38<00:05, 5.50s/it]\n[ComfyUI] 100%|██████████| 30/30 [02:43<00:00, 5.50s/it]\n[ComfyUI] 100%|██████████| 30/30 [02:43<00:00, 5.46s/it]\n[ComfyUI] Allocated memory: memory=12.760 GB\n[ComfyUI] Max allocated memory: max_memory=18.839 GB\n[ComfyUI] Max reserved memory: max_reserved=20.719 GB\nExecuting node 5, title: HunyuanVideo Decode, class type: HyVideoDecode\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/3 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 33%|███▎ | 1/3 [00:01<00:03, 1.51s/it]\n[ComfyUI] Decoding rows: 67%|██████▋ | 2/3 [00:03<00:01, 1.62s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 3/3 [00:04<00:00, 1.40s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 3/3 [00:04<00:00, 1.45s/it]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/3 [00:00<?, ?it/s]\n[ComfyUI] Blending tiles: 33%|███▎ | 1/3 [00:00<00:00, 7.82it/s]\nExecuting node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine\n[ComfyUI] Blending tiles: 100%|██████████| 3/3 [00:00<00:00, 18.19it/s]\n[ComfyUI] Prompt executed in 201.71 seconds\noutputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 20.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}}\n====================================\nHunyuanVideo_00001.png\nHunyuanVideo_00001.mp4", "metrics": { "predict_time": 208.047554521, "total_time": 239.191907 }, "output": "https://replicate.delivery/xezq/zTp9KsCdeFyf0EsWVRwCtE1BLaQQDmuvBvP0sunEJBDudwEUA/HunyuanVideo_00001.mp4", "started_at": "2025-01-13T17:59:58.676352Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bcwr-7cue56omya2vzamcrwojmpueok7ergmkfw3byomekeqclu46cdqq", "get": "https://api.replicate.com/v1/predictions/v2yv7zgj5hrmc0cmc39a8ckf2r", "cancel": "https://api.replicate.com/v1/predictions/v2yv7zgj5hrmc0cmc39a8ckf2r/cancel" }, "version": "923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4" }
Generated inRandom seed set to: 3841941661 Checking inputs ==================================== Checking weights ✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models ✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae ==================================== Running workflow [ComfyUI] got prompt Executing node 7, title: HunyuanVideo VAE Loader, class type: HyVideoVAELoader [ComfyUI] Loading text encoder model (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14 Executing node 16, title: (Down)Load HunyuanVideo TextEncoder, class type: DownloadAndLoadHyVideoTextEncoder [ComfyUI] Text encoder to dtype: torch.float16 [ComfyUI] Loading tokenizer (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14 [ComfyUI] Loading text encoder model (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer [ComfyUI] [ComfyUI] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s] [ComfyUI] Loading checkpoint shards: 25%|██▌ | 1/4 [00:00<00:02, 1.43it/s] [ComfyUI] Loading checkpoint shards: 50%|█████ | 2/4 [00:01<00:01, 1.23it/s] [ComfyUI] Loading checkpoint shards: 75%|███████▌ | 3/4 [00:02<00:00, 1.13it/s] [ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:02<00:00, 1.57it/s] [ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:02<00:00, 1.42it/s] [ComfyUI] Text encoder to dtype: torch.float16 [ComfyUI] Loading tokenizer (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer Executing node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode [ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 123 [ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77 Executing node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect Executing node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader [ComfyUI] model_type FLOW [ComfyUI] Using accelerate to load and assign model weights to device... [ComfyUI] Loading LoRA: lora_comfyui with strength: 0.9 [ComfyUI] Requested to load HyVideoModel [ComfyUI] Loading 1 new model [ComfyUI] loaded completely 0.0 12555.953247070312 True [ComfyUI] Input (height, width, video_length) = (544, 960, 49) Executing node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler [ComfyUI] Sampling 49 frames in 13 latents at 960x544 with 30 inference steps [ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['num_train_timesteps', 'n_tokens'])])[ComfyUI] [ComfyUI] 0%| | 0/30 [00:00<?, ?it/s] [ComfyUI] 3%|▎ | 1/30 [00:04<01:58, 4.09s/it] [ComfyUI] 7%|▋ | 2/30 [00:09<02:17, 4.92s/it] [ComfyUI] 10%|█ | 3/30 [00:15<02:19, 5.18s/it] [ComfyUI] 13%|█▎ | 4/30 [00:20<02:17, 5.31s/it] [ComfyUI] 17%|█▋ | 5/30 [00:26<02:14, 5.38s/it] [ComfyUI] 20%|██ | 6/30 [00:31<02:10, 5.42s/it] [ComfyUI] 23%|██▎ | 7/30 [00:37<02:05, 5.45s/it] [ComfyUI] 27%|██▋ | 8/30 [00:42<02:00, 5.46s/it] [ComfyUI] 30%|███ | 9/30 [00:48<01:55, 5.48s/it] [ComfyUI] 33%|███▎ | 10/30 [00:53<01:49, 5.48s/it] [ComfyUI] 37%|███▋ | 11/30 [00:59<01:44, 5.49s/it] [ComfyUI] 40%|████ | 12/30 [01:04<01:38, 5.49s/it] [ComfyUI] 43%|████▎ | 13/30 [01:10<01:33, 5.50s/it] [ComfyUI] 47%|████▋ | 14/30 [01:15<01:27, 5.50s/it] [ComfyUI] 50%|█████ | 15/30 [01:21<01:22, 5.50s/it] [ComfyUI] 53%|█████▎ | 16/30 [01:26<01:17, 5.50s/it] [ComfyUI] 57%|█████▋ | 17/30 [01:32<01:11, 5.51s/it] [ComfyUI] 60%|██████ | 18/30 [01:37<01:06, 5.50s/it] [ComfyUI] 63%|██████▎ | 19/30 [01:43<01:00, 5.50s/it] [ComfyUI] 67%|██████▋ | 20/30 [01:48<00:55, 5.51s/it] [ComfyUI] 70%|███████ | 21/30 [01:54<00:49, 5.50s/it] [ComfyUI] 73%|███████▎ | 22/30 [01:59<00:44, 5.50s/it] [ComfyUI] 77%|███████▋ | 23/30 [02:05<00:38, 5.51s/it] [ComfyUI] 80%|████████ | 24/30 [02:10<00:33, 5.51s/it] [ComfyUI] 83%|████████▎ | 25/30 [02:16<00:27, 5.51s/it] [ComfyUI] 87%|████████▋ | 26/30 [02:21<00:22, 5.50s/it] [ComfyUI] 90%|█████████ | 27/30 [02:27<00:16, 5.50s/it] [ComfyUI] 93%|█████████▎| 28/30 [02:32<00:11, 5.50s/it] [ComfyUI] 97%|█████████▋| 29/30 [02:38<00:05, 5.50s/it] [ComfyUI] 100%|██████████| 30/30 [02:43<00:00, 5.50s/it] [ComfyUI] 100%|██████████| 30/30 [02:43<00:00, 5.46s/it] [ComfyUI] Allocated memory: memory=12.760 GB [ComfyUI] Max allocated memory: max_memory=18.839 GB [ComfyUI] Max reserved memory: max_reserved=20.719 GB Executing node 5, title: HunyuanVideo Decode, class type: HyVideoDecode [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/3 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 33%|███▎ | 1/3 [00:01<00:03, 1.51s/it] [ComfyUI] Decoding rows: 67%|██████▋ | 2/3 [00:03<00:01, 1.62s/it] [ComfyUI] Decoding rows: 100%|██████████| 3/3 [00:04<00:00, 1.40s/it] [ComfyUI] Decoding rows: 100%|██████████| 3/3 [00:04<00:00, 1.45s/it] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/3 [00:00<?, ?it/s] [ComfyUI] Blending tiles: 33%|███▎ | 1/3 [00:00<00:00, 7.82it/s] Executing node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine [ComfyUI] Blending tiles: 100%|██████████| 3/3 [00:00<00:00, 18.19it/s] [ComfyUI] Prompt executed in 201.71 seconds outputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 20.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}} ==================================== HunyuanVideo_00001.png HunyuanVideo_00001.mp4
Prediction
lucataco/hunyuan-heygen-woman-2:923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4IDzrppshf3qnrmc0cmc3erg6djpmStatusSucceededSourceWebHardwareH100Total durationCreatedInput
- crf
- 19
- steps
- 50
- width
- 960
- height
- 544
- prompt
- HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression
- lora_url
- flow_shift
- 9
- frame_rate
- 20
- num_frames
- 49
- force_offload
- lora_strength
- 0.9
- guidance_scale
- 6
- denoise_strength
- 1
{ "crf": 19, "steps": 50, "width": 960, "height": 544, "prompt": "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", "lora_url": "", "flow_shift": 9, "frame_rate": 20, "num_frames": 49, "force_offload": true, "lora_strength": 0.9, "guidance_scale": 6, "denoise_strength": 1 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run lucataco/hunyuan-heygen-woman-2 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "lucataco/hunyuan-heygen-woman-2:923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4", { input: { crf: 19, steps: 50, width: 960, height: 544, prompt: "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", lora_url: "", flow_shift: 9, frame_rate: 20, num_frames: 49, force_offload: true, lora_strength: 0.9, guidance_scale: 6, denoise_strength: 1 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run lucataco/hunyuan-heygen-woman-2 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "lucataco/hunyuan-heygen-woman-2:923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4", input={ "crf": 19, "steps": 50, "width": 960, "height": 544, "prompt": "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", "lora_url": "", "flow_shift": 9, "frame_rate": 20, "num_frames": 49, "force_offload": True, "lora_strength": 0.9, "guidance_scale": 6, "denoise_strength": 1 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run lucataco/hunyuan-heygen-woman-2 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "lucataco/hunyuan-heygen-woman-2:923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4", "input": { "crf": 19, "steps": 50, "width": 960, "height": 544, "prompt": "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", "lora_url": "", "flow_shift": 9, "frame_rate": 20, "num_frames": 49, "force_offload": true, "lora_strength": 0.9, "guidance_scale": 6, "denoise_strength": 1 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2025-01-13T18:17:22.710540Z", "created_at": "2025-01-13T18:12:22.077000Z", "data_removed": false, "error": null, "id": "zrppshf3qnrmc0cmc3erg6djpm", "input": { "crf": 19, "steps": 50, "width": 960, "height": 544, "prompt": "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", "lora_url": "", "flow_shift": 9, "frame_rate": 20, "num_frames": 49, "force_offload": true, "lora_strength": 0.9, "guidance_scale": 6, "denoise_strength": 1 }, "logs": "Random seed set to: 261764370\nChecking inputs\n====================================\nChecking weights\n✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae\n✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models\n====================================\nRunning workflow\n[ComfyUI] got prompt\n[ComfyUI] Input (height, width, video_length) = (544, 960, 49)\nExecuting node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler\n[ComfyUI] Sampling 49 frames in 13 latents at 960x544 with 50 inference steps\n[ComfyUI]\n[ComfyUI] 0%| | 0/50 [00:00<?, ?it/s]\n[ComfyUI] 2%|▏ | 1/50 [00:03<03:15, 3.99s/it]\n[ComfyUI] 4%|▍ | 2/50 [00:09<03:55, 4.90s/it]\n[ComfyUI] 6%|▌ | 3/50 [00:15<04:03, 5.18s/it]\n[ComfyUI] 8%|▊ | 4/50 [00:20<04:04, 5.32s/it]\n[ComfyUI] 10%|█ | 5/50 [00:26<04:02, 5.40s/it]\n[ComfyUI] 12%|█▏ | 6/50 [00:31<03:59, 5.44s/it]\n[ComfyUI] 14%|█▍ | 7/50 [00:37<03:55, 5.47s/it]\n[ComfyUI] 16%|█▌ | 8/50 [00:42<03:50, 5.49s/it]\n[ComfyUI] 18%|█▊ | 9/50 [00:48<03:45, 5.50s/it]\n[ComfyUI] 20%|██ | 10/50 [00:53<03:40, 5.51s/it]\n[ComfyUI] 22%|██▏ | 11/50 [00:59<03:35, 5.52s/it]\n[ComfyUI] 24%|██▍ | 12/50 [01:04<03:29, 5.52s/it]\n[ComfyUI] 26%|██▌ | 13/50 [01:10<03:24, 5.52s/it]\n[ComfyUI] 28%|██▊ | 14/50 [01:15<03:18, 5.53s/it]\n[ComfyUI] 30%|███ | 15/50 [01:21<03:13, 5.53s/it]\n[ComfyUI] 32%|███▏ | 16/50 [01:26<03:08, 5.53s/it]\n[ComfyUI] 34%|███▍ | 17/50 [01:32<03:02, 5.53s/it]\n[ComfyUI] 36%|███▌ | 18/50 [01:38<02:56, 5.53s/it]\n[ComfyUI] 38%|███▊ | 19/50 [01:43<02:51, 5.53s/it]\n[ComfyUI] 40%|████ | 20/50 [01:49<02:45, 5.53s/it]\n[ComfyUI] 42%|████▏ | 21/50 [01:54<02:40, 5.53s/it]\n[ComfyUI] 44%|████▍ | 22/50 [02:00<02:34, 5.53s/it]\n[ComfyUI] 46%|████▌ | 23/50 [02:05<02:29, 5.53s/it]\n[ComfyUI] 48%|████▊ | 24/50 [02:11<02:23, 5.53s/it]\n[ComfyUI] 50%|█████ | 25/50 [02:16<02:18, 5.53s/it]\n[ComfyUI] 52%|█████▏ | 26/50 [02:22<02:12, 5.53s/it]\n[ComfyUI] 54%|█████▍ | 27/50 [02:27<02:07, 5.53s/it]\n[ComfyUI] 56%|█████▌ | 28/50 [02:33<02:01, 5.53s/it]\n[ComfyUI] 58%|█████▊ | 29/50 [02:38<01:56, 5.53s/it]\n[ComfyUI] 60%|██████ | 30/50 [02:44<01:50, 5.53s/it]\n[ComfyUI] 62%|██████▏ | 31/50 [02:49<01:45, 5.53s/it]\n[ComfyUI] 64%|██████▍ | 32/50 [02:55<01:39, 5.53s/it]\n[ComfyUI] 66%|██████▌ | 33/50 [03:00<01:34, 5.53s/it]\n[ComfyUI] 68%|██████▊ | 34/50 [03:06<01:28, 5.53s/it]\n[ComfyUI] 70%|███████ | 35/50 [03:12<01:22, 5.53s/it]\n[ComfyUI] 72%|███████▏ | 36/50 [03:17<01:17, 5.53s/it]\n[ComfyUI] 74%|███████▍ | 37/50 [03:23<01:11, 5.53s/it]\n[ComfyUI] 76%|███████▌ | 38/50 [03:28<01:06, 5.53s/it]\n[ComfyUI] 78%|███████▊ | 39/50 [03:34<01:00, 5.53s/it]\n[ComfyUI] 80%|████████ | 40/50 [03:39<00:55, 5.53s/it]\n[ComfyUI] 82%|████████▏ | 41/50 [03:45<00:49, 5.53s/it]\n[ComfyUI] 84%|████████▍ | 42/50 [03:50<00:44, 5.53s/it]\n[ComfyUI] 86%|████████▌ | 43/50 [03:56<00:38, 5.53s/it]\n[ComfyUI] 88%|████████▊ | 44/50 [04:01<00:33, 5.53s/it]\n[ComfyUI] 90%|█████████ | 45/50 [04:07<00:27, 5.53s/it]\n[ComfyUI] 92%|█████████▏| 46/50 [04:12<00:22, 5.53s/it]\n[ComfyUI] 94%|█████████▍| 47/50 [04:18<00:16, 5.53s/it]\n[ComfyUI] 96%|█████████▌| 48/50 [04:23<00:11, 5.53s/it]\n[ComfyUI] 98%|█████████▊| 49/50 [04:29<00:05, 5.53s/it]\n[ComfyUI] 100%|██████████| 50/50 [04:35<00:00, 5.53s/it]\n[ComfyUI] 100%|██████████| 50/50 [04:35<00:00, 5.50s/it]\n[ComfyUI] Allocated memory: memory=12.301 GB\n[ComfyUI] Max allocated memory: max_memory=18.380 GB\n[ComfyUI] Max reserved memory: max_reserved=21.719 GB\nExecuting node 5, title: HunyuanVideo Decode, class type: HyVideoDecode\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/3 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 33%|███▎ | 1/3 [00:01<00:03, 1.50s/it]\n[ComfyUI] Decoding rows: 67%|██████▋ | 2/3 [00:03<00:01, 1.64s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 3/3 [00:04<00:00, 1.41s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 3/3 [00:04<00:00, 1.46s/it]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/3 [00:00<?, ?it/s]\n[ComfyUI] Blending tiles: 33%|███▎ | 1/3 [00:00<00:00, 7.29it/s]\nExecuting node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine\n[ComfyUI] Blending tiles: 100%|██████████| 3/3 [00:00<00:00, 17.09it/s]\n[ComfyUI] Prompt executed in 289.02 seconds\noutputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 20.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}}\n====================================\nHunyuanVideo_00001.png\nHunyuanVideo_00001.mp4", "metrics": { "predict_time": 300.624867101, "total_time": 300.63354 }, "output": "https://replicate.delivery/xezq/n9W5kwvFlU61FdxZPAqRX2haK26DtSMT0LPbNROb5CmsKMBF/HunyuanVideo_00001.mp4", "started_at": "2025-01-13T18:12:22.085673Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bcwr-jfz55en5zdqvkv3qqkkog6bnagvhqmwxiwvmkz4ivqljsu64epgq", "get": "https://api.replicate.com/v1/predictions/zrppshf3qnrmc0cmc3erg6djpm", "cancel": "https://api.replicate.com/v1/predictions/zrppshf3qnrmc0cmc3erg6djpm/cancel" }, "version": "923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4" }
Generated inRandom seed set to: 261764370 Checking inputs ==================================== Checking weights ✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae ✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models ==================================== Running workflow [ComfyUI] got prompt [ComfyUI] Input (height, width, video_length) = (544, 960, 49) Executing node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler [ComfyUI] Sampling 49 frames in 13 latents at 960x544 with 50 inference steps [ComfyUI] [ComfyUI] 0%| | 0/50 [00:00<?, ?it/s] [ComfyUI] 2%|▏ | 1/50 [00:03<03:15, 3.99s/it] [ComfyUI] 4%|▍ | 2/50 [00:09<03:55, 4.90s/it] [ComfyUI] 6%|▌ | 3/50 [00:15<04:03, 5.18s/it] [ComfyUI] 8%|▊ | 4/50 [00:20<04:04, 5.32s/it] [ComfyUI] 10%|█ | 5/50 [00:26<04:02, 5.40s/it] [ComfyUI] 12%|█▏ | 6/50 [00:31<03:59, 5.44s/it] [ComfyUI] 14%|█▍ | 7/50 [00:37<03:55, 5.47s/it] [ComfyUI] 16%|█▌ | 8/50 [00:42<03:50, 5.49s/it] [ComfyUI] 18%|█▊ | 9/50 [00:48<03:45, 5.50s/it] [ComfyUI] 20%|██ | 10/50 [00:53<03:40, 5.51s/it] [ComfyUI] 22%|██▏ | 11/50 [00:59<03:35, 5.52s/it] [ComfyUI] 24%|██▍ | 12/50 [01:04<03:29, 5.52s/it] [ComfyUI] 26%|██▌ | 13/50 [01:10<03:24, 5.52s/it] [ComfyUI] 28%|██▊ | 14/50 [01:15<03:18, 5.53s/it] [ComfyUI] 30%|███ | 15/50 [01:21<03:13, 5.53s/it] [ComfyUI] 32%|███▏ | 16/50 [01:26<03:08, 5.53s/it] [ComfyUI] 34%|███▍ | 17/50 [01:32<03:02, 5.53s/it] [ComfyUI] 36%|███▌ | 18/50 [01:38<02:56, 5.53s/it] [ComfyUI] 38%|███▊ | 19/50 [01:43<02:51, 5.53s/it] [ComfyUI] 40%|████ | 20/50 [01:49<02:45, 5.53s/it] [ComfyUI] 42%|████▏ | 21/50 [01:54<02:40, 5.53s/it] [ComfyUI] 44%|████▍ | 22/50 [02:00<02:34, 5.53s/it] [ComfyUI] 46%|████▌ | 23/50 [02:05<02:29, 5.53s/it] [ComfyUI] 48%|████▊ | 24/50 [02:11<02:23, 5.53s/it] [ComfyUI] 50%|█████ | 25/50 [02:16<02:18, 5.53s/it] [ComfyUI] 52%|█████▏ | 26/50 [02:22<02:12, 5.53s/it] [ComfyUI] 54%|█████▍ | 27/50 [02:27<02:07, 5.53s/it] [ComfyUI] 56%|█████▌ | 28/50 [02:33<02:01, 5.53s/it] [ComfyUI] 58%|█████▊ | 29/50 [02:38<01:56, 5.53s/it] [ComfyUI] 60%|██████ | 30/50 [02:44<01:50, 5.53s/it] [ComfyUI] 62%|██████▏ | 31/50 [02:49<01:45, 5.53s/it] [ComfyUI] 64%|██████▍ | 32/50 [02:55<01:39, 5.53s/it] [ComfyUI] 66%|██████▌ | 33/50 [03:00<01:34, 5.53s/it] [ComfyUI] 68%|██████▊ | 34/50 [03:06<01:28, 5.53s/it] [ComfyUI] 70%|███████ | 35/50 [03:12<01:22, 5.53s/it] [ComfyUI] 72%|███████▏ | 36/50 [03:17<01:17, 5.53s/it] [ComfyUI] 74%|███████▍ | 37/50 [03:23<01:11, 5.53s/it] [ComfyUI] 76%|███████▌ | 38/50 [03:28<01:06, 5.53s/it] [ComfyUI] 78%|███████▊ | 39/50 [03:34<01:00, 5.53s/it] [ComfyUI] 80%|████████ | 40/50 [03:39<00:55, 5.53s/it] [ComfyUI] 82%|████████▏ | 41/50 [03:45<00:49, 5.53s/it] [ComfyUI] 84%|████████▍ | 42/50 [03:50<00:44, 5.53s/it] [ComfyUI] 86%|████████▌ | 43/50 [03:56<00:38, 5.53s/it] [ComfyUI] 88%|████████▊ | 44/50 [04:01<00:33, 5.53s/it] [ComfyUI] 90%|█████████ | 45/50 [04:07<00:27, 5.53s/it] [ComfyUI] 92%|█████████▏| 46/50 [04:12<00:22, 5.53s/it] [ComfyUI] 94%|█████████▍| 47/50 [04:18<00:16, 5.53s/it] [ComfyUI] 96%|█████████▌| 48/50 [04:23<00:11, 5.53s/it] [ComfyUI] 98%|█████████▊| 49/50 [04:29<00:05, 5.53s/it] [ComfyUI] 100%|██████████| 50/50 [04:35<00:00, 5.53s/it] [ComfyUI] 100%|██████████| 50/50 [04:35<00:00, 5.50s/it] [ComfyUI] Allocated memory: memory=12.301 GB [ComfyUI] Max allocated memory: max_memory=18.380 GB [ComfyUI] Max reserved memory: max_reserved=21.719 GB Executing node 5, title: HunyuanVideo Decode, class type: HyVideoDecode [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/3 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 33%|███▎ | 1/3 [00:01<00:03, 1.50s/it] [ComfyUI] Decoding rows: 67%|██████▋ | 2/3 [00:03<00:01, 1.64s/it] [ComfyUI] Decoding rows: 100%|██████████| 3/3 [00:04<00:00, 1.41s/it] [ComfyUI] Decoding rows: 100%|██████████| 3/3 [00:04<00:00, 1.46s/it] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/3 [00:00<?, ?it/s] [ComfyUI] Blending tiles: 33%|███▎ | 1/3 [00:00<00:00, 7.29it/s] Executing node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine [ComfyUI] Blending tiles: 100%|██████████| 3/3 [00:00<00:00, 17.09it/s] [ComfyUI] Prompt executed in 289.02 seconds outputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 20.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}} ==================================== HunyuanVideo_00001.png HunyuanVideo_00001.mp4
Prediction
lucataco/hunyuan-heygen-woman-2:923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4IDzgm6ws9f1nrme0cmc3fvzqr5jwStatusSucceededSourceWebHardwareH100Total durationCreatedInput
- crf
- 19
- steps
- 30
- width
- 960
- height
- 544
- prompt
- HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression
- lora_url
- flow_shift
- 9
- frame_rate
- 20
- num_frames
- 49
- force_offload
- lora_strength
- 0.9
- guidance_scale
- 6
- denoise_strength
- 1
{ "crf": 19, "steps": 30, "width": 960, "height": 544, "prompt": "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", "lora_url": "", "flow_shift": 9, "frame_rate": 20, "num_frames": 49, "force_offload": true, "lora_strength": 0.9, "guidance_scale": 6, "denoise_strength": 1 }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run lucataco/hunyuan-heygen-woman-2 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "lucataco/hunyuan-heygen-woman-2:923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4", { input: { crf: 19, steps: 30, width: 960, height: 544, prompt: "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", lora_url: "", flow_shift: 9, frame_rate: 20, num_frames: 49, force_offload: true, lora_strength: 0.9, guidance_scale: 6, denoise_strength: 1 } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run lucataco/hunyuan-heygen-woman-2 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "lucataco/hunyuan-heygen-woman-2:923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4", input={ "crf": 19, "steps": 30, "width": 960, "height": 544, "prompt": "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", "lora_url": "", "flow_shift": 9, "frame_rate": 20, "num_frames": 49, "force_offload": True, "lora_strength": 0.9, "guidance_scale": 6, "denoise_strength": 1 } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Run lucataco/hunyuan-heygen-woman-2 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "lucataco/hunyuan-heygen-woman-2:923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4", "input": { "crf": 19, "steps": 30, "width": 960, "height": 544, "prompt": "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", "lora_url": "", "flow_shift": 9, "frame_rate": 20, "num_frames": 49, "force_offload": true, "lora_strength": 0.9, "guidance_scale": 6, "denoise_strength": 1 } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2025-01-13T18:18:17.338092Z", "created_at": "2025-01-13T18:13:46.893000Z", "data_removed": false, "error": null, "id": "zgm6ws9f1nrme0cmc3fvzqr5jw", "input": { "crf": 19, "steps": 30, "width": 960, "height": 544, "prompt": "HGW2 woman sitting on a beige couch in a well-decorated room. She is wearing a light-colored, long-sleeved turtleneck top and has long, straight brown hair. The room has a warm and cozy atmosphere, with soft lighting and a comfortable ambiance. The woman appears to be speaking or presenting something, as she is looking directly at the camera with a neutral expression", "lora_url": "", "flow_shift": 9, "frame_rate": 20, "num_frames": 49, "force_offload": true, "lora_strength": 0.9, "guidance_scale": 6, "denoise_strength": 1 }, "logs": "Random seed set to: 3626158721\nChecking inputs\n====================================\nChecking weights\n✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae\n✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models\n====================================\nRunning workflow\n[ComfyUI] got prompt\nExecuting node 7, title: HunyuanVideo VAE Loader, class type: HyVideoVAELoader\nExecuting node 16, title: (Down)Load HunyuanVideo TextEncoder, class type: DownloadAndLoadHyVideoTextEncoder\n[ComfyUI] Loading text encoder model (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14\n[ComfyUI] Text encoder to dtype: torch.float16\n[ComfyUI] Loading tokenizer (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14\n[ComfyUI] Loading text encoder model (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer\n[ComfyUI]\n[ComfyUI] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s]\n[ComfyUI] Loading checkpoint shards: 25%|██▌ | 1/4 [00:00<00:01, 1.61it/s]\n[ComfyUI] Loading checkpoint shards: 50%|█████ | 2/4 [00:01<00:01, 1.63it/s]\n[ComfyUI] Loading checkpoint shards: 75%|███████▌ | 3/4 [00:01<00:00, 1.63it/s]\n[ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:01<00:00, 2.40it/s]\n[ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:01<00:00, 2.04it/s]\n[ComfyUI] Text encoder to dtype: torch.float16\n[ComfyUI] Loading tokenizer (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer\nExecuting node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode\n[ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 80\n[ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77\nExecuting node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect\nExecuting node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader\n[ComfyUI] model_type FLOW\n[ComfyUI] Using accelerate to load and assign model weights to device...\n[ComfyUI] Loading LoRA: lora_comfyui with strength: 0.9\n[ComfyUI] Requested to load HyVideoModel\n[ComfyUI] Loading 1 new model\n[ComfyUI] loaded completely 0.0 12555.953247070312 True\n[ComfyUI] Input (height, width, video_length) = (544, 960, 49)\nExecuting node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler\n[ComfyUI] Sampling 49 frames in 13 latents at 960x544 with 30 inference steps\n[ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['n_tokens', 'num_train_timesteps'])])\n[ComfyUI]\n[ComfyUI] 0%| | 0/30 [00:00<?, ?it/s]\n[ComfyUI] 3%|▎ | 1/30 [00:04<02:00, 4.16s/it]\n[ComfyUI] 7%|▋ | 2/30 [00:09<02:18, 4.95s/it]\n[ComfyUI] 10%|█ | 3/30 [00:15<02:20, 5.20s/it]\n[ComfyUI] 13%|█▎ | 4/30 [00:20<02:18, 5.31s/it]\n[ComfyUI] 17%|█▋ | 5/30 [00:26<02:14, 5.38s/it]\n[ComfyUI] 20%|██ | 6/30 [00:31<02:10, 5.42s/it]\n[ComfyUI] 23%|██▎ | 7/30 [00:37<02:05, 5.44s/it]\n[ComfyUI] 27%|██▋ | 8/30 [00:42<02:00, 5.46s/it]\n[ComfyUI] 30%|███ | 9/30 [00:48<01:54, 5.47s/it]\n[ComfyUI] 33%|███▎ | 10/30 [00:53<01:49, 5.48s/it]\n[ComfyUI] 37%|███▋ | 11/30 [00:59<01:44, 5.48s/it]\n[ComfyUI] 40%|████ | 12/30 [01:04<01:38, 5.49s/it]\n[ComfyUI] 43%|████▎ | 13/30 [01:10<01:33, 5.49s/it]\n[ComfyUI] 47%|████▋ | 14/30 [01:15<01:27, 5.49s/it]\n[ComfyUI] 50%|█████ | 15/30 [01:21<01:22, 5.49s/it]\n[ComfyUI] 53%|█████▎ | 16/30 [01:26<01:16, 5.50s/it]\n[ComfyUI] 57%|█████▋ | 17/30 [01:32<01:11, 5.50s/it]\n[ComfyUI] 60%|██████ | 18/30 [01:37<01:05, 5.50s/it]\n[ComfyUI] 63%|██████▎ | 19/30 [01:43<01:00, 5.50s/it]\n[ComfyUI] 67%|██████▋ | 20/30 [01:48<00:54, 5.50s/it]\n[ComfyUI] 70%|███████ | 21/30 [01:54<00:49, 5.50s/it]\n[ComfyUI] 73%|███████▎ | 22/30 [01:59<00:43, 5.50s/it]\n[ComfyUI] 77%|███████▋ | 23/30 [02:05<00:38, 5.50s/it]\n[ComfyUI] 80%|████████ | 24/30 [02:10<00:32, 5.50s/it]\n[ComfyUI] 83%|████████▎ | 25/30 [02:16<00:27, 5.50s/it]\n[ComfyUI] 87%|████████▋ | 26/30 [02:21<00:21, 5.50s/it]\n[ComfyUI] 90%|█████████ | 27/30 [02:27<00:16, 5.50s/it]\n[ComfyUI] 93%|█████████▎| 28/30 [02:32<00:10, 5.50s/it]\n[ComfyUI] 97%|█████████▋| 29/30 [02:38<00:05, 5.50s/it]\n[ComfyUI] 100%|██████████| 30/30 [02:43<00:00, 5.50s/it]\n[ComfyUI] 100%|██████████| 30/30 [02:43<00:00, 5.45s/it]\n[ComfyUI] Allocated memory: memory=12.760 GB\n[ComfyUI] Max allocated memory: max_memory=18.839 GB\n[ComfyUI] Max reserved memory: max_reserved=20.719 GB\nExecuting node 5, title: HunyuanVideo Decode, class type: HyVideoDecode\n[ComfyUI]\n[ComfyUI] Decoding rows: 0%| | 0/3 [00:00<?, ?it/s]\n[ComfyUI] Decoding rows: 33%|███▎ | 1/3 [00:01<00:03, 1.52s/it]\n[ComfyUI] Decoding rows: 67%|██████▋ | 2/3 [00:03<00:01, 1.62s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 3/3 [00:04<00:00, 1.40s/it]\n[ComfyUI] Decoding rows: 100%|██████████| 3/3 [00:04<00:00, 1.45s/it]\n[ComfyUI]\n[ComfyUI] Blending tiles: 0%| | 0/3 [00:00<?, ?it/s]\n[ComfyUI] Blending tiles: 33%|███▎ | 1/3 [00:00<00:00, 7.83it/s]\nExecuting node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine\n[ComfyUI] Blending tiles: 100%|██████████| 3/3 [00:00<00:00, 18.12it/s]\n[ComfyUI] Prompt executed in 200.68 seconds\noutputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 20.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}}\n====================================\nHunyuanVideo_00001.png\nHunyuanVideo_00001.mp4", "metrics": { "predict_time": 209.816128387, "total_time": 270.445092 }, "output": "https://replicate.delivery/xezq/S0NO2cd0l2rbHRHtmKBaWflAgZj2IGvfhXJDRjKMFZMprwEUA/HunyuanVideo_00001.mp4", "started_at": "2025-01-13T18:14:47.521964Z", "status": "succeeded", "urls": { "stream": "https://stream.replicate.com/v1/files/bcwr-vvjieg7d33gciammm6prbzuxqpi3sb5wdddhjnkyyxnlept3zwoq", "get": "https://api.replicate.com/v1/predictions/zgm6ws9f1nrme0cmc3fvzqr5jw", "cancel": "https://api.replicate.com/v1/predictions/zgm6ws9f1nrme0cmc3fvzqr5jw/cancel" }, "version": "923b4f49c7a0a882abb89494ac38fd902ef60640f742eb478bae94f225439ab4" }
Generated inRandom seed set to: 3626158721 Checking inputs ==================================== Checking weights ✅ hunyuan_video_vae_bf16.safetensors exists in ComfyUI/models/vae ✅ hunyuan_video_720_fp8_e4m3fn.safetensors exists in ComfyUI/models/diffusion_models ==================================== Running workflow [ComfyUI] got prompt Executing node 7, title: HunyuanVideo VAE Loader, class type: HyVideoVAELoader Executing node 16, title: (Down)Load HunyuanVideo TextEncoder, class type: DownloadAndLoadHyVideoTextEncoder [ComfyUI] Loading text encoder model (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14 [ComfyUI] Text encoder to dtype: torch.float16 [ComfyUI] Loading tokenizer (clipL) from: /src/ComfyUI/models/clip/clip-vit-large-patch14 [ComfyUI] Loading text encoder model (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer [ComfyUI] [ComfyUI] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s] [ComfyUI] Loading checkpoint shards: 25%|██▌ | 1/4 [00:00<00:01, 1.61it/s] [ComfyUI] Loading checkpoint shards: 50%|█████ | 2/4 [00:01<00:01, 1.63it/s] [ComfyUI] Loading checkpoint shards: 75%|███████▌ | 3/4 [00:01<00:00, 1.63it/s] [ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:01<00:00, 2.40it/s] [ComfyUI] Loading checkpoint shards: 100%|██████████| 4/4 [00:01<00:00, 2.04it/s] [ComfyUI] Text encoder to dtype: torch.float16 [ComfyUI] Loading tokenizer (llm) from: /src/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer Executing node 30, title: HunyuanVideo TextEncode, class type: HyVideoTextEncode [ComfyUI] llm prompt attention_mask shape: torch.Size([1, 161]), masked tokens: 80 [ComfyUI] clipL prompt attention_mask shape: torch.Size([1, 77]), masked tokens: 77 Executing node 41, title: HunyuanVideo Lora Select, class type: HyVideoLoraSelect Executing node 1, title: HunyuanVideo Model Loader, class type: HyVideoModelLoader [ComfyUI] model_type FLOW [ComfyUI] Using accelerate to load and assign model weights to device... [ComfyUI] Loading LoRA: lora_comfyui with strength: 0.9 [ComfyUI] Requested to load HyVideoModel [ComfyUI] Loading 1 new model [ComfyUI] loaded completely 0.0 12555.953247070312 True [ComfyUI] Input (height, width, video_length) = (544, 960, 49) Executing node 3, title: HunyuanVideo Sampler, class type: HyVideoSampler [ComfyUI] Sampling 49 frames in 13 latents at 960x544 with 30 inference steps [ComfyUI] Scheduler config: FrozenDict([('num_train_timesteps', 1000), ('shift', 9.0), ('reverse', True), ('solver', 'euler'), ('n_tokens', None), ('_use_default_values', ['n_tokens', 'num_train_timesteps'])]) [ComfyUI] [ComfyUI] 0%| | 0/30 [00:00<?, ?it/s] [ComfyUI] 3%|▎ | 1/30 [00:04<02:00, 4.16s/it] [ComfyUI] 7%|▋ | 2/30 [00:09<02:18, 4.95s/it] [ComfyUI] 10%|█ | 3/30 [00:15<02:20, 5.20s/it] [ComfyUI] 13%|█▎ | 4/30 [00:20<02:18, 5.31s/it] [ComfyUI] 17%|█▋ | 5/30 [00:26<02:14, 5.38s/it] [ComfyUI] 20%|██ | 6/30 [00:31<02:10, 5.42s/it] [ComfyUI] 23%|██▎ | 7/30 [00:37<02:05, 5.44s/it] [ComfyUI] 27%|██▋ | 8/30 [00:42<02:00, 5.46s/it] [ComfyUI] 30%|███ | 9/30 [00:48<01:54, 5.47s/it] [ComfyUI] 33%|███▎ | 10/30 [00:53<01:49, 5.48s/it] [ComfyUI] 37%|███▋ | 11/30 [00:59<01:44, 5.48s/it] [ComfyUI] 40%|████ | 12/30 [01:04<01:38, 5.49s/it] [ComfyUI] 43%|████▎ | 13/30 [01:10<01:33, 5.49s/it] [ComfyUI] 47%|████▋ | 14/30 [01:15<01:27, 5.49s/it] [ComfyUI] 50%|█████ | 15/30 [01:21<01:22, 5.49s/it] [ComfyUI] 53%|█████▎ | 16/30 [01:26<01:16, 5.50s/it] [ComfyUI] 57%|█████▋ | 17/30 [01:32<01:11, 5.50s/it] [ComfyUI] 60%|██████ | 18/30 [01:37<01:05, 5.50s/it] [ComfyUI] 63%|██████▎ | 19/30 [01:43<01:00, 5.50s/it] [ComfyUI] 67%|██████▋ | 20/30 [01:48<00:54, 5.50s/it] [ComfyUI] 70%|███████ | 21/30 [01:54<00:49, 5.50s/it] [ComfyUI] 73%|███████▎ | 22/30 [01:59<00:43, 5.50s/it] [ComfyUI] 77%|███████▋ | 23/30 [02:05<00:38, 5.50s/it] [ComfyUI] 80%|████████ | 24/30 [02:10<00:32, 5.50s/it] [ComfyUI] 83%|████████▎ | 25/30 [02:16<00:27, 5.50s/it] [ComfyUI] 87%|████████▋ | 26/30 [02:21<00:21, 5.50s/it] [ComfyUI] 90%|█████████ | 27/30 [02:27<00:16, 5.50s/it] [ComfyUI] 93%|█████████▎| 28/30 [02:32<00:10, 5.50s/it] [ComfyUI] 97%|█████████▋| 29/30 [02:38<00:05, 5.50s/it] [ComfyUI] 100%|██████████| 30/30 [02:43<00:00, 5.50s/it] [ComfyUI] 100%|██████████| 30/30 [02:43<00:00, 5.45s/it] [ComfyUI] Allocated memory: memory=12.760 GB [ComfyUI] Max allocated memory: max_memory=18.839 GB [ComfyUI] Max reserved memory: max_reserved=20.719 GB Executing node 5, title: HunyuanVideo Decode, class type: HyVideoDecode [ComfyUI] [ComfyUI] Decoding rows: 0%| | 0/3 [00:00<?, ?it/s] [ComfyUI] Decoding rows: 33%|███▎ | 1/3 [00:01<00:03, 1.52s/it] [ComfyUI] Decoding rows: 67%|██████▋ | 2/3 [00:03<00:01, 1.62s/it] [ComfyUI] Decoding rows: 100%|██████████| 3/3 [00:04<00:00, 1.40s/it] [ComfyUI] Decoding rows: 100%|██████████| 3/3 [00:04<00:00, 1.45s/it] [ComfyUI] [ComfyUI] Blending tiles: 0%| | 0/3 [00:00<?, ?it/s] [ComfyUI] Blending tiles: 33%|███▎ | 1/3 [00:00<00:00, 7.83it/s] Executing node 34, title: Video Combine 🎥🅥🅗🅢, class type: VHS_VideoCombine [ComfyUI] Blending tiles: 100%|██████████| 3/3 [00:00<00:00, 18.12it/s] [ComfyUI] Prompt executed in 200.68 seconds outputs: {'34': {'gifs': [{'filename': 'HunyuanVideo_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 20.0, 'workflow': 'HunyuanVideo_00001.png', 'fullpath': '/tmp/outputs/HunyuanVideo_00001.mp4'}]}} ==================================== HunyuanVideo_00001.png HunyuanVideo_00001.mp4
Want to make some of these yourself?
Run this model