hvision-nku
/
storydiffusion
Consistent Self-Attention for Long-Range Image and Video Generation
Prediction
hvision-nku/storydiffusion:39c85f15IDm69pbmv63drgm0cf8qnskc27nrStatusSucceededSourceWebHardwareA40 (Large)Total durationCreatedInput
- num_ids
- 3
- sd_model
- Unstable
- num_steps
- 25
- style_name
- Japanese Anime
- comic_style
- Classic Comic Style
- image_width
- 768
- image_height
- 768
- sa32_setting
- 0.5
- sa64_setting
- 0.5
- output_format
- webp
- guidance_scale
- 5
- output_quality
- 80
- negative_prompt
- bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs
- comic_description
- at home, read new paper #at home, The newspaper says there is a treasure house in the forest. on the road, near the forest [NC] The car on the road, near the forest #He drives to the forest in search of treasure. [NC]A tiger appeared in the forest, at night very frightened, open mouth, in the forest, at night running very fast, in the forest, at night [NC] A house in the forest, at night #Suddenly, he discovers the treasure house! in the house filled with treasure, laughing, at night #He is overjoyed inside the house.
- style_strength_ratio
- 20
- character_description
- a man, wearing black suit
{ "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "style_name": "Japanese Anime", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\non the road, near the forest\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\n[NC]A tiger appeared in the forest, at night \nvery frightened, open mouth, in the forest, at night\nrunning very fast, in the forest, at night\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house.", "style_strength_ratio": 20, "character_description": "a man, wearing black suit" }
Install Replicate’s Node.js client library:npm install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "hvision-nku/storydiffusion:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", { input: { num_ids: 3, sd_model: "Unstable", num_steps: 25, style_name: "Japanese Anime", comic_style: "Classic Comic Style", image_width: 768, image_height: 768, sa32_setting: 0.5, sa64_setting: 0.5, output_format: "webp", guidance_scale: 5, output_quality: 80, negative_prompt: "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", comic_description: "at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\non the road, near the forest\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\n[NC]A tiger appeared in the forest, at night \nvery frightened, open mouth, in the forest, at night\nrunning very fast, in the forest, at night\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house.", style_strength_ratio: 20, character_description: "a man, wearing black suit" } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import the client:import replicate
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "hvision-nku/storydiffusion:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", input={ "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "style_name": "Japanese Anime", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\non the road, near the forest\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\n[NC]A tiger appeared in the forest, at night \nvery frightened, open mouth, in the forest, at night\nrunning very fast, in the forest, at night\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house.", "style_strength_ratio": 20, "character_description": "a man, wearing black suit" } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", "input": { "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "style_name": "Japanese Anime", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\\non the road, near the forest\\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\\n[NC]A tiger appeared in the forest, at night \\nvery frightened, open mouth, in the forest, at night\\nrunning very fast, in the forest, at night\\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house.", "style_strength_ratio": 20, "character_description": "a man, wearing black suit" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
You can run this model locally using Cog. First, install Cog:brew install cog
If you don’t have Homebrew, there are other installation options available.
Run this to download the model and run it in your local environment:
cog predict r8.im/cjwbw/storydiffusion@sha256:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2 \ -i 'num_ids=3' \ -i 'sd_model="Unstable"' \ -i 'num_steps=25' \ -i 'style_name="Japanese Anime"' \ -i 'comic_style="Classic Comic Style"' \ -i 'image_width=768' \ -i 'image_height=768' \ -i 'sa32_setting=0.5' \ -i 'sa64_setting=0.5' \ -i 'output_format="webp"' \ -i 'guidance_scale=5' \ -i 'output_quality=80' \ -i 'negative_prompt="bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs"' \ -i $'comic_description="at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\\non the road, near the forest\\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\\n[NC]A tiger appeared in the forest, at night \\nvery frightened, open mouth, in the forest, at night\\nrunning very fast, in the forest, at night\\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house."' \ -i 'style_strength_ratio=20' \ -i 'character_description="a man, wearing black suit"'
To learn more, take a look at the Cog documentation.
Run this to download the model and run it in your local environment:
docker run -d -p 5000:5000 --gpus=all r8.im/cjwbw/storydiffusion@sha256:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2
curl -s -X POST \ -H "Content-Type: application/json" \ -d $'{ "input": { "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "style_name": "Japanese Anime", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\\non the road, near the forest\\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\\n[NC]A tiger appeared in the forest, at night \\nvery frightened, open mouth, in the forest, at night\\nrunning very fast, in the forest, at night\\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house.", "style_strength_ratio": 20, "character_description": "a man, wearing black suit" } }' \ http://localhost:5000/predictions
To learn more, take a look at the Cog documentation.
Output
{ "completed_at": "2024-05-04T23:24:09.982389Z", "created_at": "2024-05-04T23:23:26.107000Z", "data_removed": false, "error": null, "id": "m69pbmv63drgm0cf8qnskc27nr", "input": { "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "style_name": "Japanese Anime", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\non the road, near the forest\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\n[NC]A tiger appeared in the forest, at night \nvery frightened, open mouth, in the forest, at night\nrunning very fast, in the forest, at night\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house.", "style_strength_ratio": 20, "character_description": "a man, wearing black suit" }, "logs": "['at home, read new paper #at home, The newspaper says there is a treasure house in the forest.', 'on the road, near the forest', '[NC] The car on the road, near the forest #He drives to the forest in search of treasure.', '[NC]A tiger appeared in the forest, at night ', 'very frightened, open mouth, in the forest, at night', 'running very fast, in the forest, at night', '[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!', 'in the house filled with treasure, laughing, at night #He is overjoyed inside the house.']\n['a man, wearing black suit,at home, read new paper #at home, The newspaper says there is a treasure house in the forest.', 'a man, wearing black suit,on the road, near the forest', ' The car on the road, near the forest #He drives to the forest in search of treasure.', 'A tiger appeared in the forest, at night ', 'a man, wearing black suit,very frightened, open mouth, in the forest, at night', 'a man, wearing black suit,running very fast, in the forest, at night', ' A house in the forest, at night #Suddenly, he discovers the treasure house!', 'a man, wearing black suit,in the house filled with treasure, laughing, at night #He is overjoyed inside the house.']\n['a man, wearing black suit,at home, read new paper', 'a man, wearing black suit,on the road, near the forest', 'The car on the road, near the forest', 'A tiger appeared in the forest, at night', 'a man, wearing black suit,very frightened, open mouth, in the forest, at night', 'a man, wearing black suit,running very fast, in the forest, at night', 'A house in the forest, at night', 'a man, wearing black suit,in the house filled with treasure, laughing, at night']\nUsing seed: 58753\nSuccessfully load paired self-attention\nNumber of the processor : 36\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:07, 3.11it/s]\n 8%|▊ | 2/25 [00:00<00:07, 3.10it/s]\n 12%|█▏ | 3/25 [00:00<00:07, 3.09it/s]\n 16%|█▌ | 4/25 [00:01<00:06, 3.08it/s]\n 20%|██ | 5/25 [00:01<00:06, 3.08it/s]\n 24%|██▍ | 6/25 [00:02<00:07, 2.60it/s]\n 28%|██▊ | 7/25 [00:02<00:07, 2.35it/s]\n 32%|███▏ | 8/25 [00:03<00:07, 2.23it/s]\n 36%|███▌ | 9/25 [00:03<00:07, 2.23it/s]\n 40%|████ | 10/25 [00:04<00:06, 2.19it/s]\n 44%|████▍ | 11/25 [00:04<00:06, 2.13it/s]\n 48%|████▊ | 12/25 [00:05<00:06, 2.12it/s]\n 52%|█████▏ | 13/25 [00:05<00:05, 2.04it/s]\n 56%|█████▌ | 14/25 [00:06<00:05, 2.01it/s]\n 60%|██████ | 15/25 [00:06<00:05, 1.97it/s]\n 64%|██████▍ | 16/25 [00:07<00:04, 2.00it/s]\n 68%|██████▊ | 17/25 [00:07<00:03, 2.06it/s]\n 72%|███████▏ | 18/25 [00:08<00:03, 2.05it/s]\n 76%|███████▌ | 19/25 [00:08<00:02, 2.06it/s]\n 80%|████████ | 20/25 [00:08<00:02, 2.08it/s]\n 84%|████████▍ | 21/25 [00:09<00:01, 2.03it/s]\n 88%|████████▊ | 22/25 [00:10<00:01, 1.97it/s]\n 92%|█████████▏| 23/25 [00:10<00:01, 1.96it/s]\n 96%|█████████▌| 24/25 [00:11<00:00, 1.97it/s]\n100%|██████████| 25/25 [00:11<00:00, 1.93it/s]\n100%|██████████| 25/25 [00:11<00:00, 2.15it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:03, 6.46it/s]\n 8%|▊ | 2/25 [00:00<00:03, 6.44it/s]\n 12%|█▏ | 3/25 [00:00<00:03, 6.46it/s]\n 16%|█▌ | 4/25 [00:00<00:03, 6.45it/s]\n 20%|██ | 5/25 [00:00<00:03, 6.44it/s]\n 24%|██▍ | 6/25 [00:00<00:03, 5.70it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 5.33it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 5.27it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 5.09it/s]\n 40%|████ | 10/25 [00:01<00:02, 5.08it/s]\n 44%|████▍ | 11/25 [00:02<00:02, 4.98it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 5.07it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.97it/s]\n 56%|█████▌ | 14/25 [00:02<00:02, 4.78it/s]\n 60%|██████ | 15/25 [00:02<00:02, 4.86it/s]\n 64%|██████▍ | 16/25 [00:03<00:01, 4.80it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.79it/s]\n 72%|███████▏ | 18/25 [00:03<00:01, 4.94it/s]\n 76%|███████▌ | 19/25 [00:03<00:01, 4.89it/s]\n 80%|████████ | 20/25 [00:03<00:01, 4.98it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.73it/s]\n 88%|████████▊ | 22/25 [00:04<00:00, 4.59it/s]\n 92%|█████████▏| 23/25 [00:04<00:00, 4.58it/s]\n 96%|█████████▌| 24/25 [00:04<00:00, 4.48it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.48it/s]\n100%|██████████| 25/25 [00:05<00:00, 5.00it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:03, 6.51it/s]\n 8%|▊ | 2/25 [00:00<00:03, 6.48it/s]\n 12%|█▏ | 3/25 [00:00<00:03, 6.44it/s]\n 16%|█▌ | 4/25 [00:00<00:03, 6.42it/s]\n 20%|██ | 5/25 [00:00<00:03, 6.42it/s]\n 24%|██▍ | 6/25 [00:00<00:03, 5.99it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 5.59it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 5.15it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 5.12it/s]\n 40%|████ | 10/25 [00:01<00:02, 5.13it/s]\n 44%|████▍ | 11/25 [00:01<00:02, 5.06it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 4.99it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.95it/s]\n 56%|█████▌ | 14/25 [00:02<00:02, 4.94it/s]\n 60%|██████ | 15/25 [00:02<00:02, 4.89it/s]\n 64%|██████▍ | 16/25 [00:03<00:01, 4.82it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.95it/s]\n 72%|███████▏ | 18/25 [00:03<00:01, 4.89it/s]\n 76%|███████▌ | 19/25 [00:03<00:01, 4.73it/s]\n 80%|████████ | 20/25 [00:03<00:01, 4.67it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.54it/s]\n 88%|████████▊ | 22/25 [00:04<00:00, 4.44it/s]\n 92%|█████████▏| 23/25 [00:04<00:00, 4.51it/s]\n 96%|█████████▌| 24/25 [00:04<00:00, 4.44it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.39it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.97it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:03, 6.52it/s]\n 8%|▊ | 2/25 [00:00<00:03, 6.46it/s]\n 12%|█▏ | 3/25 [00:00<00:03, 6.43it/s]\n 16%|█▌ | 4/25 [00:00<00:03, 6.42it/s]\n 20%|██ | 5/25 [00:00<00:03, 6.42it/s]\n 24%|██▍ | 6/25 [00:00<00:03, 5.95it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 5.50it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 5.28it/s]\n 36%|███▌ | 9/25 [00:01<00:02, 5.35it/s]\n 40%|████ | 10/25 [00:01<00:02, 5.02it/s]\n 44%|████▍ | 11/25 [00:02<00:02, 4.83it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 4.83it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.89it/s]\n 56%|█████▌ | 14/25 [00:02<00:02, 4.90it/s]\n 60%|██████ | 15/25 [00:02<00:02, 4.77it/s]\n 64%|██████▍ | 16/25 [00:03<00:01, 4.72it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.94it/s]\n 72%|███████▏ | 18/25 [00:03<00:01, 4.94it/s]\n 76%|███████▌ | 19/25 [00:03<00:01, 4.78it/s]\n 80%|████████ | 20/25 [00:03<00:01, 4.66it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.62it/s]\n 88%|████████▊ | 22/25 [00:04<00:00, 4.52it/s]\n 92%|█████████▏| 23/25 [00:04<00:00, 4.42it/s]\n 96%|█████████▌| 24/25 [00:04<00:00, 4.39it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.36it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.94it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:03, 6.49it/s]\n 8%|▊ | 2/25 [00:00<00:03, 6.43it/s]\n 12%|█▏ | 3/25 [00:00<00:03, 6.42it/s]\n 16%|█▌ | 4/25 [00:00<00:03, 6.42it/s]\n 20%|██ | 5/25 [00:00<00:03, 6.42it/s]\n 24%|██▍ | 6/25 [00:00<00:03, 5.68it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 5.58it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 5.31it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 5.07it/s]\n 40%|████ | 10/25 [00:01<00:02, 5.06it/s]\n 44%|████▍ | 11/25 [00:01<00:02, 5.14it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 5.06it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 5.08it/s]\n 56%|█████▌ | 14/25 [00:02<00:02, 5.08it/s]\n 60%|██████ | 15/25 [00:02<00:02, 4.97it/s]\n 64%|██████▍ | 16/25 [00:02<00:01, 4.97it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.83it/s]\n 72%|███████▏ | 18/25 [00:03<00:01, 4.72it/s]\n 76%|███████▌ | 19/25 [00:03<00:01, 4.70it/s]\n 80%|████████ | 20/25 [00:03<00:01, 4.76it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.64it/s]\n 88%|████████▊ | 22/25 [00:04<00:00, 4.60it/s]\n 92%|█████████▏| 23/25 [00:04<00:00, 4.54it/s]\n 96%|█████████▌| 24/25 [00:04<00:00, 4.49it/s]\n100%|██████████| 25/25 [00:04<00:00, 4.59it/s]\n100%|██████████| 25/25 [00:04<00:00, 5.03it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:03, 6.48it/s]\n 8%|▊ | 2/25 [00:00<00:03, 6.44it/s]\n 12%|█▏ | 3/25 [00:00<00:03, 6.42it/s]\n 16%|█▌ | 4/25 [00:00<00:03, 6.42it/s]\n 20%|██ | 5/25 [00:00<00:03, 6.41it/s]\n 24%|██▍ | 6/25 [00:00<00:03, 5.72it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 5.46it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 5.19it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 5.16it/s]\n 40%|████ | 10/25 [00:01<00:02, 5.12it/s]\n 44%|████▍ | 11/25 [00:01<00:02, 5.31it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 5.03it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 5.00it/s]\n 56%|█████▌ | 14/25 [00:02<00:02, 4.92it/s]\n 60%|██████ | 15/25 [00:02<00:02, 4.98it/s]\n 64%|██████▍ | 16/25 [00:03<00:01, 4.95it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.84it/s]\n 72%|███████▏ | 18/25 [00:03<00:01, 4.75it/s]\n 76%|███████▌ | 19/25 [00:03<00:01, 4.83it/s]\n 80%|████████ | 20/25 [00:03<00:01, 4.96it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.75it/s]\n 88%|████████▊ | 22/25 [00:04<00:00, 4.58it/s]\n 92%|█████████▏| 23/25 [00:04<00:00, 4.55it/s]\n 96%|█████████▌| 24/25 [00:04<00:00, 4.52it/s]\n100%|██████████| 25/25 [00:04<00:00, 4.47it/s]\n100%|██████████| 25/25 [00:04<00:00, 5.02it/s]\n4 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85B45EB850>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85B436F5D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5916450>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5916C10>]]\n0 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85B45EB850>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85B436F5D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5916450>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5916C10>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7D82950>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A575A6D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C18790>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C18BD0>]]\n[[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85B45EB850>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85B436F5D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5916450>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5916C10>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7D82950>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A575A6D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C18790>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C18BD0>]]\n1 (7, 650)\n0 (124, 721)\n1 (56, 636)\n0 (89, 717)", "metrics": { "predict_time": 43.83848, "total_time": 43.875389 }, "output": { "comic": "https://replicate.delivery/pbxt/3YTlIyPjEKrkI1aXozvAuaw8Z4fS1nVaiWbfbk78TdUWWHxSA/comic.webp", "individual_images": [ "https://replicate.delivery/pbxt/5fFxtbwTjqwfeINf9ELZyH7RhndQGmeuE9JBI7jFIFq5y6IWC/out-0.webp", "https://replicate.delivery/pbxt/TYsTSU7VHRooMlhStPdK6W5YUmtETxo7CPcCj5QDrPOm1RsE/out-1.webp", "https://replicate.delivery/pbxt/acrWGHqqtqKaCpmjehzeFcghZj3YxmOieLxiTmz0Z1BxsOilA/out-2.webp", "https://replicate.delivery/pbxt/3PNbSoXffrnVCU0SW4SulmtTPeyWls6breoZ3dsEJdrjZdELB/out-3.webp", "https://replicate.delivery/pbxt/stdforbmgz10LSfOT7EJuS5yPFv5jn006HAKRJefgqIhZdELB/out-4.webp", "https://replicate.delivery/pbxt/QbNDf0CycEzwDKbSpscNBXLfic3bvTmWUcGX1AfdIEkysOilA/out-5.webp", "https://replicate.delivery/pbxt/hTuyxKCKY2JbGBnVCv2RzXOXSvknpGf2bexActYGJZDZWHxSA/out-6.webp", "https://replicate.delivery/pbxt/DKBlGMFJeWz0H6EOXIhtar6SMusbxN86bciM3xzS8ejZWHxSA/out-7.webp" ] }, "started_at": "2024-05-04T23:23:26.143909Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/m69pbmv63drgm0cf8qnskc27nr", "cancel": "https://api.replicate.com/v1/predictions/m69pbmv63drgm0cf8qnskc27nr/cancel" }, "version": "39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2" }
Generated in['at home, read new paper #at home, The newspaper says there is a treasure house in the forest.', 'on the road, near the forest', '[NC] The car on the road, near the forest #He drives to the forest in search of treasure.', '[NC]A tiger appeared in the forest, at night ', 'very frightened, open mouth, in the forest, at night', 'running very fast, in the forest, at night', '[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!', 'in the house filled with treasure, laughing, at night #He is overjoyed inside the house.'] ['a man, wearing black suit,at home, read new paper #at home, The newspaper says there is a treasure house in the forest.', 'a man, wearing black suit,on the road, near the forest', ' The car on the road, near the forest #He drives to the forest in search of treasure.', 'A tiger appeared in the forest, at night ', 'a man, wearing black suit,very frightened, open mouth, in the forest, at night', 'a man, wearing black suit,running very fast, in the forest, at night', ' A house in the forest, at night #Suddenly, he discovers the treasure house!', 'a man, wearing black suit,in the house filled with treasure, laughing, at night #He is overjoyed inside the house.'] ['a man, wearing black suit,at home, read new paper', 'a man, wearing black suit,on the road, near the forest', 'The car on the road, near the forest', 'A tiger appeared in the forest, at night', 'a man, wearing black suit,very frightened, open mouth, in the forest, at night', 'a man, wearing black suit,running very fast, in the forest, at night', 'A house in the forest, at night', 'a man, wearing black suit,in the house filled with treasure, laughing, at night'] Using seed: 58753 Successfully load paired self-attention Number of the processor : 36 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:07, 3.11it/s] 8%|▊ | 2/25 [00:00<00:07, 3.10it/s] 12%|█▏ | 3/25 [00:00<00:07, 3.09it/s] 16%|█▌ | 4/25 [00:01<00:06, 3.08it/s] 20%|██ | 5/25 [00:01<00:06, 3.08it/s] 24%|██▍ | 6/25 [00:02<00:07, 2.60it/s] 28%|██▊ | 7/25 [00:02<00:07, 2.35it/s] 32%|███▏ | 8/25 [00:03<00:07, 2.23it/s] 36%|███▌ | 9/25 [00:03<00:07, 2.23it/s] 40%|████ | 10/25 [00:04<00:06, 2.19it/s] 44%|████▍ | 11/25 [00:04<00:06, 2.13it/s] 48%|████▊ | 12/25 [00:05<00:06, 2.12it/s] 52%|█████▏ | 13/25 [00:05<00:05, 2.04it/s] 56%|█████▌ | 14/25 [00:06<00:05, 2.01it/s] 60%|██████ | 15/25 [00:06<00:05, 1.97it/s] 64%|██████▍ | 16/25 [00:07<00:04, 2.00it/s] 68%|██████▊ | 17/25 [00:07<00:03, 2.06it/s] 72%|███████▏ | 18/25 [00:08<00:03, 2.05it/s] 76%|███████▌ | 19/25 [00:08<00:02, 2.06it/s] 80%|████████ | 20/25 [00:08<00:02, 2.08it/s] 84%|████████▍ | 21/25 [00:09<00:01, 2.03it/s] 88%|████████▊ | 22/25 [00:10<00:01, 1.97it/s] 92%|█████████▏| 23/25 [00:10<00:01, 1.96it/s] 96%|█████████▌| 24/25 [00:11<00:00, 1.97it/s] 100%|██████████| 25/25 [00:11<00:00, 1.93it/s] 100%|██████████| 25/25 [00:11<00:00, 2.15it/s] 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:03, 6.46it/s] 8%|▊ | 2/25 [00:00<00:03, 6.44it/s] 12%|█▏ | 3/25 [00:00<00:03, 6.46it/s] 16%|█▌ | 4/25 [00:00<00:03, 6.45it/s] 20%|██ | 5/25 [00:00<00:03, 6.44it/s] 24%|██▍ | 6/25 [00:00<00:03, 5.70it/s] 28%|██▊ | 7/25 [00:01<00:03, 5.33it/s] 32%|███▏ | 8/25 [00:01<00:03, 5.27it/s] 36%|███▌ | 9/25 [00:01<00:03, 5.09it/s] 40%|████ | 10/25 [00:01<00:02, 5.08it/s] 44%|████▍ | 11/25 [00:02<00:02, 4.98it/s] 48%|████▊ | 12/25 [00:02<00:02, 5.07it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.97it/s] 56%|█████▌ | 14/25 [00:02<00:02, 4.78it/s] 60%|██████ | 15/25 [00:02<00:02, 4.86it/s] 64%|██████▍ | 16/25 [00:03<00:01, 4.80it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.79it/s] 72%|███████▏ | 18/25 [00:03<00:01, 4.94it/s] 76%|███████▌ | 19/25 [00:03<00:01, 4.89it/s] 80%|████████ | 20/25 [00:03<00:01, 4.98it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.73it/s] 88%|████████▊ | 22/25 [00:04<00:00, 4.59it/s] 92%|█████████▏| 23/25 [00:04<00:00, 4.58it/s] 96%|█████████▌| 24/25 [00:04<00:00, 4.48it/s] 100%|██████████| 25/25 [00:05<00:00, 4.48it/s] 100%|██████████| 25/25 [00:05<00:00, 5.00it/s] 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:03, 6.51it/s] 8%|▊ | 2/25 [00:00<00:03, 6.48it/s] 12%|█▏ | 3/25 [00:00<00:03, 6.44it/s] 16%|█▌ | 4/25 [00:00<00:03, 6.42it/s] 20%|██ | 5/25 [00:00<00:03, 6.42it/s] 24%|██▍ | 6/25 [00:00<00:03, 5.99it/s] 28%|██▊ | 7/25 [00:01<00:03, 5.59it/s] 32%|███▏ | 8/25 [00:01<00:03, 5.15it/s] 36%|███▌ | 9/25 [00:01<00:03, 5.12it/s] 40%|████ | 10/25 [00:01<00:02, 5.13it/s] 44%|████▍ | 11/25 [00:01<00:02, 5.06it/s] 48%|████▊ | 12/25 [00:02<00:02, 4.99it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.95it/s] 56%|█████▌ | 14/25 [00:02<00:02, 4.94it/s] 60%|██████ | 15/25 [00:02<00:02, 4.89it/s] 64%|██████▍ | 16/25 [00:03<00:01, 4.82it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.95it/s] 72%|███████▏ | 18/25 [00:03<00:01, 4.89it/s] 76%|███████▌ | 19/25 [00:03<00:01, 4.73it/s] 80%|████████ | 20/25 [00:03<00:01, 4.67it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.54it/s] 88%|████████▊ | 22/25 [00:04<00:00, 4.44it/s] 92%|█████████▏| 23/25 [00:04<00:00, 4.51it/s] 96%|█████████▌| 24/25 [00:04<00:00, 4.44it/s] 100%|██████████| 25/25 [00:05<00:00, 4.39it/s] 100%|██████████| 25/25 [00:05<00:00, 4.97it/s] 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:03, 6.52it/s] 8%|▊ | 2/25 [00:00<00:03, 6.46it/s] 12%|█▏ | 3/25 [00:00<00:03, 6.43it/s] 16%|█▌ | 4/25 [00:00<00:03, 6.42it/s] 20%|██ | 5/25 [00:00<00:03, 6.42it/s] 24%|██▍ | 6/25 [00:00<00:03, 5.95it/s] 28%|██▊ | 7/25 [00:01<00:03, 5.50it/s] 32%|███▏ | 8/25 [00:01<00:03, 5.28it/s] 36%|███▌ | 9/25 [00:01<00:02, 5.35it/s] 40%|████ | 10/25 [00:01<00:02, 5.02it/s] 44%|████▍ | 11/25 [00:02<00:02, 4.83it/s] 48%|████▊ | 12/25 [00:02<00:02, 4.83it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.89it/s] 56%|█████▌ | 14/25 [00:02<00:02, 4.90it/s] 60%|██████ | 15/25 [00:02<00:02, 4.77it/s] 64%|██████▍ | 16/25 [00:03<00:01, 4.72it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.94it/s] 72%|███████▏ | 18/25 [00:03<00:01, 4.94it/s] 76%|███████▌ | 19/25 [00:03<00:01, 4.78it/s] 80%|████████ | 20/25 [00:03<00:01, 4.66it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.62it/s] 88%|████████▊ | 22/25 [00:04<00:00, 4.52it/s] 92%|█████████▏| 23/25 [00:04<00:00, 4.42it/s] 96%|█████████▌| 24/25 [00:04<00:00, 4.39it/s] 100%|██████████| 25/25 [00:05<00:00, 4.36it/s] 100%|██████████| 25/25 [00:05<00:00, 4.94it/s] 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:03, 6.49it/s] 8%|▊ | 2/25 [00:00<00:03, 6.43it/s] 12%|█▏ | 3/25 [00:00<00:03, 6.42it/s] 16%|█▌ | 4/25 [00:00<00:03, 6.42it/s] 20%|██ | 5/25 [00:00<00:03, 6.42it/s] 24%|██▍ | 6/25 [00:00<00:03, 5.68it/s] 28%|██▊ | 7/25 [00:01<00:03, 5.58it/s] 32%|███▏ | 8/25 [00:01<00:03, 5.31it/s] 36%|███▌ | 9/25 [00:01<00:03, 5.07it/s] 40%|████ | 10/25 [00:01<00:02, 5.06it/s] 44%|████▍ | 11/25 [00:01<00:02, 5.14it/s] 48%|████▊ | 12/25 [00:02<00:02, 5.06it/s] 52%|█████▏ | 13/25 [00:02<00:02, 5.08it/s] 56%|█████▌ | 14/25 [00:02<00:02, 5.08it/s] 60%|██████ | 15/25 [00:02<00:02, 4.97it/s] 64%|██████▍ | 16/25 [00:02<00:01, 4.97it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.83it/s] 72%|███████▏ | 18/25 [00:03<00:01, 4.72it/s] 76%|███████▌ | 19/25 [00:03<00:01, 4.70it/s] 80%|████████ | 20/25 [00:03<00:01, 4.76it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.64it/s] 88%|████████▊ | 22/25 [00:04<00:00, 4.60it/s] 92%|█████████▏| 23/25 [00:04<00:00, 4.54it/s] 96%|█████████▌| 24/25 [00:04<00:00, 4.49it/s] 100%|██████████| 25/25 [00:04<00:00, 4.59it/s] 100%|██████████| 25/25 [00:04<00:00, 5.03it/s] 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:03, 6.48it/s] 8%|▊ | 2/25 [00:00<00:03, 6.44it/s] 12%|█▏ | 3/25 [00:00<00:03, 6.42it/s] 16%|█▌ | 4/25 [00:00<00:03, 6.42it/s] 20%|██ | 5/25 [00:00<00:03, 6.41it/s] 24%|██▍ | 6/25 [00:00<00:03, 5.72it/s] 28%|██▊ | 7/25 [00:01<00:03, 5.46it/s] 32%|███▏ | 8/25 [00:01<00:03, 5.19it/s] 36%|███▌ | 9/25 [00:01<00:03, 5.16it/s] 40%|████ | 10/25 [00:01<00:02, 5.12it/s] 44%|████▍ | 11/25 [00:01<00:02, 5.31it/s] 48%|████▊ | 12/25 [00:02<00:02, 5.03it/s] 52%|█████▏ | 13/25 [00:02<00:02, 5.00it/s] 56%|█████▌ | 14/25 [00:02<00:02, 4.92it/s] 60%|██████ | 15/25 [00:02<00:02, 4.98it/s] 64%|██████▍ | 16/25 [00:03<00:01, 4.95it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.84it/s] 72%|███████▏ | 18/25 [00:03<00:01, 4.75it/s] 76%|███████▌ | 19/25 [00:03<00:01, 4.83it/s] 80%|████████ | 20/25 [00:03<00:01, 4.96it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.75it/s] 88%|████████▊ | 22/25 [00:04<00:00, 4.58it/s] 92%|█████████▏| 23/25 [00:04<00:00, 4.55it/s] 96%|█████████▌| 24/25 [00:04<00:00, 4.52it/s] 100%|██████████| 25/25 [00:04<00:00, 4.47it/s] 100%|██████████| 25/25 [00:04<00:00, 5.02it/s] 4 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85B45EB850>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85B436F5D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5916450>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5916C10>]] 0 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85B45EB850>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85B436F5D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5916450>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5916C10>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7D82950>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A575A6D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C18790>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C18BD0>]] [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85B45EB850>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85B436F5D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5916450>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5916C10>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7D82950>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A575A6D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C18790>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C18BD0>]] 1 (7, 650) 0 (124, 721) 1 (56, 636) 0 (89, 717)
Prediction
hvision-nku/storydiffusion:39c85f15IDm5dwv3cfvhrgg0cf8qtv7yhvc4StatusSucceededSourceWebHardwareA40 (Large)Total durationCreatedInput
- num_ids
- 4
- sd_model
- RealVision
- num_steps
- 25
- style_name
- Line art
- comic_style
- Classic Comic Style
- image_width
- 768
- image_height
- 768
- sa32_setting
- 0.3
- sa64_setting
- 0.5
- output_format
- webp
- guidance_scale
- 5
- output_quality
- 80
- negative_prompt
- bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs
- comic_description
- at home #at home, began to go to drawing sitting alone on a park bench. reading a book on a park bench. [NC]A squirrel approaches, peeking over the bench. look around in the park. # She looks around and enjoys the beauty of nature. [NC]leaf falls from the tree, landing on the sketchbook. picks up the leaf, examining its details closely. [NC]The brown squirrel appear. is very happy # She is very happy to see the squirrel again [NC]The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker
- style_strength_ratio
- 20
- character_description
- a girl, wearing white shirt, black skirt, black tie, yellow hair
{ "num_ids": 4, "sd_model": "RealVision", "num_steps": 25, "style_name": "Line art", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.3, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home #at home, began to go to drawing\nsitting alone on a park bench.\nreading a book on a park bench.\n[NC]A squirrel approaches, peeking over the bench. \nlook around in the park. # She looks around and enjoys the beauty of nature.\n[NC]leaf falls from the tree, landing on the sketchbook.\npicks up the leaf, examining its details closely.\n[NC]The brown squirrel appear.\nis very happy # She is very happy to see the squirrel again\n[NC]The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker", "style_strength_ratio": 20, "character_description": "a girl, wearing white shirt, black skirt, black tie, yellow hair" }
Install Replicate’s Node.js client library:npm install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "hvision-nku/storydiffusion:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", { input: { num_ids: 4, sd_model: "RealVision", num_steps: 25, style_name: "Line art", comic_style: "Classic Comic Style", image_width: 768, image_height: 768, sa32_setting: 0.3, sa64_setting: 0.5, output_format: "webp", guidance_scale: 5, output_quality: 80, negative_prompt: "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", comic_description: "at home #at home, began to go to drawing\nsitting alone on a park bench.\nreading a book on a park bench.\n[NC]A squirrel approaches, peeking over the bench. \nlook around in the park. # She looks around and enjoys the beauty of nature.\n[NC]leaf falls from the tree, landing on the sketchbook.\npicks up the leaf, examining its details closely.\n[NC]The brown squirrel appear.\nis very happy # She is very happy to see the squirrel again\n[NC]The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker", style_strength_ratio: 20, character_description: "a girl, wearing white shirt, black skirt, black tie, yellow hair" } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import the client:import replicate
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "hvision-nku/storydiffusion:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", input={ "num_ids": 4, "sd_model": "RealVision", "num_steps": 25, "style_name": "Line art", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.3, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home #at home, began to go to drawing\nsitting alone on a park bench.\nreading a book on a park bench.\n[NC]A squirrel approaches, peeking over the bench. \nlook around in the park. # She looks around and enjoys the beauty of nature.\n[NC]leaf falls from the tree, landing on the sketchbook.\npicks up the leaf, examining its details closely.\n[NC]The brown squirrel appear.\nis very happy # She is very happy to see the squirrel again\n[NC]The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker", "style_strength_ratio": 20, "character_description": "a girl, wearing white shirt, black skirt, black tie, yellow hair" } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", "input": { "num_ids": 4, "sd_model": "RealVision", "num_steps": 25, "style_name": "Line art", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.3, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home #at home, began to go to drawing\\nsitting alone on a park bench.\\nreading a book on a park bench.\\n[NC]A squirrel approaches, peeking over the bench. \\nlook around in the park. # She looks around and enjoys the beauty of nature.\\n[NC]leaf falls from the tree, landing on the sketchbook.\\npicks up the leaf, examining its details closely.\\n[NC]The brown squirrel appear.\\nis very happy # She is very happy to see the squirrel again\\n[NC]The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker", "style_strength_ratio": 20, "character_description": "a girl, wearing white shirt, black skirt, black tie, yellow hair" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
You can run this model locally using Cog. First, install Cog:brew install cog
If you don’t have Homebrew, there are other installation options available.
Run this to download the model and run it in your local environment:
cog predict r8.im/cjwbw/storydiffusion@sha256:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2 \ -i 'num_ids=4' \ -i 'sd_model="RealVision"' \ -i 'num_steps=25' \ -i 'style_name="Line art"' \ -i 'comic_style="Classic Comic Style"' \ -i 'image_width=768' \ -i 'image_height=768' \ -i 'sa32_setting=0.3' \ -i 'sa64_setting=0.5' \ -i 'output_format="webp"' \ -i 'guidance_scale=5' \ -i 'output_quality=80' \ -i 'negative_prompt="bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs"' \ -i $'comic_description="at home #at home, began to go to drawing\\nsitting alone on a park bench.\\nreading a book on a park bench.\\n[NC]A squirrel approaches, peeking over the bench. \\nlook around in the park. # She looks around and enjoys the beauty of nature.\\n[NC]leaf falls from the tree, landing on the sketchbook.\\npicks up the leaf, examining its details closely.\\n[NC]The brown squirrel appear.\\nis very happy # She is very happy to see the squirrel again\\n[NC]The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker"' \ -i 'style_strength_ratio=20' \ -i 'character_description="a girl, wearing white shirt, black skirt, black tie, yellow hair"'
To learn more, take a look at the Cog documentation.
Run this to download the model and run it in your local environment:
docker run -d -p 5000:5000 --gpus=all r8.im/cjwbw/storydiffusion@sha256:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2
curl -s -X POST \ -H "Content-Type: application/json" \ -d $'{ "input": { "num_ids": 4, "sd_model": "RealVision", "num_steps": 25, "style_name": "Line art", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.3, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home #at home, began to go to drawing\\nsitting alone on a park bench.\\nreading a book on a park bench.\\n[NC]A squirrel approaches, peeking over the bench. \\nlook around in the park. # She looks around and enjoys the beauty of nature.\\n[NC]leaf falls from the tree, landing on the sketchbook.\\npicks up the leaf, examining its details closely.\\n[NC]The brown squirrel appear.\\nis very happy # She is very happy to see the squirrel again\\n[NC]The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker", "style_strength_ratio": 20, "character_description": "a girl, wearing white shirt, black skirt, black tie, yellow hair" } }' \ http://localhost:5000/predictions
To learn more, take a look at the Cog documentation.
Output
{ "completed_at": "2024-05-04T23:35:30.504012Z", "created_at": "2024-05-04T23:34:32.156000Z", "data_removed": false, "error": null, "id": "m5dwv3cfvhrgg0cf8qtv7yhvc4", "input": { "num_ids": 4, "sd_model": "RealVision", "num_steps": 25, "style_name": "Line art", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.3, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home #at home, began to go to drawing\nsitting alone on a park bench.\nreading a book on a park bench.\n[NC]A squirrel approaches, peeking over the bench. \nlook around in the park. # She looks around and enjoys the beauty of nature.\n[NC]leaf falls from the tree, landing on the sketchbook.\npicks up the leaf, examining its details closely.\n[NC]The brown squirrel appear.\nis very happy # She is very happy to see the squirrel again\n[NC]The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker", "style_strength_ratio": 20, "character_description": "a girl, wearing white shirt, black skirt, black tie, yellow hair" }, "logs": "['at home #at home, began to go to drawing', 'sitting alone on a park bench.', 'reading a book on a park bench.', '[NC]A squirrel approaches, peeking over the bench. ', 'look around in the park. # She looks around and enjoys the beauty of nature.', '[NC]leaf falls from the tree, landing on the sketchbook.', 'picks up the leaf, examining its details closely.', '[NC]The brown squirrel appear.', 'is very happy # She is very happy to see the squirrel again', '[NC]The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker']\n['a girl, wearing white shirt, black skirt, black tie, yellow hair,at home #at home, began to go to drawing', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,sitting alone on a park bench.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,reading a book on a park bench.', 'A squirrel approaches, peeking over the bench. ', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,look around in the park. # She looks around and enjoys the beauty of nature.', 'leaf falls from the tree, landing on the sketchbook.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,picks up the leaf, examining its details closely.', 'The brown squirrel appear.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,is very happy # She is very happy to see the squirrel again', 'The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker']\n['a girl, wearing white shirt, black skirt, black tie, yellow hair,at home', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,sitting alone on a park bench.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,reading a book on a park bench.', 'A squirrel approaches, peeking over the bench.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,look around in the park.', 'leaf falls from the tree, landing on the sketchbook.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,picks up the leaf, examining its details closely.', 'The brown squirrel appear.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,is very happy', 'The brown squirrel takes the cracker and scampers up a tree.']\nUsing seed: 1486\nSuccessfully load paired self-attention\nNumber of the processor : 36\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:10, 2.26it/s]\n 8%|▊ | 2/25 [00:00<00:10, 2.25it/s]\n 12%|█▏ | 3/25 [00:01<00:09, 2.25it/s]\n 16%|█▌ | 4/25 [00:01<00:09, 2.25it/s]\n 20%|██ | 5/25 [00:02<00:08, 2.25it/s]\n 24%|██▍ | 6/25 [00:02<00:10, 1.87it/s]\n 28%|██▊ | 7/25 [00:03<00:10, 1.73it/s]\n 32%|███▏ | 8/25 [00:04<00:10, 1.65it/s]\n 36%|███▌ | 9/25 [00:04<00:10, 1.59it/s]\n 40%|████ | 10/25 [00:05<00:09, 1.53it/s]\n 44%|████▍ | 11/25 [00:06<00:09, 1.48it/s]\n 48%|████▊ | 12/25 [00:07<00:08, 1.48it/s]\n 52%|█████▏ | 13/25 [00:07<00:08, 1.45it/s]\n 56%|█████▌ | 14/25 [00:08<00:07, 1.45it/s]\n 60%|██████ | 15/25 [00:09<00:07, 1.42it/s]\n 64%|██████▍ | 16/25 [00:09<00:06, 1.41it/s]\n 68%|██████▊ | 17/25 [00:10<00:05, 1.41it/s]\n 72%|███████▏ | 18/25 [00:11<00:04, 1.43it/s]\n 76%|███████▌ | 19/25 [00:12<00:04, 1.40it/s]\n 80%|████████ | 20/25 [00:12<00:03, 1.40it/s]\n 84%|████████▍ | 21/25 [00:13<00:02, 1.34it/s]\n 88%|████████▊ | 22/25 [00:14<00:02, 1.30it/s]\n 92%|█████████▏| 23/25 [00:15<00:01, 1.29it/s]\n 96%|█████████▌| 24/25 [00:16<00:00, 1.28it/s]\n100%|██████████| 25/25 [00:16<00:00, 1.25it/s]\n100%|██████████| 25/25 [00:16<00:00, 1.48it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:03, 6.19it/s]\n 8%|▊ | 2/25 [00:00<00:03, 6.15it/s]\n 12%|█▏ | 3/25 [00:00<00:03, 6.15it/s]\n 16%|█▌ | 4/25 [00:00<00:03, 6.14it/s]\n 20%|██ | 5/25 [00:00<00:03, 6.11it/s]\n 24%|██▍ | 6/25 [00:01<00:03, 5.46it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 4.94it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 4.57it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.49it/s]\n 40%|████ | 10/25 [00:01<00:03, 4.46it/s]\n 44%|████▍ | 11/25 [00:02<00:03, 4.53it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 4.49it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.45it/s]\n 56%|█████▌ | 14/25 [00:02<00:02, 4.49it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.53it/s]\n 64%|██████▍ | 16/25 [00:03<00:02, 4.44it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.45it/s]\n 72%|███████▏ | 18/25 [00:03<00:01, 4.38it/s]\n 76%|███████▌ | 19/25 [00:04<00:01, 4.33it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.27it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.24it/s]\n 88%|████████▊ | 22/25 [00:04<00:00, 4.07it/s]\n 92%|█████████▏| 23/25 [00:05<00:00, 3.96it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 3.94it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.87it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.49it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:03, 6.17it/s]\n 8%|▊ | 2/25 [00:00<00:03, 6.16it/s]\n 12%|█▏ | 3/25 [00:00<00:03, 6.16it/s]\n 16%|█▌ | 4/25 [00:00<00:03, 6.14it/s]\n 20%|██ | 5/25 [00:00<00:03, 6.13it/s]\n 24%|██▍ | 6/25 [00:01<00:03, 5.28it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 4.89it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 4.69it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.56it/s]\n 40%|████ | 10/25 [00:01<00:03, 4.48it/s]\n 44%|████▍ | 11/25 [00:02<00:03, 4.36it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 4.36it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.46it/s]\n 56%|█████▌ | 14/25 [00:02<00:02, 4.55it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.42it/s]\n 64%|██████▍ | 16/25 [00:03<00:01, 4.51it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.55it/s]\n 72%|███████▏ | 18/25 [00:03<00:01, 4.48it/s]\n 76%|███████▌ | 19/25 [00:04<00:01, 4.37it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.37it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.15it/s]\n 88%|████████▊ | 22/25 [00:04<00:00, 4.07it/s]\n 92%|█████████▏| 23/25 [00:05<00:00, 4.01it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 3.92it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.85it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.48it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:03, 6.19it/s]\n 8%|▊ | 2/25 [00:00<00:03, 6.15it/s]\n 12%|█▏ | 3/25 [00:00<00:03, 6.13it/s]\n 16%|█▌ | 4/25 [00:00<00:03, 6.12it/s]\n 20%|██ | 5/25 [00:00<00:03, 6.12it/s]\n 24%|██▍ | 6/25 [00:01<00:03, 5.08it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 4.70it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 4.57it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.77it/s]\n 40%|████ | 10/25 [00:01<00:03, 4.63it/s]\n 44%|████▍ | 11/25 [00:02<00:02, 4.70it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 4.81it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.65it/s]\n 56%|█████▌ | 14/25 [00:02<00:02, 4.79it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.84it/s]\n 64%|██████▍ | 16/25 [00:03<00:01, 4.60it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.44it/s]\n 72%|███████▏ | 18/25 [00:03<00:01, 4.44it/s]\n 76%|███████▌ | 19/25 [00:03<00:01, 4.46it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.45it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.29it/s]\n 88%|████████▊ | 22/25 [00:04<00:00, 4.11it/s]\n 92%|█████████▏| 23/25 [00:04<00:00, 4.07it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 3.94it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.91it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.57it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:03, 6.17it/s]\n 8%|▊ | 2/25 [00:00<00:03, 6.13it/s]\n 12%|█▏ | 3/25 [00:00<00:03, 6.12it/s]\n 16%|█▌ | 4/25 [00:00<00:03, 6.12it/s]\n 20%|██ | 5/25 [00:00<00:03, 6.12it/s]\n 24%|██▍ | 6/25 [00:01<00:03, 5.51it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 5.24it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 4.93it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.59it/s]\n 40%|████ | 10/25 [00:01<00:03, 4.52it/s]\n 44%|████▍ | 11/25 [00:02<00:03, 4.37it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 4.39it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.35it/s]\n 56%|█████▌ | 14/25 [00:02<00:02, 4.35it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.34it/s]\n 64%|██████▍ | 16/25 [00:03<00:02, 4.48it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.42it/s]\n 72%|███████▏ | 18/25 [00:03<00:01, 4.26it/s]\n 76%|███████▌ | 19/25 [00:04<00:01, 4.44it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.37it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.26it/s]\n 88%|████████▊ | 22/25 [00:04<00:00, 4.13it/s]\n 92%|█████████▏| 23/25 [00:05<00:00, 4.07it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 3.95it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.90it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.51it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:03, 6.15it/s]\n 8%|▊ | 2/25 [00:00<00:03, 6.11it/s]\n 12%|█▏ | 3/25 [00:00<00:03, 6.11it/s]\n 16%|█▌ | 4/25 [00:00<00:03, 6.11it/s]\n 20%|██ | 5/25 [00:00<00:03, 6.10it/s]\n 24%|██▍ | 6/25 [00:01<00:03, 5.45it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 5.09it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 4.83it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.74it/s]\n 40%|████ | 10/25 [00:01<00:03, 4.58it/s]\n 44%|████▍ | 11/25 [00:02<00:03, 4.63it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 4.47it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.54it/s]\n 56%|█████▌ | 14/25 [00:02<00:02, 4.41it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.24it/s]\n 64%|██████▍ | 16/25 [00:03<00:02, 4.30it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.16it/s]\n 72%|███████▏ | 18/25 [00:03<00:01, 4.16it/s]\n 76%|███████▌ | 19/25 [00:04<00:01, 4.18it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.20it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.12it/s]\n 88%|████████▊ | 22/25 [00:04<00:00, 3.99it/s]\n 92%|█████████▏| 23/25 [00:05<00:00, 3.96it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 3.87it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.82it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.44it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:03, 6.17it/s]\n 8%|▊ | 2/25 [00:00<00:03, 6.13it/s]\n 12%|█▏ | 3/25 [00:00<00:03, 6.12it/s]\n 16%|█▌ | 4/25 [00:00<00:03, 6.11it/s]\n 20%|██ | 5/25 [00:00<00:03, 6.11it/s]\n 24%|██▍ | 6/25 [00:01<00:03, 5.22it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 4.70it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 4.45it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.41it/s]\n 40%|████ | 10/25 [00:02<00:03, 4.40it/s]\n 44%|████▍ | 11/25 [00:02<00:03, 4.39it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 4.40it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.51it/s]\n 56%|█████▌ | 14/25 [00:02<00:02, 4.42it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.50it/s]\n 64%|██████▍ | 16/25 [00:03<00:01, 4.50it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.37it/s]\n 72%|███████▏ | 18/25 [00:03<00:01, 4.36it/s]\n 76%|███████▌ | 19/25 [00:04<00:01, 4.37it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.36it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.15it/s]\n 88%|████████▊ | 22/25 [00:04<00:00, 4.14it/s]\n 92%|█████████▏| 23/25 [00:05<00:00, 4.03it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 3.93it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.87it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.46it/s]\n6 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A36C4810>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A6EE7BD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A3D70850>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A5820C10>]]\n0 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A36C4810>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A6EE7BD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A3D70850>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A5820C10>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A3528410>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A5210>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A66D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A5CD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A4410>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A7E55A10>]]\n[[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A36C4810>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A6EE7BD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A3D70850>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A5820C10>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A3528410>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A5210>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A66D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A5CD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A4410>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A7E55A10>]]\n0 (83, 712)\n0 (115, 712)\n1 (30, 641)\n0 (136, 718)", "metrics": { "predict_time": 58.30952, "total_time": 58.348012 }, "output": { "comic": "https://replicate.delivery/pbxt/kb51zJAT4LZ3DBrDTdxG3r8zvWo6GoKTQtV74S5UvtnP4RsE/comic.webp", "individual_images": [ "https://replicate.delivery/pbxt/kbWjLatKuY58NRfu6A3nPsffTHWeVcovKcpbi20fK639H8IWC/out-0.webp", "https://replicate.delivery/pbxt/eSE9XNP9UTwWEK291xXBWVnPfADGTbIC4enr1ia4A7FACPilA/out-1.webp", "https://replicate.delivery/pbxt/Ze04d4jR4fix0USFIrXfp3p85jXWuAzPz4rgfzCfcqwHI8IWC/out-2.webp", "https://replicate.delivery/pbxt/21cjVTK86ULNIlIAlNfXh4FXf6DdYU1XKDWkFGqjQ8yAhHxSA/out-3.webp", "https://replicate.delivery/pbxt/T8f8flamuPjoWk5J1DcBk7vfEI1x0x8RR76DPbFb6TxACPilA/out-4.webp", "https://replicate.delivery/pbxt/V12yP6wHKYpmId8FX8vjkAwUz0cnfG8Polo74uMLxAmgwjYJA/out-5.webp", "https://replicate.delivery/pbxt/GOCaZrZzlRbPC9oLgRpCypCZS0t7MGn1fDQkj9yhh8vgwjYJA/out-6.webp", "https://replicate.delivery/pbxt/PEt9c7zSSZKPOBsl2eMnuFBSpVyJJig3FeOvJL5meRADCPilA/out-7.webp", "https://replicate.delivery/pbxt/JpGM444lfsxVdiAeGd5OePCzhgzghrarlrHVBvWYexzIEeIWC/out-8.webp", "https://replicate.delivery/pbxt/BuYXgeVo4TztVawo7Vj80cdf1BUu4hj28tDEkGlAzsLChHxSA/out-9.webp" ] }, "started_at": "2024-05-04T23:34:32.194492Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/m5dwv3cfvhrgg0cf8qtv7yhvc4", "cancel": "https://api.replicate.com/v1/predictions/m5dwv3cfvhrgg0cf8qtv7yhvc4/cancel" }, "version": "39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2" }
Generated in['at home #at home, began to go to drawing', 'sitting alone on a park bench.', 'reading a book on a park bench.', '[NC]A squirrel approaches, peeking over the bench. ', 'look around in the park. # She looks around and enjoys the beauty of nature.', '[NC]leaf falls from the tree, landing on the sketchbook.', 'picks up the leaf, examining its details closely.', '[NC]The brown squirrel appear.', 'is very happy # She is very happy to see the squirrel again', '[NC]The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker'] ['a girl, wearing white shirt, black skirt, black tie, yellow hair,at home #at home, began to go to drawing', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,sitting alone on a park bench.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,reading a book on a park bench.', 'A squirrel approaches, peeking over the bench. ', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,look around in the park. # She looks around and enjoys the beauty of nature.', 'leaf falls from the tree, landing on the sketchbook.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,picks up the leaf, examining its details closely.', 'The brown squirrel appear.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,is very happy # She is very happy to see the squirrel again', 'The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker'] ['a girl, wearing white shirt, black skirt, black tie, yellow hair,at home', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,sitting alone on a park bench.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,reading a book on a park bench.', 'A squirrel approaches, peeking over the bench.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,look around in the park.', 'leaf falls from the tree, landing on the sketchbook.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,picks up the leaf, examining its details closely.', 'The brown squirrel appear.', 'a girl, wearing white shirt, black skirt, black tie, yellow hair,is very happy', 'The brown squirrel takes the cracker and scampers up a tree.'] Using seed: 1486 Successfully load paired self-attention Number of the processor : 36 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:10, 2.26it/s] 8%|▊ | 2/25 [00:00<00:10, 2.25it/s] 12%|█▏ | 3/25 [00:01<00:09, 2.25it/s] 16%|█▌ | 4/25 [00:01<00:09, 2.25it/s] 20%|██ | 5/25 [00:02<00:08, 2.25it/s] 24%|██▍ | 6/25 [00:02<00:10, 1.87it/s] 28%|██▊ | 7/25 [00:03<00:10, 1.73it/s] 32%|███▏ | 8/25 [00:04<00:10, 1.65it/s] 36%|███▌ | 9/25 [00:04<00:10, 1.59it/s] 40%|████ | 10/25 [00:05<00:09, 1.53it/s] 44%|████▍ | 11/25 [00:06<00:09, 1.48it/s] 48%|████▊ | 12/25 [00:07<00:08, 1.48it/s] 52%|█████▏ | 13/25 [00:07<00:08, 1.45it/s] 56%|█████▌ | 14/25 [00:08<00:07, 1.45it/s] 60%|██████ | 15/25 [00:09<00:07, 1.42it/s] 64%|██████▍ | 16/25 [00:09<00:06, 1.41it/s] 68%|██████▊ | 17/25 [00:10<00:05, 1.41it/s] 72%|███████▏ | 18/25 [00:11<00:04, 1.43it/s] 76%|███████▌ | 19/25 [00:12<00:04, 1.40it/s] 80%|████████ | 20/25 [00:12<00:03, 1.40it/s] 84%|████████▍ | 21/25 [00:13<00:02, 1.34it/s] 88%|████████▊ | 22/25 [00:14<00:02, 1.30it/s] 92%|█████████▏| 23/25 [00:15<00:01, 1.29it/s] 96%|█████████▌| 24/25 [00:16<00:00, 1.28it/s] 100%|██████████| 25/25 [00:16<00:00, 1.25it/s] 100%|██████████| 25/25 [00:16<00:00, 1.48it/s] 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:03, 6.19it/s] 8%|▊ | 2/25 [00:00<00:03, 6.15it/s] 12%|█▏ | 3/25 [00:00<00:03, 6.15it/s] 16%|█▌ | 4/25 [00:00<00:03, 6.14it/s] 20%|██ | 5/25 [00:00<00:03, 6.11it/s] 24%|██▍ | 6/25 [00:01<00:03, 5.46it/s] 28%|██▊ | 7/25 [00:01<00:03, 4.94it/s] 32%|███▏ | 8/25 [00:01<00:03, 4.57it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.49it/s] 40%|████ | 10/25 [00:01<00:03, 4.46it/s] 44%|████▍ | 11/25 [00:02<00:03, 4.53it/s] 48%|████▊ | 12/25 [00:02<00:02, 4.49it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.45it/s] 56%|█████▌ | 14/25 [00:02<00:02, 4.49it/s] 60%|██████ | 15/25 [00:03<00:02, 4.53it/s] 64%|██████▍ | 16/25 [00:03<00:02, 4.44it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.45it/s] 72%|███████▏ | 18/25 [00:03<00:01, 4.38it/s] 76%|███████▌ | 19/25 [00:04<00:01, 4.33it/s] 80%|████████ | 20/25 [00:04<00:01, 4.27it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.24it/s] 88%|████████▊ | 22/25 [00:04<00:00, 4.07it/s] 92%|█████████▏| 23/25 [00:05<00:00, 3.96it/s] 96%|█████████▌| 24/25 [00:05<00:00, 3.94it/s] 100%|██████████| 25/25 [00:05<00:00, 3.87it/s] 100%|██████████| 25/25 [00:05<00:00, 4.49it/s] 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:03, 6.17it/s] 8%|▊ | 2/25 [00:00<00:03, 6.16it/s] 12%|█▏ | 3/25 [00:00<00:03, 6.16it/s] 16%|█▌ | 4/25 [00:00<00:03, 6.14it/s] 20%|██ | 5/25 [00:00<00:03, 6.13it/s] 24%|██▍ | 6/25 [00:01<00:03, 5.28it/s] 28%|██▊ | 7/25 [00:01<00:03, 4.89it/s] 32%|███▏ | 8/25 [00:01<00:03, 4.69it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.56it/s] 40%|████ | 10/25 [00:01<00:03, 4.48it/s] 44%|████▍ | 11/25 [00:02<00:03, 4.36it/s] 48%|████▊ | 12/25 [00:02<00:02, 4.36it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.46it/s] 56%|█████▌ | 14/25 [00:02<00:02, 4.55it/s] 60%|██████ | 15/25 [00:03<00:02, 4.42it/s] 64%|██████▍ | 16/25 [00:03<00:01, 4.51it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.55it/s] 72%|███████▏ | 18/25 [00:03<00:01, 4.48it/s] 76%|███████▌ | 19/25 [00:04<00:01, 4.37it/s] 80%|████████ | 20/25 [00:04<00:01, 4.37it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.15it/s] 88%|████████▊ | 22/25 [00:04<00:00, 4.07it/s] 92%|█████████▏| 23/25 [00:05<00:00, 4.01it/s] 96%|█████████▌| 24/25 [00:05<00:00, 3.92it/s] 100%|██████████| 25/25 [00:05<00:00, 3.85it/s] 100%|██████████| 25/25 [00:05<00:00, 4.48it/s] 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:03, 6.19it/s] 8%|▊ | 2/25 [00:00<00:03, 6.15it/s] 12%|█▏ | 3/25 [00:00<00:03, 6.13it/s] 16%|█▌ | 4/25 [00:00<00:03, 6.12it/s] 20%|██ | 5/25 [00:00<00:03, 6.12it/s] 24%|██▍ | 6/25 [00:01<00:03, 5.08it/s] 28%|██▊ | 7/25 [00:01<00:03, 4.70it/s] 32%|███▏ | 8/25 [00:01<00:03, 4.57it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.77it/s] 40%|████ | 10/25 [00:01<00:03, 4.63it/s] 44%|████▍ | 11/25 [00:02<00:02, 4.70it/s] 48%|████▊ | 12/25 [00:02<00:02, 4.81it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.65it/s] 56%|█████▌ | 14/25 [00:02<00:02, 4.79it/s] 60%|██████ | 15/25 [00:03<00:02, 4.84it/s] 64%|██████▍ | 16/25 [00:03<00:01, 4.60it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.44it/s] 72%|███████▏ | 18/25 [00:03<00:01, 4.44it/s] 76%|███████▌ | 19/25 [00:03<00:01, 4.46it/s] 80%|████████ | 20/25 [00:04<00:01, 4.45it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.29it/s] 88%|████████▊ | 22/25 [00:04<00:00, 4.11it/s] 92%|█████████▏| 23/25 [00:04<00:00, 4.07it/s] 96%|█████████▌| 24/25 [00:05<00:00, 3.94it/s] 100%|██████████| 25/25 [00:05<00:00, 3.91it/s] 100%|██████████| 25/25 [00:05<00:00, 4.57it/s] 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:03, 6.17it/s] 8%|▊ | 2/25 [00:00<00:03, 6.13it/s] 12%|█▏ | 3/25 [00:00<00:03, 6.12it/s] 16%|█▌ | 4/25 [00:00<00:03, 6.12it/s] 20%|██ | 5/25 [00:00<00:03, 6.12it/s] 24%|██▍ | 6/25 [00:01<00:03, 5.51it/s] 28%|██▊ | 7/25 [00:01<00:03, 5.24it/s] 32%|███▏ | 8/25 [00:01<00:03, 4.93it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.59it/s] 40%|████ | 10/25 [00:01<00:03, 4.52it/s] 44%|████▍ | 11/25 [00:02<00:03, 4.37it/s] 48%|████▊ | 12/25 [00:02<00:02, 4.39it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.35it/s] 56%|█████▌ | 14/25 [00:02<00:02, 4.35it/s] 60%|██████ | 15/25 [00:03<00:02, 4.34it/s] 64%|██████▍ | 16/25 [00:03<00:02, 4.48it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.42it/s] 72%|███████▏ | 18/25 [00:03<00:01, 4.26it/s] 76%|███████▌ | 19/25 [00:04<00:01, 4.44it/s] 80%|████████ | 20/25 [00:04<00:01, 4.37it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.26it/s] 88%|████████▊ | 22/25 [00:04<00:00, 4.13it/s] 92%|█████████▏| 23/25 [00:05<00:00, 4.07it/s] 96%|█████████▌| 24/25 [00:05<00:00, 3.95it/s] 100%|██████████| 25/25 [00:05<00:00, 3.90it/s] 100%|██████████| 25/25 [00:05<00:00, 4.51it/s] 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:03, 6.15it/s] 8%|▊ | 2/25 [00:00<00:03, 6.11it/s] 12%|█▏ | 3/25 [00:00<00:03, 6.11it/s] 16%|█▌ | 4/25 [00:00<00:03, 6.11it/s] 20%|██ | 5/25 [00:00<00:03, 6.10it/s] 24%|██▍ | 6/25 [00:01<00:03, 5.45it/s] 28%|██▊ | 7/25 [00:01<00:03, 5.09it/s] 32%|███▏ | 8/25 [00:01<00:03, 4.83it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.74it/s] 40%|████ | 10/25 [00:01<00:03, 4.58it/s] 44%|████▍ | 11/25 [00:02<00:03, 4.63it/s] 48%|████▊ | 12/25 [00:02<00:02, 4.47it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.54it/s] 56%|█████▌ | 14/25 [00:02<00:02, 4.41it/s] 60%|██████ | 15/25 [00:03<00:02, 4.24it/s] 64%|██████▍ | 16/25 [00:03<00:02, 4.30it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.16it/s] 72%|███████▏ | 18/25 [00:03<00:01, 4.16it/s] 76%|███████▌ | 19/25 [00:04<00:01, 4.18it/s] 80%|████████ | 20/25 [00:04<00:01, 4.20it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.12it/s] 88%|████████▊ | 22/25 [00:04<00:00, 3.99it/s] 92%|█████████▏| 23/25 [00:05<00:00, 3.96it/s] 96%|█████████▌| 24/25 [00:05<00:00, 3.87it/s] 100%|██████████| 25/25 [00:05<00:00, 3.82it/s] 100%|██████████| 25/25 [00:05<00:00, 4.44it/s] 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:03, 6.17it/s] 8%|▊ | 2/25 [00:00<00:03, 6.13it/s] 12%|█▏ | 3/25 [00:00<00:03, 6.12it/s] 16%|█▌ | 4/25 [00:00<00:03, 6.11it/s] 20%|██ | 5/25 [00:00<00:03, 6.11it/s] 24%|██▍ | 6/25 [00:01<00:03, 5.22it/s] 28%|██▊ | 7/25 [00:01<00:03, 4.70it/s] 32%|███▏ | 8/25 [00:01<00:03, 4.45it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.41it/s] 40%|████ | 10/25 [00:02<00:03, 4.40it/s] 44%|████▍ | 11/25 [00:02<00:03, 4.39it/s] 48%|████▊ | 12/25 [00:02<00:02, 4.40it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.51it/s] 56%|█████▌ | 14/25 [00:02<00:02, 4.42it/s] 60%|██████ | 15/25 [00:03<00:02, 4.50it/s] 64%|██████▍ | 16/25 [00:03<00:01, 4.50it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.37it/s] 72%|███████▏ | 18/25 [00:03<00:01, 4.36it/s] 76%|███████▌ | 19/25 [00:04<00:01, 4.37it/s] 80%|████████ | 20/25 [00:04<00:01, 4.36it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.15it/s] 88%|████████▊ | 22/25 [00:04<00:00, 4.14it/s] 92%|█████████▏| 23/25 [00:05<00:00, 4.03it/s] 96%|█████████▌| 24/25 [00:05<00:00, 3.93it/s] 100%|██████████| 25/25 [00:05<00:00, 3.87it/s] 100%|██████████| 25/25 [00:05<00:00, 4.46it/s] 6 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A36C4810>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A6EE7BD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A3D70850>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A5820C10>]] 0 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A36C4810>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A6EE7BD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A3D70850>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A5820C10>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A3528410>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A5210>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A66D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A5CD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A4410>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A7E55A10>]] [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A36C4810>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A6EE7BD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A3D70850>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A5820C10>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A3528410>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A5210>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A66D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A5CD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A40A4410>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F22A7E55A10>]] 0 (83, 712) 0 (115, 712) 1 (30, 641) 0 (136, 718)
Prediction
hvision-nku/storydiffusion:39c85f15IDwq1tk3y125rgg0cf8mmrd6rfg4StatusSucceededSourceWebHardwareA40 (Large)Total durationCreatedInput
- num_ids
- 3
- sd_model
- RealVision
- num_steps
- 25
- style_name
- Comic book
- comic_style
- Four Pannel
- image_width
- 768
- image_height
- 768
- sa32_setting
- 0.5
- sa64_setting
- 0.5
- output_format
- webp
- guidance_scale
- 5
- output_quality
- 80
- negative_prompt
- bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs
- comic_description
- Directing traffic on the road. walking on the streets. Chasing a man on the street. At the police station.
- style_strength_ratio
- 20
- character_description
- a policeman img, wearing a white shirt
{ "num_ids": 3, "sd_model": "RealVision", "num_steps": 25, "ref_image": "https://replicate.delivery/pbxt/KrVIf310BZASLnFwfnDHR2R4Zwienkks9jS3IBDAKi9puhb9/yann-lecun2.jpeg", "style_name": "Comic book", "comic_style": "Four Pannel", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "Directing traffic on the road. \nwalking on the streets.\nChasing a man on the street.\nAt the police station.", "style_strength_ratio": 20, "character_description": "a policeman img, wearing a white shirt" }
Install Replicate’s Node.js client library:npm install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "hvision-nku/storydiffusion:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", { input: { num_ids: 3, sd_model: "RealVision", num_steps: 25, ref_image: "https://replicate.delivery/pbxt/KrVIf310BZASLnFwfnDHR2R4Zwienkks9jS3IBDAKi9puhb9/yann-lecun2.jpeg", style_name: "Comic book", comic_style: "Four Pannel", image_width: 768, image_height: 768, sa32_setting: 0.5, sa64_setting: 0.5, output_format: "webp", guidance_scale: 5, output_quality: 80, negative_prompt: "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", comic_description: "Directing traffic on the road. \nwalking on the streets.\nChasing a man on the street.\nAt the police station.", style_strength_ratio: 20, character_description: "a policeman img, wearing a white shirt" } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import the client:import replicate
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "hvision-nku/storydiffusion:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", input={ "num_ids": 3, "sd_model": "RealVision", "num_steps": 25, "ref_image": "https://replicate.delivery/pbxt/KrVIf310BZASLnFwfnDHR2R4Zwienkks9jS3IBDAKi9puhb9/yann-lecun2.jpeg", "style_name": "Comic book", "comic_style": "Four Pannel", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "Directing traffic on the road. \nwalking on the streets.\nChasing a man on the street.\nAt the police station.", "style_strength_ratio": 20, "character_description": "a policeman img, wearing a white shirt" } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", "input": { "num_ids": 3, "sd_model": "RealVision", "num_steps": 25, "ref_image": "https://replicate.delivery/pbxt/KrVIf310BZASLnFwfnDHR2R4Zwienkks9jS3IBDAKi9puhb9/yann-lecun2.jpeg", "style_name": "Comic book", "comic_style": "Four Pannel", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "Directing traffic on the road. \\nwalking on the streets.\\nChasing a man on the street.\\nAt the police station.", "style_strength_ratio": 20, "character_description": "a policeman img, wearing a white shirt" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
You can run this model locally using Cog. First, install Cog:brew install cog
If you don’t have Homebrew, there are other installation options available.
Run this to download the model and run it in your local environment:
cog predict r8.im/cjwbw/storydiffusion@sha256:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2 \ -i 'num_ids=3' \ -i 'sd_model="RealVision"' \ -i 'num_steps=25' \ -i 'ref_image="https://replicate.delivery/pbxt/KrVIf310BZASLnFwfnDHR2R4Zwienkks9jS3IBDAKi9puhb9/yann-lecun2.jpeg"' \ -i 'style_name="Comic book"' \ -i 'comic_style="Four Pannel"' \ -i 'image_width=768' \ -i 'image_height=768' \ -i 'sa32_setting=0.5' \ -i 'sa64_setting=0.5' \ -i 'output_format="webp"' \ -i 'guidance_scale=5' \ -i 'output_quality=80' \ -i 'negative_prompt="bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs"' \ -i $'comic_description="Directing traffic on the road. \\nwalking on the streets.\\nChasing a man on the street.\\nAt the police station."' \ -i 'style_strength_ratio=20' \ -i 'character_description="a policeman img, wearing a white shirt"'
To learn more, take a look at the Cog documentation.
Run this to download the model and run it in your local environment:
docker run -d -p 5000:5000 --gpus=all r8.im/cjwbw/storydiffusion@sha256:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2
curl -s -X POST \ -H "Content-Type: application/json" \ -d $'{ "input": { "num_ids": 3, "sd_model": "RealVision", "num_steps": 25, "ref_image": "https://replicate.delivery/pbxt/KrVIf310BZASLnFwfnDHR2R4Zwienkks9jS3IBDAKi9puhb9/yann-lecun2.jpeg", "style_name": "Comic book", "comic_style": "Four Pannel", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "Directing traffic on the road. \\nwalking on the streets.\\nChasing a man on the street.\\nAt the police station.", "style_strength_ratio": 20, "character_description": "a policeman img, wearing a white shirt" } }' \ http://localhost:5000/predictions
To learn more, take a look at the Cog documentation.
Output
{ "completed_at": "2024-05-04T19:53:48.102022Z", "created_at": "2024-05-04T19:51:55.409000Z", "data_removed": false, "error": null, "id": "wq1tk3y125rgg0cf8mmrd6rfg4", "input": { "num_ids": 3, "sd_model": "RealVision", "num_steps": 25, "ref_image": "https://replicate.delivery/pbxt/KrVIf310BZASLnFwfnDHR2R4Zwienkks9jS3IBDAKi9puhb9/yann-lecun2.jpeg", "style_name": "Comic book", "comic_style": "Four Pannel", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "Directing traffic on the road. \nwalking on the streets.\nChasing a man on the street.\nAt the police station.", "style_strength_ratio": 20, "character_description": "a policeman img, wearing a white shirt" }, "logs": "['Directing traffic on the road. ', 'walking on the streets.', 'Chasing a man on the street.', 'At the police station.']\n['a policeman img, wearing a white shirt,Directing traffic on the road. ', 'a policeman img, wearing a white shirt,walking on the streets.', 'a policeman img, wearing a white shirt,Chasing a man on the street.', 'a policeman img, wearing a white shirt,At the police station.']\n['a policeman img, wearing a white shirt,Directing traffic on the road.', 'a policeman img, wearing a white shirt,walking on the streets.', 'a policeman img, wearing a white shirt,Chasing a man on the street.', 'a policeman img, wearing a white shirt,At the police station.']\nUsing seed: 46051\nSuccessfully load paired self-attention\nNumber of the processor : 36\ncomic a policeman, wearing a white shirt, directing traffic on the road.. graphic illustration, comic art, graphic novel art, vibrant, highly detailed\ncomic a policeman, wearing a white shirt, walking on the streets.. graphic illustration, comic art, graphic novel art, vibrant, highly detailed\ncomic a policeman, wearing a white shirt, chasing a man on the street.. graphic illustration, comic art, graphic novel art, vibrant, highly detailed\ntorch.Size([3, 77, 2048])\ntorch.Size([3, 77, 2048])\ntorch.Size([3, 77, 2048])\ntorch.Size([3, 1280])\ntorch.Size([3, 1280])\ntorch.Size([3, 1280])\ntorch.Size([3, 4, 96, 96])\ntorch.Size([6, 6])\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:09, 2.60it/s]\n 8%|▊ | 2/25 [00:00<00:08, 2.86it/s]\n 12%|█▏ | 3/25 [00:01<00:07, 2.94it/s]\n 16%|█▌ | 4/25 [00:01<00:07, 2.99it/s]\n 20%|██ | 5/25 [00:01<00:06, 3.02it/s]\n 24%|██▍ | 6/25 [00:02<00:07, 2.54it/s]\n 28%|██▊ | 7/25 [00:02<00:07, 2.34it/s]\n 32%|███▏ | 8/25 [00:03<00:07, 2.21it/s]\n 36%|███▌ | 9/25 [00:03<00:07, 2.25it/s]\n 40%|████ | 10/25 [00:04<00:06, 2.15it/s]\n 44%|████▍ | 11/25 [00:04<00:06, 2.06it/s]\n 48%|████▊ | 12/25 [00:05<00:06, 2.04it/s]\n 52%|█████▏ | 13/25 [00:05<00:05, 2.06it/s]\n 56%|█████▌ | 14/25 [00:06<00:05, 2.10it/s]\n 60%|██████ | 15/25 [00:06<00:04, 2.10it/s]\n 64%|██████▍ | 16/25 [00:07<00:04, 2.08it/s]\n 68%|██████▊ | 17/25 [00:07<00:03, 2.07it/s]\n 72%|███████▏ | 18/25 [00:08<00:03, 2.05it/s]\n 76%|███████▌ | 19/25 [00:08<00:02, 2.10it/s]\n 80%|████████ | 20/25 [00:08<00:02, 2.13it/s]\n 84%|████████▍ | 21/25 [00:09<00:01, 2.05it/s]\n 88%|████████▊ | 22/25 [00:10<00:01, 2.02it/s]\n 92%|█████████▏| 23/25 [00:10<00:01, 1.96it/s]\n 96%|█████████▌| 24/25 [00:11<00:00, 1.93it/s]\n100%|██████████| 25/25 [00:11<00:00, 1.95it/s]\n100%|██████████| 25/25 [00:11<00:00, 2.16it/s]\ncomic a policeman, wearing a white shirt, at the police station.. graphic illustration, comic art, graphic novel art, vibrant, highly detailed\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 4, 96, 96])\ntorch.Size([2, 6])\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:05, 4.62it/s]\n 8%|▊ | 2/25 [00:00<00:04, 5.44it/s]\n 12%|█▏ | 3/25 [00:00<00:03, 5.74it/s]\n 16%|█▌ | 4/25 [00:00<00:03, 5.92it/s]\n 20%|██ | 5/25 [00:00<00:03, 6.04it/s]\n 24%|██▍ | 6/25 [00:01<00:03, 5.37it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 5.29it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 5.16it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.96it/s]\n 40%|████ | 10/25 [00:01<00:03, 4.90it/s]\n 44%|████▍ | 11/25 [00:02<00:02, 4.83it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 4.74it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.80it/s]\n 56%|█████▌ | 14/25 [00:02<00:02, 4.76it/s]\n 60%|██████ | 15/25 [00:02<00:02, 4.76it/s]\n 64%|██████▍ | 16/25 [00:03<00:01, 4.68it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.60it/s]\n 72%|███████▏ | 18/25 [00:03<00:01, 4.68it/s]\n 76%|███████▌ | 19/25 [00:03<00:01, 4.57it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.80it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.66it/s]\n 88%|████████▊ | 22/25 [00:04<00:00, 4.50it/s]\n 92%|█████████▏| 23/25 [00:04<00:00, 4.48it/s]\n 96%|█████████▌| 24/25 [00:04<00:00, 4.50it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.39it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.81it/s]\n0 (115, 712)\n0 (172, 712)\n0 (114, 712)\n0 (192, 718)", "metrics": { "predict_time": 23.627901, "total_time": 112.693022 }, "output": { "comic": "https://replicate.delivery/pbxt/JqvHQajVBV7xExDIfe3vTMDNqn5cnuQM4JqOYNKNjPSJRExSA/comic.webp", "individual_images": [ "https://replicate.delivery/pbxt/yKzeien5Ukke7JhzKpg4CrnQRVHkERgvgK6nPo2HAbhViIilA/out-0.webp", "https://replicate.delivery/pbxt/uimbUi9fsVSTYKrZgylxe4LvVI6ypWGeAohrgCC8np0WiIilA/out-1.webp", "https://replicate.delivery/pbxt/ePmX5ffgqbBr1pe0PyPbeqevhyERBgu24Ww8nnlUY9D0SERsE/out-2.webp", "https://replicate.delivery/pbxt/US3Uqhh5GR7fVK66XYOboI6uXL8DSyOPSjyIcULbRfALRExSA/out-3.webp" ] }, "started_at": "2024-05-04T19:53:24.474121Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/wq1tk3y125rgg0cf8mmrd6rfg4", "cancel": "https://api.replicate.com/v1/predictions/wq1tk3y125rgg0cf8mmrd6rfg4/cancel" }, "version": "7e763eb4ef9e4d0e9a1991cb7467c54e7688292c4cba4a57321810d12fef2852" }
Generated in['Directing traffic on the road. ', 'walking on the streets.', 'Chasing a man on the street.', 'At the police station.'] ['a policeman img, wearing a white shirt,Directing traffic on the road. ', 'a policeman img, wearing a white shirt,walking on the streets.', 'a policeman img, wearing a white shirt,Chasing a man on the street.', 'a policeman img, wearing a white shirt,At the police station.'] ['a policeman img, wearing a white shirt,Directing traffic on the road.', 'a policeman img, wearing a white shirt,walking on the streets.', 'a policeman img, wearing a white shirt,Chasing a man on the street.', 'a policeman img, wearing a white shirt,At the police station.'] Using seed: 46051 Successfully load paired self-attention Number of the processor : 36 comic a policeman, wearing a white shirt, directing traffic on the road.. graphic illustration, comic art, graphic novel art, vibrant, highly detailed comic a policeman, wearing a white shirt, walking on the streets.. graphic illustration, comic art, graphic novel art, vibrant, highly detailed comic a policeman, wearing a white shirt, chasing a man on the street.. graphic illustration, comic art, graphic novel art, vibrant, highly detailed torch.Size([3, 77, 2048]) torch.Size([3, 77, 2048]) torch.Size([3, 77, 2048]) torch.Size([3, 1280]) torch.Size([3, 1280]) torch.Size([3, 1280]) torch.Size([3, 4, 96, 96]) torch.Size([6, 6]) 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:09, 2.60it/s] 8%|▊ | 2/25 [00:00<00:08, 2.86it/s] 12%|█▏ | 3/25 [00:01<00:07, 2.94it/s] 16%|█▌ | 4/25 [00:01<00:07, 2.99it/s] 20%|██ | 5/25 [00:01<00:06, 3.02it/s] 24%|██▍ | 6/25 [00:02<00:07, 2.54it/s] 28%|██▊ | 7/25 [00:02<00:07, 2.34it/s] 32%|███▏ | 8/25 [00:03<00:07, 2.21it/s] 36%|███▌ | 9/25 [00:03<00:07, 2.25it/s] 40%|████ | 10/25 [00:04<00:06, 2.15it/s] 44%|████▍ | 11/25 [00:04<00:06, 2.06it/s] 48%|████▊ | 12/25 [00:05<00:06, 2.04it/s] 52%|█████▏ | 13/25 [00:05<00:05, 2.06it/s] 56%|█████▌ | 14/25 [00:06<00:05, 2.10it/s] 60%|██████ | 15/25 [00:06<00:04, 2.10it/s] 64%|██████▍ | 16/25 [00:07<00:04, 2.08it/s] 68%|██████▊ | 17/25 [00:07<00:03, 2.07it/s] 72%|███████▏ | 18/25 [00:08<00:03, 2.05it/s] 76%|███████▌ | 19/25 [00:08<00:02, 2.10it/s] 80%|████████ | 20/25 [00:08<00:02, 2.13it/s] 84%|████████▍ | 21/25 [00:09<00:01, 2.05it/s] 88%|████████▊ | 22/25 [00:10<00:01, 2.02it/s] 92%|█████████▏| 23/25 [00:10<00:01, 1.96it/s] 96%|█████████▌| 24/25 [00:11<00:00, 1.93it/s] 100%|██████████| 25/25 [00:11<00:00, 1.95it/s] 100%|██████████| 25/25 [00:11<00:00, 2.16it/s] comic a policeman, wearing a white shirt, at the police station.. graphic illustration, comic art, graphic novel art, vibrant, highly detailed torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 4, 96, 96]) torch.Size([2, 6]) 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:05, 4.62it/s] 8%|▊ | 2/25 [00:00<00:04, 5.44it/s] 12%|█▏ | 3/25 [00:00<00:03, 5.74it/s] 16%|█▌ | 4/25 [00:00<00:03, 5.92it/s] 20%|██ | 5/25 [00:00<00:03, 6.04it/s] 24%|██▍ | 6/25 [00:01<00:03, 5.37it/s] 28%|██▊ | 7/25 [00:01<00:03, 5.29it/s] 32%|███▏ | 8/25 [00:01<00:03, 5.16it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.96it/s] 40%|████ | 10/25 [00:01<00:03, 4.90it/s] 44%|████▍ | 11/25 [00:02<00:02, 4.83it/s] 48%|████▊ | 12/25 [00:02<00:02, 4.74it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.80it/s] 56%|█████▌ | 14/25 [00:02<00:02, 4.76it/s] 60%|██████ | 15/25 [00:02<00:02, 4.76it/s] 64%|██████▍ | 16/25 [00:03<00:01, 4.68it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.60it/s] 72%|███████▏ | 18/25 [00:03<00:01, 4.68it/s] 76%|███████▌ | 19/25 [00:03<00:01, 4.57it/s] 80%|████████ | 20/25 [00:04<00:01, 4.80it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.66it/s] 88%|████████▊ | 22/25 [00:04<00:00, 4.50it/s] 92%|█████████▏| 23/25 [00:04<00:00, 4.48it/s] 96%|█████████▌| 24/25 [00:04<00:00, 4.50it/s] 100%|██████████| 25/25 [00:05<00:00, 4.39it/s] 100%|██████████| 25/25 [00:05<00:00, 4.81it/s] 0 (115, 712) 0 (172, 712) 0 (114, 712) 0 (192, 718)
Prediction
hvision-nku/storydiffusion:39c85f15IDh4bpbm4qzdrgj0cf8qvb9e75bcStatusSucceededSourceWebHardwareA40 (Large)Total durationCreatedInput
- num_ids
- 3
- sd_model
- Unstable
- num_steps
- 25
- style_name
- Japanese Anime
- comic_style
- Classic Comic Style
- image_width
- 768
- image_height
- 768
- sa32_setting
- 0.5
- sa64_setting
- 0.5
- output_format
- webp
- guidance_scale
- 5
- output_quality
- 80
- negative_prompt
- bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs
- comic_description
- at home, began to go to drawing sitting alone on a park bench. reading a book on a park bench. A squirrel approaches, peeking over the bench. look around in the park. # She looks around and enjoys the beauty of nature. leaf falls from the tree, landing on the sketchbook. picks up the leaf, examining its details closely. The brown squirrel appear. She is very happy to see the squirrel again The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker
- style_strength_ratio
- 20
- character_description
- a woman img, wearing a white T-shirt, blue loose hair
{ "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "ref_image": "https://replicate.delivery/pbxt/KrYifi1qCInaAdHZqMlSY0BN5O3vegkLfU8fRCBKu3iqbKXL/1.jpeg", "style_name": "Japanese Anime", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, began to go to drawing\nsitting alone on a park bench.\nreading a book on a park bench.\nA squirrel approaches, peeking over the bench. \nlook around in the park. # She looks around and enjoys the beauty of nature.\nleaf falls from the tree, landing on the sketchbook.\npicks up the leaf, examining its details closely.\nThe brown squirrel appear.\nShe is very happy to see the squirrel again\nThe brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker", "style_strength_ratio": 20, "character_description": "a woman img, wearing a white T-shirt, blue loose hair" }
Install Replicate’s Node.js client library:npm install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "hvision-nku/storydiffusion:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", { input: { num_ids: 3, sd_model: "Unstable", num_steps: 25, ref_image: "https://replicate.delivery/pbxt/KrYifi1qCInaAdHZqMlSY0BN5O3vegkLfU8fRCBKu3iqbKXL/1.jpeg", style_name: "Japanese Anime", comic_style: "Classic Comic Style", image_width: 768, image_height: 768, sa32_setting: 0.5, sa64_setting: 0.5, output_format: "webp", guidance_scale: 5, output_quality: 80, negative_prompt: "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", comic_description: "at home, began to go to drawing\nsitting alone on a park bench.\nreading a book on a park bench.\nA squirrel approaches, peeking over the bench. \nlook around in the park. # She looks around and enjoys the beauty of nature.\nleaf falls from the tree, landing on the sketchbook.\npicks up the leaf, examining its details closely.\nThe brown squirrel appear.\nShe is very happy to see the squirrel again\nThe brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker", style_strength_ratio: 20, character_description: "a woman img, wearing a white T-shirt, blue loose hair" } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import the client:import replicate
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "hvision-nku/storydiffusion:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", input={ "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "ref_image": "https://replicate.delivery/pbxt/KrYifi1qCInaAdHZqMlSY0BN5O3vegkLfU8fRCBKu3iqbKXL/1.jpeg", "style_name": "Japanese Anime", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, began to go to drawing\nsitting alone on a park bench.\nreading a book on a park bench.\nA squirrel approaches, peeking over the bench. \nlook around in the park. # She looks around and enjoys the beauty of nature.\nleaf falls from the tree, landing on the sketchbook.\npicks up the leaf, examining its details closely.\nThe brown squirrel appear.\nShe is very happy to see the squirrel again\nThe brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker", "style_strength_ratio": 20, "character_description": "a woman img, wearing a white T-shirt, blue loose hair" } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", "input": { "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "ref_image": "https://replicate.delivery/pbxt/KrYifi1qCInaAdHZqMlSY0BN5O3vegkLfU8fRCBKu3iqbKXL/1.jpeg", "style_name": "Japanese Anime", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, began to go to drawing\\nsitting alone on a park bench.\\nreading a book on a park bench.\\nA squirrel approaches, peeking over the bench. \\nlook around in the park. # She looks around and enjoys the beauty of nature.\\nleaf falls from the tree, landing on the sketchbook.\\npicks up the leaf, examining its details closely.\\nThe brown squirrel appear.\\nShe is very happy to see the squirrel again\\nThe brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker", "style_strength_ratio": 20, "character_description": "a woman img, wearing a white T-shirt, blue loose hair" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
You can run this model locally using Cog. First, install Cog:brew install cog
If you don’t have Homebrew, there are other installation options available.
Run this to download the model and run it in your local environment:
cog predict r8.im/cjwbw/storydiffusion@sha256:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2 \ -i 'num_ids=3' \ -i 'sd_model="Unstable"' \ -i 'num_steps=25' \ -i 'ref_image="https://replicate.delivery/pbxt/KrYifi1qCInaAdHZqMlSY0BN5O3vegkLfU8fRCBKu3iqbKXL/1.jpeg"' \ -i 'style_name="Japanese Anime"' \ -i 'comic_style="Classic Comic Style"' \ -i 'image_width=768' \ -i 'image_height=768' \ -i 'sa32_setting=0.5' \ -i 'sa64_setting=0.5' \ -i 'output_format="webp"' \ -i 'guidance_scale=5' \ -i 'output_quality=80' \ -i 'negative_prompt="bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs"' \ -i $'comic_description="at home, began to go to drawing\\nsitting alone on a park bench.\\nreading a book on a park bench.\\nA squirrel approaches, peeking over the bench. \\nlook around in the park. # She looks around and enjoys the beauty of nature.\\nleaf falls from the tree, landing on the sketchbook.\\npicks up the leaf, examining its details closely.\\nThe brown squirrel appear.\\nShe is very happy to see the squirrel again\\nThe brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker"' \ -i 'style_strength_ratio=20' \ -i 'character_description="a woman img, wearing a white T-shirt, blue loose hair"'
To learn more, take a look at the Cog documentation.
Run this to download the model and run it in your local environment:
docker run -d -p 5000:5000 --gpus=all r8.im/cjwbw/storydiffusion@sha256:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2
curl -s -X POST \ -H "Content-Type: application/json" \ -d $'{ "input": { "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "ref_image": "https://replicate.delivery/pbxt/KrYifi1qCInaAdHZqMlSY0BN5O3vegkLfU8fRCBKu3iqbKXL/1.jpeg", "style_name": "Japanese Anime", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, began to go to drawing\\nsitting alone on a park bench.\\nreading a book on a park bench.\\nA squirrel approaches, peeking over the bench. \\nlook around in the park. # She looks around and enjoys the beauty of nature.\\nleaf falls from the tree, landing on the sketchbook.\\npicks up the leaf, examining its details closely.\\nThe brown squirrel appear.\\nShe is very happy to see the squirrel again\\nThe brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker", "style_strength_ratio": 20, "character_description": "a woman img, wearing a white T-shirt, blue loose hair" } }' \ http://localhost:5000/predictions
To learn more, take a look at the Cog documentation.
Output
{ "completed_at": "2024-05-04T23:36:42.902541Z", "created_at": "2024-05-04T23:35:39.771000Z", "data_removed": false, "error": null, "id": "h4bpbm4qzdrgj0cf8qvb9e75bc", "input": { "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "ref_image": "https://replicate.delivery/pbxt/KrYifi1qCInaAdHZqMlSY0BN5O3vegkLfU8fRCBKu3iqbKXL/1.jpeg", "style_name": "Japanese Anime", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.5, "sa64_setting": 0.5, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, began to go to drawing\nsitting alone on a park bench.\nreading a book on a park bench.\nA squirrel approaches, peeking over the bench. \nlook around in the park. # She looks around and enjoys the beauty of nature.\nleaf falls from the tree, landing on the sketchbook.\npicks up the leaf, examining its details closely.\nThe brown squirrel appear.\nShe is very happy to see the squirrel again\nThe brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker", "style_strength_ratio": 20, "character_description": "a woman img, wearing a white T-shirt, blue loose hair" }, "logs": "['at home, began to go to drawing', 'sitting alone on a park bench.', 'reading a book on a park bench.', 'A squirrel approaches, peeking over the bench. ', 'look around in the park. # She looks around and enjoys the beauty of nature.', 'leaf falls from the tree, landing on the sketchbook.', 'picks up the leaf, examining its details closely.', 'The brown squirrel appear.', 'She is very happy to see the squirrel again', 'The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker']\n['a woman img, wearing a white T-shirt, blue loose hair,at home, began to go to drawing', 'a woman img, wearing a white T-shirt, blue loose hair,sitting alone on a park bench.', 'a woman img, wearing a white T-shirt, blue loose hair,reading a book on a park bench.', 'a woman img, wearing a white T-shirt, blue loose hair,A squirrel approaches, peeking over the bench. ', 'a woman img, wearing a white T-shirt, blue loose hair,look around in the park. # She looks around and enjoys the beauty of nature.', 'a woman img, wearing a white T-shirt, blue loose hair,leaf falls from the tree, landing on the sketchbook.', 'a woman img, wearing a white T-shirt, blue loose hair,picks up the leaf, examining its details closely.', 'a woman img, wearing a white T-shirt, blue loose hair,The brown squirrel appear.', 'a woman img, wearing a white T-shirt, blue loose hair,She is very happy to see the squirrel again', 'a woman img, wearing a white T-shirt, blue loose hair,The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker']\n['a woman img, wearing a white T-shirt, blue loose hair,at home, began to go to drawing', 'a woman img, wearing a white T-shirt, blue loose hair,sitting alone on a park bench.', 'a woman img, wearing a white T-shirt, blue loose hair,reading a book on a park bench.', 'a woman img, wearing a white T-shirt, blue loose hair,A squirrel approaches, peeking over the bench.', 'a woman img, wearing a white T-shirt, blue loose hair,look around in the park.', 'a woman img, wearing a white T-shirt, blue loose hair,leaf falls from the tree, landing on the sketchbook.', 'a woman img, wearing a white T-shirt, blue loose hair,picks up the leaf, examining its details closely.', 'a woman img, wearing a white T-shirt, blue loose hair,The brown squirrel appear.', 'a woman img, wearing a white T-shirt, blue loose hair,She is very happy to see the squirrel again', 'a woman img, wearing a white T-shirt, blue loose hair,The brown squirrel takes the cracker and scampers up a tree.']\nUsing seed: 57629\nSuccessfully load paired self-attention\nNumber of the processor : 36\nanime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, at home, began to go to drawing. created by japanese anime studio. highly emotional. best quality, high resolution\nanime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, sitting alone on a park bench.. created by japanese anime studio. highly emotional. best quality, high resolution\nanime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, reading a book on a park bench.. created by japanese anime studio. highly emotional. best quality, high resolution\ntorch.Size([3, 77, 2048])\ntorch.Size([3, 77, 2048])\ntorch.Size([3, 77, 2048])\ntorch.Size([3, 1280])\ntorch.Size([3, 1280])\ntorch.Size([3, 1280])\ntorch.Size([3, 4, 96, 96])\ntorch.Size([6, 6])\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:09, 2.55it/s]\n 8%|▊ | 2/25 [00:00<00:09, 2.55it/s]\n 12%|█▏ | 3/25 [00:01<00:08, 2.56it/s]\n 16%|█▌ | 4/25 [00:01<00:08, 2.57it/s]\n 20%|██ | 5/25 [00:01<00:07, 2.56it/s]\n 24%|██▍ | 6/25 [00:02<00:08, 2.25it/s]\n 28%|██▊ | 7/25 [00:03<00:08, 2.07it/s]\n 32%|███▏ | 8/25 [00:03<00:08, 1.99it/s]\n 36%|███▌ | 9/25 [00:04<00:07, 2.01it/s]\n 40%|████ | 10/25 [00:04<00:07, 1.90it/s]\n 44%|████▍ | 11/25 [00:05<00:07, 1.86it/s]\n 48%|████▊ | 12/25 [00:05<00:06, 1.86it/s]\n 52%|█████▏ | 13/25 [00:06<00:06, 1.83it/s]\n 56%|█████▌ | 14/25 [00:06<00:06, 1.82it/s]\n 60%|██████ | 15/25 [00:07<00:05, 1.84it/s]\n 64%|██████▍ | 16/25 [00:08<00:04, 1.82it/s]\n 68%|██████▊ | 17/25 [00:08<00:04, 1.78it/s]\n 72%|███████▏ | 18/25 [00:09<00:03, 1.81it/s]\n 76%|███████▌ | 19/25 [00:09<00:03, 1.80it/s]\n 80%|████████ | 20/25 [00:10<00:02, 1.84it/s]\n 84%|████████▍ | 21/25 [00:10<00:02, 1.78it/s]\n 88%|████████▊ | 22/25 [00:11<00:01, 1.76it/s]\n 92%|█████████▏| 23/25 [00:11<00:01, 1.73it/s]\n 96%|█████████▌| 24/25 [00:12<00:00, 1.73it/s]\n100%|██████████| 25/25 [00:13<00:00, 1.75it/s]\n100%|██████████| 25/25 [00:13<00:00, 1.90it/s]\nanime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, a squirrel approaches, peeking over the bench.. created by japanese anime studio. highly emotional. best quality, high resolution\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 4, 96, 96])\ntorch.Size([2, 6])\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:04, 5.17it/s]\n 8%|▊ | 2/25 [00:00<00:04, 5.24it/s]\n 12%|█▏ | 3/25 [00:00<00:04, 5.27it/s]\n 16%|█▌ | 4/25 [00:00<00:04, 5.25it/s]\n 20%|██ | 5/25 [00:00<00:03, 5.25it/s]\n 24%|██▍ | 6/25 [00:01<00:03, 4.89it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 4.71it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 4.45it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.26it/s]\n 40%|████ | 10/25 [00:02<00:03, 4.17it/s]\n 44%|████▍ | 11/25 [00:02<00:03, 4.20it/s]\n 48%|████▊ | 12/25 [00:02<00:03, 4.16it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.21it/s]\n 56%|█████▌ | 14/25 [00:03<00:02, 4.14it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.03it/s]\n 64%|██████▍ | 16/25 [00:03<00:02, 4.04it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.05it/s]\n 72%|███████▏ | 18/25 [00:04<00:01, 4.20it/s]\n 76%|███████▌ | 19/25 [00:04<00:01, 4.23it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.26it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.14it/s]\n 88%|████████▊ | 22/25 [00:05<00:00, 4.03it/s]\n 92%|█████████▏| 23/25 [00:05<00:00, 3.93it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 3.92it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.86it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.25it/s]\nanime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, look around in the park.. created by japanese anime studio. highly emotional. best quality, high resolution\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 4, 96, 96])\ntorch.Size([2, 6])\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:04, 5.11it/s]\n 8%|▊ | 2/25 [00:00<00:04, 5.18it/s]\n 12%|█▏ | 3/25 [00:00<00:04, 5.20it/s]\n 16%|█▌ | 4/25 [00:00<00:04, 5.21it/s]\n 20%|██ | 5/25 [00:00<00:03, 5.21it/s]\n 24%|██▍ | 6/25 [00:01<00:03, 4.85it/s]\n 28%|██▊ | 7/25 [00:01<00:04, 4.48it/s]\n 32%|███▏ | 8/25 [00:01<00:04, 4.23it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.17it/s]\n 40%|████ | 10/25 [00:02<00:03, 4.14it/s]\n 44%|████▍ | 11/25 [00:02<00:03, 4.17it/s]\n 48%|████▊ | 12/25 [00:02<00:03, 4.13it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.24it/s]\n 56%|█████▌ | 14/25 [00:03<00:02, 4.31it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.45it/s]\n 64%|██████▍ | 16/25 [00:03<00:02, 4.26it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.34it/s]\n 72%|███████▏ | 18/25 [00:04<00:01, 4.33it/s]\n 76%|███████▌ | 19/25 [00:04<00:01, 4.38it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.29it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.13it/s]\n 88%|████████▊ | 22/25 [00:05<00:00, 4.10it/s]\n 92%|█████████▏| 23/25 [00:05<00:00, 4.05it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 4.03it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.96it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.31it/s]\nanime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, leaf falls from the tree, landing on the sketchbook.. created by japanese anime studio. highly emotional. best quality, high resolution\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 4, 96, 96])\ntorch.Size([2, 6])\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:04, 5.17it/s]\n 8%|▊ | 2/25 [00:00<00:04, 5.17it/s]\n 12%|█▏ | 3/25 [00:00<00:04, 5.20it/s]\n 16%|█▌ | 4/25 [00:00<00:04, 5.20it/s]\n 20%|██ | 5/25 [00:00<00:03, 5.23it/s]\n 24%|██▍ | 6/25 [00:01<00:03, 4.77it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 4.57it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 4.44it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.42it/s]\n 40%|████ | 10/25 [00:02<00:03, 4.28it/s]\n 44%|████▍ | 11/25 [00:02<00:03, 4.26it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 4.39it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.30it/s]\n 56%|█████▌ | 14/25 [00:03<00:02, 4.38it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.40it/s]\n 64%|██████▍ | 16/25 [00:03<00:02, 4.24it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.26it/s]\n 72%|███████▏ | 18/25 [00:04<00:01, 4.22it/s]\n 76%|███████▌ | 19/25 [00:04<00:01, 4.21it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.22it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.10it/s]\n 88%|████████▊ | 22/25 [00:05<00:00, 4.01it/s]\n 92%|█████████▏| 23/25 [00:05<00:00, 3.99it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 3.94it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.88it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.31it/s]\nanime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, picks up the leaf, examining its details closely.. created by japanese anime studio. highly emotional. best quality, high resolution\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 4, 96, 96])\ntorch.Size([2, 6])\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:04, 5.19it/s]\n 8%|▊ | 2/25 [00:00<00:04, 5.19it/s]\n 12%|█▏ | 3/25 [00:00<00:04, 5.20it/s]\n 16%|█▌ | 4/25 [00:00<00:04, 5.22it/s]\n 20%|██ | 5/25 [00:00<00:03, 5.21it/s]\n 24%|██▍ | 6/25 [00:01<00:04, 4.68it/s]\n 28%|██▊ | 7/25 [00:01<00:04, 4.44it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 4.34it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.31it/s]\n 40%|████ | 10/25 [00:02<00:03, 4.37it/s]\n 44%|████▍ | 11/25 [00:02<00:03, 4.38it/s]\n 48%|████▊ | 12/25 [00:02<00:02, 4.40it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.27it/s]\n 56%|█████▌ | 14/25 [00:03<00:02, 4.27it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.22it/s]\n 64%|██████▍ | 16/25 [00:03<00:02, 4.20it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.20it/s]\n 72%|███████▏ | 18/25 [00:04<00:01, 4.22it/s]\n 76%|███████▌ | 19/25 [00:04<00:01, 4.22it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.22it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.12it/s]\n 88%|████████▊ | 22/25 [00:05<00:00, 4.00it/s]\n 92%|█████████▏| 23/25 [00:05<00:00, 3.93it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 3.88it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.82it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.27it/s]\nanime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, the brown squirrel appear.. created by japanese anime studio. highly emotional. best quality, high resolution\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 4, 96, 96])\ntorch.Size([2, 6])\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:04, 5.23it/s]\n 8%|▊ | 2/25 [00:00<00:04, 5.24it/s]\n 12%|█▏ | 3/25 [00:00<00:04, 5.23it/s]\n 16%|█▌ | 4/25 [00:00<00:04, 5.23it/s]\n 20%|██ | 5/25 [00:00<00:03, 5.24it/s]\n 24%|██▍ | 6/25 [00:01<00:04, 4.66it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 4.50it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 4.36it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.24it/s]\n 40%|████ | 10/25 [00:02<00:03, 4.22it/s]\n 44%|████▍ | 11/25 [00:02<00:03, 4.25it/s]\n 48%|████▊ | 12/25 [00:02<00:03, 4.20it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.18it/s]\n 56%|█████▌ | 14/25 [00:03<00:02, 4.14it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.12it/s]\n 64%|██████▍ | 16/25 [00:03<00:02, 4.13it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.23it/s]\n 72%|███████▏ | 18/25 [00:04<00:01, 4.26it/s]\n 76%|███████▌ | 19/25 [00:04<00:01, 4.37it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.18it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.01it/s]\n 88%|████████▊ | 22/25 [00:05<00:00, 3.89it/s]\n 92%|█████████▏| 23/25 [00:05<00:00, 3.89it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 3.81it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.89it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.24it/s]\nanime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, she is very happy to see the squirrel again. created by japanese anime studio. highly emotional. best quality, high resolution\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 4, 96, 96])\ntorch.Size([2, 6])\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:04, 5.21it/s]\n 8%|▊ | 2/25 [00:00<00:04, 5.21it/s]\n 12%|█▏ | 3/25 [00:00<00:04, 5.21it/s]\n 16%|█▌ | 4/25 [00:00<00:04, 5.13it/s]\n 20%|██ | 5/25 [00:00<00:03, 5.15it/s]\n 24%|██▍ | 6/25 [00:01<00:04, 4.60it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 4.54it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 4.44it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.37it/s]\n 40%|████ | 10/25 [00:02<00:03, 4.29it/s]\n 44%|████▍ | 11/25 [00:02<00:03, 4.19it/s]\n 48%|████▊ | 12/25 [00:02<00:03, 4.24it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.20it/s]\n 56%|█████▌ | 14/25 [00:03<00:02, 4.12it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.14it/s]\n 64%|██████▍ | 16/25 [00:03<00:02, 4.21it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.18it/s]\n 72%|███████▏ | 18/25 [00:04<00:01, 4.29it/s]\n 76%|███████▌ | 19/25 [00:04<00:01, 4.44it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.28it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.07it/s]\n 88%|████████▊ | 22/25 [00:05<00:00, 3.95it/s]\n 92%|█████████▏| 23/25 [00:05<00:00, 3.91it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 3.82it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.78it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.24it/s]\nanime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, the brown squirrel takes the cracker and scampers up a tree.. created by japanese anime studio. highly emotional. best quality, high resolution\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 77, 2048])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 1280])\ntorch.Size([1, 4, 96, 96])\ntorch.Size([2, 6])\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:04, 5.18it/s]\n 8%|▊ | 2/25 [00:00<00:04, 5.21it/s]\n 12%|█▏ | 3/25 [00:00<00:04, 5.22it/s]\n 16%|█▌ | 4/25 [00:00<00:04, 5.22it/s]\n 20%|██ | 5/25 [00:00<00:03, 5.22it/s]\n 24%|██▍ | 6/25 [00:01<00:03, 4.95it/s]\n 28%|██▊ | 7/25 [00:01<00:03, 4.62it/s]\n 32%|███▏ | 8/25 [00:01<00:03, 4.48it/s]\n 36%|███▌ | 9/25 [00:01<00:03, 4.30it/s]\n 40%|████ | 10/25 [00:02<00:03, 4.14it/s]\n 44%|████▍ | 11/25 [00:02<00:03, 4.05it/s]\n 48%|████▊ | 12/25 [00:02<00:03, 4.05it/s]\n 52%|█████▏ | 13/25 [00:02<00:02, 4.23it/s]\n 56%|█████▌ | 14/25 [00:03<00:02, 4.38it/s]\n 60%|██████ | 15/25 [00:03<00:02, 4.27it/s]\n 64%|██████▍ | 16/25 [00:03<00:02, 4.21it/s]\n 68%|██████▊ | 17/25 [00:03<00:01, 4.22it/s]\n 72%|███████▏ | 18/25 [00:04<00:01, 4.21it/s]\n 76%|███████▌ | 19/25 [00:04<00:01, 4.14it/s]\n 80%|████████ | 20/25 [00:04<00:01, 4.15it/s]\n 84%|████████▍ | 21/25 [00:04<00:00, 4.08it/s]\n 88%|████████▊ | 22/25 [00:05<00:00, 4.01it/s]\n 92%|█████████▏| 23/25 [00:05<00:00, 3.94it/s]\n 96%|█████████▌| 24/25 [00:05<00:00, 3.87it/s]\n100%|██████████| 25/25 [00:05<00:00, 3.89it/s]\n100%|██████████| 25/25 [00:05<00:00, 4.26it/s]\n6 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C39CD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A8C864D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C78A90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C4A590>]]\n0 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C39CD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A8C864D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C78A90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C4A590>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C84E50>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C1A050>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C53B10>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C51E10>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C53D90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5F8A310>]]\n[[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C39CD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A8C864D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C78A90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C4A590>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C84E50>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C1A050>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C53B10>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C51E10>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C53D90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5F8A310>]]\n0 (97, 712)\n1 (18, 648)\n1 (30, 641)\n0 (136, 718)", "metrics": { "predict_time": 63.081754, "total_time": 63.131541 }, "output": { "comic": "https://replicate.delivery/pbxt/VdUGelocnvS0dKfpiz73sKqNo3xXpmgbchUGtvtYzwvHiHxSA/comic.webp", "individual_images": [ "https://replicate.delivery/pbxt/JmDzXdexnXWdJyauochgMUieGYeKBRTErTqpFjNO57FREPilA/out-0.webp", "https://replicate.delivery/pbxt/0tcpPbWpdZpjO9ANerF9fe3f9afnwQc24Ezr1zJKHLgBR8IWC/out-1.webp", "https://replicate.delivery/pbxt/413Bih4QLLKcOlmamsH9elwiINvAFtoOmeUyfdO3ydKQEPilA/out-2.webp", "https://replicate.delivery/pbxt/xAzA4Bx3edWfa01DjyrKmqoueFwLa2qTmZjDZWtFfrWnIeIWC/out-3.webp", "https://replicate.delivery/pbxt/RmYPipvbHmLLMVuDFKDVGBgrqvRoOKY9zRu1NK3lHfoExjYJA/out-4.webp", "https://replicate.delivery/pbxt/Tar169T7BHKKApcjZNyB0jshlbceK3WqNsIHjfCgEQtJiHxSA/out-5.webp", "https://replicate.delivery/pbxt/k1MazWljBXIeZiasegcYudM2xwChbC22XTdlu7uIHs2JiHxSA/out-6.webp", "https://replicate.delivery/pbxt/DdIqJMmAniZbPRLpzH5fdt3IZc13eypkwMOb3tfmuQ0UEPilA/out-7.webp", "https://replicate.delivery/pbxt/Ik9lWX2fwWw3YyeNhNkw0CHNH1FSWqMi9EG6F7liJ3cKiHxSA/out-8.webp", "https://replicate.delivery/pbxt/oRUBMe44Ed38diDAJfmoxVv6tXNeh7fF4rifylgsi7yRR8IWC/out-9.webp" ] }, "started_at": "2024-05-04T23:35:39.820787Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/h4bpbm4qzdrgj0cf8qvb9e75bc", "cancel": "https://api.replicate.com/v1/predictions/h4bpbm4qzdrgj0cf8qvb9e75bc/cancel" }, "version": "39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2" }
Generated in['at home, began to go to drawing', 'sitting alone on a park bench.', 'reading a book on a park bench.', 'A squirrel approaches, peeking over the bench. ', 'look around in the park. # She looks around and enjoys the beauty of nature.', 'leaf falls from the tree, landing on the sketchbook.', 'picks up the leaf, examining its details closely.', 'The brown squirrel appear.', 'She is very happy to see the squirrel again', 'The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker'] ['a woman img, wearing a white T-shirt, blue loose hair,at home, began to go to drawing', 'a woman img, wearing a white T-shirt, blue loose hair,sitting alone on a park bench.', 'a woman img, wearing a white T-shirt, blue loose hair,reading a book on a park bench.', 'a woman img, wearing a white T-shirt, blue loose hair,A squirrel approaches, peeking over the bench. ', 'a woman img, wearing a white T-shirt, blue loose hair,look around in the park. # She looks around and enjoys the beauty of nature.', 'a woman img, wearing a white T-shirt, blue loose hair,leaf falls from the tree, landing on the sketchbook.', 'a woman img, wearing a white T-shirt, blue loose hair,picks up the leaf, examining its details closely.', 'a woman img, wearing a white T-shirt, blue loose hair,The brown squirrel appear.', 'a woman img, wearing a white T-shirt, blue loose hair,She is very happy to see the squirrel again', 'a woman img, wearing a white T-shirt, blue loose hair,The brown squirrel takes the cracker and scampers up a tree. # She gives the squirrel cracker'] ['a woman img, wearing a white T-shirt, blue loose hair,at home, began to go to drawing', 'a woman img, wearing a white T-shirt, blue loose hair,sitting alone on a park bench.', 'a woman img, wearing a white T-shirt, blue loose hair,reading a book on a park bench.', 'a woman img, wearing a white T-shirt, blue loose hair,A squirrel approaches, peeking over the bench.', 'a woman img, wearing a white T-shirt, blue loose hair,look around in the park.', 'a woman img, wearing a white T-shirt, blue loose hair,leaf falls from the tree, landing on the sketchbook.', 'a woman img, wearing a white T-shirt, blue loose hair,picks up the leaf, examining its details closely.', 'a woman img, wearing a white T-shirt, blue loose hair,The brown squirrel appear.', 'a woman img, wearing a white T-shirt, blue loose hair,She is very happy to see the squirrel again', 'a woman img, wearing a white T-shirt, blue loose hair,The brown squirrel takes the cracker and scampers up a tree.'] Using seed: 57629 Successfully load paired self-attention Number of the processor : 36 anime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, at home, began to go to drawing. created by japanese anime studio. highly emotional. best quality, high resolution anime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, sitting alone on a park bench.. created by japanese anime studio. highly emotional. best quality, high resolution anime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, reading a book on a park bench.. created by japanese anime studio. highly emotional. best quality, high resolution torch.Size([3, 77, 2048]) torch.Size([3, 77, 2048]) torch.Size([3, 77, 2048]) torch.Size([3, 1280]) torch.Size([3, 1280]) torch.Size([3, 1280]) torch.Size([3, 4, 96, 96]) torch.Size([6, 6]) 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:09, 2.55it/s] 8%|▊ | 2/25 [00:00<00:09, 2.55it/s] 12%|█▏ | 3/25 [00:01<00:08, 2.56it/s] 16%|█▌ | 4/25 [00:01<00:08, 2.57it/s] 20%|██ | 5/25 [00:01<00:07, 2.56it/s] 24%|██▍ | 6/25 [00:02<00:08, 2.25it/s] 28%|██▊ | 7/25 [00:03<00:08, 2.07it/s] 32%|███▏ | 8/25 [00:03<00:08, 1.99it/s] 36%|███▌ | 9/25 [00:04<00:07, 2.01it/s] 40%|████ | 10/25 [00:04<00:07, 1.90it/s] 44%|████▍ | 11/25 [00:05<00:07, 1.86it/s] 48%|████▊ | 12/25 [00:05<00:06, 1.86it/s] 52%|█████▏ | 13/25 [00:06<00:06, 1.83it/s] 56%|█████▌ | 14/25 [00:06<00:06, 1.82it/s] 60%|██████ | 15/25 [00:07<00:05, 1.84it/s] 64%|██████▍ | 16/25 [00:08<00:04, 1.82it/s] 68%|██████▊ | 17/25 [00:08<00:04, 1.78it/s] 72%|███████▏ | 18/25 [00:09<00:03, 1.81it/s] 76%|███████▌ | 19/25 [00:09<00:03, 1.80it/s] 80%|████████ | 20/25 [00:10<00:02, 1.84it/s] 84%|████████▍ | 21/25 [00:10<00:02, 1.78it/s] 88%|████████▊ | 22/25 [00:11<00:01, 1.76it/s] 92%|█████████▏| 23/25 [00:11<00:01, 1.73it/s] 96%|█████████▌| 24/25 [00:12<00:00, 1.73it/s] 100%|██████████| 25/25 [00:13<00:00, 1.75it/s] 100%|██████████| 25/25 [00:13<00:00, 1.90it/s] anime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, a squirrel approaches, peeking over the bench.. created by japanese anime studio. highly emotional. best quality, high resolution torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 4, 96, 96]) torch.Size([2, 6]) 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:04, 5.17it/s] 8%|▊ | 2/25 [00:00<00:04, 5.24it/s] 12%|█▏ | 3/25 [00:00<00:04, 5.27it/s] 16%|█▌ | 4/25 [00:00<00:04, 5.25it/s] 20%|██ | 5/25 [00:00<00:03, 5.25it/s] 24%|██▍ | 6/25 [00:01<00:03, 4.89it/s] 28%|██▊ | 7/25 [00:01<00:03, 4.71it/s] 32%|███▏ | 8/25 [00:01<00:03, 4.45it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.26it/s] 40%|████ | 10/25 [00:02<00:03, 4.17it/s] 44%|████▍ | 11/25 [00:02<00:03, 4.20it/s] 48%|████▊ | 12/25 [00:02<00:03, 4.16it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.21it/s] 56%|█████▌ | 14/25 [00:03<00:02, 4.14it/s] 60%|██████ | 15/25 [00:03<00:02, 4.03it/s] 64%|██████▍ | 16/25 [00:03<00:02, 4.04it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.05it/s] 72%|███████▏ | 18/25 [00:04<00:01, 4.20it/s] 76%|███████▌ | 19/25 [00:04<00:01, 4.23it/s] 80%|████████ | 20/25 [00:04<00:01, 4.26it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.14it/s] 88%|████████▊ | 22/25 [00:05<00:00, 4.03it/s] 92%|█████████▏| 23/25 [00:05<00:00, 3.93it/s] 96%|█████████▌| 24/25 [00:05<00:00, 3.92it/s] 100%|██████████| 25/25 [00:05<00:00, 3.86it/s] 100%|██████████| 25/25 [00:05<00:00, 4.25it/s] anime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, look around in the park.. created by japanese anime studio. highly emotional. best quality, high resolution torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 4, 96, 96]) torch.Size([2, 6]) 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:04, 5.11it/s] 8%|▊ | 2/25 [00:00<00:04, 5.18it/s] 12%|█▏ | 3/25 [00:00<00:04, 5.20it/s] 16%|█▌ | 4/25 [00:00<00:04, 5.21it/s] 20%|██ | 5/25 [00:00<00:03, 5.21it/s] 24%|██▍ | 6/25 [00:01<00:03, 4.85it/s] 28%|██▊ | 7/25 [00:01<00:04, 4.48it/s] 32%|███▏ | 8/25 [00:01<00:04, 4.23it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.17it/s] 40%|████ | 10/25 [00:02<00:03, 4.14it/s] 44%|████▍ | 11/25 [00:02<00:03, 4.17it/s] 48%|████▊ | 12/25 [00:02<00:03, 4.13it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.24it/s] 56%|█████▌ | 14/25 [00:03<00:02, 4.31it/s] 60%|██████ | 15/25 [00:03<00:02, 4.45it/s] 64%|██████▍ | 16/25 [00:03<00:02, 4.26it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.34it/s] 72%|███████▏ | 18/25 [00:04<00:01, 4.33it/s] 76%|███████▌ | 19/25 [00:04<00:01, 4.38it/s] 80%|████████ | 20/25 [00:04<00:01, 4.29it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.13it/s] 88%|████████▊ | 22/25 [00:05<00:00, 4.10it/s] 92%|█████████▏| 23/25 [00:05<00:00, 4.05it/s] 96%|█████████▌| 24/25 [00:05<00:00, 4.03it/s] 100%|██████████| 25/25 [00:05<00:00, 3.96it/s] 100%|██████████| 25/25 [00:05<00:00, 4.31it/s] anime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, leaf falls from the tree, landing on the sketchbook.. created by japanese anime studio. highly emotional. best quality, high resolution torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 4, 96, 96]) torch.Size([2, 6]) 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:04, 5.17it/s] 8%|▊ | 2/25 [00:00<00:04, 5.17it/s] 12%|█▏ | 3/25 [00:00<00:04, 5.20it/s] 16%|█▌ | 4/25 [00:00<00:04, 5.20it/s] 20%|██ | 5/25 [00:00<00:03, 5.23it/s] 24%|██▍ | 6/25 [00:01<00:03, 4.77it/s] 28%|██▊ | 7/25 [00:01<00:03, 4.57it/s] 32%|███▏ | 8/25 [00:01<00:03, 4.44it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.42it/s] 40%|████ | 10/25 [00:02<00:03, 4.28it/s] 44%|████▍ | 11/25 [00:02<00:03, 4.26it/s] 48%|████▊ | 12/25 [00:02<00:02, 4.39it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.30it/s] 56%|█████▌ | 14/25 [00:03<00:02, 4.38it/s] 60%|██████ | 15/25 [00:03<00:02, 4.40it/s] 64%|██████▍ | 16/25 [00:03<00:02, 4.24it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.26it/s] 72%|███████▏ | 18/25 [00:04<00:01, 4.22it/s] 76%|███████▌ | 19/25 [00:04<00:01, 4.21it/s] 80%|████████ | 20/25 [00:04<00:01, 4.22it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.10it/s] 88%|████████▊ | 22/25 [00:05<00:00, 4.01it/s] 92%|█████████▏| 23/25 [00:05<00:00, 3.99it/s] 96%|█████████▌| 24/25 [00:05<00:00, 3.94it/s] 100%|██████████| 25/25 [00:05<00:00, 3.88it/s] 100%|██████████| 25/25 [00:05<00:00, 4.31it/s] anime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, picks up the leaf, examining its details closely.. created by japanese anime studio. highly emotional. best quality, high resolution torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 4, 96, 96]) torch.Size([2, 6]) 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:04, 5.19it/s] 8%|▊ | 2/25 [00:00<00:04, 5.19it/s] 12%|█▏ | 3/25 [00:00<00:04, 5.20it/s] 16%|█▌ | 4/25 [00:00<00:04, 5.22it/s] 20%|██ | 5/25 [00:00<00:03, 5.21it/s] 24%|██▍ | 6/25 [00:01<00:04, 4.68it/s] 28%|██▊ | 7/25 [00:01<00:04, 4.44it/s] 32%|███▏ | 8/25 [00:01<00:03, 4.34it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.31it/s] 40%|████ | 10/25 [00:02<00:03, 4.37it/s] 44%|████▍ | 11/25 [00:02<00:03, 4.38it/s] 48%|████▊ | 12/25 [00:02<00:02, 4.40it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.27it/s] 56%|█████▌ | 14/25 [00:03<00:02, 4.27it/s] 60%|██████ | 15/25 [00:03<00:02, 4.22it/s] 64%|██████▍ | 16/25 [00:03<00:02, 4.20it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.20it/s] 72%|███████▏ | 18/25 [00:04<00:01, 4.22it/s] 76%|███████▌ | 19/25 [00:04<00:01, 4.22it/s] 80%|████████ | 20/25 [00:04<00:01, 4.22it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.12it/s] 88%|████████▊ | 22/25 [00:05<00:00, 4.00it/s] 92%|█████████▏| 23/25 [00:05<00:00, 3.93it/s] 96%|█████████▌| 24/25 [00:05<00:00, 3.88it/s] 100%|██████████| 25/25 [00:05<00:00, 3.82it/s] 100%|██████████| 25/25 [00:05<00:00, 4.27it/s] anime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, the brown squirrel appear.. created by japanese anime studio. highly emotional. best quality, high resolution torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 4, 96, 96]) torch.Size([2, 6]) 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:04, 5.23it/s] 8%|▊ | 2/25 [00:00<00:04, 5.24it/s] 12%|█▏ | 3/25 [00:00<00:04, 5.23it/s] 16%|█▌ | 4/25 [00:00<00:04, 5.23it/s] 20%|██ | 5/25 [00:00<00:03, 5.24it/s] 24%|██▍ | 6/25 [00:01<00:04, 4.66it/s] 28%|██▊ | 7/25 [00:01<00:03, 4.50it/s] 32%|███▏ | 8/25 [00:01<00:03, 4.36it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.24it/s] 40%|████ | 10/25 [00:02<00:03, 4.22it/s] 44%|████▍ | 11/25 [00:02<00:03, 4.25it/s] 48%|████▊ | 12/25 [00:02<00:03, 4.20it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.18it/s] 56%|█████▌ | 14/25 [00:03<00:02, 4.14it/s] 60%|██████ | 15/25 [00:03<00:02, 4.12it/s] 64%|██████▍ | 16/25 [00:03<00:02, 4.13it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.23it/s] 72%|███████▏ | 18/25 [00:04<00:01, 4.26it/s] 76%|███████▌ | 19/25 [00:04<00:01, 4.37it/s] 80%|████████ | 20/25 [00:04<00:01, 4.18it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.01it/s] 88%|████████▊ | 22/25 [00:05<00:00, 3.89it/s] 92%|█████████▏| 23/25 [00:05<00:00, 3.89it/s] 96%|█████████▌| 24/25 [00:05<00:00, 3.81it/s] 100%|██████████| 25/25 [00:05<00:00, 3.89it/s] 100%|██████████| 25/25 [00:05<00:00, 4.24it/s] anime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, she is very happy to see the squirrel again. created by japanese anime studio. highly emotional. best quality, high resolution torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 4, 96, 96]) torch.Size([2, 6]) 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:04, 5.21it/s] 8%|▊ | 2/25 [00:00<00:04, 5.21it/s] 12%|█▏ | 3/25 [00:00<00:04, 5.21it/s] 16%|█▌ | 4/25 [00:00<00:04, 5.13it/s] 20%|██ | 5/25 [00:00<00:03, 5.15it/s] 24%|██▍ | 6/25 [00:01<00:04, 4.60it/s] 28%|██▊ | 7/25 [00:01<00:03, 4.54it/s] 32%|███▏ | 8/25 [00:01<00:03, 4.44it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.37it/s] 40%|████ | 10/25 [00:02<00:03, 4.29it/s] 44%|████▍ | 11/25 [00:02<00:03, 4.19it/s] 48%|████▊ | 12/25 [00:02<00:03, 4.24it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.20it/s] 56%|█████▌ | 14/25 [00:03<00:02, 4.12it/s] 60%|██████ | 15/25 [00:03<00:02, 4.14it/s] 64%|██████▍ | 16/25 [00:03<00:02, 4.21it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.18it/s] 72%|███████▏ | 18/25 [00:04<00:01, 4.29it/s] 76%|███████▌ | 19/25 [00:04<00:01, 4.44it/s] 80%|████████ | 20/25 [00:04<00:01, 4.28it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.07it/s] 88%|████████▊ | 22/25 [00:05<00:00, 3.95it/s] 92%|█████████▏| 23/25 [00:05<00:00, 3.91it/s] 96%|█████████▌| 24/25 [00:05<00:00, 3.82it/s] 100%|██████████| 25/25 [00:05<00:00, 3.78it/s] 100%|██████████| 25/25 [00:05<00:00, 4.24it/s] anime artwork illustrating a woman, wearing a white t - shirt, blue loose hair, the brown squirrel takes the cracker and scampers up a tree.. created by japanese anime studio. highly emotional. best quality, high resolution torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 77, 2048]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 1280]) torch.Size([1, 4, 96, 96]) torch.Size([2, 6]) 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:04, 5.18it/s] 8%|▊ | 2/25 [00:00<00:04, 5.21it/s] 12%|█▏ | 3/25 [00:00<00:04, 5.22it/s] 16%|█▌ | 4/25 [00:00<00:04, 5.22it/s] 20%|██ | 5/25 [00:00<00:03, 5.22it/s] 24%|██▍ | 6/25 [00:01<00:03, 4.95it/s] 28%|██▊ | 7/25 [00:01<00:03, 4.62it/s] 32%|███▏ | 8/25 [00:01<00:03, 4.48it/s] 36%|███▌ | 9/25 [00:01<00:03, 4.30it/s] 40%|████ | 10/25 [00:02<00:03, 4.14it/s] 44%|████▍ | 11/25 [00:02<00:03, 4.05it/s] 48%|████▊ | 12/25 [00:02<00:03, 4.05it/s] 52%|█████▏ | 13/25 [00:02<00:02, 4.23it/s] 56%|█████▌ | 14/25 [00:03<00:02, 4.38it/s] 60%|██████ | 15/25 [00:03<00:02, 4.27it/s] 64%|██████▍ | 16/25 [00:03<00:02, 4.21it/s] 68%|██████▊ | 17/25 [00:03<00:01, 4.22it/s] 72%|███████▏ | 18/25 [00:04<00:01, 4.21it/s] 76%|███████▌ | 19/25 [00:04<00:01, 4.14it/s] 80%|████████ | 20/25 [00:04<00:01, 4.15it/s] 84%|████████▍ | 21/25 [00:04<00:00, 4.08it/s] 88%|████████▊ | 22/25 [00:05<00:00, 4.01it/s] 92%|█████████▏| 23/25 [00:05<00:00, 3.94it/s] 96%|█████████▌| 24/25 [00:05<00:00, 3.87it/s] 100%|██████████| 25/25 [00:05<00:00, 3.89it/s] 100%|██████████| 25/25 [00:05<00:00, 4.26it/s] 6 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C39CD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A8C864D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C78A90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C4A590>]] 0 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C39CD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A8C864D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C78A90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C4A590>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C84E50>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C1A050>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C53B10>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C51E10>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C53D90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5F8A310>]] [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C39CD0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A8C864D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C78A90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C4A590>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C84E50>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C1A050>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C53B10>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C51E10>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A7C53D90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F85A5F8A310>]] 0 (97, 712) 1 (18, 648) 1 (30, 641) 0 (136, 718)
Prediction
hvision-nku/storydiffusion:39c85f15IDx0mfgjfatdrgm0cf8q6v5rbb3mStatusSucceededSourceWebHardwareA100 (40GB)Total durationCreatedInput
- num_ids
- 3
- sd_model
- Unstable
- num_steps
- 25
- style_name
- Disney Charactor
- comic_style
- Classic Comic Style
- image_width
- 768
- image_height
- 768
- sa32_setting
- 0.7
- sa64_setting
- 0.7
- output_format
- webp
- guidance_scale
- 5
- output_quality
- 80
- negative_prompt
- bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs
- comic_description
- at home, read new paper #at home, The newspaper says there is a treasure house in the forest. on the road, near the forest [NC] The car on the road, near the forest #He drives to the forest in search of treasure. [NC]A tiger appeared in the forest, at night very frightened, open mouth, in the forest, at night running very fast, in the forest, at night [NC] A house in the forest, at night #Suddenly, he discovers the treasure house! in the house filled with treasure, laughing, at night #He is overjoyed inside the house.
- style_strength_ratio
- 20
- character_description
- a man, wearing black suit
{ "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "style_name": "Disney Charactor", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.7, "sa64_setting": 0.7, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\non the road, near the forest\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\n[NC]A tiger appeared in the forest, at night \nvery frightened, open mouth, in the forest, at night\nrunning very fast, in the forest, at night\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house.", "style_strength_ratio": 20, "character_description": "a man, wearing black suit" }
Install Replicate’s Node.js client library:npm install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "hvision-nku/storydiffusion:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", { input: { num_ids: 3, sd_model: "Unstable", num_steps: 25, style_name: "Disney Charactor", comic_style: "Classic Comic Style", image_width: 768, image_height: 768, sa32_setting: 0.7, sa64_setting: 0.7, output_format: "webp", guidance_scale: 5, output_quality: 80, negative_prompt: "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", comic_description: "at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\non the road, near the forest\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\n[NC]A tiger appeared in the forest, at night \nvery frightened, open mouth, in the forest, at night\nrunning very fast, in the forest, at night\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house.", style_strength_ratio: 20, character_description: "a man, wearing black suit" } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import the client:import replicate
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "hvision-nku/storydiffusion:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", input={ "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "style_name": "Disney Charactor", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.7, "sa64_setting": 0.7, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\non the road, near the forest\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\n[NC]A tiger appeared in the forest, at night \nvery frightened, open mouth, in the forest, at night\nrunning very fast, in the forest, at night\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house.", "style_strength_ratio": 20, "character_description": "a man, wearing black suit" } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run hvision-nku/storydiffusion using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2", "input": { "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "style_name": "Disney Charactor", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.7, "sa64_setting": 0.7, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\\non the road, near the forest\\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\\n[NC]A tiger appeared in the forest, at night \\nvery frightened, open mouth, in the forest, at night\\nrunning very fast, in the forest, at night\\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house.", "style_strength_ratio": 20, "character_description": "a man, wearing black suit" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
You can run this model locally using Cog. First, install Cog:brew install cog
If you don’t have Homebrew, there are other installation options available.
Run this to download the model and run it in your local environment:
cog predict r8.im/cjwbw/storydiffusion@sha256:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2 \ -i 'num_ids=3' \ -i 'sd_model="Unstable"' \ -i 'num_steps=25' \ -i 'style_name="Disney Charactor"' \ -i 'comic_style="Classic Comic Style"' \ -i 'image_width=768' \ -i 'image_height=768' \ -i 'sa32_setting=0.7' \ -i 'sa64_setting=0.7' \ -i 'output_format="webp"' \ -i 'guidance_scale=5' \ -i 'output_quality=80' \ -i 'negative_prompt="bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs"' \ -i $'comic_description="at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\\non the road, near the forest\\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\\n[NC]A tiger appeared in the forest, at night \\nvery frightened, open mouth, in the forest, at night\\nrunning very fast, in the forest, at night\\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house."' \ -i 'style_strength_ratio=20' \ -i 'character_description="a man, wearing black suit"'
To learn more, take a look at the Cog documentation.
Run this to download the model and run it in your local environment:
docker run -d -p 5000:5000 --gpus=all r8.im/cjwbw/storydiffusion@sha256:39c85f153f00e4e9328cb3035b94559a8ec66170eb4c0618c07b16528bf20ac2
curl -s -X POST \ -H "Content-Type: application/json" \ -d $'{ "input": { "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "style_name": "Disney Charactor", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.7, "sa64_setting": 0.7, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\\non the road, near the forest\\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\\n[NC]A tiger appeared in the forest, at night \\nvery frightened, open mouth, in the forest, at night\\nrunning very fast, in the forest, at night\\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house.", "style_strength_ratio": 20, "character_description": "a man, wearing black suit" } }' \ http://localhost:5000/predictions
To learn more, take a look at the Cog documentation.
Output
{ "completed_at": "2024-05-04T22:52:48.274267Z", "created_at": "2024-05-04T22:51:14.003000Z", "data_removed": false, "error": null, "id": "x0mfgjfatdrgm0cf8q6v5rbb3m", "input": { "num_ids": 3, "sd_model": "Unstable", "num_steps": 25, "style_name": "Disney Charactor", "comic_style": "Classic Comic Style", "image_width": 768, "image_height": 768, "sa32_setting": 0.7, "sa64_setting": 0.7, "output_format": "webp", "guidance_scale": 5, "output_quality": 80, "negative_prompt": "bad anatomy, bad hands, missing fingers, extra fingers, three hands, three legs, bad arms, missing legs, missing arms, poorly drawn face, bad face, fused face, cloned face, three crus, fused feet, fused thigh, extra crus, ugly fingers, horn, cartoon, cg, 3d, unreal, animate, amputation, disconnected limbs", "comic_description": "at home, read new paper #at home, The newspaper says there is a treasure house in the forest.\non the road, near the forest\n[NC] The car on the road, near the forest #He drives to the forest in search of treasure.\n[NC]A tiger appeared in the forest, at night \nvery frightened, open mouth, in the forest, at night\nrunning very fast, in the forest, at night\n[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!\nin the house filled with treasure, laughing, at night #He is overjoyed inside the house.", "style_strength_ratio": 20, "character_description": "a man, wearing black suit" }, "logs": "['at home, read new paper #at home, The newspaper says there is a treasure house in the forest.', 'on the road, near the forest', '[NC] The car on the road, near the forest #He drives to the forest in search of treasure.', '[NC]A tiger appeared in the forest, at night ', 'very frightened, open mouth, in the forest, at night', 'running very fast, in the forest, at night', '[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!', 'in the house filled with treasure, laughing, at night #He is overjoyed inside the house.']\n['a man, wearing black suit,at home, read new paper #at home, The newspaper says there is a treasure house in the forest.', 'a man, wearing black suit,on the road, near the forest', ' The car on the road, near the forest #He drives to the forest in search of treasure.', 'A tiger appeared in the forest, at night ', 'a man, wearing black suit,very frightened, open mouth, in the forest, at night', 'a man, wearing black suit,running very fast, in the forest, at night', ' A house in the forest, at night #Suddenly, he discovers the treasure house!', 'a man, wearing black suit,in the house filled with treasure, laughing, at night #He is overjoyed inside the house.']\n['a man, wearing black suit,at home, read new paper', 'a man, wearing black suit,on the road, near the forest', 'The car on the road, near the forest', 'A tiger appeared in the forest, at night', 'a man, wearing black suit,very frightened, open mouth, in the forest, at night', 'a man, wearing black suit,running very fast, in the forest, at night', 'A house in the forest, at night', 'a man, wearing black suit,in the house filled with treasure, laughing, at night']\nUsing seed: 26420\nSuccessfully load paired self-attention\nNumber of the processor : 36\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:19, 1.21it/s]\n 8%|▊ | 2/25 [00:00<00:10, 2.27it/s]\n 12%|█▏ | 3/25 [00:01<00:07, 3.14it/s]\n 16%|█▌ | 4/25 [00:01<00:05, 3.82it/s]\n 20%|██ | 5/25 [00:01<00:04, 4.34it/s]\n 24%|██▍ | 6/25 [00:01<00:04, 4.09it/s]\n 28%|██▊ | 7/25 [00:02<00:04, 4.00it/s]\n 32%|███▏ | 8/25 [00:02<00:04, 3.92it/s]\n 36%|███▌ | 9/25 [00:02<00:04, 3.89it/s]\n 40%|████ | 10/25 [00:02<00:03, 3.88it/s]\n 44%|████▍ | 11/25 [00:03<00:03, 3.88it/s]\n 48%|████▊ | 12/25 [00:03<00:03, 3.87it/s]\n 52%|█████▏ | 13/25 [00:03<00:03, 3.89it/s]\n 56%|█████▌ | 14/25 [00:03<00:02, 3.90it/s]\n 60%|██████ | 15/25 [00:04<00:02, 3.82it/s]\n 64%|██████▍ | 16/25 [00:04<00:02, 3.69it/s]\n 68%|██████▊ | 17/25 [00:04<00:02, 3.60it/s]\n 72%|███████▏ | 18/25 [00:05<00:01, 3.58it/s]\n 76%|███████▌ | 19/25 [00:05<00:01, 3.64it/s]\n 80%|████████ | 20/25 [00:05<00:01, 3.58it/s]\n 84%|████████▍ | 21/25 [00:05<00:01, 3.48it/s]\n 88%|████████▊ | 22/25 [00:06<00:00, 3.41it/s]\n 92%|█████████▏| 23/25 [00:06<00:00, 3.36it/s]\n 96%|█████████▌| 24/25 [00:06<00:00, 3.34it/s]\n100%|██████████| 25/25 [00:07<00:00, 3.32it/s]\n100%|██████████| 25/25 [00:07<00:00, 3.52it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 4%|▍ | 1/25 [00:00<00:03, 6.92it/s]\n 12%|█▏ | 3/25 [00:00<00:02, 9.37it/s]\n 20%|██ | 5/25 [00:00<00:01, 10.09it/s]\n 24%|██▍ | 6/25 [00:00<00:02, 9.26it/s]\n 28%|██▊ | 7/25 [00:00<00:02, 8.76it/s]\n 32%|███▏ | 8/25 [00:00<00:02, 8.46it/s]\n 36%|███▌ | 9/25 [00:01<00:01, 8.45it/s]\n 40%|████ | 10/25 [00:01<00:01, 8.32it/s]\n 44%|████▍ | 11/25 [00:01<00:01, 8.12it/s]\n 48%|████▊ | 12/25 [00:01<00:01, 8.09it/s]\n 52%|█████▏ | 13/25 [00:01<00:01, 8.04it/s]\n 56%|█████▌ | 14/25 [00:01<00:01, 7.89it/s]\n 60%|██████ | 15/25 [00:01<00:01, 7.93it/s]\n 64%|██████▍ | 16/25 [00:01<00:01, 7.98it/s]\n 68%|██████▊ | 17/25 [00:02<00:00, 8.38it/s]\n 72%|███████▏ | 18/25 [00:02<00:00, 8.23it/s]\n 76%|███████▌ | 19/25 [00:02<00:00, 8.21it/s]\n 80%|████████ | 20/25 [00:02<00:00, 7.95it/s]\n 84%|████████▍ | 21/25 [00:02<00:00, 7.68it/s]\n 88%|████████▊ | 22/25 [00:02<00:00, 7.74it/s]\n 92%|█████████▏| 23/25 [00:02<00:00, 7.51it/s]\n 96%|█████████▌| 24/25 [00:02<00:00, 7.40it/s]\n100%|██████████| 25/25 [00:03<00:00, 7.49it/s]\n100%|██████████| 25/25 [00:03<00:00, 8.12it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 8%|▊ | 2/25 [00:00<00:02, 10.77it/s]\n 16%|█▌ | 4/25 [00:00<00:01, 10.83it/s]\n 24%|██▍ | 6/25 [00:00<00:01, 10.10it/s]\n 32%|███▏ | 8/25 [00:00<00:01, 9.13it/s]\n 36%|███▌ | 9/25 [00:00<00:01, 9.03it/s]\n 40%|████ | 10/25 [00:01<00:01, 8.84it/s]\n 44%|████▍ | 11/25 [00:01<00:01, 8.62it/s]\n 48%|████▊ | 12/25 [00:01<00:01, 8.66it/s]\n 52%|█████▏ | 13/25 [00:01<00:01, 8.59it/s]\n 56%|█████▌ | 14/25 [00:01<00:01, 8.49it/s]\n 60%|██████ | 15/25 [00:01<00:01, 8.50it/s]\n 64%|██████▍ | 16/25 [00:01<00:01, 8.32it/s]\n 68%|██████▊ | 17/25 [00:01<00:00, 8.21it/s]\n 72%|███████▏ | 18/25 [00:02<00:00, 8.37it/s]\n 76%|███████▌ | 19/25 [00:02<00:00, 8.27it/s]\n 80%|████████ | 20/25 [00:02<00:00, 8.33it/s]\n 84%|████████▍ | 21/25 [00:02<00:00, 7.99it/s]\n 88%|████████▊ | 22/25 [00:02<00:00, 7.77it/s]\n 92%|█████████▏| 23/25 [00:02<00:00, 7.54it/s]\n 96%|█████████▌| 24/25 [00:02<00:00, 7.50it/s]\n100%|██████████| 25/25 [00:02<00:00, 7.48it/s]\n100%|██████████| 25/25 [00:02<00:00, 8.45it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 8%|▊ | 2/25 [00:00<00:02, 10.80it/s]\n 16%|█▌ | 4/25 [00:00<00:01, 10.82it/s]\n 24%|██▍ | 6/25 [00:00<00:01, 10.16it/s]\n 32%|███▏ | 8/25 [00:00<00:01, 9.63it/s]\n 36%|███▌ | 9/25 [00:00<00:01, 9.26it/s]\n 40%|████ | 10/25 [00:01<00:01, 8.89it/s]\n 44%|████▍ | 11/25 [00:01<00:01, 8.54it/s]\n 48%|████▊ | 12/25 [00:01<00:01, 8.53it/s]\n 52%|█████▏ | 13/25 [00:01<00:01, 8.35it/s]\n 56%|█████▌ | 14/25 [00:01<00:01, 8.32it/s]\n 60%|██████ | 15/25 [00:01<00:01, 8.33it/s]\n 64%|██████▍ | 16/25 [00:01<00:01, 8.30it/s]\n 68%|██████▊ | 17/25 [00:01<00:00, 8.20it/s]\n 72%|███████▏ | 18/25 [00:02<00:00, 8.11it/s]\n 76%|███████▌ | 19/25 [00:02<00:00, 7.93it/s]\n 80%|████████ | 20/25 [00:02<00:00, 8.07it/s]\n 84%|████████▍ | 21/25 [00:02<00:00, 8.03it/s]\n 88%|████████▊ | 22/25 [00:02<00:00, 7.81it/s]\n 92%|█████████▏| 23/25 [00:02<00:00, 7.57it/s]\n 96%|█████████▌| 24/25 [00:02<00:00, 7.58it/s]\n100%|██████████| 25/25 [00:02<00:00, 7.50it/s]\n100%|██████████| 25/25 [00:02<00:00, 8.42it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 8%|▊ | 2/25 [00:00<00:02, 10.73it/s]\n 16%|█▌ | 4/25 [00:00<00:01, 10.83it/s]\n 24%|██▍ | 6/25 [00:00<00:01, 10.47it/s]\n 32%|███▏ | 8/25 [00:00<00:01, 9.35it/s]\n 36%|███▌ | 9/25 [00:00<00:01, 8.95it/s]\n 40%|████ | 10/25 [00:01<00:01, 8.58it/s]\n 44%|████▍ | 11/25 [00:01<00:01, 8.33it/s]\n 48%|████▊ | 12/25 [00:01<00:01, 8.53it/s]\n 52%|█████▏ | 13/25 [00:01<00:01, 8.63it/s]\n 56%|█████▌ | 14/25 [00:01<00:01, 8.46it/s]\n 60%|██████ | 15/25 [00:01<00:01, 8.44it/s]\n 64%|██████▍ | 16/25 [00:01<00:01, 8.30it/s]\n 68%|██████▊ | 17/25 [00:01<00:00, 8.19it/s]\n 72%|███████▏ | 18/25 [00:02<00:00, 8.03it/s]\n 76%|███████▌ | 19/25 [00:02<00:00, 7.97it/s]\n 80%|████████ | 20/25 [00:02<00:00, 7.92it/s]\n 84%|████████▍ | 21/25 [00:02<00:00, 7.75it/s]\n 88%|████████▊ | 22/25 [00:02<00:00, 7.63it/s]\n 92%|█████████▏| 23/25 [00:02<00:00, 7.57it/s]\n 96%|█████████▌| 24/25 [00:02<00:00, 7.51it/s]\n100%|██████████| 25/25 [00:02<00:00, 7.44it/s]\n100%|██████████| 25/25 [00:02<00:00, 8.36it/s]\n 0%| | 0/25 [00:00<?, ?it/s]\n 8%|▊ | 2/25 [00:00<00:02, 10.76it/s]\n 16%|█▌ | 4/25 [00:00<00:01, 10.82it/s]\n 24%|██▍ | 6/25 [00:00<00:01, 10.03it/s]\n 32%|███▏ | 8/25 [00:00<00:01, 9.16it/s]\n 36%|███▌ | 9/25 [00:00<00:01, 8.95it/s]\n 40%|████ | 10/25 [00:01<00:01, 8.91it/s]\n 44%|████▍ | 11/25 [00:01<00:01, 8.43it/s]\n 48%|████▊ | 12/25 [00:01<00:01, 8.45it/s]\n 52%|█████▏ | 13/25 [00:01<00:01, 8.20it/s]\n 56%|█████▌ | 14/25 [00:01<00:01, 8.23it/s]\n 60%|██████ | 15/25 [00:01<00:01, 8.03it/s]\n 64%|██████▍ | 16/25 [00:01<00:01, 7.92it/s]\n 68%|██████▊ | 17/25 [00:01<00:01, 7.95it/s]\n 72%|███████▏ | 18/25 [00:02<00:00, 8.04it/s]\n 76%|███████▌ | 19/25 [00:02<00:00, 8.03it/s]\n 80%|████████ | 20/25 [00:02<00:00, 7.96it/s]\n 84%|████████▍ | 21/25 [00:02<00:00, 7.78it/s]\n 88%|████████▊ | 22/25 [00:02<00:00, 7.52it/s]\n 92%|█████████▏| 23/25 [00:02<00:00, 7.40it/s]\n 96%|█████████▌| 24/25 [00:02<00:00, 7.29it/s]\n100%|██████████| 25/25 [00:03<00:00, 7.41it/s]\n100%|██████████| 25/25 [00:03<00:00, 8.26it/s]\n4 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93E64690>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C9428B0D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C95F0EA90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C94235110>]]\n0 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93E64690>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C9428B0D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C95F0EA90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C94235110>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C941E5A50>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93CC64D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93CC7E90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93CC7ED0>]]\n[[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93E64690>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C9428B0D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C95F0EA90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C94235110>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C941E5A50>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93CC64D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93CC7E90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93CC7ED0>]]\n1 (7, 650)\n0 (124, 721)\n1 (56, 636)\n0 (1, 712)\nPipelines loaded with `dtype=torch.float16` cannot run with `cpu` device. It is not recommended to move them to `cpu` as running them will fail. Please make sure to use an accelerator to run the pipeline in inference, due to the lack of support for`float16` operations on this device in PyTorch. Please, remove the `torch_dtype=torch.float16` argument, or use another device for inference.\nPipelines loaded with `dtype=torch.float16` cannot run with `cpu` device. It is not recommended to move them to `cpu` as running them will fail. Please make sure to use an accelerator to run the pipeline in inference, due to the lack of support for`float16` operations on this device in PyTorch. Please, remove the `torch_dtype=torch.float16` argument, or use another device for inference.\nPipelines loaded with `dtype=torch.float16` cannot run with `cpu` device. It is not recommended to move them to `cpu` as running them will fail. Please make sure to use an accelerator to run the pipeline in inference, due to the lack of support for`float16` operations on this device in PyTorch. Please, remove the `torch_dtype=torch.float16` argument, or use another device for inference.\nPipelines loaded with `dtype=torch.float16` cannot run with `cpu` device. It is not recommended to move them to `cpu` as running them will fail. Please make sure to use an accelerator to run the pipeline in inference, due to the lack of support for`float16` operations on this device in PyTorch. Please, remove the `torch_dtype=torch.float16` argument, or use another device for inference.", "metrics": { "predict_time": 32.411101, "total_time": 94.271267 }, "output": { "comic": "https://replicate.delivery/pbxt/2Dfo7bS5xtRVaKetjG3dBAmfvi0U47R0fxznTearmUZyH3IWC/comic.webp", "individual_images": [ "https://replicate.delivery/pbxt/jbMHRLctL3oePSecrNlTpEYlSusbISeqG36PsO80qH6ejbELB/out-0.webp", "https://replicate.delivery/pbxt/qfRozduVXqwPSSAApNiGHvAnMYcMWnxVR5ozsJlaEfsfxNilA/out-1.webp", "https://replicate.delivery/pbxt/wKSIYJY2IU4iNR1x2IK6BcgiMsvTGWOmUQKmcI02MoxPuRsE/out-2.webp", "https://replicate.delivery/pbxt/J2qFUX6L8Xa5BxkjfNhQj1ENK7PkesDNvIL9eHr3p9OejbELB/out-3.webp", "https://replicate.delivery/pbxt/KGrZeFia1PwHV6bT1SI2q3KK8DtNPQH0Cs1fctwN0X9fxNilA/out-4.webp", "https://replicate.delivery/pbxt/NMz3LLKBl37ODNNYTu6RehpsVPALgJfGLH4UNPNx3zCfxNilA/out-5.webp", "https://replicate.delivery/pbxt/Ny3mrUcfxORrICxnRICp2MLmiVEmKQaUwSeQcDc8yQ6fxNilA/out-6.webp", "https://replicate.delivery/pbxt/YI6MJsi9QEZVAtAjyOsnU58bTckiUtjEFtjn7besmmGgcjYJA/out-7.webp" ] }, "started_at": "2024-05-04T22:52:15.863166Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/x0mfgjfatdrgm0cf8q6v5rbb3m", "cancel": "https://api.replicate.com/v1/predictions/x0mfgjfatdrgm0cf8q6v5rbb3m/cancel" }, "version": "d0faed1e8b5779d4d060c5bb0afac8e52c704700f7bee9e4341b0684a8feda04" }
Generated in['at home, read new paper #at home, The newspaper says there is a treasure house in the forest.', 'on the road, near the forest', '[NC] The car on the road, near the forest #He drives to the forest in search of treasure.', '[NC]A tiger appeared in the forest, at night ', 'very frightened, open mouth, in the forest, at night', 'running very fast, in the forest, at night', '[NC] A house in the forest, at night #Suddenly, he discovers the treasure house!', 'in the house filled with treasure, laughing, at night #He is overjoyed inside the house.'] ['a man, wearing black suit,at home, read new paper #at home, The newspaper says there is a treasure house in the forest.', 'a man, wearing black suit,on the road, near the forest', ' The car on the road, near the forest #He drives to the forest in search of treasure.', 'A tiger appeared in the forest, at night ', 'a man, wearing black suit,very frightened, open mouth, in the forest, at night', 'a man, wearing black suit,running very fast, in the forest, at night', ' A house in the forest, at night #Suddenly, he discovers the treasure house!', 'a man, wearing black suit,in the house filled with treasure, laughing, at night #He is overjoyed inside the house.'] ['a man, wearing black suit,at home, read new paper', 'a man, wearing black suit,on the road, near the forest', 'The car on the road, near the forest', 'A tiger appeared in the forest, at night', 'a man, wearing black suit,very frightened, open mouth, in the forest, at night', 'a man, wearing black suit,running very fast, in the forest, at night', 'A house in the forest, at night', 'a man, wearing black suit,in the house filled with treasure, laughing, at night'] Using seed: 26420 Successfully load paired self-attention Number of the processor : 36 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:19, 1.21it/s] 8%|▊ | 2/25 [00:00<00:10, 2.27it/s] 12%|█▏ | 3/25 [00:01<00:07, 3.14it/s] 16%|█▌ | 4/25 [00:01<00:05, 3.82it/s] 20%|██ | 5/25 [00:01<00:04, 4.34it/s] 24%|██▍ | 6/25 [00:01<00:04, 4.09it/s] 28%|██▊ | 7/25 [00:02<00:04, 4.00it/s] 32%|███▏ | 8/25 [00:02<00:04, 3.92it/s] 36%|███▌ | 9/25 [00:02<00:04, 3.89it/s] 40%|████ | 10/25 [00:02<00:03, 3.88it/s] 44%|████▍ | 11/25 [00:03<00:03, 3.88it/s] 48%|████▊ | 12/25 [00:03<00:03, 3.87it/s] 52%|█████▏ | 13/25 [00:03<00:03, 3.89it/s] 56%|█████▌ | 14/25 [00:03<00:02, 3.90it/s] 60%|██████ | 15/25 [00:04<00:02, 3.82it/s] 64%|██████▍ | 16/25 [00:04<00:02, 3.69it/s] 68%|██████▊ | 17/25 [00:04<00:02, 3.60it/s] 72%|███████▏ | 18/25 [00:05<00:01, 3.58it/s] 76%|███████▌ | 19/25 [00:05<00:01, 3.64it/s] 80%|████████ | 20/25 [00:05<00:01, 3.58it/s] 84%|████████▍ | 21/25 [00:05<00:01, 3.48it/s] 88%|████████▊ | 22/25 [00:06<00:00, 3.41it/s] 92%|█████████▏| 23/25 [00:06<00:00, 3.36it/s] 96%|█████████▌| 24/25 [00:06<00:00, 3.34it/s] 100%|██████████| 25/25 [00:07<00:00, 3.32it/s] 100%|██████████| 25/25 [00:07<00:00, 3.52it/s] 0%| | 0/25 [00:00<?, ?it/s] 4%|▍ | 1/25 [00:00<00:03, 6.92it/s] 12%|█▏ | 3/25 [00:00<00:02, 9.37it/s] 20%|██ | 5/25 [00:00<00:01, 10.09it/s] 24%|██▍ | 6/25 [00:00<00:02, 9.26it/s] 28%|██▊ | 7/25 [00:00<00:02, 8.76it/s] 32%|███▏ | 8/25 [00:00<00:02, 8.46it/s] 36%|███▌ | 9/25 [00:01<00:01, 8.45it/s] 40%|████ | 10/25 [00:01<00:01, 8.32it/s] 44%|████▍ | 11/25 [00:01<00:01, 8.12it/s] 48%|████▊ | 12/25 [00:01<00:01, 8.09it/s] 52%|█████▏ | 13/25 [00:01<00:01, 8.04it/s] 56%|█████▌ | 14/25 [00:01<00:01, 7.89it/s] 60%|██████ | 15/25 [00:01<00:01, 7.93it/s] 64%|██████▍ | 16/25 [00:01<00:01, 7.98it/s] 68%|██████▊ | 17/25 [00:02<00:00, 8.38it/s] 72%|███████▏ | 18/25 [00:02<00:00, 8.23it/s] 76%|███████▌ | 19/25 [00:02<00:00, 8.21it/s] 80%|████████ | 20/25 [00:02<00:00, 7.95it/s] 84%|████████▍ | 21/25 [00:02<00:00, 7.68it/s] 88%|████████▊ | 22/25 [00:02<00:00, 7.74it/s] 92%|█████████▏| 23/25 [00:02<00:00, 7.51it/s] 96%|█████████▌| 24/25 [00:02<00:00, 7.40it/s] 100%|██████████| 25/25 [00:03<00:00, 7.49it/s] 100%|██████████| 25/25 [00:03<00:00, 8.12it/s] 0%| | 0/25 [00:00<?, ?it/s] 8%|▊ | 2/25 [00:00<00:02, 10.77it/s] 16%|█▌ | 4/25 [00:00<00:01, 10.83it/s] 24%|██▍ | 6/25 [00:00<00:01, 10.10it/s] 32%|███▏ | 8/25 [00:00<00:01, 9.13it/s] 36%|███▌ | 9/25 [00:00<00:01, 9.03it/s] 40%|████ | 10/25 [00:01<00:01, 8.84it/s] 44%|████▍ | 11/25 [00:01<00:01, 8.62it/s] 48%|████▊ | 12/25 [00:01<00:01, 8.66it/s] 52%|█████▏ | 13/25 [00:01<00:01, 8.59it/s] 56%|█████▌ | 14/25 [00:01<00:01, 8.49it/s] 60%|██████ | 15/25 [00:01<00:01, 8.50it/s] 64%|██████▍ | 16/25 [00:01<00:01, 8.32it/s] 68%|██████▊ | 17/25 [00:01<00:00, 8.21it/s] 72%|███████▏ | 18/25 [00:02<00:00, 8.37it/s] 76%|███████▌ | 19/25 [00:02<00:00, 8.27it/s] 80%|████████ | 20/25 [00:02<00:00, 8.33it/s] 84%|████████▍ | 21/25 [00:02<00:00, 7.99it/s] 88%|████████▊ | 22/25 [00:02<00:00, 7.77it/s] 92%|█████████▏| 23/25 [00:02<00:00, 7.54it/s] 96%|█████████▌| 24/25 [00:02<00:00, 7.50it/s] 100%|██████████| 25/25 [00:02<00:00, 7.48it/s] 100%|██████████| 25/25 [00:02<00:00, 8.45it/s] 0%| | 0/25 [00:00<?, ?it/s] 8%|▊ | 2/25 [00:00<00:02, 10.80it/s] 16%|█▌ | 4/25 [00:00<00:01, 10.82it/s] 24%|██▍ | 6/25 [00:00<00:01, 10.16it/s] 32%|███▏ | 8/25 [00:00<00:01, 9.63it/s] 36%|███▌ | 9/25 [00:00<00:01, 9.26it/s] 40%|████ | 10/25 [00:01<00:01, 8.89it/s] 44%|████▍ | 11/25 [00:01<00:01, 8.54it/s] 48%|████▊ | 12/25 [00:01<00:01, 8.53it/s] 52%|█████▏ | 13/25 [00:01<00:01, 8.35it/s] 56%|█████▌ | 14/25 [00:01<00:01, 8.32it/s] 60%|██████ | 15/25 [00:01<00:01, 8.33it/s] 64%|██████▍ | 16/25 [00:01<00:01, 8.30it/s] 68%|██████▊ | 17/25 [00:01<00:00, 8.20it/s] 72%|███████▏ | 18/25 [00:02<00:00, 8.11it/s] 76%|███████▌ | 19/25 [00:02<00:00, 7.93it/s] 80%|████████ | 20/25 [00:02<00:00, 8.07it/s] 84%|████████▍ | 21/25 [00:02<00:00, 8.03it/s] 88%|████████▊ | 22/25 [00:02<00:00, 7.81it/s] 92%|█████████▏| 23/25 [00:02<00:00, 7.57it/s] 96%|█████████▌| 24/25 [00:02<00:00, 7.58it/s] 100%|██████████| 25/25 [00:02<00:00, 7.50it/s] 100%|██████████| 25/25 [00:02<00:00, 8.42it/s] 0%| | 0/25 [00:00<?, ?it/s] 8%|▊ | 2/25 [00:00<00:02, 10.73it/s] 16%|█▌ | 4/25 [00:00<00:01, 10.83it/s] 24%|██▍ | 6/25 [00:00<00:01, 10.47it/s] 32%|███▏ | 8/25 [00:00<00:01, 9.35it/s] 36%|███▌ | 9/25 [00:00<00:01, 8.95it/s] 40%|████ | 10/25 [00:01<00:01, 8.58it/s] 44%|████▍ | 11/25 [00:01<00:01, 8.33it/s] 48%|████▊ | 12/25 [00:01<00:01, 8.53it/s] 52%|█████▏ | 13/25 [00:01<00:01, 8.63it/s] 56%|█████▌ | 14/25 [00:01<00:01, 8.46it/s] 60%|██████ | 15/25 [00:01<00:01, 8.44it/s] 64%|██████▍ | 16/25 [00:01<00:01, 8.30it/s] 68%|██████▊ | 17/25 [00:01<00:00, 8.19it/s] 72%|███████▏ | 18/25 [00:02<00:00, 8.03it/s] 76%|███████▌ | 19/25 [00:02<00:00, 7.97it/s] 80%|████████ | 20/25 [00:02<00:00, 7.92it/s] 84%|████████▍ | 21/25 [00:02<00:00, 7.75it/s] 88%|████████▊ | 22/25 [00:02<00:00, 7.63it/s] 92%|█████████▏| 23/25 [00:02<00:00, 7.57it/s] 96%|█████████▌| 24/25 [00:02<00:00, 7.51it/s] 100%|██████████| 25/25 [00:02<00:00, 7.44it/s] 100%|██████████| 25/25 [00:02<00:00, 8.36it/s] 0%| | 0/25 [00:00<?, ?it/s] 8%|▊ | 2/25 [00:00<00:02, 10.76it/s] 16%|█▌ | 4/25 [00:00<00:01, 10.82it/s] 24%|██▍ | 6/25 [00:00<00:01, 10.03it/s] 32%|███▏ | 8/25 [00:00<00:01, 9.16it/s] 36%|███▌ | 9/25 [00:00<00:01, 8.95it/s] 40%|████ | 10/25 [00:01<00:01, 8.91it/s] 44%|████▍ | 11/25 [00:01<00:01, 8.43it/s] 48%|████▊ | 12/25 [00:01<00:01, 8.45it/s] 52%|█████▏ | 13/25 [00:01<00:01, 8.20it/s] 56%|█████▌ | 14/25 [00:01<00:01, 8.23it/s] 60%|██████ | 15/25 [00:01<00:01, 8.03it/s] 64%|██████▍ | 16/25 [00:01<00:01, 7.92it/s] 68%|██████▊ | 17/25 [00:01<00:01, 7.95it/s] 72%|███████▏ | 18/25 [00:02<00:00, 8.04it/s] 76%|███████▌ | 19/25 [00:02<00:00, 8.03it/s] 80%|████████ | 20/25 [00:02<00:00, 7.96it/s] 84%|████████▍ | 21/25 [00:02<00:00, 7.78it/s] 88%|████████▊ | 22/25 [00:02<00:00, 7.52it/s] 92%|█████████▏| 23/25 [00:02<00:00, 7.40it/s] 96%|█████████▌| 24/25 [00:02<00:00, 7.29it/s] 100%|██████████| 25/25 [00:03<00:00, 7.41it/s] 100%|██████████| 25/25 [00:03<00:00, 8.26it/s] 4 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93E64690>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C9428B0D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C95F0EA90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C94235110>]] 0 [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93E64690>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C9428B0D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C95F0EA90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C94235110>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C941E5A50>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93CC64D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93CC7E90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93CC7ED0>]] [[<PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93E64690>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C9428B0D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C95F0EA90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C94235110>], [<PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C941E5A50>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93CC64D0>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93CC7E90>, <PIL.Image.Image image mode=RGB size=788x788 at 0x7F5C93CC7ED0>]] 1 (7, 650) 0 (124, 721) 1 (56, 636) 0 (1, 712) Pipelines loaded with `dtype=torch.float16` cannot run with `cpu` device. It is not recommended to move them to `cpu` as running them will fail. Please make sure to use an accelerator to run the pipeline in inference, due to the lack of support for`float16` operations on this device in PyTorch. Please, remove the `torch_dtype=torch.float16` argument, or use another device for inference. Pipelines loaded with `dtype=torch.float16` cannot run with `cpu` device. It is not recommended to move them to `cpu` as running them will fail. Please make sure to use an accelerator to run the pipeline in inference, due to the lack of support for`float16` operations on this device in PyTorch. Please, remove the `torch_dtype=torch.float16` argument, or use another device for inference. Pipelines loaded with `dtype=torch.float16` cannot run with `cpu` device. It is not recommended to move them to `cpu` as running them will fail. Please make sure to use an accelerator to run the pipeline in inference, due to the lack of support for`float16` operations on this device in PyTorch. Please, remove the `torch_dtype=torch.float16` argument, or use another device for inference. Pipelines loaded with `dtype=torch.float16` cannot run with `cpu` device. It is not recommended to move them to `cpu` as running them will fail. Please make sure to use an accelerator to run the pipeline in inference, due to the lack of support for`float16` operations on this device in PyTorch. Please, remove the `torch_dtype=torch.float16` argument, or use another device for inference.
Want to make some of these yourself?
Run this model