fofr / batch-image-captioning

A wrapper model for captioning multiple images using GPT, Claude or Gemini, useful for lora training

  • Public
  • 1.1K runs
  • CPU
  • GitHub
  • License
  • Prediction

    fofr/batch-image-captioning:d0adb15f
    ID
    8zj4ygh84srg80ch9e19xw111m
    Status
    Succeeded
    Source
    Web
    Hardware
    CPU
    Total duration
    Created

    Input

    model
    gpt-4o-2024-08-06
    max_dimension
    1024
    system_prompt
    Write a four sentence caption for this image. In the first sentence describe the style and type (painting, photo, etc) of the image. Describe in the remaining sentences the contents and composition of the image. Only use language that would be used to prompt a text to image model. Do not include usage. Comma separate keywords rather than using "or". Precise composition is important. Avoid phrases like "conveys a sense of" and "capturing the", just use the terms themselves. Good examples are: "Photo of an alien woman with a glowing halo standing on top of a mountain, wearing a white robe and silver mask in the futuristic style with futuristic design, sky background, soft lighting, dynamic pose, a sense of future technology, a science fiction movie scene rendered in the Unreal Engine." "A scene from the cartoon series Masters of the Universe depicts Man-At-Arms wearing a gray helmet and gray armor with red gloves. He is holding an iron bar above his head while looking down on Orko, a pink blob character. Orko is sitting behind Man-At-Arms facing left on a chair. Both characters are standing near each other, with Orko inside a yellow chestplate over a blue shirt and black pants. The scene is drawn in the style of the Masters of the Universe cartoon series." "An emoji, digital illustration, playful, whimsical. A cartoon zombie character with green skin and tattered clothes reaches forward with two hands, they have green skin, messy hair, an open mouth and gaping teeth, one eye is half closed."
    caption_prefix
     
    caption_suffix
     
    message_prompt
    Caption this image please
    openai_api_key
    ████████████████████

    This value was redacted after being sent to the model.

    image_zip_archive
    Archive.zip
    resize_images_for_captioning

    Output

    Generated in
  • Prediction

    fofr/batch-image-captioning:d0adb15f
    ID
    h6t1p9ahv1rgc0ch9e3v9h2d2m
    Status
    Succeeded
    Source
    Web
    Hardware
    CPU
    Total duration
    Created

    Input

    model
    gpt-4o-2024-08-06
    max_dimension
    1024
    system_prompt
    Write a four sentence caption for this image. In the first sentence describe the style and type (painting, photo, etc) of the image. Describe in the remaining sentences the contents and composition of the image. Only use language that would be used to prompt a text to image model. Do not include usage. Comma separate keywords rather than using "or". Precise composition is important. Avoid phrases like "conveys a sense of" and "capturing the", just use the terms themselves. Good examples are: "Photo of an alien woman with a glowing halo standing on top of a mountain, wearing a white robe and silver mask in the futuristic style with futuristic design, sky background, soft lighting, dynamic pose, a sense of future technology, a science fiction movie scene rendered in the Unreal Engine." "A scene from the cartoon series Masters of the Universe depicts Man-At-Arms wearing a gray helmet and gray armor with red gloves. He is holding an iron bar above his head while looking down on Orko, a pink blob character. Orko is sitting behind Man-At-Arms facing left on a chair. Both characters are standing near each other, with Orko inside a yellow chestplate over a blue shirt and black pants. The scene is drawn in the style of the Masters of the Universe cartoon series." "An emoji, digital illustration, playful, whimsical. A cartoon zombie character with green skin and tattered clothes reaches forward with two hands, they have green skin, messy hair, an open mouth and gaping teeth, one eye is half closed."
    caption_prefix
    a photo of a phone in a toaster
    caption_suffix
     
    message_prompt
    Caption this image please
    openai_api_key
    ████████████████████

    This value was redacted after being sent to the model.

    image_zip_archive
    Archive.zip
    resize_images_for_captioning

    Output

    Generated in

Want to make some of these yourself?

Run this model