lucataco / llama-3-vision-alpha

Projection module trained to add vision capabilties to Llama 3 using SigLIP

  • Public
  • 5.5K runs
  • L40S
  • GitHub
  • Paper
  • License
  • Prediction

    lucataco/llama-3-vision-alpha:79f9f341
    ID
    8kzht6ne5hrgm0cf5ybv5cmzkc
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    prompt
    Describe the image

    Output

    The image is of a young girl with short, curly hair and bright blue eyes. She is wearing a white dress with a golden belt around her waist and has a mischievous grin on her face. She is holding a giant hamburger in her hand, with a big bite taken out of it. Her eyes are shining with excitement and her cheeks are flushed from eating. She looks like she's in the middle of a fun and carefree moment, enjoying her meal and the company of her friends. The background is a warm and cozy setting, with a wooden table and chairs, and a festive atmosphere that suggests a holiday or special occasion. The overall vibe is playful and joyful, capturing the carefree spirit of childhood.
    Generated in
  • Prediction

    lucataco/llama-3-vision-alpha:79f9f341
    ID
    jecmkssk3drgp0cf5ye9mvcmhc
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    prompt
    What type of food is the girl holding? answer briefly

    Output

    A hamburger!
    Generated in
  • Prediction

    lucataco/llama-3-vision-alpha:79f9f341
    ID
    9ypnhbzfd5rgp0cf5yea6f9m84
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    prompt
    What color is the woman's hair? answer briefly

    Output

    It s white!
    Generated in
  • Prediction

    lucataco/llama-3-vision-alpha:79f9f341
    ID
    xr5yqyfvx5rgg0cjzm4903jh9g
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    prompt
    Describe the image

    Output

    The image is of a young girl with short, curly hair and bright blue eyes. She has a sweet and cheerful face, with a hint of mischief in her eyes. She is holding a giant hamburger in her hand, with a big bite taken out of it. She looks like she's enjoying every bite of her meal, with a satisfied expression on her face. She is sitting at a table in a medieval - style tavern, surrounded by wooden tables and chairs, with a fireplace crackling in the background. The atmosphere is cozy and inviting, with a warm glow emanating from the fireplace. The girl looks like she's in her element, enjoying a hearty meal with friends and family.
    Generated in

Want to make some of these yourself?

Run this model