lucataco / apollo-3b

Apollo 3B - An Exploration of Video Understanding in Large Multimodal Models

  • Public
  • 120 runs
  • GitHub
  • Weights
  • Paper
  • License
Iterate in playground
  • Prediction

    lucataco/apollo-3b:ed5ec3233f7a4331a2b0d33b8a02e62057a335e1a03107d56db9fac10cf3ce55
    ID
    65eyxmwszdrm80ckt4zajjt3rm
    Status
    Succeeded
    Source
    Web
    Hardware
    L40S
    Total duration
    Created
    by @lucataco

    Input

    top_p
    0.7
    video
    prompt
    Describe this video in detail
    temperature
    0.4
    max_new_tokens
    256

    Output

    The video features a lone astronaut in a white spacesuit, equipped with a helmet and gloves, standing on the moon's surface. The backdrop is dominated by a large, detailed image of the moon, set against a starry space. The astronaut begins to run across the lunar terrain, leaving footprints behind. As he runs, the camera angle shifts to reveal more of the moon's rugged landscape. The astronaut continues his run until he reaches the edge of the frame, where he leaps into the vast expanse of space, floating away from the moon. Throughout the sequence, the moon remains a constant and prominent feature in the background, emphasizing the astronaut's journey into the cosmos. The video captures the astronaut's solitary trek across the moon's surface and his subsequent leap into the unknown, symbolizing humanity's boundless curiosity and spirit of exploration. The astronaut's actions are depicted with precision and grace, highlighting the beauty and isolation of space travel. The video concludes with the astronaut floating freely in space, surrounded by the endless void of space, underscoring the awe-inspiring scale and mystery of the universe. The astronaut's journey serves as a powerful metaphor for human ambition and the quest for knowledge, encapsulating the essence of space exploration. The
    Generated in
  • Prediction

    lucataco/apollo-3b:ed5ec3233f7a4331a2b0d33b8a02e62057a335e1a03107d56db9fac10cf3ce55
    ID
    fxdyrfwv65rma0ckt5csgvf0cg
    Status
    Succeeded
    Source
    Web
    Hardware
    L40S
    Total duration
    Created

    Input

    top_p
    0.7
    video
    prompt
    What are the two people discussing?
    temperature
    0.4
    max_new_tokens
    256

    Output

    The video does not provide specific details about what they are discussing. However, it can be inferred that they are engaged in a professional conversation. One person is holding a pen and appears to be writing or taking notes while the other person is gesturing with their hand. The woman is also pointing towards something off-screen, indicating a possible discussion about something outside of the frame. The man is focused on his laptop, suggesting he might be referring to information displayed on the screen. Overall, the interaction seems to be a collaborative effort between the two individuals. The exact nature of their discussion is not clear from the video alone. However, based on the actions and body language, it can be inferred that they are discussing something important or significant. The woman's gestures and pointing suggest she is emphasizing a particular point or idea, while the man's focus on his laptop indicates he is considering or referencing information displayed on the screen. The overall atmosphere suggests a productive and focused discussion. The video does not provide any additional context or details about the topic being discussed. Therefore, it is difficult to determine the exact nature of their conversation. However, based on the actions and body language, it can be inferred that they are discussing something important or significant. The woman's gestures and pointing suggest she is emphasizing a
    Generated in
  • Prediction

    lucataco/apollo-3b:ed5ec3233f7a4331a2b0d33b8a02e62057a335e1a03107d56db9fac10cf3ce55
    ID
    t7104vtxcsrmc0ckwkjvkqex4g
    Status
    Succeeded
    Source
    Web
    Hardware
    L40S
    Total duration
    Created

    Input

    top_p
    0.7
    video
    prompt
    Describe this video in detail
    temperature
    0.4
    max_new_tokens
    256

    Output

    The video features a golden retriever dog enjoying a slice of watermelon on a wooden table. The dog is seen licking and nibbling at the juicy, red watermelon with black seeds, set against a blurred green background that suggests an outdoor setting. The dog's focused expression highlights its enjoyment of the treat. As the video progresses, the dog continues to savor the watermelon, maintaining its attention on the delicious piece in front of it. The serene and natural atmosphere underscores the dog's contentment as it relishes its meal. The video wraps up with the dog still engaged with the watermelon, emphasizing its satisfaction and delight. Throughout the clip, the dog's movements are slow and deliberate, underscoring the peaceful and enjoyable moment shared between the pet and its treat. The video concludes with the dog still holding the watermelon slice in its mouth, showcasing its contentment and satisfaction. The consistent background and the dog's actions create a cohesive narrative of a simple yet joyful moment of pet enjoyment. The video captures the essence of a relaxed and happy outdoor experience for the dog. The final frame shows the dog still holding the watermelon slice, highlighting its contentment and satisfaction. The background remains blurred, keeping the focus on the dog
    Generated in

Want to make some of these yourself?

Run this model