lucataco / apollo-7b

Apollo 7B - An Exploration of Video Understanding in Large Multimodal Models

  • Public
  • 2.5K runs
  • L40S
  • GitHub
  • Weights
  • Paper
  • License

Input

*file

Input video file

string
Shift + Return to add a new line

Question or prompt about the video

Default: "Describe this video in detail"

number
(minimum: 0.1, maximum: 2)

Sampling temperature

Default: 0.4

integer
(minimum: 32, maximum: 1024)

Maximum number of tokens to generate

Default: 256

number
(minimum: 0, maximum: 1)

Top-p sampling probability

Default: 0.7

Output

The video features an astronaut in a white spacesuit walking on the moon's surface. The background showcases a large, detailed moon against a starry sky. As the astronaut walks, they begin to run and eventually leap into the air, floating above the moon's rocky terrain. The scene transitions to the astronaut drifting away from the moon, with the lunar landscape and the moon itself visible in the background. The video concludes with the astronaut continuing to float in space, gazing at the moon.
Generated in

Run time and cost

This model costs approximately $0.0014 to run on Replicate, or 714 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 2 seconds. The predict time for this model varies significantly based on the inputs.

Readme

This model doesn't have a readme.