lucataco / bulk-video-caption

Video Preprocessing tool for captioning multiple videos using GPT, Claude or Gemini

  • Public
  • 6 runs
  • GitHub
  • License

Run time and cost

This model runs on CPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

Batch video caption

A cog model for batch video captioning using various AI from OpenAI, Anthropic, and Google

Features

  • Process multiple images from a ZIP archive
  • supports mov, mp4
  • Customizable caption prefixes and suffixes
  • Support for multiple AI models:
    • OpenAI: GPT-4 and variants
    • Anthropic: Claude-3.5, Claude-3 variants
    • Google: Gemini-1.5 variants
  • Flexible system prompts
  • Error handling and retry mechanism
  • Output as a ZIP file containing captions that match image filenames as well as an optional CSV summary