lucataco / bulk-video-caption

Video Preprocessing tool for captioning multiple videos using GPT, Claude or Gemini

Public
121 runs
GitHub
License

Iterate in playground

Run with an API

Playground API Examples README Versions

Run time and cost

This model runs on CPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

Batch video caption

A cog model for batch video captioning using various AI from OpenAI, Anthropic, and Google

Features

Process multiple images from a ZIP archive
supports mov, mp4
Customizable caption prefixes and suffixes
Support for multiple AI models:
- OpenAI: GPT-4 and variants
- Anthropic: Claude-3.5, Claude-3 variants
- Google: Gemini-1.5 variants
Flexible system prompts
Error handling and retry mechanism
Output as a ZIP file containing captions that match image filenames as well as an optional CSV summary