You're looking at a specific version of this model. Jump to the model overview.

lucataco /bulk-video-caption:bd610b3c

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
video_zip_archive
string
ZIP archive containing videos to process
include_csv
boolean
True
Whether to include CSV in output
caption_prefix
string
Optional prefix for video captions
caption_suffix
string
Optional suffix for video captions
frames_to_extract
integer
2
Number of frames to extract from each video for analysis
system_prompt
string
Analyze these frames from a video and write a detailed caption. Describe the type of video (e.g., animation, live-action footage, etc.). Focus on consistent elements across frames and any notable motion or action. Describe the main subjects, setting, and overall mood of the video. Use clear, descriptive language suitable for text-to-video generation.
System prompt for caption generation
model
string (enum)
gpt-4o

Options:

gpt-4o, gpt-4o-mini, gpt-4-turbo, claude-3-5-sonnet-20240620, claude-3-opus-20240229, claude-3-sonnet-20240229, claude-3-haiku-20240307, gemini-1.5-pro, gemini-1.5-flash

AI model to use for captioning
openai_api_key
string
API key for OpenAI
anthropic_api_key
string
API key for Anthropic
google_generativeai_api_key
string
API key for Google Generative AI

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}