lucataco/bulk-video-caption:bd610b3c – Run with an API on Replicate

You're looking at a specific version of this model. Jump to the model overview.

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field	Type	Default value	Description
video_zip_archive	string		ZIP archive containing videos to process
include_csv	boolean	True	Whether to include CSV in output
caption_prefix	string		Optional prefix for video captions
caption_suffix	string		Optional suffix for video captions
frames_to_extract	integer	2	Number of frames to extract from each video for analysis
system_prompt	string	Analyze these frames from a video and write a detailed caption. Describe the type of video (e.g., animation, live-action footage, etc.). Focus on consistent elements across frames and any notable motion or action. Describe the main subjects, setting, and overall mood of the video. Use clear, descriptive language suitable for text-to-video generation.	System prompt for caption generation
model	string (enum)	gpt-4o Options: gpt-4o, gpt-4o-mini, gpt-4-turbo, claude-3-5-sonnet-20240620, claude-3-opus-20240229, claude-3-sonnet-20240229, claude-3-haiku-20240307, gemini-1.5-pro, gemini-1.5-flash	AI model to use for captioning
openai_api_key	string		API key for OpenAI
anthropic_api_key	string		API key for Anthropic
google_generativeai_api_key	string		API key for Google Generative AI

The shape of the response you’ll get when you run this model with an API.

Schema

{'format': 'uri', 'title': 'Output', 'type': 'string'}