These models provide utility functions for working with media like images, audio, and video. They serve as convenient building blocks for media processing pipelines and workflows.
Some highllights:
Featured models


fictions-ai/autocaption
Automatically add captions to a video
Updated 1 year, 10 months ago
60.7K runs


charlesmccarthy/addwatermark
Add a watermark to your videos using the power of Replicate brought to you from your friends at FullJourney.AI
Updated 1 year, 10 months ago
1.1M runs


falcons-ai/nsfw_image_detection
Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification
Updated 1 year, 11 months ago
65.2M runs
Recommended Models
If you need quick, low-overhead processing—like extracting frames or audio—models such as lucataco/frame-extractor and lucataco/extract-audio are some of the speedier options. These utilities focus on simple transformations, so they typically run faster than more complex generation models.
Keep in mind that performance still depends on input file size and format.
For more advanced workflows, models like fictions-ai/autocaption, charlesmccarthy/addwatermark, and falcons-ai/nsfw_image_detection add extra functionality such as captioning, watermarking, or filtering content.
If your workflow involves bulk processing or automation, combining lightweight extractors with these more feature-rich utilities can give you a solid balance between speed and capability.
For low-level media manipulation:
Utility models usually return:
You can package your own processing script or pipeline with Cog and publish it to Replicate under the Media Utilities collection.
Clearly define your input/output types (e.g., video → frames), set versioning, and configure sharing or pricing if needed.
Many models in the Media Utilities collection support commercial use, but licenses vary. Check each model’s card for attribution requirements or restrictions before using them in production workflows.
Recommended Models

nicolascoutureau/video-utils
Updated 2 weeks, 3 days ago
5.2M runs


fofr/color-matcher
Color match and white balance fixes for images
Updated 2 months, 2 weeks ago
103.5K runs


lucataco/extract-audio
Simple tool to extract audio from a video file
Updated 2 months, 3 weeks ago
787 runs


lucataco/video-merge
Simple tool to merge together separate video snippets
Updated 3 months, 3 weeks ago
8.9K runs


lucataco/frame-extractor
Extract the first or last frame from any video file as a high-quality image
Updated 7 months, 1 week ago
642.7K runs


lucataco/merge-img
Simple tool to merge a foreground and background image
Updated 9 months, 3 weeks ago
2.9K runs


lucataco/video-split
Simple tool to split apart a video into snippets
Updated 10 months, 2 weeks ago
144 runs


midllle/material-maker
AI generated Normal maps, Displacement maps, and Roughness maps
Updated 1 year, 7 months ago
203 runs


lucataco/mvsep-mdx23-music-separation
Model for Sound demixing challenge 2023: Music Demixing Track - MDX'23
Updated 1 year, 7 months ago
21.8K runs


lucataco/depth-anything-video
Depth Anything on full video files
Updated 1 year, 8 months ago
564 runs


lucataco/img-and-audio2video
Take an image and an audio file and create a video clip
Updated 1 year, 9 months ago
11.7K runs


fofr/toolkit
Video toolkit – convert, make GIFs, extract audio
Updated 1 year, 9 months ago
10.8K runs


jd7h/edit-video-by-editing-text
A pipeline for superfast video editing! Make cuts to a video by editing its transcript.
Updated 1 year, 10 months ago
814 runs


chigozienri/image-urls-to-video
Take a list of image URLs as frames and output a video
Updated 1 year, 11 months ago
1.2K runs


fofr/controlnet-preprocessors
Canny, soft edge, depth, lineart, segmentation, pose, etc
Updated 1 year, 11 months ago
42.5K runs


fofr/audio-to-waveform
Create a waveform video from audio
Updated 2 years, 4 months ago
383.7K runs


fofr/video-to-frames
Split a video into frames
Updated 2 years, 4 months ago
23.3K runs


fofr/frames-to-video
Convert a set of frames to a video
Updated 2 years, 4 months ago
1.6K runs