These models provide utility functions for working with media like images, audio, and video. They serve as convenient building blocks for media processing pipelines and workflows.
Some highllights:
Featured models

fictions-ai/autocaptionAutomatically add captions to a video
Updated 2 years, 1 month ago
82.3K runs

charlesmccarthy/addwatermarkAdd a watermark to your videos using the power of Replicate brought to you from your friends at FullJourney.AI
Updated 2 years, 1 month ago
1.4M runs

falcons-ai/nsfw_image_detectionFine-Tuned Vision Transformer (ViT) for NSFW Image Classification
Updated 2 years, 2 months ago
74.9M runs
Recommended Models
If you need quick, low-overhead processing—like extracting frames or audio—models such as lucataco/frame-extractor and lucataco/extract-audio are some of the speedier options. These utilities focus on simple transformations, so they typically run faster than more complex generation models.
Keep in mind that performance still depends on input file size and format.
For more advanced workflows, models like fictions-ai/autocaption, charlesmccarthy/addwatermark, and falcons-ai/nsfw_image_detection add extra functionality such as captioning, watermarking, or filtering content.
If your workflow involves bulk processing or automation, combining lightweight extractors with these more feature-rich utilities can give you a solid balance between speed and capability.
For low-level media manipulation:
Utility models usually return:
You can package your own processing script or pipeline with Cog and publish it to Replicate under the Media Utilities collection.
Clearly define your input/output types (e.g., video → frames), set versioning, and configure sharing or pricing if needed.
Many models in the Media Utilities collection support commercial use, but licenses vary. Check each model’s card for attribution requirements or restrictions before using them in production workflows.
Recommended Models
nicolascoutureau/video-utilsUpdated 1 month, 3 weeks ago
10.6M runs

fofr/color-matcherColor match and white balance fixes for images
Updated 5 months, 2 weeks ago
206.9K runs

lucataco/extract-audioSimple tool to extract audio from a video file
Updated 5 months, 3 weeks ago
4.2K runs

lucataco/video-mergeSimple tool to merge together separate video snippets
Updated 6 months, 3 weeks ago
19.7K runs

lucataco/frame-extractorExtract the first or last frame from any video file as a high-quality image
Updated 10 months, 1 week ago
847.2K runs

lucataco/merge-imgSimple tool to merge a foreground and background image
Updated 1 year ago
3K runs

lucataco/video-splitSimple tool to split apart a video into snippets
Updated 1 year, 1 month ago
162 runs

midllle/material-makerAI generated Normal maps, Displacement maps, and Roughness maps
Updated 1 year, 10 months ago
218 runs

lucataco/mvsep-mdx23-music-separationModel for Sound demixing challenge 2023: Music Demixing Track - MDX'23
Updated 1 year, 10 months ago
23.1K runs

lucataco/depth-anything-videoDepth Anything on full video files
Updated 1 year, 11 months ago
634 runs

lucataco/img-and-audio2videoTake an image and an audio file and create a video clip
Updated 2 years ago
14K runs

fofr/toolkitVideo toolkit – convert, make GIFs, extract audio
Updated 2 years ago
22.7K runs

jd7h/edit-video-by-editing-textA pipeline for superfast video editing! Make cuts to a video by editing its transcript.
Updated 2 years, 1 month ago
820 runs

chigozienri/image-urls-to-videoTake a list of image URLs as frames and output a video
Updated 2 years, 2 months ago
1.2K runs

fofr/controlnet-preprocessorsCanny, soft edge, depth, lineart, segmentation, pose, etc
Updated 2 years, 2 months ago
43.1K runs

fofr/audio-to-waveformCreate a waveform video from audio
Updated 2 years, 7 months ago
384.1K runs

fofr/video-to-framesSplit a video into frames
Updated 2 years, 7 months ago
25.8K runs

fofr/frames-to-videoConvert a set of frames to a video
Updated 2 years, 7 months ago
1.7K runs