These models provide utility functions for working with media like images, audio, and video. They serve as convenient building blocks for media processing pipelines and workflows.
Some highllights:
Featured models

fictions-ai/autocaptionAutomatically add captions to a video
Updated 1 year, 11 months ago
68.3K runs

charlesmccarthy/addwatermarkAdd a watermark to your videos using the power of Replicate brought to you from your friends at FullJourney.AI
Updated 2 years ago
1.3M runs

falcons-ai/nsfw_image_detectionFine-Tuned Vision Transformer (ViT) for NSFW Image Classification
Updated 2 years ago
67.8M runs
Recommended Models
If you need quick, low-overhead processing—like extracting frames or audio—models such as lucataco/frame-extractor and lucataco/extract-audio are some of the speedier options. These utilities focus on simple transformations, so they typically run faster than more complex generation models.
Keep in mind that performance still depends on input file size and format.
For more advanced workflows, models like fictions-ai/autocaption, charlesmccarthy/addwatermark, and falcons-ai/nsfw_image_detection add extra functionality such as captioning, watermarking, or filtering content.
If your workflow involves bulk processing or automation, combining lightweight extractors with these more feature-rich utilities can give you a solid balance between speed and capability.
For low-level media manipulation:
Utility models usually return:
You can package your own processing script or pipeline with Cog and publish it to Replicate under the Media Utilities collection.
Clearly define your input/output types (e.g., video → frames), set versioning, and configure sharing or pricing if needed.
Many models in the Media Utilities collection support commercial use, but licenses vary. Check each model’s card for attribution requirements or restrictions before using them in production workflows.
Recommended Models
nicolascoutureau/video-utilsUpdated 1 week, 1 day ago
7.5M runs

fofr/color-matcherColor match and white balance fixes for images
Updated 4 months ago
155.4K runs

lucataco/extract-audioSimple tool to extract audio from a video file
Updated 4 months ago
3.3K runs

lucataco/video-mergeSimple tool to merge together separate video snippets
Updated 5 months ago
13.7K runs

lucataco/frame-extractorExtract the first or last frame from any video file as a high-quality image
Updated 8 months, 3 weeks ago
741.5K runs

lucataco/merge-imgSimple tool to merge a foreground and background image
Updated 11 months, 1 week ago
3K runs

lucataco/video-splitSimple tool to split apart a video into snippets
Updated 11 months, 4 weeks ago
150 runs

midllle/material-makerAI generated Normal maps, Displacement maps, and Roughness maps
Updated 1 year, 8 months ago
210 runs

lucataco/mvsep-mdx23-music-separationModel for Sound demixing challenge 2023: Music Demixing Track - MDX'23
Updated 1 year, 8 months ago
22.4K runs

lucataco/depth-anything-videoDepth Anything on full video files
Updated 1 year, 10 months ago
606 runs

lucataco/img-and-audio2videoTake an image and an audio file and create a video clip
Updated 1 year, 10 months ago
12.7K runs

fofr/toolkitVideo toolkit – convert, make GIFs, extract audio
Updated 1 year, 11 months ago
13K runs

jd7h/edit-video-by-editing-textA pipeline for superfast video editing! Make cuts to a video by editing its transcript.
Updated 2 years ago
818 runs

chigozienri/image-urls-to-videoTake a list of image URLs as frames and output a video
Updated 2 years ago
1.2K runs

fofr/controlnet-preprocessorsCanny, soft edge, depth, lineart, segmentation, pose, etc
Updated 2 years, 1 month ago
42.8K runs

fofr/audio-to-waveformCreate a waveform video from audio
Updated 2 years, 5 months ago
383.9K runs

fofr/video-to-framesSplit a video into frames
Updated 2 years, 6 months ago
24.4K runs

fofr/frames-to-videoConvert a set of frames to a video
Updated 2 years, 6 months ago
1.7K runs